danbooru

Author	SHA1	Message	Date
evazion	85c3b4f2d1	ugoiras: add md5 column to pixiv_ugoira_frame_data. This is necessary so we can associate ugoira frame data with the media asset instead of with the post.	2021-10-18 00:34:24 -05:00
evazion	e3b836b506	Refactor full-text search to get rid of tsvector columns. Refactor full-text search on several tables (comments, dmails, forum_posts, forum_topics, notes, and wiki_pages) to use to_tsvector expression indexes instead of dedicated tsvector columns. This way full-text search works the same way across all tables. API changes: * Changed /wiki_pages.json?search[body_matches] to match against only the body. Before `body_matches` matched against both the title and the body. * Added /wiki_pages.json?search[title_or_body_matches] to match against both the title and the body. * Fixed /dmails.json?search[message_matches] to match against both the title and body when doing a wildcard search. Before a wildcard search only matched against the body. * Added /dmails.json?search[body_matches] to match against only the dmail body.	2021-10-16 07:44:27 -05:00
evazion	d50cfdb856	db: drop dependency on Postgres test_parser extension. Drop the final dependency on the Postgres test_parser extension. We also have to remove references to test_parser in the migration where it was first defined, otherwise replaying all migrations from the beginning will fail. Replaying all migrations from the beginning normally isn't done except in testing. After this, it should be possible to use a vanilla install of Postgres with Danbooru. It's still recommended to use Danbooru's Docker image for Postgres (https://ghcr.io/danbooru/postgres), as other Postgres extensions may be necessary in the future.	2021-10-14 02:41:44 -05:00
evazion	e72446463e	Fix #4901 : Duplicate disapprovals * Add uniqueness constraint on post_disapprovals (user_id, post_id). * Add fix script to remove existing duplicates.	2021-10-12 20:22:00 -05:00
evazion	7976323f7a	wiki pages: change tsvector update trigger to not use test_parser. Change the wiki_pages tsvector_update_trigger to use `pg_catalog.english` instead of `public.danbooru`. This changes how wiki page text is parsed for full-text search to use the standard English parser instead of test_parser. This is to prepare for dropping test_parser. Using test_parser here was wrong anyway because it meant that punctuation wasn't removed from words when indexing wiki pages for full-text search.	2021-10-11 03:34:47 -05:00
evazion	51e9ea2772	posts: add string_to_array(tag_string, ' ') index. This is preparation for removing tag_index and test_parser.	2021-10-10 17:45:19 -05:00
evazion	340e1008e9	favorites: merge favorites subtables. Merge the 100 favorite subtables into a single table. Previously the favorites table was partitioned by user id into 100 subtables to try to make searching by user id faster. This wasn't really necessary and probably slower than just making an index on (favorites.user_id, favorites.id) to satisfy ordfav searches. BTree indexes are logarithmic so dividing an index by 100 doesn't make it 100 times faster to search; instead it just removes a layer or two from the tree. This also adds a uniqueness index on (user_id, post_id) to prevent duplicate favorites. Previously we had to check for duplicates at the application layer, which required careful locking to do it correctly. Finally, this adds an index on favorites.id, which was surprisingly missing before. This made ordering and deleting favorites by id really slow because it degraded to a sequential scan.	2021-10-08 21:26:42 -05:00
evazion	01cdc7da7f	media assets: add status column.	2021-09-26 08:06:13 -05:00
evazion	3d30bfd69d	media assets: add duration column. Add a column for tracking the duration (length) of videos and animations. The duration will be null for static images.	2021-09-26 07:45:29 -05:00
evazion	3a0614bb55	db: recreate post versions and pool versions tables. Add the post and pool versions tables back. Currently only used by the test suite to make it easier to run. Not yet used for production.	2021-09-21 12:35:46 -05:00
evazion	3d660953d4	Add MediaMetadata model. Add a model for storing image and video metadata for uploaded files. Metadata is extracted using ExifTool. You will need to install ExifTool after this commit. ExifTool 12.22 is the minimum required version because we use the `--binary` option, which was added in this release. The MediaMetadata model is separate from the MediaAsset model because some files contain tons of metadata, and most of it is non-essential. The MediaAsset model represents an uploaded file and contains essential metadata, like the file's size and type, while the MediaMetadata model represents all the other non-essential metadata associated with a file. Metadata is stored as a JSON column in the database. ExifTool returns all the file's metadata, not just the EXIF metadata. EXIF is one of several types of image metadata, hence why we call it MediaMetadata instead of EXIFMetadata.	2021-09-08 05:00:54 -05:00
evazion	207a1b7688	db/structure.sql: fix syntax error.	2021-09-04 03:59:35 -05:00
evazion	b068c113a8	Add MediaAsset model. A MediaAsset represents an image or video file uploaded to Danbooru. It stores the metadata associated with the image or video. This is to work on decoupling files from posts so that images can be uploaded separately from posts.	2021-09-02 06:07:52 -05:00
evazion	247934ad83	db: add non-null constraints to all non-optional columns. Add non-null constraints to all columns that are non-optional. Now the only columns that are nullable are optional columns.	2021-03-30 04:52:01 -05:00
evazion	6b91e55283	comments: allow votes to be soft deleted. Make it so that when a user removes their own vote, the vote is soft deleted (the is_deleted flag is set) instead of hard deleted. Changes: * Add is_deleted flag to comment votes. * Relax uniqueness constraint so you can have multiple deleted votes on the same comment. You can still only have one active vote on the comment. * Add `soft_delete` method to Deletable concern.	2021-03-30 00:10:22 -05:00
evazion	808c039f03	db/structure.sql: fixup ban duration field from `81fe68d39`.	2021-03-12 23:33:51 -06:00
evazion	81fe68d392	bans: change `expires_at` field to `duration`. Changes: * Change the `expires_at` field to `duration`. * Make moderators choose from a fixed set of standard ban lengths, instead of allowing arbitrary ban lengths. * List `duration` in seconds in the /bans.json API. * Dump bans to BigQuery. Note that some old bans have a negative duration. This is because their expiration date was before their creation date, which is because in 2013 bans were migrated to Danbooru 2 and the original ban creation dates were lost.	2021-03-11 02:59:58 -06:00
evazion	4492610dfe	rate limits: rework rate limit implementation. Rework the rate limit implementation to make it more flexible: * Allow setting different rate limits for different actions. Before we had a single rate limit for all write actions. Now different controller endpoints can have different limits. * Allow actions to be rate limited by user ID, by IP address, or both. Before actions were only limited by user ID, which meant non-logged-in actions like creating new accounts or attempting to login couldn't be rate limited. Also, because actions were limited by user ID only, you could use multiple accounts with the same IP to get around limits. Other changes: * Remove the API Limit field from user profile pages. * Remove the `remaining_api_limit` field from the `/profile.json` endpoint. * Rename the `X-Api-Limit` header to `X-Rate-Limit` and change it from a number to a JSON object containing all the rate limit info (including the refill rate, the burst factor, the cost of the call, and the current limits). * Fix a potential race condition where, if you flooded requests fast enough, you could exceed the rate limit. This was because we checked and updated the rate limit in two separate steps, which was racy; simultaneous requests could pass the check before the update happened. The new code uses some tricky SQL to check and update multiple limits in a single statement.	2021-03-05 16:00:54 -06:00
evazion	25fda1ecc2	api keys: add IP whitelist and API permission system. Add the ability to restrict API keys so that they can only be used with certain IP addresses or certain API endpoints. Restricting your key is useful to limit damage in case it gets leaked or stolen. For example, if your key is on a remote server and it gets hacked, or if you accidentally check-in your key to Github. Restricting your key's API permissions is useful if a third-party app or script wants your key, but you don't want to give full access to your account. If you're an app or userscript developer, and your app needs an API key from the user, you should only request a key with the minimum permissions needed by your app. If you have a privileged account, and you have scripts running under your account, you are highly encouraged to restrict your key to limit damage in case your key gets leaked or stolen.	2021-02-14 21:02:07 -06:00
evazion	a6707fbfa2	api keys: allow users to have multiple API keys. This is useful if you have multiple programs and want to give them different API keys, or if you want to rotate keys for a single program.	2021-02-14 04:09:47 -06:00
evazion	78892e6239	posts: add index on rating column. Should improve performance for rating:e and rating:q searches. Rating:s isn't isn't indexed because Postgres is unlikely to use the index for rating:s searches (the selectivity is too low, ~77% of all posts are rating:s).	2021-01-26 19:24:18 -06:00
evazion	f7ea4917e5	db: sync production schema with declared schema. Fix various minor inconsistencies between the production database schema and the declared schema in db/structure.sql. * tags.category was a smallint instead of an integer in production. * The unique_schema_migrations index didn't exist outside production. * The index_posts_on_tag_index index was called index_posts_on_tags_index outside production. * The posts.tag_index column didn't have a statistics target defined outside production. * ID sequences didn't have `AS integer` defined in production.	2021-01-26 19:19:20 -06:00
evazion	b689c9cbed	comments: add uniqueness constraint on votes.	2021-01-23 14:44:11 -06:00
evazion	4719a5ed1c	users: set default settings in ruby instead of in database. Specify the default settings for new users inside the User model instead of inside the database. This makes it easier to change defaults, and it makes the code clearer.	2021-01-15 02:03:54 -06:00
evazion	fc5db679e4	autocomplete: optimize searching by artist/wiki page other names. Optimize searches for non-English phrases in autocomplete. These searches were pretty slow, and could sometimes cause sitewide lag spikes when users typed long strings of non-English text into the search box and caused an unintentional DoS. The trick is to use an `array_to_tsvector(other_names) USING gin` index on other_names. This supports fast string prefix matching against all elements of the array. The downside is that it doesn't allow infix or suffix matches, so we can't support wildcards in general. Wildcards didn't quite work anyway, since artist and wiki other names can contain literal '*' characters.	2021-01-10 03:35:12 -06:00
evazion	9759701071	search: add way to search array attributes by regex. Add a `where_any_in_array_matches_regex` method and expose it to the API: * https://danbooru.donmai.us/artists?search[any_other_name_matches_regex]=^blah * https://danbooru.donmai.us/wiki_pages?search[any_other_name_matches_regex]=^blah * https://danbooru.donmai.us/saved_searches?search[any_label_matches_regex]=^blah In SQL, this does `WHERE '^blah' ~<< ANY(other_names)`, where `~<<` is a custom operator based on the `~` regex match operator, but with the arguments reversed. This allows it to be used with the ANY(array) operator. See also: * https://stackoverflow.com/a/22101172 * https://www.postgresql.org/docs/current/sql-createfunction.html * https://www.postgresql.org/docs/current/sql-createoperator.html * https://www.postgresql.org/docs/current/functions-comparisons.html	2021-01-10 02:03:02 -06:00
evazion	65adcd09c2	users: track logins, signups, and other user events. Add tracking of certain important user actions. These events include: * Logins * Logouts * Failed login attempts * Account creations * Account deletions * Password reset requests * Password changes * Email address changes This is similar to the mod actions log, except for account activity related to a single user. The information tracked includes the user, the event type (login, logout, etc), the timestamp, the user's IP address, IP geolocation information, the user's browser user agent, and the user's session ID from their session cookie. This information is visible to mods only. This is done with three models. The UserEvent model tracks the event type (login, logout, password change, etc) and the user. The UserEvent is tied to a UserSession, which contains the user's IP address and browser metadata. Finally, the IpGeolocation model contains the geolocation information for IPs, including the city, country, ISP, and whether the IP is a proxy. This tracking will be used for a few purposes: * Letting users view their account history, to detect things like logins from unrecognized IPs, failed logins attempts, password changes, etc. * Rate limiting failed login attempts. * Detecting sockpuppet accounts using their login history. * Detecting unauthorized account sharing.	2021-01-08 22:34:37 -06:00
evazion	b223a87868	aliases/implications: add back legacy reason field. In Danbooru 1, aliases (and implications) had a `reason` field where either the admin or the alias requester gave a reason for the alias. This field was removed from the code and the database schema, but it still existed in the production database. This adds the field back, so that the dev schema is consistent with the production schema, and so that legacy reasons can be viewed on site again. * Add back legacy tag_aliases.reason and tag_implications.reason field. * Make /tag_aliases and /tag_implications show legacy reasons. * Add the reason field to the search form.	2021-01-06 16:05:56 -06:00
evazion	df44937c57	post regenerations: replace PostRegeneration model with mod actions. * Remove the PostRegeneration model. Instead just use a mod action to log when a post is regenerated. * Change it so that IQDB is also updated when the image samples are regenerated. This is necessary because when the images samples are regenerated, the thumbnail may change, which means IQDB needs to be updated too. This can happen when regenerating old images with transparent backgrounds where the transparency was flattened to black instead of white in the thumbnail. * Only display one "Regenerate image" option in the post sidebar, to regenerate both the images and IQDB. Regenerating IQDB only can be done through the API. Having two options in the sidebar is too much clutter, and it's too confusing for Mods who don't know the difference between an IQDB-only regeneration and a full image regeneration. * Add a confirm prompt to the "Regenerate image" link.	2021-01-04 21:35:43 -06:00
BrokenEagle	16d6f3bbd5	Add post regenerations	2021-01-04 18:35:50 -06:00
evazion	74ed2a8b96	user upgrades: add UserUpgrade model. Add a model to store the status of user upgrades. * Store the upgrade purchaser and the upgrade receiver (these are different for a gifted upgrade, the same for a self upgrade). * Store the upgrade type: gold, platinum, or gold-to-platinum upgrades. * Store the upgrade status: pending: User is still on the Stripe checkout page, no payment received yet. processing: User has completed checkout, but the checkout status in Stripe is still 'unpaid'. ** complete: We've received notification from Stripe that the payment has gone through and the user has been upgraded. * Store the Stripe checkout ID, to cross-reference the upgrade record on Danbooru with the checkout record on Stripe. This is the upgrade flow: * When the user clicks the upgrade button on the upgrade page, we call POST /user_upgrades and create a pending UserUpgrade. * We redirect the user to the checkout page on Stripe. * When the user completes checkout on Stripe, Stripe sends us a webhook notification at POST /webhooks/receive. * When we receive the webhook, we check the payment status, and if it's paid we mark the UserUpgrade as complete and upgrade the user. * After Stripe sees that we have successfully processed the webhook, they redirect the user to the /user_upgrades/:id page, where we show the user their upgrade receipt.	2020-12-24 21:15:04 -06:00
evazion	4cb39422b2	post replacements: rename <attr>_was to old_<attr> Rename the following post replacement attributes: * file_size_was -> old_file_size * file_ext_was -> old_file_ext * image_width_was -> old_image_width * image_height_was -> old_image_height * md5_was -> old_md5 In Rails 6.1, having attributes named `file_size` and `file_size_was` on the same model breaks things because it conflicts with Rails' dirty attribute tracking.	2020-12-19 14:26:07 -06:00
evazion	6a46aeb55c	autocomplete: tune autocorrect algorithm. Tune autocorrect to produce fewer false positives. Before we used trigram similarity. Now we use Levenshtein edit distance with a dynamic typo threshold. Trigram similarity was able to correct large transpositions (e.g. `miku_hatsune` -> `hatsune_miku`), but it was bad at correcting small typos. Levenshtein is good at small typos, but can't correct large transpositions.	2020-12-13 04:10:48 -06:00
evazion	c8a9015e8e	Merge pull request #4611 from aaronfranke/formatting Make file formatting comply with POSIX standards and remove trailing space characters	2020-12-05 12:45:38 -06:00
evazion	8717c319ab	aliases/implications: remove 'pending' state. Remove the pending status from tag aliases and implications. Previously aliases would be created first in the pending state then changed to active when the alias was later processed in a delayed job. This meant that BURs weren't processed completely sequentially; first all the aliases in a BUR would be created in one go, then later they would be processed and set to active sequentially. This was problematic in complex BURs that tried to reverse or swap around aliases, since new pending aliases could be created before old conflicting aliases were removed.	2020-12-01 18:58:45 -06:00
Aaron Franke	6cdafdf136	Remove trailing space characters Trailing space characters do nothing except take up disk space, they should be removed	2020-10-04 05:15:02 -04:00
evazion	f337902c57	modqueue: fix performance regression from including appeals. * Add index on posts.is_deleted. The modqueue was slow because we the appeal condition wasn't constrained to deleted posts, so it degraded to a full table scan. * Avoid extra queries for calculating the page count and disapproval counts.	2020-08-16 14:31:47 -05:00
evazion	0a0a85ee70	Fix #4568 : Send appealed posts back to the mod queue * Include appealed posts in the modqueue. * Add `status` field to appeals. Appeals start out as `pending`, then become `rejected` if the post isn't approved within three days. If the post is approved, the appeal's status becomes `succeeded`. * Add `status` field to flags. Flags start out as `pending` then become `rejected` if the post is approved within three days. If the post isn't approved, the flag's status becomes `succeeded`. * Leave behind a "Unapproved in three days" dummy flag when an appeal goes unapproved, just like when a pending post is unapproved. * Only allow deleted posts to be appealed. Don't allow flagged posts to be appealed. * Add `status:appealed` metatag. `status:appealed` is separate from `status:pending`. * Include appealed posts in `status:modqueue`. Search `status:modqueue order:modqueue` to view the modqueue as a normal search. * Retroactively set old flags and appeals as succeeded or rejected. This may not be correct for posts that were appealed or flagged multiple times. This is difficult to set correctly because we don't have approval records for old posts, so we can't tell the actual outcome of old flags and appeals. * Deprecate the `is_resolved` field on post flags. A resolved flag is a flag that isn't pending. * Known bug: appealed posts have a black border instead of a blue border. Checking whether a post has been appealed would require either an extra query on the posts/index page, or an is_appealed flag on posts, neither of which are very desirable. * Known bug: you can't use `status:appealed` in blacklists, for the same reason as above.	2020-08-06 20:55:45 -05:00
evazion	d5a7fafca1	posts/index: fix several "This tag is under discussion" issues. Several fixes for the "This tag is under discussion" notice on the post index page: * Fix the notice appearing for BURs that aren't pending. * Fix the notice never going away because of the cache never expiring. * List all topics when a tag is involved in multiple BURs. * Link to the forum post instead of the forum topic (fix #4421). * Optimization: don't check for BURs when the search isn't a simple single tag search. * Add a `tags` field to the bulk update requests table for tracking all tags involved in the request (excluding tags in mass updates that are negated/optional/wildcards). Known issue: doesn't handle tag type prefixes in mass updates correctly (e.g. `mass update foo -> artist:bar` doesn't detect the tag `bar`). * Allow searching the /bulk_update_requests page by tags. We don't really need to cache the notice here, but we do it anyway to reduce queries on the post index page.	2020-04-27 19:11:47 -05:00
evazion	b2ee1f0766	ip bans: add hit counter, deleted flag, new ban type. * Make IP bans soft deletable. * Add a hit counter to track how many times an IP ban has blocked someone. * Add a last hit timestamp to track when the IP ban last blocked someone. * Add a new type of IP ban, the signup ban. Signup bans restrict new signups from editing anything until they've verified their email address.	2020-04-06 14:13:22 -05:00
evazion	fde42022c0	post disapprovals: refactor disapproval reasons. * Factor out reasons into a constant * Change column default and eliminate unused `legacy` reason.	2020-04-03 23:44:02 -05:00
evazion	4b1114b4a4	users: drop email column.	2020-03-25 18:48:42 -05:00
evazion	ea8cdadce9	commentary versions: migrate columns to non-null. Fixes #4355.	2020-03-25 18:48:21 -05:00
BrokenEagle	09445cb28b	Drop unused BUR title column	2020-03-18 23:16:43 +00:00
evazion	7d028f67e9	structure.sql: resynchronize.	2020-03-18 03:09:08 -05:00
evazion	258f4a8b95	users: move emails to separate table. * Move emails from users table to email_addresses table. * Validate that addresses are formatted correctly and are unique across users. Existing invalid emails are grandfathered in. * Add is_verified flag (the address has been confirmed by the user). * Add is_deliverable flag (an undeliverable address is an address that bounces). * Normalize addresses to prevent registering multiple accounts with the same email address (using tricks like Gmail's plus addressing).	2020-03-12 21:18:53 -05:00
evazion	5625458f69	users: refactor password reset flow. The old password reset flow: * User requests a password reset. * Danbooru generates a password reset nonce. * Danbooru emails user a password reset confirmation link. * User follows link to password reset confirmation page. * The link contains a nonce authenticating the user. * User confirms password reset. * Danbooru resets user's password to a random string. * Danbooru emails user their new password in plaintext. The new password reset flow: * User requests a password reset. * Danbooru emails user a password reset link. * User follows link to password edit page. * The link contains a signed_user_id param authenticating the user. * User changes their own password.	2020-03-08 23:18:15 -05:00
evazion	f4da0a127d	artists: add is_deleted and is_banned indexes.	2020-03-06 23:23:38 -06:00
evazion	4c11e339bd	artists: rename is_active flag to is_deleted. Rename is_active to is_deleted. This is for better consistency with other models, and to reduce confusion over what "active" means for artists. Sometimes users think active is for whether the artist is actively producing work.	2020-03-06 14:50:21 -06:00
evazion	ce11485fe0	Remove super voters.	2020-02-23 17:52:38 -06:00

1 2 3 4 5 ...

296 Commits