danbooru

Author	SHA1	Message	Date
evazion	1518c3c4be	posts: fix search queries not being logged to NewRelic in some cases (#4900 ) Fix the /posts index controller not logging the normalized search query to NewRelic when the search failed, either because of a tag limit error, a search timeout, or a RSS feed rate limit error. Also don't log the number of search results when it's an API request or failed search. This is to avoid doing a potentially slow full post count when it's not otherwise needed.	2022-01-11 13:39:30 -06:00
evazion	72ea78e697	searchable: replace find_ordered with in_order_of. Rails 7 added an `in_order_of` method that does what our `find_ordered` method did before.	2022-01-07 14:24:57 -06:00
evazion	a7dc05ce63	Enable frozen string literals. Make all string literals immutable by default.	2021-12-14 21:33:27 -06:00
evazion	0baca68a37	search: make `order:random` truly random; add `random:N` metatag. Make the `order:random` metatag truly randomize the search. Add a `random:N` metatag that returns up to N random posts, like what `order:random` did before. `order:random` now returns the entire search in random order. Before it just returned a pageful of pseudorandom posts. This will be more accurate for small searches, but slower for large searches. If `order:random` times out, try `random:N` instead. The `random:N` metatag returns up to N pseudorandom posts. This is faster than `order:random` for large searches, but for small searches, it may return less than N posts, and the randomness may be biased. Some posts may be more likely than others to appear. N must be between 0 and 200. Also, `/posts?tags=touhou&random=1` now redirects to `/posts?tags=touhou+random:N`. Before the `random=1` param acted like a free `order:random` tag; now it redirects to a `random:N` search, so it counts against your tag limit.	2021-11-25 18:14:34 -06:00
evazion	5dc67613e6	search: optimize username metatags. Optimize metatag searches involving usernames, including user:, approver:, appealer:, commenter:, upvoter:, etc. Do `User.find_by_name` instead of `User.name_matches` because this fetches the user upfront instead of doing it inside a subquery. Using a subquery makes the SQL more complicated and leads to worse query plans. This especially helps searches involving multiple username metatags.	2021-11-25 00:40:53 -06:00
evazion	353e708538	votes: allow admins to remove post votes. Allow admins to remove votes on posts. This is for fixing vote abuse. Votes can be removed by going to the vote list on the /post_votes page, or by clicking on a post's score, then using the "Remove" option in the "..." dropdown menu next to the vote. Votes are soft-deleted - they're marked as deleted in the database, but not fully deleted. Removed votes are only visible to admins, not to regular users. When a vote is removed by an admin, it leaves a mod action. Technically it's possible to undelete votes, but there's no UI for it.	2021-11-23 23:18:54 -06:00
evazion	43c2870664	Fix #4917 : Add down_score/up_score orders and metasearches. Add `upvotes:N`, `downvotes:N`, `order:upvotes`, `order:downvotes`, `order:upvotes_asc`, `order:downvotes_asc` metatags. In the API, the field is called up_score / down_score. Here it's called `upvotes` and `downvotes` because this should be easier to understand for end users. Note that internally, `down_score` is negative. A post that matches `downvotes:>5` will have down_score < -5 internally.	2021-11-16 03:52:38 -06:00
evazion	148752d3c4	PostQueryBuilder: remove useless code. The workaround for `unaliased:fav:1` is no longer needed since favorites are no longer included in the post's tag_index.	2021-11-02 04:07:21 -05:00
evazion	a5ed8c72c9	search: fix parsing of invalid metatag values. * Change `age:` metatag to require time units. This means e.g. `age:<600` no longer works; instead you have to say `age:<600sec`. * Allow time units in the `age:` metatag to be abbreviated as long as they're unambiguous. This means `age:<60sec`, `age:<5min`, and `age:<5mon` now work, in addition to `age:<60s` and `age:<60seconds`. * Allow the `ratio:` metatag to be written like `ratio:16/9` in addition to `ratio:16:9`. * Fix invalid date searches like `date:foo` or `date:05-15-2021` to return nothing instead of raising an "undefined method 'beginning_of_day' for nil" exception. (`date:05-15-2021` is invalid because it's parsed as DD-MM-YYYY). * Fix invalid searches like `score:foo`, `ratio:foo`, and `mpixels:foo` to return nothing instead of being treated like `score:0`, `ratio:0`, `mpixels:0`. * Fix `age:<60m` to return nothing instead of silently being treated like `age:<60seconds`. * Fix `age:foo` to return nothing instead of silently being treated like `age:0d` (return all uploads from today). Fixes #4389.	2021-11-02 01:54:05 -05:00
evazion	84212acfae	Merge pull request #4905 from nottalulah/remove-locks-from-autocomplete remove references to locks	2021-10-25 21:18:36 -05:00
evazion	f1b5c34b4d	posts: show length of videos and animations in thumbnails. Show the length of videos and animated posts in the thumbnail. The length is shown the top left corner in MM:SS format. This replaces the play button icon. Show a speaker icon instead of a music note icon for posts with sound. Doing this requires doing `.includes(:media_asset)` in a bunch of places to avoid N+1 queries when we access the post's duration.	2021-10-25 02:56:55 -05:00
Lily	647848b499	remove references to locks	2021-10-24 15:16:48 -03:00
evazion	587a9d0c8f	tags: move tag category definitions out of the config file. Move all the code for defining tag categories from the config file to TagCategory. It didn't belong in the config because it's not possible to add new tag categories purely in the config without editing other things like the CSS. Also change it so that tag colors are hardcoded in the CSS instead of generated using ERB. Generating the CSS in ERB meant that the Docker build had to recompile the CSS on every commit, even when it didn't change, because it relied on Ruby code outside the CSS that we couldn't guarantee didn't change.	2021-10-12 21:17:17 -05:00
evazion	92e20713e3	search: fixup hardcoded small search threshold. Fixup for `f6abf39eb`.	2021-10-12 19:01:31 -05:00
evazion	f6abf39ebc	search: try to optimize slow searches. Try to optimize certain types of common slow searches: * Searches for mutually-exclusive tags (e.g. `1girl multiple_girls`, `touhou solo -1girl -1boy`) * Relatively large tags that are heavily skewed towards old posts (e.g. lucky_star, haruhi_suzumiya_no_yuuutsu, inazuma_eleven_(series), imageboard_desourced). * Mid-sized tags in the <30k post range that Postgres thinks are big enough for a post id index scan, but a tag index scan is faster. The general pattern is Postgres not using the tag index because it thinks scanning down the post id index would be faster, but it's actually much slower because it degrades to a full table scan. This usually happens when Postgres thinks a tag is larger or more common than it really is. Here we try to force Postgres into using the tag index when we know the search is small. One case that is still slow is `2girls -multiple_girls`. This returns no results, but we can't know that without searching all of `2girls`. The general case is searching for `A -B` where A is a subset of B and A and B are both large tags. Hopefully fixes #581, #654, #743, #1020, #1039, #1421, #2207, #4070, #4337, #4896, and various other issues raised over the years regarding slow searches.	2021-10-12 02:30:30 -05:00
evazion	0b22e873c9	search: cache timed out search counts. When a search is performed, we cache the post count so we don't have to calculate it again every time the user switches pages. However, if the count times out, we didn't cache it before, causing us to do a slow count on every page load. This usually happens on multi-tag searches that return a lot of results, `1girl solo` for example. This changes it so that the count is cached even when it times out. This will speed up large multi-tag searches. This also changes it so that the count is cached for a fixed 5 minutes. Before it was variable based on the size of the count, but this probably didn't make much difference.	2021-10-12 01:33:21 -05:00
evazion	f155023b77	posts: remove unused exception classes.	2021-10-11 18:58:15 -05:00
evazion	37a8dc5dbd	posts: use string_to_array index for tag searches. Use the `string_to_array(tag_string, ' ')` index instead of the `tag_index` for tag searches. The string_to_array index lets us treat the tag_string as an array for searching purposes. This lets us get rid of the tag_index column and the test_parser dependency in the future.	2021-10-10 22:00:10 -05:00
evazion	1653392361	posts: stop updating fav_string attribute. Stop updating the fav_string attribute on posts. The column still exists on the table, but is no longer used or updated. Like the pool_string in `7d503f08`, the fav_string was used in the past to facilitate `fav:X` searches. Posts had a hidden fav_string column that contained a list of every user who favorited the post. These were treated like fake hidden tags on the post so that a search for `fav:X` was treated like a tag search. The fav_string attribute has been unused for search purposes for a while now. It was only kept because of technicalities that required departitioning the favorites table first (`340e1008e`) before it could be removed. Basically, removing favorites with `@favorite.destroy` was slow because Rails always deletes object by ID, but we didn't have an index on favorites.id, and we couldn't easily add one until the favorites table was departitioned. Fixes #4652. See https://github.com/danbooru/danbooru/issues/4652#issuecomment-754993802 for more discussion of issues caused by the fav_string (in short: write amplification, post table bloat, and favorite inconsistency problems).	2021-10-09 22:36:26 -05:00
evazion	c4eeeb8531	search: optimize counting posts for fav: and pool: searches. Optimize counting the number of posts returned by fav:<name> and pool:<name> searches. Use cached counts to avoid slow count(*) queries for users with lots of favorites.	2021-10-08 21:26:42 -05:00
evazion	340e1008e9	favorites: merge favorites subtables. Merge the 100 favorite subtables into a single table. Previously the favorites table was partitioned by user id into 100 subtables to try to make searching by user id faster. This wasn't really necessary and probably slower than just making an index on (favorites.user_id, favorites.id) to satisfy ordfav searches. BTree indexes are logarithmic so dividing an index by 100 doesn't make it 100 times faster to search; instead it just removes a layer or two from the tree. This also adds a uniqueness index on (user_id, post_id) to prevent duplicate favorites. Previously we had to check for duplicates at the application layer, which required careful locking to do it correctly. Finally, this adds an index on favorites.id, which was surprisingly missing before. This made ordering and deleting favorites by id really slow because it degraded to a sequential scan.	2021-10-08 21:26:42 -05:00
evazion	595e02ab45	posts: add duration:<x> and order:duration metatags. Add duration:<x> and order:duration metatags for searching animated posts by duration. https://danbooru.donmai.us/posts?tags=animated+duration:<5.0 https://danbooru.donmai.us/posts?tags=animated+duration:>60 https://danbooru.donmai.us/posts?tags=animated+order:duration	2021-10-07 03:21:08 -05:00
evazion	126046cb69	posts: remove rating, note, and status locks. Remove the ability for users to lock ratings, note, and post statuses. Historically the majority of locked posts were from 10+ years ago when certain users habitually locked ratings and notes on every post they touched for no reason. Nowadays most posts have been unlocked. Only a handful of locked posts are left, none of which deserve to be locked. The is_rating_locked, is_note_locked, and is_status_locked columns still exist in the database, but aren't used.	2021-09-27 22:32:30 -05:00
evazion	313257b771	posts: add `exif:<value>` search metatags. Examples: * https://danbooru.donmai.us/posts?tags=exif:File:ColorComponents * https://danbooru.donmai.us/posts?tags=exif:GIF:GIFVersion * https://danbooru.donmai.us/posts?tags=exif:PNG:ColorType * https://danbooru.donmai.us/posts?tags=exif:PNG:ColorType=RGB * https://danbooru.donmai.us/posts?tags=exif:GIF:GIFVersion=89a * https://danbooru.donmai.us/posts?tags=exif:File:ColorComponents=3	2021-09-16 02:13:15 -05:00
evazion	d00aa847ae	search: allow mods to search `disapproved:<user>` for other users. Allow moderators to search `disapproved:<username>` with any user. Before mods could only search for their own disapprovals, even though they could see disapprovals by others.	2021-09-01 01:39:14 -05:00
Thayol	b9068b8a3e	Fix #4435 : Search: wildcards with no matches should return no results	2021-06-24 04:04:13 -05:00
evazion	00ca7526bb	docs: add remaining docs for classes in app/logical.	2021-06-24 01:31:41 -05:00
evazion	28c0a48117	discord: fix tag search commands being limited to 2 tags.	2021-03-14 16:42:07 -05:00
evazion	81fe68d392	bans: change `expires_at` field to `duration`. Changes: * Change the `expires_at` field to `duration`. * Make moderators choose from a fixed set of standard ban lengths, instead of allowing arbitrary ban lengths. * List `duration` in seconds in the /bans.json API. * Dump bans to BigQuery. Note that some old bans have a negative duration. This is because their expiration date was before their creation date, which is because in 2013 bans were migrated to Danbooru 2 and the original ban creation dates were lost.	2021-03-11 02:59:58 -06:00
evazion	93f6e935a8	search: fix underestimation of page count for blank searches. Bug: In Postgres 13, getting the count of a blank search underestimated the page count by a large margin (~700,000 posts). The query we were executing was this: EXPLAIN (FORMAT JSON) SELECT * FROM posts ORDER BY id DESC The `ORDER BY id DESC` clause triggered a parallel seq scan query plan in Postgres 13, which for some reason causes Postgres to underestimate the row count by large amount in each parallel branch. Getting rid of the ORDER BY clause makes it do a regular seq scan, which gives an accurate estimate.	2021-02-18 04:37:27 -06:00
evazion	19974d3ab1	Fix #4688 : Malformed SQL when searching `-status:any`. Negating `Post.all` produced the invalid SQL fragment `WHERE NOT ()`. Use `Post.where("TRUE")` instead to produce `WHERE NOT (TRUE)`.	2021-02-03 21:15:58 -06:00
evazion	27cac98516	posts: eliminate pool_id, favgroup_id url params. Replace the `pool_id` and `favgroup_id` url params: https://danbooru.donmai.us/posts/123?pool_id=456 https://danbooru.donmai.us/posts/123?favgroup_id=456 with the `q` param: https://danbooru.donmai.us/posts/123?q=pool:456 https://danbooru.donmai.us/posts/123?q=favgroup:456	2021-01-29 23:01:44 -06:00
evazion	054ac51d47	policies: remove current request from context. This refactors Pundit policies to only rely on the current user, not on the current user and the current HTTP request. In retrospect, it was a bad idea to include the current request in the Pundit context. It bleeds out everywhere and there are many contexts (in tests and models) where we only have the current user, not the current request. The previous commit got rid of the only two places where we used it.	2021-01-17 00:57:59 -06:00
evazion	1e7a5ba49d	Fix ruby warnings about deprecated keyword arguments.	2021-01-11 05:12:09 -06:00
evazion	98ee6c31c1	favorites: refactor fav:/ordfav: searches to not use fav_string. Refactor fav:<name> and ordfav:<name> searches to use the favorites table instead of the posts.fav_string. This may be slower for fav:<name> searches. The fav_string effectively treats favorites like secret tags on the post, so fav:<name> searches were effectively the same as tag searches. Now they do a subquery on the favorites table, which may not perform as well for things like multiple fav:<name> metatags or negated fav:<name> metatags. For ordfav:<name> searches, this may be faster. ordfav: searches had a tag match clause (`tag_index @@ 'fav:123'`) in addition to a join on the favs table. This was redundant, and in some cases it inhibited the query planner from choosing a more optimal plan. Partially addresses #4652 by eliminating another place where we depended on the fav_string.	2021-01-03 19:18:31 -06:00
evazion	36f95891bd	search: let wildcard searches match up to 100 tags. Let searching for things like *_legwear match up to 100 tags. Previously the limit was 25.	2021-01-02 04:09:43 -06:00
evazion	7e8f859b24	tags: eliminate Tag.category_for method. Tag.category_for looked up a tag's category in the Redis cache. This was only used in a few places (in related tags, and on the popular/missed search pages). Get rid of this method so we can work towards getting rid of caching tag categories in Redis.	2020-12-27 21:03:26 -06:00
evazion	35134abe8f	post query builder: fix incompatibilities with Rails 6.1. * Rename the `#negate` and `#and` methods that we monkey patch into ActiveRecord::Relation. These methods are now defined in Rails 6.1, but they shadow our methods and have slightly different behavior. * Fix a call to `invert`. It no longer accepts an argument.	2020-12-13 04:10:48 -06:00
nonamethanks	4070354322	Return empty search for ordfav of fake user	2020-12-04 01:03:34 +01:00
evazion	c002166801	Fix #4584 : Disable deleted post filter when searching for status:appealed.	2020-08-14 22:05:19 -05:00
evazion	d884cb6642	search: fix parsing of quoted metatag values. * Fix #4552: Multiple quoted search terms not parsed correctly. * Allow quotes to be escaped in quoted metatags. * Allow spaces to be escaped in unquoted metatags. * Allow the empty string to be used in metatags. Examples: * `source:""` and `source:''` (same as `source:none`) * `source:foo\ bar\ baz` (same as `source:"foo bar baz"`) * `source:"don't say \"lazy\""` (use \" to write a literal ") * `source:'don\'t say "lazy"'` (use \' to write a literal ') * `source:"C:\\Windows"` (use \\ to write a literal \)	2020-08-12 18:12:16 -05:00
evazion	3a17b5a13e	flags/appeals: replace is_resolved flag with statuses. Replace references to the `is_resolved` field with the `status` field. Post flags were marked as resolved when a post was approved (but not when the post was deleted because it went unapproved). The status field supercedes the resolved field.	2020-08-07 19:24:57 -05:00
evazion	0a0a85ee70	Fix #4568 : Send appealed posts back to the mod queue * Include appealed posts in the modqueue. * Add `status` field to appeals. Appeals start out as `pending`, then become `rejected` if the post isn't approved within three days. If the post is approved, the appeal's status becomes `succeeded`. * Add `status` field to flags. Flags start out as `pending` then become `rejected` if the post is approved within three days. If the post isn't approved, the flag's status becomes `succeeded`. * Leave behind a "Unapproved in three days" dummy flag when an appeal goes unapproved, just like when a pending post is unapproved. * Only allow deleted posts to be appealed. Don't allow flagged posts to be appealed. * Add `status:appealed` metatag. `status:appealed` is separate from `status:pending`. * Include appealed posts in `status:modqueue`. Search `status:modqueue order:modqueue` to view the modqueue as a normal search. * Retroactively set old flags and appeals as succeeded or rejected. This may not be correct for posts that were appealed or flagged multiple times. This is difficult to set correctly because we don't have approval records for old posts, so we can't tell the actual outcome of old flags and appeals. * Deprecate the `is_resolved` field on post flags. A resolved flag is a flag that isn't pending. * Known bug: appealed posts have a black border instead of a blue border. Checking whether a post has been appealed would require either an extra query on the posts/index page, or an is_appealed flag on posts, neither of which are very desirable. * Known bug: you can't use `status:appealed` in blacklists, for the same reason as above.	2020-08-06 20:55:45 -05:00
BrokenEagle	4acd89838b	Add ability to search on status of parent/child	2020-06-21 22:39:21 +00:00
evazion	484eacfd3b	config: remove `is_unlimited_metatag?` config option.	2020-06-02 18:25:42 -05:00
evazion	ad02e0f62c	posts/index: fix rating:s being included in page title in safe mode. Fixes bug described in `d3e4ac7c17 (commitcomment-39049351)` When dealing with searches, there are several variables we have to keep in mind: * Whether tag aliases should be applied. * Whether search terms should be sorted. * Whether the rating:s and -status:deleted metatags should be added by safe mode and the hide deleted posts setting. Which of these things we need to do depends on the context: * We want to apply aliases when actually doing the search, calculating the count, looking up the wiki excerpt, recording missed/popular searches in Reportbooru, and calculating related tags for the sidebar, but not when displaying the raw search as typed by the user (for example, in the page title or in the tag search box). * We want to sort the search when calculating cache keys for fast_count or related tags, and when recording missed/popular searches, but not in the page title or when displaying the raw search. * We want to add rating:s and -status:deleted when performing the search, calculating the count, or recording missed/popular searches, but not when calculating related tags for the sidebar, or when displaying the page title or raw search. Here we introduce normalized_query and try to use it in contexts where query normalization is necessary. When to use the normalized query versus the raw unnormalized query is still subtle and prone to error.	2020-05-12 21:47:00 -05:00
evazion	438186a75a	search: fix user-dependent searches showing incorrect paginators. Some searches, such as searches for private favorites or for the status:unmoderated tag, return different results for different users. These searches need to have their counts cached separately for each user so that we don't return incorrect page counts when two different users perform the same search. This can also potentially leak private information, such as the number of posts flagged, downvoted, or disapproved by a given user. Partial fix for #4280.	2020-05-07 21:02:22 -05:00
evazion	41c6c882c2	search: refactor fast_count to return nil on timeout. * Refactor fast_count to return nil instead of 1,000,000 if the exact count times out. * Remove the estimate_post_counts and blank_tag_search_fast_count global config options. * Replace the hardcoded post count estimates inside fast_count with a method that parses Postgres's estimated row count from EXPLAIN. * /counts/posts.json: Remove the `raise_on_timeout` parameter. Add an `estimate_count=<true\|false>` parameter. ** Return null instead of 1,000,000 if the exact count times out.	2020-05-07 21:02:22 -05:00
evazion	d3e4ac7c17	search: clean up safe_mode / hide_deleted_posts settings. Change PostQueryBuilder to add rating:s and -status:deleted to the search inside the constructor instead of inside `#build` and `#fast_count`. This lets up clean up `#fast_count` so it doesn't have to reparse the query after adding these tags. This caused aliases to be evaluated more than once on the post index page.	2020-05-07 21:02:22 -05:00
evazion	67aab0236d	search: apply aliases after parsing searches. Make PostQueryBuilder apply aliases earlier, immediately after parsing the search. On the post index page there are multiple places where we need to apply aliases: * When running the search with PostQueryBuilder#build. * When calculating the search count with PostQueryBuilder#fast_count. * When calculating the related tags for the sidebar. * When tracking missed searches and popular searches for Reportbooru. * When looking up wiki excerpts. Applying aliases after parsing ensures we only have to apply aliases once for all of these things. We also normalize the order of tags in searches and strip repeated tags. This is so that we have consistent cache keys for fast_count. * Fixes searches for aliased tags being counted as missed searches (fixes #4433). * Fixes wiki excerpts not showing up when searching for aliased tags.	2020-05-07 13:53:35 -05:00

1 2 3 4 5 ...

254 Commits