danbooru

Author	SHA1	Message	Date
evazion	3ecc389995	Fix #5313 : Exception in -duration:>0.	2022-11-02 00:03:31 -05:00
evazion	7e99be0b2e	searchable: better fix for #5312 . Simplify fix for #5312. Fixes an exception when searching `has_one` associations like `/wiki_pages?search[artist][is_banned]=true`.	2022-10-25 23:30:57 -05:00
evazion	6413b9abcd	Fix #5312 : StatementInvalid Error When Querying the media_assets Attribute of Uploads	2022-10-25 16:48:16 -05:00
evazion	5565c753d0	reports: fix exception when using `period` option and filtering by association. Fix an exception in reports like this: * https://betabooru.donmai.us/reports/posts?search[period]=day&search[uploader][name]=evazion Caused by the `search` method doing a left join instead of a subquery when filtering by a belongs to association.	2022-10-23 21:55:06 -05:00
evazion	203067b5ed	reports: add non-timeseries charts. Add bar charts for non-timeseries data. For example, a bar chart of the top 10 uploaders overall in the last month, rather than a timeseries chart of the number of uploads per day for the last month.	2022-10-23 04:42:51 -05:00
evazion	f73d2e3956	reports: add ability to group reports by column. Add ability to group reports by various columns. For example, you can see the posts by the top 10 uploaders over time, or posts grouped by rating over time.	2022-10-22 04:05:10 -05:00
evazion	7646521d0f	Add basic tables and graphs for various tables. Add basic tables and graphs for viewing things like uploads over time, new users over time, comments over time, etc. Located at https://betabooru.donmai.us/reports. The graphing uses Apache ECharts: https://echarts.apache.org/en/index.html.	2022-10-20 05:20:22 -05:00
evazion	4066ee52b1	db: add tsvector indexes on commentary, dmail, and wiki page text columns.	2022-10-10 02:57:23 -05:00
evazion	559bf1ae0a	modqueue: fix the `disapproved:` metatag showing posts outside the queue. Fix a bug where filtering the modqueue by the `disapproved:<reason>` tag would return posts outside the modqueue.	2022-09-30 02:03:15 -05:00
evazion	d629c81aa1	Fix #5267 : order[custom] no longer works when only a single ID is being searched	2022-09-29 04:36:12 -05:00
evazion	530d8cf762	searchable: fix searching for invalid IP addresses. Fix an ArgumentError exception when searching for an invalid IP address. Also allow searching for multiple subnets at once.	2022-09-29 04:36:12 -05:00
evazion	c5b215ffcb	searchable: fix searching for polymorphic attributes. Fix not being able to use the full set of search operators on polymorphic `model_id` and `model_type` attributes. Before things like `search[model_type]=Post` worked, but `search[model_type_not_eq]=Post` or other `model_type_*` operators didn't.	2022-09-29 04:36:12 -05:00
evazion	09eb763e3c	searchable: fix searching by json attribute. Fix accidentally hardcoding the attribute name to `metadata`.	2022-09-29 04:36:12 -05:00
evazion	f49b3c439f	posts: optimize modqueue page, status:modqueue, and status:unmoderated searches. * Optimize status:modqueue and status:unmoderated searches. This brings them down from taking 500ms-1000ms per search to ~5ms. * Change status:unmoderated so that it only filters out the user's disapproved posts, not the user's own uploads or past approvals. Now it's equivalent to `status:modqueue -disapproved:evazion`, whereas before it was equivalent to `status:modqueue -disapproved:evazion -approver:evazion -user:evazion`. Filtering out the user's own uploads and approvals was slow and usually unnecessary, since for most users it's rare for their own uploads or approvals to reenter the modqueue. Before status:modqueue did this: SELECT * FROM posts WHERE is_pending = TRUE OR is_flagged = TRUE OR (is_deleted = TRUE AND id IN (SELECT post_id FROM post_appeals WHERE status = 0)) Now we do this: SELECT * FROM posts WHERE id IN (SELECT id FROM posts WHERE is_pending = TRUE UNION ALL SELECT id FROM posts WHERE is_flagged = TRUE UNION ALL SELECT id FROM posts WHERE id IN (SELECT post_id FROM post_appeals WHERE status = 0)) Postgres had a bad time with the "pending or flagged or has a pending appeal" clause because it didn't know that posts can only be in one state at a time, so it overestimated how many posts would be returned and chose a seq scan. Replacing the OR with a UNION avoids this.	2022-09-28 00:29:50 -05:00
evazion	331f15259a	Fix #1590 : id metatag: "id:A..B,C..D,E" Support searches like the following: * score:<0,>100 (equivalent to `score:<0 or score:>100`) * score:5,10..15,>20 (equivalent to `score:5 or score:10..15 or score:>20`) * id:5,10..15,20..25,30 (equivalent to `id:5 or id:10..15 or id:20..25 or id:30`) This also works inside the `search[id]` URL parameter, and inside other numeric URL search parameters.	2022-09-27 01:44:18 -05:00
evazion	116c1f1af8	searchable: factor out metatag value parser. Factor out the code that parses metatag values (e.g. `score:>5`) and search URL params (e.g. `search[score]=>5`) into a RangeParser class. Also fix a bug where, if the `search[order]=custom` param was used without a `search[id]` param, an exception would be raised. Fix another bug where if an invalid `search[id]` was provided, then the custom order would be ignored and the search would be returned with the default order instead. Now if you use `search[order]=custom` without a valid `search[id]` param, the search will return no results.	2022-09-26 22:50:45 -05:00
evazion	65dbd89e25	Fix #4978 : -approver:username implicitly adds approver:any Fix the approver:, parent:, and pixiv: metatags not working correctly when negated: * Fix -approver:<name> not including posts that don't have an approver (the approver_id is NULL) * Fix -parent:<id> not including posts that don't have a parent (the parent_id is NULL) * Fix -pixiv:<id> not including posts that aren't from Pixiv (the pixiv_id is NULL) The problem lies how the equality operator is negated when the column contains NULL values; `approver_id != 52664` doesn't match posts where the `approver_id` is NULL. The search `approver:evazion` boils down to: # Post.where(approver_id: 52664).to_sql SELECT * FROM posts WHERE approver_id = 52664; When that is negated with `-approver:evazion`, it becomes: # Post.where(approver_id: 52664).invert_where.to_sql SELECT * FROM posts WHERE approver_id != 52664; But in SQL, `approver_id != 52664` doesn't match when the approver_id IS NULL, so the search doesn't include posts without an approver. We could use `a IS NOT DISTINCT FROM b` instead of `a = b`: # Post.where(Post.arel_table[:approver_id].is_not_distinct_from(52664)).to_sql SELECT * FROM posts WHERE approver_id IS NOT DISTINCT FROM 52664; This way when it's inverted it becomes `IS DISTINCT FROM`: # Post.where(Post.arel_table[:approver_id].is_not_distinct_from(52664)).invert_where.to_sql SELECT * FROM posts WHERE approver_id IS NOT DISTINCT FROM 52664; `approver_id IS DISTINCT FROM 52664` is like `approver_id != 52664`, except it matches when approver_id is NULL [1]. This works correctly, however the problem is that `IS NOT DISTINCT FROM` can't use indexes because of a long-standing Postgres limitation [2]. This makes searches too slow. So instead we do this: # Post.where(approver_id: 52664).where.not(approver_id: nil).to_sql SELECT * FROM posts WHERE approver_id = 52664 AND approver_id IS NOT NULL; That way when negated it becomes: # Post.where(approver_id: 52664).where.not(approver_id: nil).invert_where.to_sql SELECT * FROM posts WHERE approver_id != 52664 OR approver_id IS NULL; Which is the correct behavior. [1] https://modern-sql.com/feature/is-distinct-from [2] https://www.postgresql.org/message-id/6FC83909-5DB1-420F-9191-DBE533A3CEDE@excoventures.com	2022-09-26 19:28:10 -05:00
evazion	88ac91f5f3	search: refactor to pass in the current user explicitly.	2022-09-22 04:31:21 -05:00
evazion	3114ef3daf	searchable: standardize the `<field>_matches` operator for text fields. Standardize it so that all fields of type `text` are searchable with `search[<field>_matches]`. Before, the `<field>_matches` param was handled manually and some fields were left out or handled inconsistently. Now it applies to all columns of type `text`. This does a full-text search on the field, so for example, searching `/artist_commentaries?search[translated_description_matches]=smiling` will match translated commentaries containing either the word "smiling", "smiles", "smiled", or "smile". Note that this only applies to columns defined as type `text`, not to columns defined as `character varying`. The difference is that `text` is used for fields containing free-form natural language, such as comments, notes, forum posts, wiki pages, pool descriptions, etc, while `character varying` is used for short strings not containing free-form language, such as tag names, wiki page titles, urls, status fields, etc. API changes: * Add the `search[original_title_matches]`, `search[original_description_matches]`, `search[translated_title_matches]`, `search[translated_description_matches]` params to /artist_commentaries and /artist_commentary_versions. * Remove the `search[name_matches]` and `search[group_name_matches]` params from /artist_versions. * Remove the `search[title_matches]` param from /wiki_page_versions. * Change the `search[name_matches]` param on /pools, /favorite_groups, and /pool_versions to do a full-text search instead of a substring match.	2022-09-22 01:52:13 -05:00
evazion	a35f49e905	searchable: add framework for defining user search permissions. Add a `visible_for_search` method to ApplicationPolicy that lets us define which fields a user is allowed to search for. For example, when a normal user searches for post flags by flagger name, they're only allowed to see their own flags, not flags by other users. But when a mod searches for flags by flagger name, they're allowed to see all flags, except for flags on their own uploads. This framework lets us define these rules in the `visible_for_search` method in the model's policy class, rather than as special cases in the `search` method of each model.	2022-09-22 01:51:23 -05:00
evazion	6a9a679149	searchable: refactor `search_attributes` helper methods into class. Move the `search_attributes` helper methods inside the Searchable concern into a SearchContext helper class. This is so we don't have to pass `params` and `current_user` down through a bunch of different helper methods, and so that these private helper methods don't pollute the object's namespace.	2022-09-22 01:48:21 -05:00
evazion	bb728ecebf	tags: add /tag_versions page.	2022-09-11 18:41:16 -05:00
evazion	54a45a3021	tags: track tag histories. Track the history of the tag `category` and `is_deprecated` fields in the `tag_versions` table. Adds generic Versionable and VersionFor concerns that encapsulate most of the history tracking logic. These concerns are designed to make it easy to add history to any model. There are a couple notable differences between tag versions and other versions: * There is no 1 hour edit merge window. All changes to the `category` and `is_deprecated` fields produce a new version in the tag history. * New versions aren't created when a tag is created. Versions are only created when a tag is edited for the first time. The tag's initial version isn't created until after the tag is edited for the first time. For example, if you change the category of a tag that was last updated 10 years ago, that will create an initial version of the tag backdated to 10 years ago, plus a new version for your edit. This is for a few reasons: * So that we don't have to create new tag versions every time a new tag is created. This would be wasteful because most tags never have their category or deprecation status change. * So that if you make a typo tag, your name isn't recorded in the tag's history forever. * So that we can create new tags in various places without having to know who created the tag (which may be unknown if the current user isn't set). * Because we don't know the full history of most tags, so we have to deal with incomplete histories anyway. This has a few important consequences: * Most tags won't have any tag versions. They only gain tag versions if they're edited. * You can't track /tag_versions to see newly created tags. It only shows changes to already existing tags. * Tag version IDs won't be in strict chronological order. Higher IDs may have created_at timestamps before lower IDs. For example, if you change the category of a tag that is 10 years old, that will create an initial version with a high ID, but with a created_at timestamp dated to 10 years ago. Fixes #4402: Track tag category changes	2022-09-11 17:47:44 -05:00
evazion	f8e4e5724f	autocomplete: switch to word-based tag matching. Switch autocomplete to match individual words in the tag, instead of only matching the start of the tag. For example, "hair" matches any tag containing the word "hair", not just tags starting with "hair". "long_hair" matches all tags containing the words "long" and "hair", which includes "very_long_hair" and "absurdly_long_hair". Words can be in any order and words can be left out. So "closed_eye" matches "one_eye_closed". "asuka_langley_souryuu" matches "souryuu_asuka_langley". This has several advantages: * You can search characters by first name. For example, "miku" matches "hatsune_miku". "zelda" matches both "princess_zelda" and "the_legend_of_zelda". * You can find the right tag even if you get the word order wrong, or forget a word. For example, "eyes_closed" matches "closed_eyes". "hair_over_eye" matches "hair_over_one_eye". * You can find more related tags. For example, searching "skirt" shows all tags containing the word "skirt", not just tags starting with "skirt". The downside is this may break muscle memory by changing the autocomplete order of some tags. This is an acceptable trade-off. You can get the old behavior by writing a "" at the end of the tag. For example, searching "skirt" gives the same results as before.	2022-09-02 13:56:26 -05:00
evazion	6386962357	Fix #5225 : PG::AmbiguousColumn: ERROR: column reference "bit_prefs" is ambiguous	2022-07-02 17:37:22 -05:00
evazion	1aeb52186e	Add AI tag model and UI. Add a database model for storing AI-predicted tags, and add a UI for browsing and searching these tags. AI tags are generated by the Danbooru Autotagger (https://github.com/danbooru/autotagger). See that repo for details about the model. The database schema is `ai_tags (media_asset_id integer, tag_id integer, score smallint)`. This is designed to be as space-efficient as possible, since in production we have over 300 million AI-generated tags (6 million images and 50 tags per post). This amounts to over 10GB in size, plus indexes. You can search for AI tags using e.g. `ai:scenery`. You can do `ai:scenery -scenery` to find posts where the scenery tag is potentially missing, or `scenery -ai:scenery` to find posts that are potentially mistagged (or more likely where the AI missed the tag). You can browse AI tags at https://danbooru.donmai.us/ai_tags. On this page you can filter by confidence level. You can also search unposted media assets by AI tag. To generate tags, use the `autotag` script from the Autotagger repo, something like this: docker run --rm -v ~/danbooru/public/data/360x360:/images ghcr.io/danbooru/autotagger ./autotag -c -f /images \| gzip > tags.csv.gz To import tags, use the fix script in script/fixes/. Expect a Danbooru-size dataset to take hours to days to generate tags, then 20-30 minutes to import. Currently this all has to be done by hand.	2022-06-24 04:54:26 -05:00
evazion	af183467b6	post queries: switch to new post search engine. Switch to the post search engine using the new PostQuery parser. The new engine fully supports AND, OR, and NOT operators and grouping expressions with parentheses. Highlights: New OR operator: * `skirt or dress` (same as `~skirt ~dress`) Tags can be grouped with parentheses: * `1girl (skirt or dress)` * `(blonde_hair blue_eyes) or (red_hair green_eyes)` * `~(blonde_hair blue_eyes) ~(red_hair green_eyes)` (same as above) * `(pantyhose or thighhighs) (black_legwear or brown_legwear)` * `(~pantyhose ~thighhighs) (~black_legwear ~brown_legwear)` (same as above) Metatags can be OR'd together: * `user:evazion or fav:evazion` * `~user:evazion ~fav:evazion` Wildcard tags can combined with either AND or OR: * `black_* white_` (find posts with at least one black_ tag AND one white_* tag) * `black_* or white_` (find posts with at least one black_ tag OR one white_* tag) * `~black_* ~white_*` (same as above) See `4c7cfc73` for more syntax examples. Fixes #4949: And+or search? Fixes #5056: Wildcard searches return unexpected results when combined with OR searches	2022-04-17 23:20:22 -05:00
evazion	01a22930e7	posts: move attribute search methods from PostQueryBuilder to Post. Move `status_matches` etc methods from PostQueryBuilder to Post. This is to make refactoring to use the new query parser easier.	2022-04-06 20:25:09 -05:00
evazion	51ba56e8a3	Fix #5001 : Media assets not searchable through upload records. Fix this: https://danbooru.donmai.us/uploads.json?search[media_assets][md5]=b83daa7f1ae7e4127b1befd32f71ba10 failing with an ActiveRecord::StatementInvalid error. The bug was that for a `has_many through: ...` association, like `has_many :media_assets, through: :upload_media_assets`, we weren't joining on the associated table properly so we ended up generating invalid SQL.	2022-02-08 19:18:11 -06:00
evazion	33103f6dc4	pools: add ability to search for pools linking to given tag. Add ability to search for pools linking to a given tag in the pool description. Example: https://danbooru.donmai.us/pools?search[linked_to]=touhou (This isn't actually exposed in the UI to avoid cluttering the pool search form with rarely used options.) Pools with broken links can be found here: https://danbooru.donmai.us/dtext_links?search[has_linked_tag]=No&search[has_linked_wiki]=No&search[model_type]=Pool Lays the groundwork for fixing #4629.	2022-01-15 20:26:30 -06:00
evazion	72ea78e697	searchable: replace find_ordered with in_order_of. Rails 7 added an `in_order_of` method that does what our `find_ordered` method did before.	2022-01-07 14:24:57 -06:00
evazion	82211ba935	jobs: add ability to search jobs on /jobs page. Add ability to search jobs on the /jobs page by job type or by status. Fixes #2577 (Search filters for delayed jobs). This wasn't possible before with DelayedJobs because it stored the job data in a YAML string, which made it difficult to search jobs by type. GoodJobs stores job data in a JSON object, which is easier to search in Postgres.	2022-01-04 17:18:36 -06:00
evazion	a7dc05ce63	Enable frozen string literals. Make all string literals immutable by default.	2021-12-14 21:33:27 -06:00
evazion	6b9e1181e5	search: optimize ?search[user_name]=... searches. Optimize searches using the `search[user_name]=...` URL parameter. If we're not doing a wildcard search, then do a regular user lookup, which generates better SQL.	2021-11-20 03:19:04 -06:00
evazion	bc96eb864b	votes: make private favorites and upvotes a Gold-only option. Make private favorites and upvotes a Gold-only account option. Existing Members with private favorites enabled are allowed to keep it enabled, as long as they don't disable it. If they disable it, then they can't re-enable it again without upgrading to Gold first. This is a Gold-only option to prevent uploaders from creating multiple accounts to upvote their own posts. If private upvotes were allowed for Members, then it would be too easy to use fake accounts and private upvotes to upvote your own posts.	2021-11-18 04:11:51 -06:00
evazion	2845164872	search: support quoted phrases, OR, and NOT operators in full-text search. Make all full-text search fields support quoted phrases and OR and NOT operators. This affects all text search fields (any search field that looks like `_matches`). Examples: hakurei reimu - matches anything containing the words "hakurei" and "reimu", in any order. * hakuri or reimu - matches either "hakurei" or "reimu". * hakurei -reimu - matches "hakurei" but not "reimu" * "hakurei reimu" - matches the exact phrase "hakurei reimu" * "reimu hakurei" - matches the exact phrase "reimu hakurei" * https://danbooru.donmai.us/notes?search[body_matches]=reimu+hakurei * https://danbooru.donmai.us/notes?search[body_matches]=reimu+or+hakurei * https://danbooru.donmai.us/notes?search[body_matches]=reimu+-hakurei * https://danbooru.donmai.us/notes?search[body_matches]="hakurei+reimu" * https://danbooru.donmai.us/notes?search[body_matches]="reimu+hakurei" The phrase search ability partially fixes #4536 (Inconsistent behavior of search function for comments/forums). See `websearch_to_tsquery` [1] for full details of the search syntax. [1]: https://www.postgresql.org/docs/current/textsearch-controls.html#TEXTSEARCH-PARSING-QUERIES	2021-10-16 19:13:09 -05:00
evazion	e3b836b506	Refactor full-text search to get rid of tsvector columns. Refactor full-text search on several tables (comments, dmails, forum_posts, forum_topics, notes, and wiki_pages) to use to_tsvector expression indexes instead of dedicated tsvector columns. This way full-text search works the same way across all tables. API changes: * Changed /wiki_pages.json?search[body_matches] to match against only the body. Before `body_matches` matched against both the title and the body. * Added /wiki_pages.json?search[title_or_body_matches] to match against both the title and the body. * Fixed /dmails.json?search[message_matches] to match against both the title and body when doing a wildcard search. Before a wildcard search only matched against the body. * Added /dmails.json?search[body_matches] to match against only the dmail body.	2021-10-16 07:44:27 -05:00
evazion	c0f744f84d	Fix #4893 : Add a FIELD_present parameter variation for text fields. Usage: * https://danbooru.donmai.us/wiki_pages.json?search[body_present]=true * https://danbooru.donmai.us/wiki_pages.json?search[body_present]=false	2021-10-13 04:10:23 -05:00
evazion	7976323f7a	wiki pages: change tsvector update trigger to not use test_parser. Change the wiki_pages tsvector_update_trigger to use `pg_catalog.english` instead of `public.danbooru`. This changes how wiki page text is parsed for full-text search to use the standard English parser instead of test_parser. This is to prepare for dropping test_parser. Using test_parser here was wrong anyway because it meant that punctuation wasn't removed from words when indexing wiki pages for full-text search.	2021-10-11 03:34:47 -05:00
evazion	37a8dc5dbd	posts: use string_to_array index for tag searches. Use the `string_to_array(tag_string, ' ')` index instead of the `tag_index` for tag searches. The string_to_array index lets us treat the tag_string as an array for searching purposes. This lets us get rid of the tag_index column and the test_parser dependency in the future.	2021-10-10 22:00:10 -05:00
evazion	ea6e47125e	metadata: add ability to search exif metadata. Usage: * https://danbooru.donmai.us/media_metadata?search[has_metadata]=true * https://danbooru.donmai.us/media_metadata?search[has_metadata]=false * https://danbooru.donmai.us/media_metadata?search[metadata_has_key]=GIF:GIFVersion * https://danbooru.donmai.us/media_metadata?search[metadata][GIF:GIFVersion]=89a * https://danbooru.donmai.us/media_metadata?search[metadata][GIF:GIFVersion]&search[metadata][GIF:BackgroundColor]=0	2021-09-16 00:25:21 -05:00
evazion	49d18e64e8	Fix #4869 : "Random" button raises exception when viewing ordfav. Fix exception during https://danbooru.donmai.us/posts/random?tags=ordfav:nonamethanks Before we were doing a query like this: SELECT "posts".* FROM "posts" INNER JOIN "favorites" ON "favorites"."post_id" = "posts"."id" WHERE (favorites.user_id % 100 = 64 AND favorites.user_id = 52664) AND "posts"."id" = 343894 ORDER BY favorites.id DESC, posts.id DESC, ID=343894 DESC but `ID=? DESC` is ambiguous during an ordfav: search because of the join on the favorites table. The fix is to qualify the reference as `posts.id`.	2021-08-30 16:46:03 -05:00
evazion	ad4c75eb1a	docs add more docs to app/{jobs,logical}. These were missed in the last commit.	2021-06-28 05:09:19 -05:00
evazion	00ca7526bb	docs: add remaining docs for classes in app/logical.	2021-06-24 01:31:41 -05:00
evazion	6b91e55283	comments: allow votes to be soft deleted. Make it so that when a user removes their own vote, the vote is soft deleted (the is_deleted flag is set) instead of hard deleted. Changes: * Add is_deleted flag to comment votes. * Relax uniqueness constraint so you can have multiple deleted votes on the same comment. You can still only have one active vote on the comment. * Add `soft_delete` method to Deletable concern.	2021-03-30 00:10:22 -05:00
evazion	81fe68d392	bans: change `expires_at` field to `duration`. Changes: * Change the `expires_at` field to `duration`. * Make moderators choose from a fixed set of standard ban lengths, instead of allowing arbitrary ban lengths. * List `duration` in seconds in the /bans.json API. * Dump bans to BigQuery. Note that some old bans have a negative duration. This is because their expiration date was before their creation date, which is because in 2013 bans were migrated to Danbooru 2 and the original ban creation dates were lost.	2021-03-11 02:59:58 -06:00
evazion	92b8f24724	ip addresses: move more logic to Danbooru::IpAddress. * Move `is_local?` from IpLookup to Danbooru::IpAddress. * Refactor more things to use Danbooru::IpAddress instead of using IPAddress directly.	2021-03-01 20:13:14 -06:00
evazion	031032326e	mentions: fix exception when mentioning nonexistent user.	2021-02-05 19:40:30 -06:00
evazion	3f16fe3d80	Fix #4680 : @-ing yourself sends you a DMail. Don't send a dmail when the user @-mentions themselves, whether in an edit or in the original message.	2021-02-03 23:46:59 -06:00
evazion	ef177a09cf	searchable: fixup bugs in `e7b454686`.	2021-01-11 19:47:20 -06:00

1 2

88 Commits