danbooru

Author	SHA1	Message	Date
evazion	d8de58d991	Fix bug in 079_fix_duration.rb `assets` was unused.	2021-10-17 18:31:54 -05:00
evazion	2845164872	search: support quoted phrases, OR, and NOT operators in full-text search. Make all full-text search fields support quoted phrases and OR and NOT operators. This affects all text search fields (any search field that looks like `_matches`). Examples: hakurei reimu - matches anything containing the words "hakurei" and "reimu", in any order. * hakuri or reimu - matches either "hakurei" or "reimu". * hakurei -reimu - matches "hakurei" but not "reimu" * "hakurei reimu" - matches the exact phrase "hakurei reimu" * "reimu hakurei" - matches the exact phrase "reimu hakurei" * https://danbooru.donmai.us/notes?search[body_matches]=reimu+hakurei * https://danbooru.donmai.us/notes?search[body_matches]=reimu+or+hakurei * https://danbooru.donmai.us/notes?search[body_matches]=reimu+-hakurei * https://danbooru.donmai.us/notes?search[body_matches]="hakurei+reimu" * https://danbooru.donmai.us/notes?search[body_matches]="reimu+hakurei" The phrase search ability partially fixes #4536 (Inconsistent behavior of search function for comments/forums). See `websearch_to_tsquery` [1] for full details of the search syntax. [1]: https://www.postgresql.org/docs/current/textsearch-controls.html#TEXTSEARCH-PARSING-QUERIES	2021-10-16 19:13:09 -05:00
evazion	e3b836b506	Refactor full-text search to get rid of tsvector columns. Refactor full-text search on several tables (comments, dmails, forum_posts, forum_topics, notes, and wiki_pages) to use to_tsvector expression indexes instead of dedicated tsvector columns. This way full-text search works the same way across all tables. API changes: * Changed /wiki_pages.json?search[body_matches] to match against only the body. Before `body_matches` matched against both the title and the body. * Added /wiki_pages.json?search[title_or_body_matches] to match against both the title and the body. * Fixed /dmails.json?search[message_matches] to match against both the title and body when doing a wildcard search. Before a wildcard search only matched against the body. * Added /dmails.json?search[body_matches] to match against only the dmail body.	2021-10-16 07:44:27 -05:00
evazion	300bc6941e	newrelic: log with_timeout errors as expected. Make it so that when a database call inside a `with_timeout` block times out, the error logged to New Relic is marked as expected. This is so that expected timeouts, such as timeouts when calculating search counts or timeouts when generating related tags for the sidebar, don't count against the error rate.	2021-10-14 23:39:21 -05:00
evazion	7dc885f216	docker: remove Postgres Dockerfile. This now lives in the https://github.com/danbooru/danbooru-infrastructure repository.	2021-10-14 04:15:41 -05:00
evazion	d50cfdb856	db: drop dependency on Postgres test_parser extension. Drop the final dependency on the Postgres test_parser extension. We also have to remove references to test_parser in the migration where it was first defined, otherwise replaying all migrations from the beginning will fail. Replaying all migrations from the beginning normally isn't done except in testing. After this, it should be possible to use a vanilla install of Postgres with Danbooru. It's still recommended to use Danbooru's Docker image for Postgres (https://ghcr.io/danbooru/postgres), as other Postgres extensions may be necessary in the future.	2021-10-14 02:41:44 -05:00
evazion	c8e4dceedb	ci: use Postgres 14.0 for test suite.	2021-10-13 04:21:49 -05:00
evazion	5e8c91700c	tests: fix amcheck job tests.	2021-10-13 04:19:44 -05:00
evazion	c0f744f84d	Fix #4893 : Add a FIELD_present parameter variation for text fields. Usage: * https://danbooru.donmai.us/wiki_pages.json?search[body_present]=true * https://danbooru.donmai.us/wiki_pages.json?search[body_present]=false	2021-10-13 04:10:23 -05:00
evazion	d771f6f41b	autocomplete: remove old enter key workaround. Remove an old workaround designed to prevent pressing the Enter key in autocomplete from submitting the upload or tag edit.	2021-10-13 03:52:41 -05:00
evazion	206a4b5de5	docker: avoid rebuilding CSS/JS assets on every commit. Restructure the Dockerfile and the CSS/JS files so that we only rebuild the CSS and JS when they change, not on every commit. Before it took several minutes to rebuild the Docker image after every commit, even when the JS/CSS files didn't change. This also made pulling images slower. This requires refactoring the CSS and JS to not use embedded Ruby (ERB) templates, since this made the CSS and JS dependent on the Ruby codebase, which is why we had to rebuild the assets after every Ruby change.	2021-10-13 02:48:30 -05:00
evazion	587a9d0c8f	tags: move tag category definitions out of the config file. Move all the code for defining tag categories from the config file to TagCategory. It didn't belong in the config because it's not possible to add new tag categories purely in the config without editing other things like the CSS. Also change it so that tag colors are hardcoded in the CSS instead of generated using ERB. Generating the CSS in ERB meant that the Docker build had to recompile the CSS on every commit, even when it didn't change, because it relied on Ruby code outside the CSS that we couldn't guarantee didn't change.	2021-10-12 21:17:17 -05:00
evazion	e72446463e	Fix #4901 : Duplicate disapprovals * Add uniqueness constraint on post_disapprovals (user_id, post_id). * Add fix script to remove existing duplicates.	2021-10-12 20:22:00 -05:00
evazion	92e20713e3	search: fixup hardcoded small search threshold. Fixup for `f6abf39eb`.	2021-10-12 19:01:31 -05:00
evazion	374b03d62f	Merge pull request #4902 from danbooru/dependabot/bundler/puma-5.5.1 build(deps): bump puma from 5.5.0 to 5.5.1	2021-10-12 18:26:40 -05:00
dependabot[bot]	387fea160b	build(deps): bump puma from 5.5.0 to 5.5.1 Bumps [puma](https://github.com/puma/puma) from 5.5.0 to 5.5.1. - [Release notes](https://github.com/puma/puma/releases) - [Changelog](https://github.com/puma/puma/blob/master/History.md) - [Commits](https://github.com/puma/puma/compare/v5.5.0...v5.5.1) --- updated-dependencies: - dependency-name: puma dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>	2021-10-12 18:16:18 +00:00
evazion	4b15e56909	Update Ruby gems and Yarn packages.	2021-10-12 03:00:24 -05:00
evazion	f6abf39ebc	search: try to optimize slow searches. Try to optimize certain types of common slow searches: * Searches for mutually-exclusive tags (e.g. `1girl multiple_girls`, `touhou solo -1girl -1boy`) * Relatively large tags that are heavily skewed towards old posts (e.g. lucky_star, haruhi_suzumiya_no_yuuutsu, inazuma_eleven_(series), imageboard_desourced). * Mid-sized tags in the <30k post range that Postgres thinks are big enough for a post id index scan, but a tag index scan is faster. The general pattern is Postgres not using the tag index because it thinks scanning down the post id index would be faster, but it's actually much slower because it degrades to a full table scan. This usually happens when Postgres thinks a tag is larger or more common than it really is. Here we try to force Postgres into using the tag index when we know the search is small. One case that is still slow is `2girls -multiple_girls`. This returns no results, but we can't know that without searching all of `2girls`. The general case is searching for `A -B` where A is a subset of B and A and B are both large tags. Hopefully fixes #581, #654, #743, #1020, #1039, #1421, #2207, #4070, #4337, #4896, and various other issues raised over the years regarding slow searches.	2021-10-12 02:30:30 -05:00
evazion	0b22e873c9	search: cache timed out search counts. When a search is performed, we cache the post count so we don't have to calculate it again every time the user switches pages. However, if the count times out, we didn't cache it before, causing us to do a slow count on every page load. This usually happens on multi-tag searches that return a lot of results, `1girl solo` for example. This changes it so that the count is cached even when it times out. This will speed up large multi-tag searches. This also changes it so that the count is cached for a fixed 5 minutes. Before it was variable based on the size of the count, but this probably didn't make much difference.	2021-10-12 01:33:21 -05:00
evazion	341be51f95	posts: remove unused `flag!` and `approve!` methods. These methods were unused outside of the test suite	2021-10-11 20:05:09 -05:00
evazion	f155023b77	posts: remove unused exception classes.	2021-10-11 18:58:15 -05:00
evazion	7976323f7a	wiki pages: change tsvector update trigger to not use test_parser. Change the wiki_pages tsvector_update_trigger to use `pg_catalog.english` instead of `public.danbooru`. This changes how wiki page text is parsed for full-text search to use the standard English parser instead of test_parser. This is to prepare for dropping test_parser. Using test_parser here was wrong anyway because it meant that punctuation wasn't removed from words when indexing wiki pages for full-text search.	2021-10-11 03:34:47 -05:00
evazion	37a8dc5dbd	posts: use string_to_array index for tag searches. Use the `string_to_array(tag_string, ' ')` index instead of the `tag_index` for tag searches. The string_to_array index lets us treat the tag_string as an array for searching purposes. This lets us get rid of the tag_index column and the test_parser dependency in the future.	2021-10-10 22:00:10 -05:00
evazion	51e9ea2772	posts: add string_to_array(tag_string, ' ') index. This is preparation for removing tag_index and test_parser.	2021-10-10 17:45:19 -05:00
evazion	1653392361	posts: stop updating fav_string attribute. Stop updating the fav_string attribute on posts. The column still exists on the table, but is no longer used or updated. Like the pool_string in `7d503f08`, the fav_string was used in the past to facilitate `fav:X` searches. Posts had a hidden fav_string column that contained a list of every user who favorited the post. These were treated like fake hidden tags on the post so that a search for `fav:X` was treated like a tag search. The fav_string attribute has been unused for search purposes for a while now. It was only kept because of technicalities that required departitioning the favorites table first (`340e1008e`) before it could be removed. Basically, removing favorites with `@favorite.destroy` was slow because Rails always deletes object by ID, but we didn't have an index on favorites.id, and we couldn't easily add one until the favorites table was departitioned. Fixes #4652. See https://github.com/danbooru/danbooru/issues/4652#issuecomment-754993802 for more discussion of issues caused by the fav_string (in short: write amplification, post table bloat, and favorite inconsistency problems).	2021-10-09 22:36:26 -05:00
evazion	5ce36b482f	maintenance: disable amcheck job. Creates too much load and causes creating favorites to time out.	2021-10-09 11:45:36 -05:00
evazion	8b0d58130c	posts: add workaround to avoid falsely delete pending posts. Add a temporary workaround for the database index corruption bug. Add a check to skip deleting pending posts if they're not really pending.	2021-10-08 21:47:56 -05:00
evazion	c4eeeb8531	search: optimize counting posts for fav: and pool: searches. Optimize counting the number of posts returned by fav:<name> and pool:<name> searches. Use cached counts to avoid slow count(*) queries for users with lots of favorites.	2021-10-08 21:26:42 -05:00
evazion	26a411ba27	favorites: include favorites in bigquery exports. Include the favorites table in the nightly database dumps in BigQuery. Previously we couldn't do this because we didn't have an index on the favorite ID, which we needed to iterate across the table efficiently. Note that this doesn't include private favorites. Note also that if a user switches their favorites from private to public, then their favorites will begin to appear in these dumps.	2021-10-08 21:26:42 -05:00
evazion	c4a4e77ca5	favorites: order /favorites.json by id. Order https://danbooru.donmai.us/favorites.json by favorite ID instead of by post ID. This way you can get a feed of recently added favorites. Previously this wasn't possible because we didn't have an index on favorite ID.	2021-10-08 21:26:42 -05:00
evazion	340e1008e9	favorites: merge favorites subtables. Merge the 100 favorite subtables into a single table. Previously the favorites table was partitioned by user id into 100 subtables to try to make searching by user id faster. This wasn't really necessary and probably slower than just making an index on (favorites.user_id, favorites.id) to satisfy ordfav searches. BTree indexes are logarithmic so dividing an index by 100 doesn't make it 100 times faster to search; instead it just removes a layer or two from the tree. This also adds a uniqueness index on (user_id, post_id) to prevent duplicate favorites. Previously we had to check for duplicates at the application layer, which required careful locking to do it correctly. Finally, this adds an index on favorites.id, which was surprisingly missing before. This made ordering and deleting favorites by id really slow because it degraded to a sequential scan.	2021-10-08 21:26:42 -05:00
evazion	73acc16271	Merge pull request #4898 from nonamethanks/feat-meme-tag Autotag the meme tag on posts with *_(meme) tags	2021-10-08 05:49:17 -05:00
evazion	6d3d7b0bd1	Fix #4651 : Favorites table contains duplicate favorites Add fix script to remove duplicate favorites. When a user has duplicate favorites on the same post, the earliest favorite will be kept and the rest will be removed.	2021-10-08 05:17:01 -05:00
evazion	7fa23c5fbf	users: give all users unlimited favorites. Let all users have unlimited favorites. Formerly the limit was 10k favorites for regular members, 20k for Gold, and unlimited for Platinum. Limiting favorites doesn't make sense since upvotes are unlimited.	2021-10-07 06:27:09 -05:00
evazion	7d503f088e	posts: stop using pool_string attribute. Stop using the pool_string attribute on posts: * Stop updating it when adding or removing posts from pools. * Stop returning pool_string in the /posts.json API. * Stop including the `data-pools` attribute on thumbnails. The pool_string attribute was used in the past to facilitate pool:X searches. Posts had a hidden pool_string attribute that contained a list of every pool the post belonged to. These pools were treated like fake hidden tags on the post and a search for `pool:X` was treated like a tag search. The pool_string has no longer been used for this purpose for a long time now, and was only maintained for API compatibility purposes. Getting rid of it eliminates a bunch of legacy cruft relating to adding and removing posts from pools. If you need to see which pools a post belongs to, do this: * https://danbooru.donmai.us/pools.json?search[post_ids_include_any]=318550 The `data-pools` attribute on thumbnails was used by some people to add custom borders to pooled posts with custom CSS. This will no longer work. This was already broken because it included things like collection pools and deleted pools, which you probably didn't want. Use a userscript to add this attribute back to thumbnails if you need it.	2021-10-07 05:55:43 -05:00
evazion	595e02ab45	posts: add duration:<x> and order:duration metatags. Add duration:<x> and order:duration metatags for searching animated posts by duration. https://danbooru.donmai.us/posts?tags=animated+duration:<5.0 https://danbooru.donmai.us/posts?tags=animated+duration:>60 https://danbooru.donmai.us/posts?tags=animated+order:duration	2021-10-07 03:21:08 -05:00
evazion	0731b07d27	posts: store duration of animations and videos. Start storing the duration of animations and videos in the `duration` field on the media_assets table. This had to wait until `3d30bfd69` was deployed, which had to wait until Postgres was upgraded in order to add the duration column to the media_assets table without downtime. Also add a fix script to backfill the duration on existing posts. Usage: TAGS=animated ./script/fixes/079_fix_duration.rb	2021-10-07 03:21:08 -05:00
evazion	2595f18b2f	posts: fix calculation of animated PNG duration. Fix certain animated PNGs returning NaN as the duration because the frame rate was being reported as "0/0" by FFMpeg. This happens when the animation has zero delay between frames. This is supposed to mean a PNG with an infinitely fast frame rate, but in practice browsers limit it to around 10FPS. The exact frame rate browsers will use is unknown and implementation defined.	2021-10-06 21:04:36 -05:00
nonamethanks	d6daec8918	Autotag the meme tag on posts with *_(meme) tags	2021-10-06 18:06:46 +02:00
evazion	f6a6289c8d	posts: autoremove tagme on posts with >30 tags. If you're able to add 30 tags then you don't need to tag it tagme.	2021-10-06 08:08:52 -05:00
evazion	950bc608c2	maintenance: add job to check for database corruption. Add a job to run pg_amcheck hourly to check for corrupt database indexes. https://www.postgresql.org/docs/14/app-pgamcheck.html	2021-10-06 08:08:52 -05:00
evazion	f687bb0608	docker: update Postgres client binaries to 14.0. Update the Postgres client binaries (psql et al) to version 14.0. This is so they match the server version, and so that pg_amcheck is available, which was introduced in 14.0. This requires updating the base image to Ubuntu 21.04 at the same time because the Postgres repo doesn't support version 14.0 on Ubuntu 20.10.	2021-10-06 08:08:52 -05:00
evazion	21f46f26d4	Merge pull request #4897 from nonamethanks/fix-lofter-tags Lofter: fix tag extraction	2021-10-04 14:26:30 -05:00
nonamethanks	45313c56a6	Lofter: fix tag extraction	2021-10-04 14:21:07 +02:00
evazion	cdd46b0ac5	tests: fix more spurious test failures in CI. * Skip Nijie tests because they fail a lot due to Nijie rate limiting us. * Skip ArtStation downloads tests because they sometimes return different file sizes. * Fix random duplicate favgroup errors because favgroup names weren't random enough.	2021-10-01 18:05:25 -05:00
evazion	5ef231e1dc	ci: use tmpfs for test suite database. Better fix for `d128bb71b`. Turns out /tmp isn't a tmpfs.	2021-10-01 17:52:53 -05:00
evazion	caa261e930	ci: fix tests running multiple times for same commit. Fix the test suite running multiple times when pushing multiple branches at once.	2021-10-01 17:48:19 -05:00
evazion	ad523b3745	Fix #4895 : Deleted forum topics visible by default in index. The bug was that since search parameters normally come from URL request parameters, a string value is expected here.	2021-10-01 17:14:48 -05:00
evazion	6c7fbb96b3	news updates: use relative time instead of date. Use relative time on site updates to show when a site update was posted ("5m ago"), instead of the date ("Oct 1").	2021-10-01 05:15:40 -05:00
evazion	1335639298	Merge pull request #4894 from nonamethanks/add-million-scale Add million scale to humanized number helper	2021-10-01 04:43:50 -05:00

1 2 3 4 5 ...

10720 Commits