danbooru

Author	SHA1	Message	Date
evazion	a07e6667b4	jobs: fix backwards job priorities. Fix bug where jobs had the opposite of the intended priority. Populating saved searches had the highest priority, while processing uploads had the lowest priority. Caused by Delayed::Job and GoodJob having opposite interpretations of job priorities. In Delayed::Job, lower numbers had higher priority, while in GoodJob, higher numbers have higher priority. This was missed when migrating from Delayed::Job to GoodJob.	2022-11-30 17:56:10 -06:00
evazion	7d0d74a0f0	jobs: sort job classes by name in search form.	2022-11-30 14:43:04 -06:00
evazion	5665346942	jobs: fix certain jobs not appearing in /jobs category dropdown.	2022-11-22 23:26:01 -06:00
evazion	ba1cf14c7e	uploads: mark uploads as failed if they're stuck processing for more than 4 hours.	2022-11-20 23:41:07 -06:00
evazion	b43a913ad7	users: delete more data when user deactivates their account. * Don't delete the user's favorites unless private favorites are enabled. The general rule is that public account activity is kept and private account activity is deleted. * Delete the user's API keys, forum topics visits, private favgroups, downvotes, and upvotes (if privacy is enabled). * Reset all of the user's account settings to default. This means custom CSS is deleted, where it wasn't before. * Delete everything but the user's name and password asynchronously. * Don't log the current user out if it's the owner deleting another user's account. * Fix #5067 (Mod actions sometimes not created for user deletions) by wrapping the deletion process in a transaction.	2022-11-06 00:05:18 -05:00
evazion	48ecb80d6b	Fix #5230 : video upload 500 error (StatementInvalid) & empty error panel on page Fix StatementInvalid exception when uploading https://files.catbox.moe/vxoe2p.mp4. This was a result of multiple bugs: * First, generating thumbnails for the video failed. This was because the video uses the AV1 codec, which FFmpeg failed to decode. It failed because our version of FFmpeg was built without the `--enable-libdav1d` flag, so it uses the builtin AV1 decoder, which apparently can't handle this particular video (it spews a bunch of errors about "Failed to get pixel format" and "missing sequence header" and "failed to get reference frame"). * Because generating the thumbnails failed, an exception was raised. We tried to save the error message in the upload_media_assets.error field. However, this also failed because the error message was 77kb long (it contained the entire output of the ffmpeg command), but the `upload_media_assets` table had a btree index on the `error` column, which meant the maximum length of the error column was limited to ~2.7kb. This lead to a StatementInvalid exception being raised. * Because the StatementInvalid exception was raised while we were trying to set the upload media asset's status to `failed`, the upload was left stuck in the `processing` state rather than being set to the `failed` state. * Because the upload was stuck in the `processing` state, the upload page would hang forever waiting for the upload to complete. The fixes are to: * Build FFmpeg with `--enable-libdav1d` to use libdav1d for decoding AV1 videos instead of the builtin AV1 decoder. * Remove the index on the `upload_media_assets.error` column so that setting overly long error messages won't fail. * Catch unexpected exceptions in ProcessUploadMediaAssetJob so we can mark uploads as failed, even if `process_upload!` itself fails because it raises an unexpected exception inside its own exception handler. * Check that the video is playable with `MediaFile::Video#is_corrupt?` before allowing it to be uploaded. This way we can return a better error message if we can't generate thumbnails because the video isn't playable. This requires decoding the entire video, so it means uploads may take several seconds longer for long videos. It's also a security risk in case ffmpeg has any bugs. * Define `MediaAsset#preview!` as raising an exception on error, so it's clear that generating thumbnails can fail. Define `MediaAsset#preview` as returning nil on error for when we don't care about the cause of the error.	2022-10-26 22:49:55 -05:00
evazion	b94cb7d824	emails: include logging information in email headers. Log the following information in email headers: * X-Danbooru-User: the user's name and ID. * X-Danbooru-IP: the user's IP. * X-Danbooru-Session: the users' session ID. * X-Danbooru-URL: the page that triggered the email. * X-Danbooru-Job-Id: the ID of the background job that sent the email. * X-Danbooru-Enqueued-At: when the email was queued as a background job. * X-Danbooru-Dmail: for Dmail notifications, the link to the Dmail. * X-Request-Id: the request ID of the HTTP request that triggered the email. Also make it so we log an event in the APM when we send an email.	2022-09-29 04:36:11 -05:00
evazion	1d2bac7b95	Remove CurrentUser.ip_addr. Remove the `CurrentUser.ip_addr` global variable and replace it with `request.remote_ip`. Before we had to track the current user's IP in a global variable so that when we edited a post for example, we could pass down the user's IP to the model and save it in the post_versions table. Now that we now longer save IPs in version tables, we don't need a global variable to get access to the current user's IP outside of controllers.	2022-09-18 05:02:10 -05:00
evazion	c8917684b8	jobs: fix failures in /jobs controller. Fix errors with searching for and retrying jobs on the /jobs page caused by the upgrade to GoodJob 3.0.	2022-08-23 18:07:04 -05:00
evazion	02edb52569	uploads: enable multi-file uploads when uploading from source. Make the upload page automatically detect when a source URL has multiple images and let the user choose which images to post. For example, when uploading a Twitter or Pixiv post with more than one image, we direct the user to a page showing a thumbnail for each image and letting them choose which ones to post. This is similar to the batch upload page, except we actually download each image in the background, instead of just hotlinking or proxying the thumbnails through our servers. This avoids various problems with proxying and makes new features possible, like showing which images in the batch have already been posted.	2022-02-14 16:13:55 -06:00
evazion	c0730630af	uploads: fixup issues causing Rails to fail to boot. * Fix `UploadService is not a class` error. * Update list of available job classes (remove UploadPreprocessorDelayedStartJob, UploadServiceDelayedStartJob, add ProcessUploadJob).	2022-01-28 04:32:20 -06:00
evazion	abdab7a0a8	uploads: rework upload process. Rework the upload process so that files are saved to Danbooru first before the user starts tagging the upload. The main user-visible change is that you have to select the file first before you can start tagging it. Saving the file first lets us fix a number of problems: * We can check for dupes before the user tags the upload. * We can perform dupe checks and show preview images for users not using the bookmarklet. * We can show preview images without having to proxy images through Danbooru. * We can show previews of videos and ugoira files. * We can reliably show the filesize and resolution of the image. * We can let the user save files to upload later. * We can get rid of a lot of spaghetti code related to preprocessing uploads. This was the cause of most weird "md5 confirmation doesn't match md5" errors. (Not all of these are implemented yet.) Internally, uploading is now a two-step process: first we create an upload object, then we create a post from the upload. This is how it works: * The user goes to /uploads/new and chooses a file or pastes an URL into the file upload component. * The file upload component calls `POST /uploads` to create an upload. * `POST /uploads` immediately returns a new upload object in the `pending` state. * Danbooru starts processing the upload in a background job (downloading, resizing, and transferring the image to the image servers). * The file upload component polls `/uploads/$id.json`, checking the upload `status` until it returns `completed` or `error`. * When the upload status is `completed`, the user is redirected to /uploads/$id. * On the /uploads/$id page, the user can tag the upload and submit it. * The upload form calls `POST /posts` to create a new post from the upload. * The user is redirected to the new post. This is the data model: * An upload represents a set of files uploaded to Danbooru by a user. Uploaded files don't have to belong to a post. An upload has an uploader, a status (pending, processing, completed, or error), a source (unless uploading from a file), and a list of media assets (image or video files). * There is a has-and-belongs-to-many relationship between uploads and media assets. An upload can have many media assets, and a media asset can belong to multiple uploads. Uploads are joined to media assets through a upload_media_assets table. An upload could potentially have multiple media assets if it's a Pixiv or Twitter gallery. This is not yet implemented (at the moment all uploads have one media asset). A media asset can belong to multiple uploads if multiple people try to upload the same file, or if the same user tries to upload the same file more than once. New features: * On the upload page, you can press Ctrl+V to paste an URL and immediately upload it. * You can save files for upload later. Your saved files are at /uploads. Fixes: * Improved error messages when uploading invalid files, bad URLs, and when forgetting the rating.	2022-01-28 04:13:22 -06:00
evazion	f11c46b4f8	uploads: stop pruning uploads.	2022-01-28 04:13:22 -06:00
evazion	35b03799a5	jobs: increase job timeout to 24 hours. Should fix BigQuery exports failing for the posts table. Apparently even an 8 hour timeout wasn't enough.	2022-01-16 15:14:48 -06:00
evazion	dc159ed135	jobs: show retried jobs in /jobs listing. Fix the /jobs listing to show retried jobs. GoodJob::ActiveJobJob has a default scope that filters out retried jobs; remove it in the index controller so we can see retried jobs. Also fix mail delivery jobs not showing up in the dropdown in the /jobs search form.	2022-01-04 17:46:54 -06:00
evazion	82211ba935	jobs: add ability to search jobs on /jobs page. Add ability to search jobs on the /jobs page by job type or by status. Fixes #2577 (Search filters for delayed jobs). This wasn't possible before with DelayedJobs because it stored the job data in a YAML string, which made it difficult to search jobs by type. GoodJobs stores job data in a JSON object, which is easier to search in Postgres.	2022-01-04 17:18:36 -06:00
evazion	f4953549ae	jobs: switch from DelayedJob to GoodJob. Switch the ActiveJob backend from DelayedJob to GoodJob. Differences: * The job worker is run with `bin/good_job start` instead of `bin/delayed_job`. * Jobs have an 8 hour timeout instead of a 4 hour timeout. * Jobs don't automatically retry on failure. * Finishing jobs are preserved and pruned after 7 days.	2022-01-04 13:52:08 -06:00
evazion	a7dc05ce63	Enable frozen string literals. Make all string literals immutable by default.	2021-12-14 21:33:27 -06:00
evazion	26fe3e26e0	tests: silence output from pg_amcheck. Fix pg_amcheck flooding the test suite output in Github.	2021-10-27 04:59:21 -05:00
evazion	6d2ce5c8c1	saved searches: lower job priority. Lower the priority of the populate saved search job. This is so that large numbers of saved searches don't overwhelm the job queue and prevent higher priority jobs from running.	2021-10-24 22:56:59 -05:00
evazion	8d5e0a5b58	replacements: don't delete replaced files. Don't delete replaced files after 30 days. There are only about 30k replacements in total, so the cost of keeping replaced files is negligible. It was also wrong because the media asset wasn't destroyed too, so there were active media assets with missing files.	2021-10-24 04:35:13 -05:00
evazion	5e8c91700c	tests: fix amcheck job tests.	2021-10-13 04:19:44 -05:00
evazion	1653392361	posts: stop updating fav_string attribute. Stop updating the fav_string attribute on posts. The column still exists on the table, but is no longer used or updated. Like the pool_string in `7d503f08`, the fav_string was used in the past to facilitate `fav:X` searches. Posts had a hidden fav_string column that contained a list of every user who favorited the post. These were treated like fake hidden tags on the post so that a search for `fav:X` was treated like a tag search. The fav_string attribute has been unused for search purposes for a while now. It was only kept because of technicalities that required departitioning the favorites table first (`340e1008e`) before it could be removed. Basically, removing favorites with `@favorite.destroy` was slow because Rails always deletes object by ID, but we didn't have an index on favorites.id, and we couldn't easily add one until the favorites table was departitioned. Fixes #4652. See https://github.com/danbooru/danbooru/issues/4652#issuecomment-754993802 for more discussion of issues caused by the fav_string (in short: write amplification, post table bloat, and favorite inconsistency problems).	2021-10-09 22:36:26 -05:00
evazion	950bc608c2	maintenance: add job to check for database corruption. Add a job to run pg_amcheck hourly to check for corrupt database indexes. https://www.postgresql.org/docs/14/app-pgamcheck.html	2021-10-06 08:08:52 -05:00
evazion	ec9e844ab3	jobs: disable timeouts by default for all jobs. Fix a regression in `52bf4a3a6` that caused certain jobs to timeout.	2021-09-27 09:17:50 -05:00
evazion	52bf4a3a6b	maintenance: break maintenance tasks into individual jobs. Break the hourly/daily/weekly/monthly maintenance tasks down into individual delayed jobs. This way if one task fails, it won't prevent other tasks from running. Also, jobs can be run in parallel, and can be individually retried if they fail.	2021-09-26 20:38:30 -05:00
evazion	c6bf3e7934	BURs: don't automatically retry failed BURs. If a bulk update job fails, don't automatically retry it. Retrying it will clobber the original error message if it fails again.	2021-09-20 16:33:23 -05:00
evazion	9ba84efc07	BURs: process BURs sequentially in a single job. Change the way BURs are processed. Before, we spawned a background job for each line of the BUR, then processed each job sequentially. Now, we process the entire BUR sequentially in a single background job. This means that: * BURs are truly sequential now. Before certain things like removing aliases weren't actually performed in a background job, so they were performed out-of-order before everything else in the BUR. * Before, if an alias or implication line failed, then subsequent alias or implication lines would still be processed. This was because each alias or implication line was queued as a separate job, so a failure of one job didn't block another. Now, if any alias or implication fails, the entire BUR will fail and stop processing after that line. This may be good or bad, depending on whether we actually need the BUR to be processed in order or not. * Before, BURs were processed inside a database transaction (except for the actual updating of posts). Now they're not. This is because we can't afford to hold transactions open while processing long-running aliases or implications. This means that if BUR fails in the middle when it is initially approved, it will be left in a half-complete state. Before it would be rolled back and left in a pending state with no changes performed. * Before, only one BUR at a time could be processed. If multiple BURs were approved at the same time, then they would queue up and be processed one at a time. Now, multiple BURs can be processed at the same time. This may be undesirable when processing large BURs, or BURs that must be approved in a specific order. * Before, large tag category changes could time out. This was because they weren't actually performed in a background job. Now they are, so they shouldn't time out.	2021-09-20 01:12:14 -05:00
evazion	ad4c75eb1a	docs add more docs to app/{jobs,logical}. These were missed in the last commit.	2021-06-28 05:09:19 -05:00
evazion	0563ca3001	docs: document config/ and some directories in app/. * Add README files to several directories in app/ giving a brief overview of some parts of Danbooru's architecture. * Add documentation for files in config/.	2021-06-27 05:21:38 -05:00
evazion	00ca7526bb	docs: add remaining docs for classes in app/logical.	2021-06-24 01:31:41 -05:00
evazion	e5cfb7904c	CurrentUser: remove #as method. Replace with CurrentUser#scoped.	2021-06-22 23:39:30 -05:00
evazion	07e23204b6	rubocop: fix various Rubocop warnings.	2021-06-17 04:17:53 -05:00
evazion	0f36bbf8d3	iqdb: update API client to use new version of IQDB. Replace the old IQDB API client with a new client for the new forked version of IQDB at https://github.com/danbooru/iqdb. Changes: * The /iqdb_queries endpoint now returns `hash` and `signature` fields. The `signature` is the full decoded Haar signature, while the `hash` is a encoded version of the signature. * The /iqdb_queries endpoint no longer returns `width` and `height` fields in the response (these were always 128x128). * We no longer need the IQDBs frontend server, now we talk to the IQDB instance directly. * We no longer send add/remove image commands to IQDB through AWS SQS, now we send them to IQDB directly. They are sent in a delayed job so that if IQDB is down, uploading images is still possible, the add image commands will just get queued up. * Fix a bug where regenerating an image's thumbnails didn't regenerate IQDB, because IQDB silently ignored add image commands when the image already existed in the database.	2021-06-16 05:36:24 -05:00
evazion	f235b72b3f	Export public database dumps to BigQuery. * Export daily public database dumps to BigQuery and Google Cloud Storage. * Only data visible to anonymous users is exported. Some tables have null or missing fields because of this. * The bans table is excluded because some bans have an expires_at timestamp set beyond year 9999, which BigQuery doesn't support. * The favorites table is excluded because it's too slow to dump (it doesn't have an id index, which is needed by find_each). * Version tables are excluded because dumping them every day is inefficient, streaming insertions should be used instead. Links: * https://console.cloud.google.com/bigquery?project=danbooru1 * https://console.cloud.google.com/storage/browser/danbooru_public * https://storage.googleapis.com/danbooru_public/data/posts.json	2021-03-10 02:52:16 -06:00
evazion	b63d8207a9	forum: automatically post new forum posts to Discord.	2021-02-18 07:08:45 -06:00
evazion	b6f9c9a866	post regenerations: regenerate posts asynchronously. Regenerate posts asynchronously using a delayed job. Regenerating a post can be slow because it involves downloading the original file, regenerating the thumbnails, and redistributing the new thumbnails back to the image servers. It's better to run this in the background, especially if a user is trying to regenerate posts in bulk. The downside is there's no notification to the user when the regeneration is complete. You have to check the modactions log to see when it's finished.	2021-01-04 21:43:27 -06:00
evazion	9e37f5a588	BURs: don't log mod actions for aliases/implications/mass updates. Don't log mod actions when aliases, implications, or mass updates are processed. Originally aliases and implications were logged because they could be approved outside of a BUR. Mass updates could also be performed by mods without making a forum request. This is no longer the case. They were also logged for debugging purposes. This is no longer needed. This generated a lot of spam in the mod action logs when a large BUR was approved.	2020-12-02 12:20:28 -06:00
evazion	4741a52cc4	aliases/implications: remove 'error' state. Remove the error status from aliases and implications. Aliases and implications normally shouldn't fail because they're validated beforehand. If they do, just let the delayed job itself record the failure. Also disable the delayed job from retrying if the alias/implication somehow fails.	2020-12-01 18:58:45 -06:00
evazion	8717c319ab	aliases/implications: remove 'pending' state. Remove the pending status from tag aliases and implications. Previously aliases would be created first in the pending state then changed to active when the alias was later processed in a delayed job. This meant that BURs weren't processed completely sequentially; first all the aliases in a BUR would be created in one go, then later they would be processed and set to active sequentially. This was problematic in complex BURs that tried to reverse or swap around aliases, since new pending aliases could be created before old conflicting aliases were removed.	2020-12-01 18:58:45 -06:00
evazion	9a287cd71f	Fix #4483 : Wrong order for BUR caused 12k mistags. Bug: if a BUR contained a mass update followed by an alias, then the alias would become active before the mass update, which could cause the mass update to return incorrect results if both the alias and mass update touched the same tags. This happened because all aliases and implications in the BUR were set to a queued state before the mass update was processed, but putting an alias in the queued state effectively made it active. The fix is to remove the queued state. This was only used anyway as a debugging tool anyway to monitor the state of BURs as they were being processed.	2020-11-12 16:09:56 -06:00
evazion	23944a1794	Fix #4491 : Have tag rename option for bulk update requests. * Add a `rename A -> B` command for bulk update requests. * Change mass updates to only retag the posts, not to move saved searches or blacklists. A tag rename does the same thing an alias does, except it doesn't create a permanent alias. More precisely, a tag rename: * Moves the wiki. * Moves the artist entry. * Moves saved searches. * Moves blacklists. * Merges the wikis, if both tags have wiki pages. * Merges the artist entries, if both tags have artist pages. * Fixes links in wiki pages to point to the new tag. * Retags the posts.	2020-08-26 19:53:04 -05:00
evazion	67aab0236d	search: apply aliases after parsing searches. Make PostQueryBuilder apply aliases earlier, immediately after parsing the search. On the post index page there are multiple places where we need to apply aliases: * When running the search with PostQueryBuilder#build. * When calculating the search count with PostQueryBuilder#fast_count. * When calculating the related tags for the sidebar. * When tracking missed searches and popular searches for Reportbooru. * When looking up wiki excerpts. Applying aliases after parsing ensures we only have to apply aliases once for all of these things. We also normalize the order of tags in searches and strip repeated tags. This is so that we have consistent cache keys for fast_count. * Fixes searches for aliased tags being counted as missed searches (fixes #4433). * Fixes wiki excerpts not showing up when searching for aliased tags.	2020-05-07 13:53:35 -05:00
evazion	f38c38f26e	search: split tag_match into user_tag_match / system_tag_match. When doing a tag search, we have to be careful about which user we're running the search as because the results depend on the current user. Specifically, things like private favorites, private favorite groups, post votes, saved searches, and flagger names depend on the user's permissions, and whether non-safe or deleted posts are filtered out depend on whether the user has safe mode on or the hide deleted posts setting enabled. * Refactor internal searches to explicitly state whether they're running as the system user (DanbooruBot) or as the current user. * Explicitly pass in the current user to PostQueryBuilder instead of implicitly relying on the CurrentUser global. * Get rid of CurrentUser.admin_mode? (used to ignore the hide deleted post setting) and CurrentUser.without_safe_mode (used to ignore safe mode). * Change the /counts/posts.json endpoint to ignore safe mode and the hide deleted posts settings when counting posts. * Fix searches not correctly overriding the hide deleted posts setting when multiple status: metatags were used (e.g. `status:banned status:active`) * Fix fast_count not respecting the hide deleted posts setting when the status:banned metatag was used.	2020-05-07 03:29:44 -05:00
evazion	3dab648d0e	search: refactor PostQueryBuilder class methods into instance methods. * Make scan_query, parse_query, normalize_query into instance methods instead of class methods. This is to a) clean up the API and b) prepare for moving certain tag utility methods into PostQueryBuilder. * Fix a few cases where a caller used scan_query when they should have used split_query or parse_tag_edit.	2020-04-22 19:38:17 -05:00
evazion	7726563733	search: refactor scan_query callers to use split_query. Refactor to use split_query instead of scan_query to split a query on spaces. Preparation for refactoring scan_query into something smarter.	2020-04-19 02:54:44 -05:00
evazion	ab1839c613	uploads: fix exception with preprocessed uploads. Fix exception when submitting an upload and an in-progress preprocessed upload already exists. In this case we forgot to pass the upload params when calling UploadService#delayed_start.	2020-03-31 21:57:34 -05:00
evazion	08ce5a71c4	Eliminate various dead code.	2020-03-31 21:57:34 -05:00
evazion	0e7632ed8a	aliases/implications: remove forum topic updating code. Remove code for updating forum topics when an alias or implication is approved or rejected. This code was only used when approving single alias or implication requests. This is no longer used now that all alias/implication requests are done through BURs.	2020-03-10 20:55:20 -05:00
evazion	967d398c8e	search: move query parsing code from tag model to post query builder.	2020-03-06 23:23:38 -06:00

1 2

73 Commits