danbooru

Author	SHA1	Message	Date
evazion	2c33539be7	uploads: allow searching uploads and media assets by metatag. Allow searching the /uploads and /media_assets pages by the following metatags: * id: * md5: * width: * height: * duration: * mpixels: * ratio: * filesize: * filetype: * date: * age: * status:<processing\|active\|deleted\|expunged\|failed> (for /media_assets) * status:<pending\|processing\|active\|failed> (for /uploads) * is:<filetype>, is:<status> * exif: Examples: * https://betabooru.donmai.us/media_assets?search[ai_tags_match]=filetype:png * https://betabooru.donmai.us/uploads?search[ai_tags_match]=filetype:png Note that in /uploads search, the id:, date:, and age: metatags refer to the upload media asset, not the upload itself. Note also that uploads may contain multiple assets, so for example searching uploads by `filetype:png` will return all uploads containing at least one PNG file, even if they contain other non-PNG files.	2022-12-07 01:02:19 -06:00
evazion	d7d3427488	Fix #5363 : Inconsistent order of files from zip uploads. Upload files in natural order rather than archive order when uploading archive files. Before files were listed in the same order they appeared in the zip file. This could be in non-alphabetical order, or even with files from different directories interleaved between each other. Now files are uploaded in natural order, which is alphabetical order but with numbers sorted properly, so that `file-9.jpg` appears before `file-10.jpg`.	2022-12-02 18:04:45 -06:00
evazion	ba1cf14c7e	uploads: mark uploads as failed if they're stuck processing for more than 4 hours.	2022-11-20 23:41:07 -06:00
evazion	2deae38a4e	uploads: allow uploading .zip, .rar., and .7z files from disk. Allow uploading .zip, .rar, and .7z files from disk. The archive will be extracted and the images inside will be uploaded. This only works for archive files uploaded from disk, not from a source URL. Post source URLs will look something like this: "file://foo.zip/1.jpg", "file://foo.zip/2.jpg", etc. Sometimes artists uses Shift JIS or other encodings instead of UTF-8 for filenames. In these cases we just assume the filename is UTF-8 and replace invalid characters with '?', so filenames might be wrong in some cases. There are various protections to prevent uploading malicious archive files: * Archives with more than 100 files aren't allowed. * Archives that decompress to more than 100MB aren't allowed. * Archives with filenames containing '..' components aren't allowed (e.g. '../../../../../etc/passwd'). * Archives with filenames containing absolute paths aren't allowed (e.g. '/etc/passwd'). * Archives containing symlinks aren't allowed (e.g. 'foo -> /etc/passwd'). * Archive types other than .zip, .rar, and .7z aren't allowed (e.g. .tar.gz, .cpio). * File permissions, owners, and other metadata are ignored. Partial fix for #5340: Add support for extracting archive attachments from certain sources	2022-11-16 16:47:37 -06:00
evazion	88ac91f5f3	search: refactor to pass in the current user explicitly.	2022-09-22 04:31:21 -05:00
evazion	a229a6f5c4	models: remove ignored_columns declarations. These columns have been removed from the underlying database.	2022-09-20 23:09:32 -05:00
evazion	d4da8499ce	models: stop saving IP addresses in version tables. Mark various `creator_ip_addr` and `updater_ip_addr` columns as ignored and stop updating them in preparation for dropping them.	2022-09-18 03:49:17 -05:00
evazion	2350362183	uploads: add ability to search your uploads by AI tags. Add ability to search your unposted uploads using AI tags. Like with media assets, only basic tags are supported (no metatags) and complex multi-tag searches will probably be slow. The default AI tag confidence threshold is 50%. There's a hidden search[min_score] URL param that lets you change this.	2022-07-06 02:01:09 -05:00
evazion	af8ef8b277	uploads: address "Failed to replace upload_media_assets..." error Sometimes uploads fail with this error: Failed to replace upload_media_assets because one or more of the new records could not be saved. Change it so that media assets are saved individually, so that if saving any of them fails we get a better error message.	2022-05-17 18:28:13 -05:00
evazion	d9d3c1dfe4	sources: rename Sources::Strategies to Source::Extractor. Rename Sources::Strategies to Source::Extractor. A Source::Extractor represents a thing that extracts information from a given URL.	2022-03-24 03:49:44 -05:00
evazion	03560bafc6	uploads: add limit to prevent users from submitting too many uploads at once. Add a limit so that users can't upload more if they already have more than 250 images queued for upload. For example, if you upload a Pixiv post that has 200 images, then you'll have 200 queued images for upload. This will go down as the images are processed. If you exceed the limit, then trying to create new uploads will return an error. This is to prevent single users from overwhelming the site by uploading too many images at once, thereby preventing other users from uploading because the job queue is backed up and can't process new uploads by other users until existing uploads are finished.	2022-02-28 23:10:12 -06:00
evazion	26f4cf1ebd	sources: factor out Source::URL::Skeb.	2022-02-25 02:06:57 -06:00
evazion	7d49ab6130	Add Danbooru::URL class. Introduce a Danbooru::URL class for dealing with URLs. This is a wrapper around Addressable::URI that adds some additional helper methods. Most significantly, the `parse` method only allows valid http/https URLs, and it returns nil instead of raising an exception when the URL is invalid.	2022-02-22 00:17:53 -06:00
evazion	202dfe5d87	uploads: allow uploading multiple files from your computer at once. Allow uploading multiple files from your computer at once. The maximum limit is 100 files at once. There is still a 50MB size limit that applies to the whole upload. This limit is at the Nginx level. The upload widget no longer shows a thumbnail preview of the uploaded file. This is because there isn't room for it in a multi-file upload, and because the next page will show a preview anyway after the files are uploaded. Direct file uploads are processed synchronously, so they may be slow. API change: the `POST /uploads` endpoint now expects the param to be `upload[files][]`, not `upload[file]`.	2022-02-19 00:00:56 -06:00
evazion	093a808a36	Fix #4986 : Add ability to filter images in /media_assets and /uploads depending on if they have become posts	2022-02-18 03:39:08 -06:00
evazion	6b56b6a122	uploads: fix error when source doesn't have any images. Fix an error when trying to upload a source that doesn't have any images, for example a Twitter post with no images.	2022-02-15 18:55:12 -06:00
evazion	02edb52569	uploads: enable multi-file uploads when uploading from source. Make the upload page automatically detect when a source URL has multiple images and let the user choose which images to post. For example, when uploading a Twitter or Pixiv post with more than one image, we direct the user to a page showing a thumbnail for each image and letting them choose which ones to post. This is similar to the batch upload page, except we actually download each image in the background, instead of just hotlinking or proxying the thumbnails through our servers. This avoids various problems with proxying and makes new features possible, like showing which images in the batch have already been posted.	2022-02-14 16:13:55 -06:00
evazion	bdf83d1ffd	uploads: refactor /uploads/:id page for multi-file uploads.	2022-02-14 00:41:08 -06:00
evazion	eb032d54c1	uploads: set upload_media_asset.status to active. Fix the status being set to pending instead of active for new upload media assets.	2022-02-14 00:40:40 -06:00
evazion	04d242c60c	uploads: save filename, image URL, page URL for uploads. * Save the filename for files uploaded from disk. This could be used in the future to extract source data if the filename is from a known site. * Save both the image URL and the page URL for files uploaded from source. This is needed for multi-file uploads. The image URL is the URL of the file actually downloaded from the source. This can be different from the URL given by the user, if the user tried to upload a sample URL and we automatically changed it to the original URL. The page URL is the URL of the page containing the image. We don't always know this, for example if someone uploads a Twitter image without the bookmarklet, then we can't find the page URL. * Add a fix script to backfill URLs for existing uploads. For file uploads, the filename will be set to "unknown.jpg". For source uploads, we fetch the source data again to get the image and page URLs. This may fail for uploads that have been deleted from the source since uploading.	2022-02-12 15:22:41 -06:00
evazion	9a23970ab1	uploads: fix media_asset_count.	2022-02-12 15:22:24 -06:00
evazion	44c9c7f1ac	uploads: removed unused /uploads/preprocess route.	2022-02-11 03:15:12 -06:00
evazion	70d38d9e0b	uploads: add columns needed for multi-file uploads. * uploads.media_asset_count - the number of media assets attached to this upload. * upload_media_assets.status - the status of each media asset attached to this upload (processing, active, failed) * upload_media_assets.source_url - the source of each media asset attached to this upload * upload_media_assets.error - the error message if uploading the media asset failed	2022-02-10 12:06:57 -06:00
evazion	1a61e329ba	uploads: add column for error messages. Change it so uploads store errors in an `error` column instead of in the `status` field.	2022-02-07 15:44:39 -06:00
evazion	7c63ac8dbd	uploads: drop unused columns.	2022-02-04 02:19:30 -06:00
evazion	2dfec29da7	uploads: mark old columns as ignored. Mark old columns as ignored in preparation for dropping them. Make the rating and tag_string nullable so they don't have to be set when creating uploads and can be ignored too.	2022-02-03 14:07:09 -06:00
evazion	11b7bcac91	uploads: fix broken tests. * Fix broken upload tests. * Fix uploads to return an error if both a file and a source are given at the same time, or if neither are given. Also fix the error message in this case so that it doesn't include "base" at the start of the string. * Fix uploads to percent-encode any Unicode characters in the source URL. * Add a max filesize validation to media assets.	2022-01-29 05:14:49 -06:00
evazion	21dcf53dcb	uploads: show similar images for disk uploads. Fix the upload page so that it shows similar images (IQDB matches) for files uploaded from your computer. Before this only worked for files uploaded from a source.	2022-01-28 21:07:06 -06:00
evazion	abdab7a0a8	uploads: rework upload process. Rework the upload process so that files are saved to Danbooru first before the user starts tagging the upload. The main user-visible change is that you have to select the file first before you can start tagging it. Saving the file first lets us fix a number of problems: * We can check for dupes before the user tags the upload. * We can perform dupe checks and show preview images for users not using the bookmarklet. * We can show preview images without having to proxy images through Danbooru. * We can show previews of videos and ugoira files. * We can reliably show the filesize and resolution of the image. * We can let the user save files to upload later. * We can get rid of a lot of spaghetti code related to preprocessing uploads. This was the cause of most weird "md5 confirmation doesn't match md5" errors. (Not all of these are implemented yet.) Internally, uploading is now a two-step process: first we create an upload object, then we create a post from the upload. This is how it works: * The user goes to /uploads/new and chooses a file or pastes an URL into the file upload component. * The file upload component calls `POST /uploads` to create an upload. * `POST /uploads` immediately returns a new upload object in the `pending` state. * Danbooru starts processing the upload in a background job (downloading, resizing, and transferring the image to the image servers). * The file upload component polls `/uploads/$id.json`, checking the upload `status` until it returns `completed` or `error`. * When the upload status is `completed`, the user is redirected to /uploads/$id. * On the /uploads/$id page, the user can tag the upload and submit it. * The upload form calls `POST /posts` to create a new post from the upload. * The user is redirected to the new post. This is the data model: * An upload represents a set of files uploaded to Danbooru by a user. Uploaded files don't have to belong to a post. An upload has an uploader, a status (pending, processing, completed, or error), a source (unless uploading from a file), and a list of media assets (image or video files). * There is a has-and-belongs-to-many relationship between uploads and media assets. An upload can have many media assets, and a media asset can belong to multiple uploads. Uploads are joined to media assets through a upload_media_assets table. An upload could potentially have multiple media assets if it's a Pixiv or Twitter gallery. This is not yet implemented (at the moment all uploads have one media asset). A media asset can belong to multiple uploads if multiple people try to upload the same file, or if the same user tries to upload the same file more than once. New features: * On the upload page, you can press Ctrl+V to paste an URL and immediately upload it. * You can save files for upload later. Your saved files are at /uploads. Fixes: * Improved error messages when uploading invalid files, bad URLs, and when forgetting the rating.	2022-01-28 04:13:22 -06:00
evazion	f11c46b4f8	uploads: stop pruning uploads.	2022-01-28 04:13:22 -06:00
evazion	18c08688df	Merge pull request #4947 from nonamethanks/fix-duration-twitter Fix duration check for uploads from twitter	2021-12-29 22:33:59 -06:00
nonamethanks	15bd5f73b3	Fix duration check for uploads from twitter Some twitter videos near the max duration had some stray milliseconds that made the check fail. For example https://twitter.com/kivo_some_18/status/1152167154059321344?s=20 (nsfw) has 140.053333 duration.	2021-12-29 14:25:14 +01:00
evazion	a7dc05ce63	Enable frozen string literals. Make all string literals immutable by default.	2021-12-14 21:33:27 -06:00
evazion	d258790199	uploads: don't delete files of abandoned uploads. Just leave them. They don't take up that much space and they may be used in the future if someone else tries to upload the same file.	2021-10-24 04:35:13 -05:00
evazion	bc506ed1b8	uploads: refactor to simplify ugoira-handling and replacements: * Make it so replacing a post doesn't generate a dummy upload as a side effect. * Make it so you can't replace a post with itself (the post should be regenerated instead). * Refactor uploads and replacements to save the ugoira frame data when the MediaAsset is created, not when the post is created. This way it's possible to view the ugoira before the post is created. * Make `download_file!` in the Pixiv source strategy return a MediaFile with the ugoira frame data already attached to it, instead of returning it in the `data` field then passing it around separately in the `context` field of the upload.	2021-10-18 05:18:46 -05:00
evazion	1d034a3223	media assets: move more file-handling logic into MediaAsset. Move more of the file-handling logic from UploadService and StorageManager into MediaAsset. This is part of refactoring posts and uploads to allow multiple images per post.	2021-10-18 00:10:29 -05:00
evazion	79fdfa86ae	Fix various rubocop warnings.	2021-09-27 00:46:13 -05:00
evazion	ee5cd8330d	uploads: fix exception when pruning expired uploads. Hourly pruning of expired uploads was failing because of nil deference errors in `media_asset.destroy!`. There are various cases where an upload doesn't have a media asset, for example when the source url fails to download or when the upload is of an invalid filetype.	2021-09-22 13:24:27 -05:00
evazion	3d660953d4	Add MediaMetadata model. Add a model for storing image and video metadata for uploaded files. Metadata is extracted using ExifTool. You will need to install ExifTool after this commit. ExifTool 12.22 is the minimum required version because we use the `--binary` option, which was added in this release. The MediaMetadata model is separate from the MediaAsset model because some files contain tons of metadata, and most of it is non-essential. The MediaAsset model represents an uploaded file and contains essential metadata, like the file's size and type, while the MediaMetadata model represents all the other non-essential metadata associated with a file. Metadata is stored as a JSON column in the database. ExifTool returns all the file's metadata, not just the EXIF metadata. EXIF is one of several types of image metadata, hence why we call it MediaMetadata instead of EXIFMetadata.	2021-09-08 05:00:54 -05:00
evazion	07e23204b6	rubocop: fix various Rubocop warnings.	2021-06-17 04:17:53 -05:00
evazion	4deb8aeea2	uploads: disallow uploading new Flash files. Flash is dead. It's no longer supported by browsers, it's not well-supported by emulators, and only two Flash posts were uploaded in the last year anyway. Old Flash files will continue to exist, but new Flash uploads will no longer be allowed.	2021-03-31 20:47:35 -05:00
evazion	520b72948f	Fix #4695 : Raise max video length to match Twitter's (2:20).	2021-02-04 00:01:03 -06:00
evazion	94e125709c	users: add Restricted user level. Add a Restricted user level. Restricted users are level 10, below Members. New users start out as Restricted if they sign up from a proxy or an IP recently used by another user. Restricted users can't update or edit any public content on the site until they verify their email address, at which point they're promoted to Member. Restricted users are only allowed to do personal actions like keep favorites, keep favgroups and saved searches, mark dmails as read or deleted, or mark forum posts as read. The restricted state already existed before, the only change here is that now it's an actual user level instead of a hidden state. Before it was based on two hidden flags on the user, the `requires_verification` flag (set when a user signs up from a proxy, etc), and the `is_verified` flag (set after the user verifies their email). Making it a user level means that now the Restricted status will be shown publicly. Introducing a new level below Member means that we have to change every `is_member?` check to `!is_anonymous` for every place where we used `is_member?` to check that the current user is logged in.	2021-01-07 17:10:29 -06:00
evazion	ee4516f5fe	searchable: refactor searchable_includes. Pass searchable associations directly to search_attributes instead of defining them separately in searchable_includes.	2020-12-16 23:57:07 -06:00
evazion	e771c0fca8	searchable: don't automatically include id, created_at, updated_at. Don't make search methods on models call super in order to search certain default attributes (id, created_at, updated_at). Simplifies some magic.	2020-12-16 23:57:07 -06:00
evazion	8d87b1a0c0	models: fix deprecated `errors[:base] << "message"` calls. Replace the idiom `errors[:base] << "message"` with `errors.add(:base, "message")`. The former is deprecated in Rails 6.1.	2020-12-13 04:10:48 -06:00
BrokenEagle	c4009efccd	Convert models to use new search includes mechanism	2020-07-27 19:29:18 +00:00
evazion	ed79b623cc	Fix #4544 : Show limited view of other user's uploads on the upload index. * Show completed uploads to other users. * Don't show failed or incomplete uploads to other users. * Don't show tags to other users. * Delete completed uploads after 1 hour. * Delete incomplete uploads after 1 day. * Delete failed uploads after 3 days.	2020-07-13 19:25:30 -05:00
evazion	8b5ffb4c43	uploads: allow admins to upload videos more than 2 minutes long. At some point the ability for admins to bypass the video length restriction got lost. ref: https://danbooru.donmai.us/forum_topics/14647	2020-06-09 03:08:06 -05:00
evazion	cf88411dce	uploads: fix /uploads listing search not working. Upload#search was declared as an instance method instead of a class method.	2020-05-24 00:29:19 -05:00

1 2 3 4 5

243 Commits