Fix random VCR failures in Pixiv tests. Sometimes tests randomly fail because the PHPSESSID they use in their HTTP requests to Pixiv is different than the one that was originally recorded by VCR. This causes VCR to complain that the requests don't match. This is caused by the PHPSESSID being globally cached in Memcache. Depending on the order the tests run in (which is random), one set of tests can use a PHPSESSID that was recorded for a /different/ set of tests. Improve Pixiv URL matching. * Allow URLs that are missing the http:// part. These are sometimes seen in artist entries. * Ignore URLs from random Pixiv domains such as dic.pixiv.net, blog.pixiv.net, etc. These are also sometimes in artist entries. Improve normalize_for_artist_finder! URL matching. * Normalize www.pixiv.net/stacc/username URLs. * Correctly normalize URLs that are missing the illust ID part on the end (i.e. http://i2.pixiv.net/img04/img/syounen_no_uta/). These are common in artist entries. Match URLs strictly when normalizing for artist entries. Only normalize Pixiv URLs that strictly match a known format. Pass any unrecognized URLs through without attempting to normalize them, just to be safe. Normalize URLs when saving artist entries.
69 lines
1.8 KiB
Ruby
69 lines
1.8 KiB
Ruby
# encoding: UTF-8
|
|
|
|
module Sources
|
|
class Site
|
|
attr_reader :url, :strategy
|
|
delegate :get, :referer_url, :site_name, :artist_name, :profile_url, :image_url, :tags, :artist_record, :unique_id, :page_count, :file_url, :ugoira_frame_data, :to => :strategy
|
|
|
|
def self.strategies
|
|
[Strategies::Pixiv, Strategies::NicoSeiga, Strategies::DeviantArt, Strategies::Nijie]
|
|
end
|
|
|
|
def initialize(url)
|
|
@url = url
|
|
|
|
Site.strategies.each do |strategy|
|
|
if strategy.url_match?(url)
|
|
@strategy = strategy.new(url)
|
|
break
|
|
end
|
|
end
|
|
end
|
|
|
|
def normalized_for_artist_finder?
|
|
available? && strategy.normalized_for_artist_finder?
|
|
end
|
|
|
|
def normalize_for_artist_finder!
|
|
if available? && strategy.normalizable_for_artist_finder?
|
|
strategy.normalize_for_artist_finder!
|
|
else
|
|
url
|
|
end
|
|
rescue
|
|
url
|
|
end
|
|
|
|
def translated_tags
|
|
untranslated_tags = tags
|
|
untranslated_tags = untranslated_tags.map(&:first)
|
|
untranslated_tags = untranslated_tags.map do |tag|
|
|
if tag =~ /\A(\S+?)_?\d+users入り\Z/
|
|
$1
|
|
else
|
|
tag
|
|
end
|
|
end
|
|
WikiPage.other_names_match(untranslated_tags).map{|wiki_page| [wiki_page.title, wiki_page.category_name]}
|
|
end
|
|
|
|
def to_json
|
|
return {
|
|
:artist_name => artist_name,
|
|
:profile_url => profile_url,
|
|
:image_url => image_url,
|
|
:tags => tags,
|
|
:translated_tags => translated_tags,
|
|
:danbooru_name => artist_record.try(:first).try(:name),
|
|
:danbooru_id => artist_record.try(:first).try(:id),
|
|
:unique_id => unique_id,
|
|
:page_count => page_count
|
|
}.to_json
|
|
end
|
|
|
|
def available?
|
|
strategy.present?
|
|
end
|
|
end
|
|
end
|