Commit Graph

102 Commits

Author SHA1 Message Date
DebaucheryLibrarian e5b1e9ea25 Removed error response body from item fetch to minimize spamming image contents. 2024-09-11 05:16:59 +02:00
DebaucheryLibrarian e642203d6c Fixed invalid URL breaking scrape, prefixing reddit URLs with origin. 2024-09-11 05:16:59 +02:00
DebaucheryLibrarian 96a041001b Stripping query for URL matching. 2024-09-11 05:16:59 +02:00
DebaucheryLibrarian d1ac6d0068 Fixed RedGIFs gallery support. 2024-09-11 05:16:59 +02:00
DebaucheryLibrarian afaa428fec Fixed YAML index file parser failing on duplicate keys. 2024-09-11 05:16:58 +02:00
DebaucheryLibrarian b7dc3f5fd1 Storing pHash. Fixed RedGIFs user agent. 2024-09-11 05:16:58 +02:00
DebaucheryLibrarian 7d633c31b4 Using Imgur API rate limit feedback to prevent exceeding it. 2024-09-11 05:16:58 +02:00
DebaucheryLibrarian bed4fe288f Added predata step. Using RedGifs temporary API key. 2024-09-11 05:16:58 +02:00
DebaucheryLibrarian 529f84d9c2 Fixed RedGIFs module content type. 2024-09-11 05:16:58 +02:00
DebaucheryLibrarian 350a2dc1dc Fixed username in save message. 2024-09-11 05:16:58 +02:00
DebaucheryLibrarian de50d609f3 Added support for RedGifs and Reddit albums. Improved command line logger. Added rate limiters for reddit and host URLs. 2024-09-11 05:16:58 +02:00
DebaucheryLibrarian bb06fe9763 Cleanup 2024-09-11 05:16:58 +02:00
DebaucheryLibrarian 6e3bfdd175 Added support for various tube and social media sites via youtube-dl. 2024-09-11 05:16:58 +02:00
DebaucheryLibrarian 45123bc630 Added hash comparison to duplicate avoidance. 2024-09-11 05:16:58 +02:00
DebaucheryLibrarian 7de0c51c16 Fetching and storing comments for self posts. 2024-09-11 05:16:58 +02:00
DebaucheryLibrarian 9140b58c70 Removed post.user variable. 2024-09-11 05:16:58 +02:00
DebaucheryLibrarian 5c7dca19e5 Added dynamic dividers to patterns. Fixed PornHub module. Updated README. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian 443f0fc028 Completed Erome module. Added content URL list fetching. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian b9a7e4b83a Added support for fetching content directly from host. Improved pattern interpolation. Refactored content modules. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian 8b85ec219d Removed debug log. Updated README to mention imgur API key requirement. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian 416a3254fb Removed HTML scraper from imgur module, no longer support due to login restriction. Preserving 'preview' property in index file. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian c21ca0df0c Fixed gfycat fetching. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian 60973a0e70 Added post score to index and variables. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian 8d75cc170d Added basic PornHub support. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian 9b0ef4da91 Added after/before-index alias for after/before-indexed 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian a817a07250 Always returning null for profile details when no profile was found, not undefined. Prevents YAML index write from failing. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian 246ed1576f Defaulting to jpeg if reddit image extension is not present. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian db9f6ed60c Added optional URL query to all external hosts in link dissection. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian 82b7f38336 Removed duplicate post link from imgur logs 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian e48c00b529 Added option to index (specified) ignored posts. Saving index through save module, and it now notifies the user that an index has been written. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian c30006f361 Added byte truncate script to repo. Fixed yargs not enforcing choices. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian 12e17c7c3c Calculating filename component limit in bytes rather than characters, preventing 'filename too long' errors when e.g. emojis are used 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian fab6d0aa1c Reintroduced imgur API as a fallback method. 2024-09-11 05:16:57 +02:00
DebaucheryLibrarian 0377053d5a Allowing for -mobile gfycat links, simplified URL patterns 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian 91550406fe Accounting for actualid_d.jpg imgur pattern. Added debug argument to print full stack trace on error. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian e950a9ac54 No longer relying on imgur API for albums. Removed config for imgur API. Fixed ?#0 URLs not being detected for imgur. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian dbc4c45601 No longer using imgur API for individual images. Only saving EXIF data to JPEGs. Always using global exiftool instance. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian b05ae06b00 Improved missing gfycat handling. Added 'preview' marker to index. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian a68493d0fa Don't attempt to save index when there are no posts or profile details (complained about directory not existing) 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian 18acb3a878 Added fetch concurrency. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian 91577a00b1 Added support for file with host IDs to ignore. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian 352b2a66d7 Allow usernames and post IDs to be read from file. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian 00de0d43f5 Switched from setTimeout to crontab for watch-mode. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian c4afd398dc Added timestamp to watch mode status log. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian 438ba05260 Fixed profile image checking for indexed description instead of image. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian e087e47069 No longer redownload profile image and description when indexed, unless --redownload-profile is specified. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian fcb85f57c8 Added watch-mode. Waiting for profile detail write to finalize before new watch cycle and capture details in index file. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian dcf7fdd274 Fixed extension getting cut off at filename limit. Cleaned up save module. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian 99c7d143f7 Fixed index file for single post fetching. 2024-09-11 05:16:56 +02:00
DebaucheryLibrarian cdd5ed36bf Added redownload option to ignore index file. 2024-09-11 05:16:56 +02:00