Commit Graph

33 Commits

Author SHA1 Message Date
ThePendulum 8a978cb803 Storing pHash. Fixed RedGIFs user agent. 2023-03-12 22:39:00 +01:00
ThePendulum c69ac9876a Added predata step. Using RedGifs temporary API key. 2023-02-27 02:37:08 +01:00
ThePendulum 66e5bfa50a Added support for RedGifs and Reddit albums. Improved command line logger. Added rate limiters for reddit and host URLs. 2021-12-17 02:04:25 +01:00
ThePendulum e175ce38c8 Fetching and storing comments for self posts. 2019-11-05 01:58:00 +01:00
Niels Simenon db6317e560 Removed HTML scraper from imgur module, no longer support due to login restriction. Preserving 'preview' property in index file. 2019-10-31 05:22:07 +01:00
ThePendulum 73504a6d32 Added post score to index and variables. 2018-07-08 18:48:14 +02:00
ThePendulum d9ce4dd056 Added option to index (specified) ignored posts. Saving index through save module, and it now notifies the user that an index has been written. 2018-07-07 02:17:41 +02:00
ThePendulum c1f26440fd Added fetch concurrency. 2018-07-02 03:45:20 +02:00
ThePendulum b4356e8e11 Added support for file with host IDs to ignore. 2018-07-02 02:33:34 +02:00
ThePendulum 1b1323dc3d No longer redownload profile image and description when indexed, unless --redownload-profile is specified. 2018-07-01 03:26:26 +02:00
ThePendulum c639503667 Fixed index file for single post fetching. 2018-07-01 01:07:32 +02:00
ThePendulum 240146e554 Added redownload option to ignore index file. 2018-07-01 00:22:17 +02:00
ThePendulum 84f7757712 Added date and indexed oldest/latest limit arguments. 2018-07-01 00:07:12 +02:00
ThePendulum a859ac573c Accounting for indexed posts. 2018-06-30 21:47:43 +02:00
ThePendulum 74e36a6826 Using YAML rather than TSV for index files. Improves both readability and reindexability. 2018-06-30 03:33:30 +02:00
ThePendulum 1c4ec06f68 Extracting indexed posts from fetched posts to allow for index rewrite flexibility. 2018-06-20 02:06:59 +02:00
ThePendulum 33ef3d2c19 Reading index file and ignoring already indexed content. 2018-06-17 03:39:12 +02:00
ThePendulum 7cf1a99915 Building user posts object after fetching user to ensure user fetched posts and directly fetched posts are added to the same user key. Refactor to make better use of functions. Moved profile detail saving call to content fetch. No longer attempting and failing to save profile details for deleted users (directory would not exist). 2018-06-17 01:11:10 +02:00
ThePendulum aef4dd02c7 Fetching and curating posts per user without merge. 2018-06-10 02:48:49 +02:00
ThePendulum e056acfbd3 Refactored post retrieval so limit is applied per-user and ignores directly requested posts, and to start utilizing async/await. 2018-05-28 01:42:46 +02:00
ThePendulum edf77e259b Refactored post fetching. Cleaner code that will now fetch archived posts even if the user no longer exists on reddit. Expanded gfycat URL detection. 2018-05-22 22:55:50 +02:00
ThePendulum 068ffbdbd8 Added include and exclude source arguments. Improved stream fetch failure handling and improved logging details. 2018-05-05 17:08:40 +02:00
ThePendulum 3794fc3776 Added support for m.imgur.com links. Changed curate ignore log message details. 2018-05-05 03:18:05 +02:00
ThePendulum dd3be88122 Added preview property. Removed debugging intercepts. Logging post permalink on failed info fetch. 2018-05-05 01:22:23 +02:00
ThePendulum ca3bdd717d Added archive support, and the IP archive. 2018-05-05 00:51:58 +02:00
ThePendulum 05b5bee7ea Added support for fetching individual posts. Improved use of yargs, --help now available. Refactored main app flow. 2018-04-29 02:02:34 +02:00
ThePendulum 33bc13ebfb Added ignore argument. 2018-04-28 00:03:31 +02:00
ThePendulum e752667be7 Added support for muxing streams, specifically for reddit-hosted videos (now also supported). 2018-04-26 03:26:39 +02:00
ThePendulum f2052004df Using metadata configuration rather than item description for EXIF save. Fixed dissect duplicate detection failing for empty host. 2018-04-26 00:34:08 +02:00
ThePendulum 0b9fba7af2 Ignoring empty interpolated metadata. Added URL query wildcard to imgur and reddit images. Using host ID rather than URL to improve duplicate detection. 2018-04-26 00:22:56 +02:00
ThePendulum 8bec57e906 Changed ignoreDuplicates to avoidDuplicates. 2018-04-23 14:58:23 +02:00
ThePendulum 8aca700269 Dealing with users without profile. 2018-04-23 14:57:07 +02:00
ThePendulum 4551a1f5d2 Moved source into dedicated directory. 2018-04-23 02:36:07 +02:00