Commit Graph

100 Commits

Author SHA1 Message Date
ThePendulum 532a4b679b Using batch insert for profiles to prevent errors on large inserts. 2020-05-21 03:44:44 +02:00
ThePendulum b1b7cd6d50 Fixed Whale Member posters and photos. 2020-05-20 02:23:45 +02:00
ThePendulum b6691e1991 Added release type distinction to REST API. 2020-05-20 01:38:58 +02:00
ThePendulum 057362d011 Added basic release and actor API. 2020-05-20 01:11:32 +02:00
ThePendulum 11eb66f834 Switched to tabs. Adding missing actor entries when scraping actors, with batch ID. 2020-05-14 04:26:05 +02:00
ThePendulum 93d4f0ff1a Added WIP media module. Returning releases from release search database function. Fixed page loop in update module. 2020-03-29 04:00:46 +02:00
ThePendulum d765543b30 Improved update runner. Improved HTTP module API, added default user agent. Added PornCZ and Czechav logos. 2020-03-21 02:48:24 +01:00
ThePendulum 4b310e9dfa Added configurable proxy to HTTP module (also used by qu). Added network and site URL to search documents. 2020-03-19 01:54:25 +01:00
ThePendulum 0f09fd53eb Refactoring deep scrape. Added tag posters. 2020-03-16 04:10:52 +01:00
ThePendulum 37e188a0df Added site aliases. Migrated various scrapers to qu. Added BAM Visions base. 2020-03-12 00:15:25 +01:00
ThePendulum d97b1ab894 Split up profile scrape runner. Fixed wrong search document date key. Added search update CLI. 2020-03-10 23:46:55 +01:00
ThePendulum 5c55750c0c Fixed qu issues. Fixed media issues. Simplified and expanded date component in search query. 2020-03-10 00:17:57 +01:00
ThePendulum 638757b6e4 Added rudimentary movie relations. 2020-03-09 05:06:37 +01:00
ThePendulum db01599569 Added profile scraper with releases to Hush. Added qtexts to q to return text nodes individually. Including network in profile site. 2020-03-06 02:49:55 +01:00
ThePendulum fd6e90e74c Added tour layout scraper to Hush, enabling Interracial POVs, POV Pornstars and See Him Fuck. 2020-03-05 20:31:11 +01:00
ThePendulum f10e4af29b Allowing scrapers to force channel allocation attempt. Added Hush Pass subsite handling to Hussie Pass scraper. 2020-03-05 03:44:27 +01:00
ThePendulum 956afa6ae7 Added Hussie Pass scraper. 2020-03-05 02:47:52 +01:00
ThePendulum 6c3cba1b87 Added actor photos to Brazzers scene scrape. Added no-video poster to Score. Not flattening actor avatar fallbacks. 2020-03-04 17:21:40 +01:00
ThePendulum 15af3e91e0 Coalescing shoot ID in search. Added stop words for common TLDs. Sorting tags in search results. 2020-03-02 04:15:47 +01:00
ThePendulum e79a6b33fb Added 'newly added' filter. Handling paywalled videos in Private scraper. Added shoot ID to search. 2020-03-02 03:41:41 +01:00
ThePendulum 8dd5925af6 Improved search engine query and added stop words. Added 'secondary' property to tag aliases, for tag aliases to be included in searches and alias lists. 2020-02-29 22:47:48 +01:00
ThePendulum b03775fa07 Using generic slugify for MindGeek channel. 2020-02-29 05:00:50 +01:00
ThePendulum a828fee476 Handling NULL actors and tags in search table query. Added limit parameter to home URL, default to 30. 2020-02-29 03:22:51 +01:00
ThePendulum f1f33080f6 Ignoring undefined video entropy. 2020-02-28 03:56:58 +01:00
ThePendulum 3dc8547431 Added fake data and Markov experiments. 2020-02-27 05:44:24 +01:00
ThePendulum 3c30e9107a Using dedicated releases search table for ts vector documents. 2020-02-26 22:33:15 +01:00
ThePendulum 915eb75719 Refactored Vixen scraper, using API endpoint and added actor profile and releases scraper. Release scraper will return base release when present and 'deep' argument is false. 2020-02-22 23:25:10 +01:00
ThePendulum e5c6ccd252 Scraping upcoming Vixen scenes. Fetching release media groups sequentially to prevent collisions. 2020-02-22 04:37:48 +01:00
ThePendulum 349a5a506e Queueing and batching media HTTP requests for improved reliability. 2020-02-22 03:22:30 +01:00
ThePendulum 7ac5a8e08c Catching media failures per batch. Refined teaser logging. 2020-02-20 22:27:00 +01:00
ThePendulum 377970f874 Added parent-child relations to network, showing parent network in sidebar. Added Burning Angel using Gamma API. 2020-02-20 02:35:23 +01:00
ThePendulum 97f5e49187 Refactored media module. Returning 320p and 720p videos from MindGeek as teasers instead of trailers. 2020-02-19 04:41:53 +01:00
ThePendulum 40bf476ea6 Fixed Porn Pros scraper. Added various Score site logos. 2020-02-18 16:00:36 +01:00
ThePendulum dd6a1d9bfd Added Vivid network. Added ASMR Fantasy to Adult Time. Storing deep URL in database. Added href to header links. 2020-02-11 04:58:18 +01:00
ThePendulum 139f0ce7cb Allowing release scrapers to return actor details. Added True Amateurs. 2020-02-09 23:25:54 +01:00
ThePendulum 0f513266a0 Added Black for Wife to JayRock. Switched parameters field to JSON type. 2020-02-09 19:41:39 +01:00
ThePendulum 1546e0836c Split Girlsway from Adult Time. Added Fantasy Massage. Using Gamma scraper for Pure Taboo. Added photo path parameter to Gamma scraper. 2020-02-08 02:49:39 +01:00
ThePendulum 5ba308f07a Added Adult Time. Adding context to logger. 2020-02-07 19:53:16 +01:00
ThePendulum d4801bb240 Returning window.document instead of element as document from q. Fixed actor collisions when scrapers return same scene multiple times. Scraping all Score actor release pages. Fixed 21Sextury and PureTaboo photo scraping. 2020-02-05 23:57:55 +01:00
ThePendulum f921bb4ae9 Generating and using URL slugs for releases, improver slugify module. Added 'extract' parameter to MindGeek scraper to get scenes not associate with a channel (see Digital Playground). Added various high res logos. 2020-02-04 03:12:09 +01:00
ThePendulum a671190fff Adapted Score scraper for Score Classics. 2020-02-03 02:04:47 +01:00
ThePendulum a45bebddac Adapter Score scraper for Score Videos. 2020-02-03 00:39:43 +01:00
ThePendulum a97c6defca Added teaser support. Added Score network with scraper for Scoreland. Improved q. Added assets. 2020-02-02 05:14:58 +01:00
ThePendulum 94bf207397 Added Wicked network. Merged Evil Angel, XEmpire and Wicked into generic Gamma scraper. 2020-02-01 01:15:40 +01:00
ThePendulum 345103d759 Fixed ;. 2020-01-27 01:54:42 +01:00
ThePendulum eca65f6b4d Inspecting performance. 2020-01-27 00:41:04 +00:00
ThePendulum f8175f6054 Added generic Gamma photo and actor scraper for XEmpire, 21Sextury, Blowpass and Evil Angel. 2020-01-22 22:25:58 +01:00
ThePendulum 75c53d338a Added Porn Pros sites and scraper. 2020-01-14 21:45:30 +01:00
ThePendulum 859cb7e1f3 Added support for Family Strokes. 2020-01-13 23:45:09 +01:00
ThePendulum 4b36de2f55 Fixed Evil Angel upcoming and actor association issues. Moving from console.log to logger. 2020-01-10 02:43:04 +01:00