Commit Graph

100 Commits

Author SHA1 Message Date
532a4b679b Using batch insert for profiles to prevent errors on large inserts. 2020-05-21 03:44:44 +02:00
b1b7cd6d50 Fixed Whale Member posters and photos. 2020-05-20 02:23:45 +02:00
b6691e1991 Added release type distinction to REST API. 2020-05-20 01:38:58 +02:00
057362d011 Added basic release and actor API. 2020-05-20 01:11:32 +02:00
11eb66f834 Switched to tabs. Adding missing actor entries when scraping actors, with batch ID. 2020-05-14 04:26:05 +02:00
93d4f0ff1a Added WIP media module. Returning releases from release search database function. Fixed page loop in update module. 2020-03-29 04:00:46 +02:00
d765543b30 Improved update runner. Improved HTTP module API, added default user agent. Added PornCZ and Czechav logos. 2020-03-21 02:48:24 +01:00
4b310e9dfa Added configurable proxy to HTTP module (also used by qu). Added network and site URL to search documents. 2020-03-19 01:54:25 +01:00
0f09fd53eb Refactoring deep scrape. Added tag posters. 2020-03-16 04:10:52 +01:00
37e188a0df Added site aliases. Migrated various scrapers to qu. Added BAM Visions base. 2020-03-12 00:15:25 +01:00
d97b1ab894 Split up profile scrape runner. Fixed wrong search document date key. Added search update CLI. 2020-03-10 23:46:55 +01:00
5c55750c0c Fixed qu issues. Fixed media issues. Simplified and expanded date component in search query. 2020-03-10 00:17:57 +01:00
638757b6e4 Added rudimentary movie relations. 2020-03-09 05:06:37 +01:00
db01599569 Added profile scraper with releases to Hush. Added qtexts to q to return text nodes individually. Including network in profile site. 2020-03-06 02:49:55 +01:00
fd6e90e74c Added tour layout scraper to Hush, enabling Interracial POVs, POV Pornstars and See Him Fuck. 2020-03-05 20:31:11 +01:00
f10e4af29b Allowing scrapers to force channel allocation attempt. Added Hush Pass subsite handling to Hussie Pass scraper. 2020-03-05 03:44:27 +01:00
956afa6ae7 Added Hussie Pass scraper. 2020-03-05 02:47:52 +01:00
6c3cba1b87 Added actor photos to Brazzers scene scrape. Added no-video poster to Score. Not flattening actor avatar fallbacks. 2020-03-04 17:21:40 +01:00
15af3e91e0 Coalescing shoot ID in search. Added stop words for common TLDs. Sorting tags in search results. 2020-03-02 04:15:47 +01:00
e79a6b33fb Added 'newly added' filter. Handling paywalled videos in Private scraper. Added shoot ID to search. 2020-03-02 03:41:41 +01:00
8dd5925af6 Improved search engine query and added stop words. Added 'secondary' property to tag aliases, for tag aliases to be included in searches and alias lists. 2020-02-29 22:47:48 +01:00
b03775fa07 Using generic slugify for MindGeek channel. 2020-02-29 05:00:50 +01:00
a828fee476 Handling NULL actors and tags in search table query. Added limit parameter to home URL, default to 30. 2020-02-29 03:22:51 +01:00
f1f33080f6 Ignoring undefined video entropy. 2020-02-28 03:56:58 +01:00
3dc8547431 Added fake data and Markov experiments. 2020-02-27 05:44:24 +01:00
3c30e9107a Using dedicated releases search table for ts vector documents. 2020-02-26 22:33:15 +01:00
915eb75719 Refactored Vixen scraper, using API endpoint and added actor profile and releases scraper. Release scraper will return base release when present and 'deep' argument is false. 2020-02-22 23:25:10 +01:00
e5c6ccd252 Scraping upcoming Vixen scenes. Fetching release media groups sequentially to prevent collisions. 2020-02-22 04:37:48 +01:00
349a5a506e Queueing and batching media HTTP requests for improved reliability. 2020-02-22 03:22:30 +01:00
7ac5a8e08c Catching media failures per batch. Refined teaser logging. 2020-02-20 22:27:00 +01:00
377970f874 Added parent-child relations to network, showing parent network in sidebar. Added Burning Angel using Gamma API. 2020-02-20 02:35:23 +01:00
97f5e49187 Refactored media module. Returning 320p and 720p videos from MindGeek as teasers instead of trailers. 2020-02-19 04:41:53 +01:00
40bf476ea6 Fixed Porn Pros scraper. Added various Score site logos. 2020-02-18 16:00:36 +01:00
dd6a1d9bfd Added Vivid network. Added ASMR Fantasy to Adult Time. Storing deep URL in database. Added href to header links. 2020-02-11 04:58:18 +01:00
139f0ce7cb Allowing release scrapers to return actor details. Added True Amateurs. 2020-02-09 23:25:54 +01:00
0f513266a0 Added Black for Wife to JayRock. Switched parameters field to JSON type. 2020-02-09 19:41:39 +01:00
1546e0836c Split Girlsway from Adult Time. Added Fantasy Massage. Using Gamma scraper for Pure Taboo. Added photo path parameter to Gamma scraper. 2020-02-08 02:49:39 +01:00
5ba308f07a Added Adult Time. Adding context to logger. 2020-02-07 19:53:16 +01:00
d4801bb240 Returning window.document instead of element as document from q. Fixed actor collisions when scrapers return same scene multiple times. Scraping all Score actor release pages. Fixed 21Sextury and PureTaboo photo scraping. 2020-02-05 23:57:55 +01:00
f921bb4ae9 Generating and using URL slugs for releases, improver slugify module. Added 'extract' parameter to MindGeek scraper to get scenes not associate with a channel (see Digital Playground). Added various high res logos. 2020-02-04 03:12:09 +01:00
a671190fff Adapted Score scraper for Score Classics. 2020-02-03 02:04:47 +01:00
a45bebddac Adapter Score scraper for Score Videos. 2020-02-03 00:39:43 +01:00
a97c6defca Added teaser support. Added Score network with scraper for Scoreland. Improved q. Added assets. 2020-02-02 05:14:58 +01:00
94bf207397 Added Wicked network. Merged Evil Angel, XEmpire and Wicked into generic Gamma scraper. 2020-02-01 01:15:40 +01:00
345103d759 Fixed ;. 2020-01-27 01:54:42 +01:00
eca65f6b4d Inspecting performance. 2020-01-27 00:41:04 +00:00
f8175f6054 Added generic Gamma photo and actor scraper for XEmpire, 21Sextury, Blowpass and Evil Angel. 2020-01-22 22:25:58 +01:00
75c53d338a Added Porn Pros sites and scraper. 2020-01-14 21:45:30 +01:00
859cb7e1f3 Added support for Family Strokes. 2020-01-13 23:45:09 +01:00
4b36de2f55 Fixed Evil Angel upcoming and actor association issues. Moving from console.log to logger. 2020-01-10 02:43:04 +01:00