DebaucheryLibrarian
|
153f28c494
|
Added last login column to users table.
|
2022-04-04 00:23:37 +02:00 |
DebaucheryLibrarian
|
25e0575c2b
|
Fixed description query in Dogfart scraper.
|
2022-04-03 23:31:36 +02:00 |
DebaucheryLibrarian
|
5cbf122d6f
|
Scraping Dogfart scenes from native sites.
|
2022-04-03 23:00:05 +02:00 |
DebaucheryLibrarian
|
762b3984a3
|
Ignore join links for trailers in Dogfart scraper.
|
2022-04-03 01:29:16 +02:00 |
DebaucheryLibrarian
|
9be80e2be9
|
Returning unextracted scenes from Kelly Madison / Teen Fidelity scraper. Fixed Dogfart profile scraper to use extract scenes.
|
2022-04-03 00:53:27 +02:00 |
DebaucheryLibrarian
|
574c117ab0
|
Refactored Dogfart scraper to use qu and return unextracted scenes.
|
2022-04-03 00:49:39 +02:00 |
DebaucheryLibrarian
|
5e499c3685
|
Added chunking to media duplicate queries to prevent overloading parameters. Added DP Diva to Perv City (coming soon).
|
2022-04-02 00:32:23 +02:00 |
DebaucheryLibrarian
|
5352186319
|
Insex not fetching video when not required.
|
2022-03-31 23:01:54 +02:00 |
DebaucheryLibrarian
|
39813d4461
|
Updated Insex scraper.
|
2022-03-31 22:46:54 +02:00 |
DebaucheryLibrarian
|
a19a77e165
|
Optionalized qualities.
|
2022-03-31 14:34:10 +02:00 |
DebaucheryLibrarian
|
18b219850e
|
Storing scene qualities. Updated Perv City scraper.
|
2022-03-31 14:11:13 +02:00 |
DebaucheryLibrarian
|
33a327a04b
|
Merge branch 'master' into experimental
|
2022-03-30 23:00:29 +02:00 |
DebaucheryLibrarian
|
61123fdb6a
|
Added Accept-Language header to MindGeek requests, seems to help with acquiring sessions.
|
2022-03-30 01:17:54 +02:00 |
DebaucheryLibrarian
|
15c9af8057
|
Added dedicated movie photo table, renamed scene photo function.
|
2022-03-27 23:42:03 +02:00 |
DebaucheryLibrarian
|
e93e8ace5c
|
Added deep scene force parameter to MindGeek scraper.
|
2022-03-27 00:27:26 +01:00 |
DebaucheryLibrarian
|
0dad5b0d68
|
Added series removal utils.
|
2022-03-26 17:56:22 +01:00 |
DebaucheryLibrarian
|
fd8170f223
|
Added series.
|
2022-03-26 17:40:20 +01:00 |
DebaucheryLibrarian
|
5ff076cac3
|
Added DP Star Sex Challenges to Digital Playground.
|
2022-03-09 23:26:48 +01:00 |
DebaucheryLibrarian
|
c6e977f842
|
Added movie support to MindGeek scraper.
|
2022-03-04 23:32:09 +01:00 |
DebaucheryLibrarian
|
5deba6b90f
|
Passing resolved parameters into before-scrape methods.
|
2022-02-25 22:13:41 +01:00 |
DebaucheryLibrarian
|
17dfeac1af
|
Renamed LegalPorno to AnalVids, added missing studios.
|
2022-02-24 23:28:24 +01:00 |
DebaucheryLibrarian
|
582269cfaa
|
Added Radical metadata layout scraper and the Got Filled and Inserted channels.
|
2022-02-24 22:33:00 +01:00 |
DebaucheryLibrarian
|
690d2bb3ed
|
Added MOFOS to default rate limit config, added light logos.
|
2022-02-20 16:52:29 +01:00 |
DebaucheryLibrarian
|
8386230f33
|
Added missing Team Skeet sites.
|
2022-02-14 01:27:56 +01:00 |
DebaucheryLibrarian
|
5918364cf5
|
Preventing unnecessary deep scrapes in Team Skeet scraper.
|
2022-02-13 01:34:39 +01:00 |
DebaucheryLibrarian
|
d9f0db6e3c
|
Fixed reading screen ID from non-existent item in Bang scraper.
|
2022-02-12 17:19:47 +01:00 |
DebaucheryLibrarian
|
5da1acc38d
|
Added media resize.
|
2022-02-12 17:16:20 +01:00 |
DebaucheryLibrarian
|
c7c62e220d
|
Removed console log from Bang scraper.
|
2022-02-12 16:22:41 +01:00 |
DebaucheryLibrarian
|
cde760c1ea
|
Preventing Bang! scraper from redundant deep scraping when base release is available.
|
2022-02-12 16:22:00 +01:00 |
DebaucheryLibrarian
|
c7710afdbf
|
Fixed Bang! scraped failing when scene has no photos.
|
2022-02-12 03:26:40 +01:00 |
DebaucheryLibrarian
|
496c29e569
|
Configured Reality Kings to fetch session from RK scene overview.
|
2022-02-11 22:14:44 +01:00 |
DebaucheryLibrarian
|
c4f0b48932
|
Fixed cover dimensions on movie tile to show lazy image, showing missing cover icon.
|
2022-02-09 00:19:54 +01:00 |
DebaucheryLibrarian
|
f7bbab08bd
|
Added Porn Pros main channel and working URLs for Flexible Positions, Public Violations and Squirt Disgrace. Added bisexual tag photos.
|
2022-02-08 23:59:44 +01:00 |
DebaucheryLibrarian
|
9b7879bff7
|
Removed channel ID from Gamma update query to prevent duplicate scenes. Added sexuality tags with covers to overview.
|
2022-02-08 00:19:34 +01:00 |
DebaucheryLibrarian
|
1dfa034332
|
Handling 'page not found' with 200 OK in Gamma scraper.
|
2022-02-07 22:29:09 +01:00 |
DebaucheryLibrarian
|
3d349c413f
|
Fixed Adult Time scraper.
|
2022-02-07 22:16:43 +01:00 |
DebaucheryLibrarian
|
fffe9080f3
|
Added stand-alone Spizoo sites.
|
2022-02-07 21:58:27 +01:00 |
DebaucheryLibrarian
|
4bcf7ef45b
|
Storing associations if existing media are found.
|
2022-02-07 01:57:09 +01:00 |
DebaucheryLibrarian
|
ceaf5a3217
|
Not storing shallow movie data when argument is disabled.
|
2022-02-05 17:14:34 +01:00 |
DebaucheryLibrarian
|
3d80e9d77b
|
Added movie URL file support. Fixed Gamma movie API scraper movie URL.
|
2022-02-03 00:04:48 +01:00 |
DebaucheryLibrarian
|
f1a7f2f905
|
Added Gamma movie API scraper.
|
2022-02-02 23:51:15 +01:00 |
DebaucheryLibrarian
|
10b8abf706
|
Updated Zero Tolerance channel to Zero Tolerance Films.
|
2022-02-02 22:25:04 +01:00 |
DebaucheryLibrarian
|
b9f3eb85f9
|
Ignoring Gamma master categories in update scrape as these seem to contain irrelevant tags.
|
2022-02-01 21:18:54 +01:00 |
DebaucheryLibrarian
|
815d56d334
|
Preventing media associations when use-reference is unavailable.
|
2022-01-30 22:15:07 +01:00 |
DebaucheryLibrarian
|
22864105ac
|
Writing media associations to disk for debugging.
|
2022-01-30 17:05:18 +01:00 |
DebaucheryLibrarian
|
52218c30f8
|
Not fetching mobile Gamma page when photos are not expected.
|
2022-01-29 16:38:36 +01:00 |
DebaucheryLibrarian
|
5c5a11fdca
|
Added Bang Bros members page scraper (not for full scenes).
|
2022-01-29 01:21:41 +01:00 |
DebaucheryLibrarian
|
90bc2f3724
|
Changed release media association existence check, added log for details.
|
2022-01-28 04:05:29 +01:00 |
DebaucheryLibrarian
|
67d0a9e0e0
|
Fixed scene entity tag association.
|
2022-01-26 01:47:34 +01:00 |
DebaucheryLibrarian
|
452f725332
|
Switched poster URL preference in Gamma scraper to prevent cropped image on XEmpire.
|
2022-01-25 15:34:15 +01:00 |
DebaucheryLibrarian
|
7ceec1c739
|
Added indexes to scene tag and movie scene tables improve performance. Removed scene count from channel overview, too expensive.
|
2022-01-24 00:15:11 +01:00 |
DebaucheryLibrarian
|
cd8f690ac6
|
Preferring network over channel in deep URL channel matching.
|
2022-01-23 02:46:42 +01:00 |
DebaucheryLibrarian
|
c557f0f1f0
|
Improved Gamma image sources to prevent cropped images.
|
2022-01-22 23:20:08 +01:00 |
DebaucheryLibrarian
|
20da2d1cf6
|
Reusing batch ID for movies to preserve new-flag.
|
2022-01-20 00:54:10 +01:00 |
DebaucheryLibrarian
|
372db86927
|
Disabled MindGeek session bundling to analyze Too Many Requests errors.
|
2022-01-16 22:24:47 +01:00 |
DebaucheryLibrarian
|
57d46f5842
|
Fixed latest query in Gamma scraper.
|
2022-01-16 17:17:45 +01:00 |
DebaucheryLibrarian
|
bbd788699e
|
Fixed GraphQL query for scenes REST API. Added entry ID to scene search document.
|
2022-01-12 17:50:23 +01:00 |
DebaucheryLibrarian
|
506971b44b
|
Chunked duplicate check to prevent postgres stack depth errors.
|
2022-01-10 02:17:17 +01:00 |
DebaucheryLibrarian
|
baebdbc0bb
|
Adding comment listing sister sites for non-exclusive Gamma API scenes.
|
2022-01-07 01:07:41 +01:00 |
DebaucheryLibrarian
|
6760c11881
|
Disabled why-is-node-running to assess OOM.
|
2021-12-31 01:45:03 +01:00 |
DebaucheryLibrarian
|
3b958a02ef
|
Added memory profiler configuration options.
|
2021-12-27 22:36:01 +01:00 |
DebaucheryLibrarian
|
299f257a01
|
Logging memory usage on media fetch.
|
2021-12-27 17:29:23 +01:00 |
DebaucheryLibrarian
|
3bfe1e8fdb
|
Storing scenes before fetching movies.
|
2021-12-20 03:29:14 +01:00 |
DebaucheryLibrarian
|
7d992f4129
|
Added triggers to memory dump.
|
2021-12-20 02:28:59 +01:00 |
DebaucheryLibrarian
|
280032937f
|
Added centralized movie page fetcher. Added memory heap dump.
|
2021-12-20 02:22:10 +01:00 |
DebaucheryLibrarian
|
dc187a9a3a
|
Added execute method to qu, removed runScripts from Gamma's fetchMovie to observe effect on memory usage.
|
2021-12-19 23:03:44 +01:00 |
DebaucheryLibrarian
|
1c0c30be11
|
Improved and re-enabled why-is-node-running logger.
|
2021-12-18 23:44:51 +01:00 |
DebaucheryLibrarian
|
8e4be06675
|
Closing JSDOM window after deep scrapes in an attempt to save memory.
|
2021-12-18 23:22:35 +01:00 |
DebaucheryLibrarian
|
65e2b72c6a
|
Fixed Gamma movie scraper failing when data object is not available.
|
2021-12-13 01:20:02 +01:00 |
DebaucheryLibrarian
|
5b5688438c
|
Fixed Gamma movie scene query. Added tag photos.
|
2021-12-11 22:46:55 +01:00 |
DebaucheryLibrarian
|
ec4e7bc12a
|
Updated Jules Jordan scraper for Sperm Swallowers and The Ass Factory.
|
2021-12-05 23:59:17 +01:00 |
DebaucheryLibrarian
|
26d15c0a6f
|
Updated Jules Jordan scraper for Manuel Ferrara and Girl Girl.
|
2021-12-05 23:43:23 +01:00 |
DebaucheryLibrarian
|
e88554666c
|
Reduced media concurrency to assess effect on memory. Moved qu context removal to deep scrape runner. Updated movie graphql queries.
|
2021-12-05 02:54:55 +01:00 |
DebaucheryLibrarian
|
abd063a578
|
Removed window map from http module to prevent memory pile-up with update scraping.
|
2021-12-04 00:32:28 +01:00 |
DebaucheryLibrarian
|
1918dd4ad5
|
Removing query methods from XEmpire release.
|
2021-12-02 00:45:22 +01:00 |
DebaucheryLibrarian
|
61daf5f037
|
Added debug log for jsdom window closing.
|
2021-12-01 23:44:25 +01:00 |
DebaucheryLibrarian
|
60c151ff6a
|
Reduced deep scrape concurrency to 1.
|
2021-12-01 23:39:09 +01:00 |
DebaucheryLibrarian
|
e41f9fa937
|
Added waitImmediate to deep scrape, reduced concurrency.
|
2021-12-01 23:30:10 +01:00 |
DebaucheryLibrarian
|
e29cbc9fea
|
Closing JSDOM window after deep scrape in an attempt to save memory. Reduced deep scrape concurrency to 5.
|
2021-12-01 17:26:13 +01:00 |
DebaucheryLibrarian
|
b93a5715cb
|
Updated Jules Jordan scraper for new update page layout.
|
2021-12-01 00:00:24 +01:00 |
DebaucheryLibrarian
|
87094a9498
|
Replaced cheerio with qu in Gamma scraper.
|
2021-11-29 02:49:07 +01:00 |
DebaucheryLibrarian
|
f04eb72891
|
Added upcoming scraper to PurgatoryX.
|
2021-11-28 01:20:39 +01:00 |
DebaucheryLibrarian
|
9d7183ac69
|
Added PurgatoryX scraper.
|
2021-11-27 23:55:16 +01:00 |
DebaucheryLibrarian
|
20d0d860d3
|
Fixed MindGeek scraper trying to acquire session from mindgeek.com
|
2021-11-22 02:51:52 +01:00 |
DebaucheryLibrarian
|
6b4aa64d74
|
Improved MindGeek scraper session check to prevent crash when network session isn't available yet.
|
2021-11-22 02:44:03 +01:00 |
DebaucheryLibrarian
|
e81864ebde
|
Removed verbose http queued output.
|
2021-11-21 00:47:32 +01:00 |
DebaucheryLibrarian
|
0305a22a24
|
Removed stray console log from media module.
|
2021-11-21 00:43:35 +01:00 |
DebaucheryLibrarian
|
2be3ea9bbb
|
Explicitly adding headers to http module result.
|
2021-11-21 00:41:12 +01:00 |
DebaucheryLibrarian
|
98c103c625
|
Fixed uuid import. Changed fs rmdir to rm.
|
2021-11-21 00:31:09 +01:00 |
DebaucheryLibrarian
|
f0b7678444
|
Fixed upsert failing on empty insert array due breaking Knex API change.
|
2021-11-21 00:19:10 +01:00 |
DebaucheryLibrarian
|
26539b74a5
|
Updated dependencies. Added periodic memory logger.
|
2021-11-20 23:59:48 +01:00 |
DebaucheryLibrarian
|
ccb99e278c
|
Added periodic memory logger.
|
2021-11-20 23:59:15 +01:00 |
DebaucheryLibrarian
|
29b8c5e38e
|
Including unextracted scenes in date determination.
|
2021-10-28 02:10:30 +02:00 |
DebaucheryLibrarian
|
0864154a0e
|
Added unextracted property to keep paginating when extracting scenes.
|
2021-10-28 01:59:53 +02:00 |
DebaucheryLibrarian
|
a22c4d5679
|
Added beforeNetwork hook, used by MindGeek. Added Filthy Kings to Gamma.
|
2021-10-27 17:19:23 +02:00 |
DebaucheryLibrarian
|
100a35b4e8
|
Added before scene fetch method to prevent e.g. unnecessary session requests, moved scraper assignment to entity lookup. Removed channel URL hostname matching..
|
2021-10-26 23:42:32 +02:00 |
DebaucheryLibrarian
|
6c5d4389fe
|
Not parsing HTML with jsdom when using http module directly to save memory. Added loading ellipsis to release grid pages.
|
2021-10-25 02:06:24 +02:00 |
DebaucheryLibrarian
|
0e4fd12d70
|
Added more loggers to http module.
|
2021-10-20 01:46:56 +02:00 |
DebaucheryLibrarian
|
972b15e72d
|
Added memory profiling.
|
2021-10-17 23:56:49 +02:00 |
DebaucheryLibrarian
|
49f891ba44
|
Ignoring 1-second scene duration from MindGeek API.
|
2021-10-17 19:59:05 +02:00 |