Go to file
ThePendulum 6f508ad2da 0.7.0 2018-04-18 04:04:42 +02:00
config Added default config to repository. 2018-04-18 02:46:12 +02:00
methods Added support for i.redd.it and self posts. Moved pattern interpolation out of fetch module. 2018-04-18 00:18:04 +02:00
.gitignore Added default config to repository. 2018-04-18 02:46:12 +02:00
README.md Added multi-user support. 2018-04-18 04:04:39 +02:00
app.js Added multi-user support. 2018-04-18 04:04:39 +02:00
curate.js Added multi-user support. 2018-04-18 04:04:39 +02:00
dissectLink.js Added support for i.redd.it and self posts. Moved pattern interpolation out of fetch module. 2018-04-18 00:18:04 +02:00
fetchContent.js Added limit and sort parameters, post index variable and relevant usage documentation. 2018-04-18 01:55:20 +02:00
interpolate.js Added title limit setting and cleaned up README 2018-04-18 02:05:30 +02:00
package-lock.json 0.7.0 2018-04-18 04:04:42 +02:00
package.json 0.7.0 2018-04-18 04:04:42 +02:00

README.md

reddit-post-dump

Installation

reddit-post-dump requires a arbitrarily recent version of Node.js. Before use, dependencies must be installed as follows:

npm install

Usage

node app.js --user={username}

Optional parameters

  • --users={user1,user2}: You may fetch posts from multiple users by either supplying a comma-separated list of usernames (no spaces) with --users, or using multiple individual --user arguments
  • --limit={number}: Maximum amount posts per user to fetch content from
  • --sort={method}: How posts should be sorted while fetched. This affects the $postIndex variable, and in combination with a --limit decides what posts will be included

Examples

  • node app.js --user=ThePendulum
  • node app.js --user=ThePendulum --limit=10 --sort=top

Configuration

The default configuration aims to be sensible, and the application may be used without any further tweaking. However, a multitude of options make this utility particularly powerful.

To change the configuration, please refer to config/default.js. I recommend not editing this file directly, but instead making a copy config/local.js, as the default configuration might be overwritten in updates and can be a useful reference for restoring any detrimental configuration errors. The structure of config/local.js must match the structure of the default configuration, but does not necessarily need to contain any properties you do not wish to override. If preferred, you may instead use JSON in config/local.json.

Patterns

Path patterns dictate where and how a file will be saved. Various variables and options are available, and you may use subdirectories divided by /.

Variables

Post

  • $postId: The ID of the reddit post
  • $postTitle: The title of the reddit post
  • $postUser: The user that submitted the post, almost always equivalent to the --user command line argument
  • $postDate: The submission date of the reddit post, formatted by the dateformat configuration described below
  • $postIndex: The index of the post according to the sort method

Album

  • $albumId: The ID of the media host album
  • $albumTitle: The title of the media host album
  • $albumDescription: The description of the media host album
  • $albumDate: The submission date of the media host album, formatted by the dateformat configuration described below

Item (individual image, video or text)

  • $itemId: The ID of the individual image or video
  • $itemTitle: The title of the individual image or video
  • $itemDescription: The description of the individual image or video
  • $itemDate: The submission date of the individual image or video, formatted by the dateformat configuration described below
  • $itemIndex: The index of the individual image or video in an album, offset by the indexOffset configuration described below
  • $ext: The extension of the medium. Must typically be included, but may be omitted for self (text) posts on Unix systems

dateFormat

Affects the representation of $postDate, $albumDate and $itemDate and defaults to YYYYMMDD. See this documentation for an overview of all available tokens.

titleLength

Titles can sometimes be longer than you prefer your filenames to be, or even overflow the operating system's limit (255 bytes for Linux). This property cuts off titles at a fixed number of characters.

indexOffset

Arrays start at 0, but as to not tire myself out debating the matter, you may offset it my any numerical value you like. Affects the $itemIndex variable for album items.

slashSubstitute

The patterns represent Unix file paths, and a / therefore indicates a new directory. You may freely use directories in your paths, but titles or descriptions may contain a / that is not supposed to create a new directory. All instances of / in a variable value will be replaced with the configured slash substitute.