Getting Started with Socrata DataSync

DataSync is an executable Java application which serves as a general solution to automate publishing data on the Socrata platform. It can be used through a easy-to-use graphical interface or as a command-line tool (‘headless mode’). Whether you are a non-technical user, developer, or ETL specialist DataSync makes data publishing simple and reliable. DataSync takes a CSV or TSV file on a local machine or networked hard drive and publishes it to a Socrata dataset so that the Socrata dataset stays up-to-date. DataSync jobs can be integrated into an ETL process, scheduled using a tool such as the Windows Task Scheduler or Cron, or used to perform updates or deletes in batches. DataSync works on any platform that runs Java version 8 or higher (i.e. Windows, Mac, and Linux). This simple, yet powerful publishing tool lets you easily update Socrata datasets programmatically and automatically (scheduled), without writing a single line of code.

Download DataSync

Download the latest release of the DataSync .jar file from here (requires Java 8 or higher): https://github.com/socrata/datasync/releases/latest.

Save the JAR file you downloaded in a permanent location on your computer or server. Double-click the JAR file you just downloaded to open the program in GUI mode.

NOTE: DataSync is under active development and new versions are released regularly. You will be notified of major version updates of DataSync when they become available; you will not be notified of minor version updates however. If you have feature suggestions or you find bugs in DataSync please submit them to the GitHub issue tracker.

DataSync source code is available on GitHub.

Prerequisites

  • A computer/server running Java 8 or higher
  • A Socrata account with a Publisher role or Owner rights to at least one dataset
  • An App Token. If you have not yet created one read how to obtain an App token

Continue to either the quick-start guide or the fuller guide for setting up a standard job.