Turn websites into useful data

We help people to extract information from web sites with simple point-and-click toolkit.

Dataflow Kit is Web scraping open source framework written in Go.


Data Extraction and Delivery Process

Open a web page

Behind-The-Scenes Headless Chrome browser is used for rendering JavaScript driven web pages properly.

Click to select data

  • Optionally check trim, UPPER, lower, Capitalize filters, Or build Regular Expression.
  • Choose paginator type from either "Next" link or "Infinite scroll" or "Load more" button.
  • Follow links and detailed pages processing.

Download results

  • Launch crawler to follow links and extract the content from specified pages.
  • Select one of available formats from CSV, Excel, JSON/ JSON Lines, or XML.
  • Download parsed data.

Open source

Dataflow kit is open source and we welcome all contributors who are interested in collaborating.

Whether you want to help with issues, coding features, releasing the project, scripting, tests, benchmarking, documentation, updating samples or share an information about Dataflow kit.

Please star DFK GitHub repository.