▲ 11 ▼ Show: Dataflow Kit - extract structured data from web sites. Web sites scraping. github.com godoc.org goreportcard.com posted by slotix 2293 days ago ▲ kenny 2293 days ago ▼ This looks really interesting, thanks for posting. What have you used it for so far? Contributing ▲ slotix 2291 days ago ▼ Thanks @kenny for your feedback. One of our customers uses dfk for aggregating data about physicians extracted from different web sites. The volume of scraped data is about several millions of pages per web site. DFK is able to extract data from Java script driven pages and from websites behind login form. Contributing ▲ Tim Donell 2289 days ago ▼ Can you use it without docker? Contributing ▲ slotix 2289 days ago ▼ Yes please find more information here https://github.com/slotix/dataflowkit#manual-way Contributing Register to comment or vote on this story
This looks really interesting, thanks for posting. What have you used it for so far?
Thanks @kenny for your feedback. One of our customers uses dfk for aggregating data about physicians extracted from different web sites.
The volume of scraped data is about several millions of pages per web site.
DFK is able to extract data from Java script driven pages and from websites behind login form.
Can you use it without docker?
Yes please find more information here https://github.com/slotix/dataflowkit#manual-way