Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Tried the OCR for scraping and gave up because it was too slow and inaccurate.

OCR works well for certain scenarios where UI is fixed like on desktop applications but it's still fragile very much like CSS and Xpath selectors.

In fact, often OCR performs far slower and less accurate than CSS/Xpath selectors.

It has it's niches but I think it's sub optimal for web automation/scraping.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: