Most of the webpages now-a-days contain lists of things, or chunks of html patterns that repeat a lot.
For example:
1. Facebook status messages on homepages.
2. Digg/Hacker News
3. StackOverflow homepage
Is there a Java library for detecting such lists. It will involve some amount of pattern matching and intelligence. Thanks. Perhaps you could use regexes?
以上就是Java library for detecting list items on webpages的详细内容,更多请关注web前端其它相关文章!