TagSoup : Java Glossary

*0-9ABCDEFGHIJKLMNOPQRSTUVWXYZ (all)

TagSoup
TagSoup is a SAX-compliant parser written in Java that, instead of parsing well-formed or valid XML (extensible Markup Language), parses HTML (Hypertext Markup Language) as it is found in the wild: poor, nasty and brutish, though quite often far from short. TagSoup is designed for people who have to process this stuff using some semblance of a rational application design. By providing a SAX (Simple API for XML) interface, it allows standard XML tools to be applied to even the worst HTML. TagSoup also includes a command-line processor that reads HTML files and can generate either clean HTML or well-formed XML that is a close approximation to XHTML (extensible Hypertext Markup Language).

This page is posted
on the web at:

http://mindprod.com/jgloss/tagsoup.html

Optional Replicator mirror
of mindprod.com
on local hard disk J:

J:\mindprod\jgloss\tagsoup.html
Canadian Mind Products
Please the feedback from other visitors, or your own feedback about the site.
Contact Roedy. Please feel free to link to this page without explicit permission.

IP:[65.110.21.43]
Your face IP:[18.232.35.62]
You are visitor number