TagSoup - SAX-compliant parser in Java