The Tidy website cited in the article (http://www.w3.org/People/Raggett/tidy/) has a Java Jar. It is really excellent for quickly importing HTML into an XML processing framework. -rod g