TidyMarkup
TidyMarkup is a Data Format that uses the TagSoup to tidy up HTML. It can be used to parse ugly HTML and return it as pretty wellformed HTML.
Camel eats our own -dog food- soap
TidyMarkup only supports the unmarshal operation as we really don't want to turn well formed HTML into ugly HTML
Example
An example where the consumer provides some HTML
from("file://site/inbox").unmarshal().tidyMarkup().to("file://site/blogs");
Requirements
TidyMarkup is provided in the camel-tagsoup.jar so if you are using maven you can just depend on this artifactId camel-tagsoup
.