http://xml.resource.org/index.html Input file: /Users/markdavis/Documents/workspace/cldr/docs/rfc Output result: File Text: plaintext then Web page: HTML http://tools.ietf.org/tools/idnits/ http://tools.ietf.org/tools/bap/abnf.cgi submit source http://datatracker.ietf.org/submit/ Fix after Eclipse format >\s+([.,:;]) >$1