commit | 4d223f1903629328eb105f64d3893a063aefbe72 | [log] [tgz] |
---|---|---|
author | jmesserly@google.com <jmesserly@google.com> | Wed Aug 14 20:59:46 2013 +0000 |
committer | jmesserly@google.com <jmesserly@google.com> | Wed Aug 14 20:59:46 2013 +0000 |
tree | 87953f72a49c4fc7ce7cc5631653ccf4ce332859 |
move html5lib code into dart svn repo also gets the tests running Changes to scripts: * pkg/pkg.gyp: add 'third_party' to list of folders to search * tools/publish_pkg.py: update copyright year * tools/publish_all_pkgs.py: also include pkg/third_party Changes to html5lib: * pubspec.yaml -- removed versions * README.md -- removed some historical notes * added html5lib.status * test/browser -- rename browser_tests to browser_test so test framework finds it * test/parser_test.dart, test/parser_feature_test.dart -- moved dart:io test into parser_test so parser_feature_test can work in browser * test/support.dart -- now finds the data folder relative to entry point script * test/support.dart -- rename "pathos" import to "path" Not changed: * ./test/run.sh still works for testing, in addition to ./tools/test.dart R=dgrove@google.com, ricow@google.com, sigmund@google.com Review URL: https://codereview.chromium.org//22375011 git-svn-id: https://dart.googlecode.com/svn/branches/bleeding_edge/dart/pkg/third_party/html5lib@26152 260f80e4-7a28-3924-810f-c04153c831b5
This is a pure Dart html5 parser. It‘s a port of html5lib from Python. Since it’s 100% Dart you can use it safely from a script or server side app.
Eventually the parse tree API will be compatible with dart:html, so the same code will work on the client and the server.
Add this to your pubspec.yaml
(or create it):
dependencies: html5lib: any
Then run the Pub Package Manager (comes with the Dart SDK):
pub install
Parsing HTML is easy!
import 'package:html5lib/parser.dart' show parse; import 'package:html5lib/dom.dart'; main() { var document = parse( '<body>Hello world! <a href="www.html5rocks.com">HTML5 rocks!'); print(document.outerHtml); }
You can pass a String or list of bytes to parse
. There's also parseFragment
for parsing a document fragment, and HtmlParser
if you want more low level control.
# From Dart SVN checkout ./tools/build.py -m release ./tools/test.py -m release html5lib ./tools/test.py -m release -r drt html5lib