<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Read HTML with Java &#8211; Then 7 Fun Things to do to It</title>
	<atom:link href="http://blog.codehangover.com/read-html-with-java-then-7-fun-things-to-do-to-it/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.codehangover.com/read-html-with-java-then-7-fun-things-to-do-to-it/</link>
	<description>Go ahead, have another</description>
	<lastBuildDate>Sat, 31 Jul 2010 20:42:38 -0400</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: MikeNereson</title>
		<link>http://blog.codehangover.com/read-html-with-java-then-7-fun-things-to-do-to-it/#comment-627</link>
		<dc:creator>MikeNereson</dc:creator>
		<pubDate>Tue, 13 Jul 2010 12:14:09 +0000</pubDate>
		<guid isPermaLink="false">http://blog.codehangover.com/?p=214#comment-627</guid>
		<description>&lt;a href=&#039;#comment-626&#039; rel=&quot;nofollow&quot;&gt;@Yaniv&lt;/a&gt; - HtmlParser&#039;s way of working with the DOM is through the NodeList. There may not be any effective way of getting the full HTML from the Parser. The Parser documentation states

&lt;em&gt;The Parser provides access to the contents of the page, via a NodeIterator, a NodeList or a NodeVisitor.

Typical usage of the parser is:&lt;/em&gt;

 &lt;code&gt;Parser parser = new Parser (&quot;http://whatever&quot;);
 NodeList list = parser.parse (null);
 // do something with your list of nodes.&lt;/code&gt;

So I don&#039;t think there is any public method to access the HTML. You might have to write something yourself to access the Parser&#039;s underlying model. Good luck.&lt;div class=&quot;comment-remix-meta&quot;&gt;&lt;a href=&quot;#&quot; class=&quot;replyto&quot; onclick=&quot;replyto(&#039;627&#039;,&#039;MikeNereson&#039;); return false;&quot;&gt;Reply&lt;/a&gt;  - &lt;a href=&quot;#&quot; class=&quot;quote&quot; onclick=&quot;quote(&#039;627&#039;,&#039;MikeNereson&#039;,&#039;&lt;a href=\&#039;#comment-626\&#039; rel=\&quot;nofollow\&quot;&gt;@Yaniv&lt;\/a&gt; - HtmlParser\&#039;s way of working with the DOM is through the NodeList. There may not be any effective way of getting the full HTML from the Parser. The Parser documentation states\r\n\r\n&lt;em&gt;The Parser provides access to the contents of the page, via a NodeIterator, a NodeList or a NodeVisitor.\r\n\r\nTypical usage of the parser is:&lt;\/em&gt;\r\n\r\n &lt;code&gt;Parser parser = new Parser (\&quot;http:\/\/whatever\&quot;);\r\n NodeList list = parser.parse (null);\r\n \/\/ do something with your list of nodes.&lt;\/code&gt;\r\n\r\nSo I don\&#039;t think there is any public method to access the HTML. You might have to write something yourself to access the Parser\&#039;s underlying model. Good luck.&#039;); return false;&quot;&gt;Quote&lt;/a&gt;&lt;/div&gt;</description>
		<content:encoded><![CDATA[<p><a href='#comment-626' rel="nofollow">@Yaniv</a> &#8211; HtmlParser&#8217;s way of working with the DOM is through the NodeList. There may not be any effective way of getting the full HTML from the Parser. The Parser documentation states</p>
<p><em>The Parser provides access to the contents of the page, via a NodeIterator, a NodeList or a NodeVisitor.</p>
<p>Typical usage of the parser is:</em></p>
<p> <code>Parser parser = new Parser ("http://whatever");<br />
 NodeList list = parser.parse (null);<br />
 // do something with your list of nodes.</code></p>
<p>So I don&#8217;t think there is any public method to access the HTML. You might have to write something yourself to access the Parser&#8217;s underlying model. Good luck.
<div class="comment-remix-meta"><a href="#" class="replyto" onclick="replyto('627','MikeNereson'); return false;">Reply</a>  &#8211; <a href="#" class="quote" onclick="quote('627','MikeNereson','&lt;a href=\'#comment-626\' rel=\&quot;nofollow\&quot;&gt;@Yaniv&lt;\/a&gt; - HtmlParser\'s way of working with the DOM is through the NodeList. There may not be any effective way of getting the full HTML from the Parser. The Parser documentation states\r\n\r\n&lt;em&gt;The Parser provides access to the contents of the page, via a NodeIterator, a NodeList or a NodeVisitor.\r\n\r\nTypical usage of the parser is:&lt;\/em&gt;\r\n\r\n &lt;code&gt;Parser parser = new Parser (\&quot;http:\/\/whatever\&quot;);\r\n NodeList list = parser.parse (null);\r\n \/\/ do something with your list of nodes.&lt;\/code&gt;\r\n\r\nSo I don\'t think there is any public method to access the HTML. You might have to write something yourself to access the Parser\'s underlying model. Good luck.'); return false;">Quote</a></div>
]]></content:encoded>
	</item>
	<item>
		<title>By: Yaniv</title>
		<link>http://blog.codehangover.com/read-html-with-java-then-7-fun-things-to-do-to-it/#comment-626</link>
		<dc:creator>Yaniv</dc:creator>
		<pubDate>Tue, 13 Jul 2010 09:13:02 +0000</pubDate>
		<guid isPermaLink="false">http://blog.codehangover.com/?p=214#comment-626</guid>
		<description>Thank you for this great article it&#039;s exactly what I needed. I have one question though: after changing the links href, how can I get the new html (the whole page html) from the parser?

Thanks.&lt;div class=&quot;comment-remix-meta&quot;&gt;&lt;a href=&quot;#&quot; class=&quot;replyto&quot; onclick=&quot;replyto(&#039;626&#039;,&#039;Yaniv&#039;); return false;&quot;&gt;Reply&lt;/a&gt;  - &lt;a href=&quot;#&quot; class=&quot;quote&quot; onclick=&quot;quote(&#039;626&#039;,&#039;Yaniv&#039;,&#039;Thank you for this great article it\&#039;s exactly what I needed. I have one question though: after changing the links href, how can I get the new html (the whole page html) from the parser?\r\n\r\nThanks.&#039;); return false;&quot;&gt;Quote&lt;/a&gt;&lt;/div&gt;</description>
		<content:encoded><![CDATA[<p>Thank you for this great article it&#8217;s exactly what I needed. I have one question though: after changing the links href, how can I get the new html (the whole page html) from the parser?</p>
<p>Thanks.
<div class="comment-remix-meta"><a href="#" class="replyto" onclick="replyto('626','Yaniv'); return false;">Reply</a>  &#8211; <a href="#" class="quote" onclick="quote('626','Yaniv','Thank you for this great article it\'s exactly what I needed. I have one question though: after changing the links href, how can I get the new html (the whole page html) from the parser?\r\n\r\nThanks.'); return false;">Quote</a></div>
]]></content:encoded>
	</item>
	<item>
		<title>By: MikeNereson</title>
		<link>http://blog.codehangover.com/read-html-with-java-then-7-fun-things-to-do-to-it/#comment-604</link>
		<dc:creator>MikeNereson</dc:creator>
		<pubDate>Wed, 31 Mar 2010 19:02:03 +0000</pubDate>
		<guid isPermaLink="false">http://blog.codehangover.com/?p=214#comment-604</guid>
		<description>&lt;a href=&#039;#comment-603&#039; rel=&quot;nofollow&quot;&gt;@S. Metzger&lt;/a&gt; - : ) Thanks for reading and thanks for your comments.&lt;div class=&quot;comment-remix-meta&quot;&gt;&lt;a href=&quot;#&quot; class=&quot;replyto&quot; onclick=&quot;replyto(&#039;604&#039;,&#039;MikeNereson&#039;); return false;&quot;&gt;Reply&lt;/a&gt;  - &lt;a href=&quot;#&quot; class=&quot;quote&quot; onclick=&quot;quote(&#039;604&#039;,&#039;MikeNereson&#039;,&#039;&lt;a href=\&#039;#comment-603\&#039; rel=\&quot;nofollow\&quot;&gt;@S. Metzger&lt;\/a&gt; - : ) Thanks for reading and thanks for your comments.&#039;); return false;&quot;&gt;Quote&lt;/a&gt;&lt;/div&gt;</description>
		<content:encoded><![CDATA[<p><a href='#comment-603' rel="nofollow">@S. Metzger</a> &#8211; : ) Thanks for reading and thanks for your comments.
<div class="comment-remix-meta"><a href="#" class="replyto" onclick="replyto('604','MikeNereson'); return false;">Reply</a>  &#8211; <a href="#" class="quote" onclick="quote('604','MikeNereson','&lt;a href=\'#comment-603\' rel=\&quot;nofollow\&quot;&gt;@S. Metzger&lt;\/a&gt; - : ) Thanks for reading and thanks for your comments.'); return false;">Quote</a></div>
]]></content:encoded>
	</item>
	<item>
		<title>By: S. Metzger</title>
		<link>http://blog.codehangover.com/read-html-with-java-then-7-fun-things-to-do-to-it/#comment-603</link>
		<dc:creator>S. Metzger</dc:creator>
		<pubDate>Wed, 31 Mar 2010 17:50:48 +0000</pubDate>
		<guid isPermaLink="false">http://blog.codehangover.com/?p=214#comment-603</guid>
		<description>As a Java newcomer, i found this a very interesting read! Thanks for writing this article.&lt;div class=&quot;comment-remix-meta&quot;&gt;&lt;a href=&quot;#&quot; class=&quot;replyto&quot; onclick=&quot;replyto(&#039;603&#039;,&#039;S. Metzger&#039;); return false;&quot;&gt;Reply&lt;/a&gt;  - &lt;a href=&quot;#&quot; class=&quot;quote&quot; onclick=&quot;quote(&#039;603&#039;,&#039;S. Metzger&#039;,&#039;As a Java newcomer, i found this a very interesting read! Thanks for writing this article.&#039;); return false;&quot;&gt;Quote&lt;/a&gt;&lt;/div&gt;</description>
		<content:encoded><![CDATA[<p>As a Java newcomer, i found this a very interesting read! Thanks for writing this article.
<div class="comment-remix-meta"><a href="#" class="replyto" onclick="replyto('603','S. Metzger'); return false;">Reply</a>  &#8211; <a href="#" class="quote" onclick="quote('603','S. Metzger','As a Java newcomer, i found this a very interesting read! Thanks for writing this article.'); return false;">Quote</a></div>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sathya Narayanan K</title>
		<link>http://blog.codehangover.com/read-html-with-java-then-7-fun-things-to-do-to-it/#comment-583</link>
		<dc:creator>Sathya Narayanan K</dc:creator>
		<pubDate>Fri, 08 Jan 2010 08:06:00 +0000</pubDate>
		<guid isPermaLink="false">http://blog.codehangover.com/?p=214#comment-583</guid>
		<description>Hi,

Thanks for posting this blog on reading a html file.
I have few clarification reg the code. please can you tell me  how to declare the &quot;htmlBody&quot; variable , as it is not declared in this blog.

Thanks in Advance,
Sathya narayanan K.&lt;div class=&quot;comment-remix-meta&quot;&gt;&lt;a href=&quot;#&quot; class=&quot;replyto&quot; onclick=&quot;replyto(&#039;583&#039;,&#039;Sathya Narayanan K&#039;); return false;&quot;&gt;Reply&lt;/a&gt;  - &lt;a href=&quot;#&quot; class=&quot;quote&quot; onclick=&quot;quote(&#039;583&#039;,&#039;Sathya Narayanan K&#039;,&#039;Hi,\r\n\r\nThanks for posting this blog on reading a html file.\r\nI have few clarification reg the code. please can you tell me  how to declare the \&quot;htmlBody\&quot; variable , as it is not declared in this blog.\r\n\r\nThanks in Advance,\r\nSathya narayanan K.&#039;); return false;&quot;&gt;Quote&lt;/a&gt;&lt;/div&gt;</description>
		<content:encoded><![CDATA[<p>Hi,</p>
<p>Thanks for posting this blog on reading a html file.<br />
I have few clarification reg the code. please can you tell me  how to declare the &#8220;htmlBody&#8221; variable , as it is not declared in this blog.</p>
<p>Thanks in Advance,<br />
Sathya narayanan K.
<div class="comment-remix-meta"><a href="#" class="replyto" onclick="replyto('583','Sathya Narayanan K'); return false;">Reply</a>  &#8211; <a href="#" class="quote" onclick="quote('583','Sathya Narayanan K','Hi,\r\n\r\nThanks for posting this blog on reading a html file.\r\nI have few clarification reg the code. please can you tell me  how to declare the \&quot;htmlBody\&quot; variable , as it is not declared in this blog.\r\n\r\nThanks in Advance,\r\nSathya narayanan K.'); return false;">Quote</a></div>
]]></content:encoded>
	</item>
	<item>
		<title>By: Adil</title>
		<link>http://blog.codehangover.com/read-html-with-java-then-7-fun-things-to-do-to-it/#comment-237</link>
		<dc:creator>Adil</dc:creator>
		<pubDate>Mon, 02 Nov 2009 07:57:26 +0000</pubDate>
		<guid isPermaLink="false">http://blog.codehangover.com/?p=214#comment-237</guid>
		<description>Hi
 I am having problem in extracting  tag which is used for Css(stylesheet) i want to extract  tag and after extracting i need its src attribute so i can have its source kindly  help me out thanks in Advance

Regards,
Adil Badshah&lt;div class=&quot;comment-remix-meta&quot;&gt;&lt;a href=&quot;#&quot; class=&quot;replyto&quot; onclick=&quot;replyto(&#039;237&#039;,&#039;Adil&#039;); return false;&quot;&gt;Reply&lt;/a&gt;  - &lt;a href=&quot;#&quot; class=&quot;quote&quot; onclick=&quot;quote(&#039;237&#039;,&#039;Adil&#039;,&#039;Hi\r\n I am having problem in extracting  tag which is used for Css(stylesheet) i want to extract  tag and after extracting i need its src attribute so i can have its source kindly  help me out thanks in Advance\r\n\r\nRegards,\r\nAdil Badshah&#039;); return false;&quot;&gt;Quote&lt;/a&gt;&lt;/div&gt;</description>
		<content:encoded><![CDATA[<p>Hi<br />
 I am having problem in extracting  tag which is used for Css(stylesheet) i want to extract  tag and after extracting i need its src attribute so i can have its source kindly  help me out thanks in Advance</p>
<p>Regards,<br />
Adil Badshah
<div class="comment-remix-meta"><a href="#" class="replyto" onclick="replyto('237','Adil'); return false;">Reply</a>  &#8211; <a href="#" class="quote" onclick="quote('237','Adil','Hi\r\n I am having problem in extracting  tag which is used for Css(stylesheet) i want to extract  tag and after extracting i need its src attribute so i can have its source kindly  help me out thanks in Advance\r\n\r\nRegards,\r\nAdil Badshah'); return false;">Quote</a></div>
]]></content:encoded>
	</item>
	<item>
		<title>By: MikeNereson</title>
		<link>http://blog.codehangover.com/read-html-with-java-then-7-fun-things-to-do-to-it/#comment-101</link>
		<dc:creator>MikeNereson</dc:creator>
		<pubDate>Fri, 21 Aug 2009 20:43:26 +0000</pubDate>
		<guid isPermaLink="false">http://blog.codehangover.com/?p=214#comment-101</guid>
		<description>Another fun thing to do is to print the HTML to PDF. I used iText for this on a project a couple of years ago &lt;a href=&quot;www.lowagie.com/itext&quot; rel=&quot;nofollow&quot;&gt;www.lowagie.com/itext&lt;/a&gt;&lt;div class=&quot;comment-remix-meta&quot;&gt;&lt;a href=&quot;#&quot; class=&quot;replyto&quot; onclick=&quot;replyto(&#039;101&#039;,&#039;MikeNereson&#039;); return false;&quot;&gt;Reply&lt;/a&gt;  - &lt;a href=&quot;#&quot; class=&quot;quote&quot; onclick=&quot;quote(&#039;101&#039;,&#039;MikeNereson&#039;,&#039;Another fun thing to do is to print the HTML to PDF. I used iText for this on a project a couple of years ago &lt;a href=\&quot;www.lowagie.com\/itext\&quot; rel=\&quot;nofollow\&quot;&gt;www.lowagie.com\/itext&lt;\/a&gt;&#039;); return false;&quot;&gt;Quote&lt;/a&gt;&lt;/div&gt;</description>
		<content:encoded><![CDATA[<p>Another fun thing to do is to print the HTML to PDF. I used iText for this on a project a couple of years ago <a href="www.lowagie.com/itext" rel="nofollow">http://www.lowagie.com/itext</a>
<div class="comment-remix-meta"><a href="#" class="replyto" onclick="replyto('101','MikeNereson'); return false;">Reply</a>  &#8211; <a href="#" class="quote" onclick="quote('101','MikeNereson','Another fun thing to do is to print the HTML to PDF. I used iText for this on a project a couple of years ago &lt;a href=\&quot;www.lowagie.com\/itext\&quot; rel=\&quot;nofollow\&quot;&gt;www.lowagie.com\/itext&lt;\/a&gt;'); return false;">Quote</a></div>
]]></content:encoded>
	</item>
</channel>
</rss>
