Home arrow Forums  
Sunday, 20 May 2012  
Latest Forums
Welcome, Guest
Please Login or Register.    Lost Password?

SIGSEGV in boost_html in file element.hpp
(1 viewing) (1) Guest
Go to bottomPage: 12
TOPIC: SIGSEGV in boost_html in file element.hpp
#29
SIGSEGV in boost_html in file element.hpp 3 Years, 12 Months ago Karma: 0
Hi!

I have segmentation fault when i parsing contents of some sites (not all) by boost_html.

Program stoped in file element.hpp in method read on line 273:
ptr_element_t newElement = factory(elementName, this);

My fragment of code:

stringstream ost;
ost << s << endl ;
html.read(ost);

Variable s is string contains content of site downloaded by curl.

Please answer what is the problem.

P.S. Very good code/library (boost_html)
irok84
Fresh Boarder
Posts: 3
graphgraph
User Offline Click here to see the profile of this user
The administrator has disabled public write access.
 
#30
Re:SIGSEGV in boost_html in file element.hpp 3 Years, 12 Months ago Karma: 0
i'm actually in the process of doing some work in the boost.html library.

one of the things that will change will be the namespace (fyi) to boost.dom

Can you email me the links to the sites that are killing the library? i've tested the code on random sites, but am interested in sites that will kill the parser.
softwareace
Administrator
Posts: 23
graph
User Offline Click here to see the profile of this user
The administrator has disabled public write access.
Andreas Haberstroh
Ethernet (n) Device used to catch the Etherbunny
 
#31
Re:SIGSEGV in boost_html in file element.hpp 3 Years, 12 Months ago Karma: 0
Example output from my program:


Reading: google.pl
Reading: images.google.pl/imghp?hl=pl&tab=wi
Reading: video.google.pl/?hl=pl&tab=wv
Reading: groups.google.pl/grphp?hl=pl&tab=wg
Reading: google.pl/dirhp?hl=pl&tab=wd
Reading: mail.google.com/mail/?hl=pl&tab=wm
Segmentation fault


other:


Reading: www.wp.pl
Segmentation fault
irok84
Fresh Boarder
Posts: 3
graphgraph
User Offline Click here to see the profile of this user
The administrator has disabled public write access.
 
#32
Re:SIGSEGV in boost_html in file element.hpp 3 Years, 12 Months ago Karma: 0
This is a good problem!
And, I might have a solution pretty quick.
softwareace
Administrator
Posts: 23
graph
User Offline Click here to see the profile of this user
The administrator has disabled public write access.
Andreas Haberstroh
Ethernet (n) Device used to catch the Etherbunny
 
#33
Re:SIGSEGV in boost_html in file element.hpp 3 Years, 11 Months ago Karma: 0
Ok. I'm waiting for solution
irok84
Fresh Boarder
Posts: 3
graphgraph
User Offline Click here to see the profile of this user
The administrator has disabled public write access.
 
#34
Re:SIGSEGV in boost_html in file element.hpp 3 Years, 11 Months ago Karma: 0
I've been in the process of refactoring alot of the code, primarily the parser that reads the HTML/XML files. This problem is directly related to tag opening and tag closing.

The code chokes on the '<' and '>' in the javascript portions. I think in a few days the new code will be ready to download. If you find more examples of crashes, please post or email me the links, so I have a wider range to test against.
softwareace
Administrator
Posts: 23
graph
User Offline Click here to see the profile of this user
The administrator has disabled public write access.
Andreas Haberstroh
Ethernet (n) Device used to catch the Etherbunny
 
Go to topPage: 12
Moderators: softwareace
Top