Home arrow Forums
Monday, 06 October 2008  
Latest Forums
FireBoard
Welcome, Guest
Please Login or Register.    Lost Password?
Re:SIGSEGV in boost_html in file element.hpp (0 viewing) 
Warning: Invalid argument supplied for foreach() in /home/ibusy.com/public_html/components/com_fireboard/template/default/fb_pathway.php on line 140
Go to bottom Post Reply Favoured: 0
TOPIC: Re:SIGSEGV in boost_html in file element.hpp
#29
irok84 (User)
Posts: 3
graphgraph
User Offline Click here to see the profile of this user
Birthdate:
SIGSEGV in boost_html in file element.hpp 4 Months, 2 Weeks ago Karma: 0  
Hi!

I have segmentation fault when i parsing contents of some sites (not all) by boost_html.

Program stoped in file element.hpp in method read on line 273:
ptr_element_t newElement = factory(elementName, this);

My fragment of code:

stringstream ost;
ost << s << endl ;
html.read(ost);

Variable s is string contains content of site downloaded by curl.

Please answer what is the problem.

P.S. Very good code/library (boost_html)
 
Report to moderator   Logged Logged  
  The administrator has disabled public write access.
#30
softwareace (Admin)
Admin
Posts: 15
graph
User Offline Click here to see the profile of this user
Birthdate:
Re:SIGSEGV in boost_html in file element.hpp 4 Months, 2 Weeks ago Karma: 0  
i'm actually in the process of doing some work in the boost.html library.

one of the things that will change will be the namespace (fyi) to boost.dom

Can you email me the links to the sites that are killing the library? i've tested the code on random sites, but am interested in sites that will kill the parser.
 
Report to moderator   Logged Logged  
 
Andreas Haberstroh
Ethernet (n) Device used to catch the Etherbunny
  The administrator has disabled public write access.
#31
irok84 (User)
Posts: 3
graphgraph
User Offline Click here to see the profile of this user
Birthdate:
Re:SIGSEGV in boost_html in file element.hpp 4 Months, 2 Weeks ago Karma: 0  
Example output from my program:

QUOTE:

Reading: http://google.pl
Reading: http://images.google.pl/imghp?hl=pl&tab=wi
Reading: http://video.google.pl/?hl=pl&tab=wv
Reading: http://groups.google.pl/grphp?hl=pl&tab=wg
Reading: http://google.pl/dirhp?hl=pl&tab=wd
Reading: http://mail.google.com/mail/?hl=pl&tab=wm
Segmentation fault


other:

QUOTE:

Reading: http://www.wp.pl
Segmentation fault
 
Report to moderator   Logged Logged  
  The administrator has disabled public write access.
#32
softwareace (Admin)
Admin
Posts: 15
graph
User Offline Click here to see the profile of this user
Birthdate:
Re:SIGSEGV in boost_html in file element.hpp 4 Months, 2 Weeks ago Karma: 0  
This is a good problem!
And, I might have a solution pretty quick.
 
Report to moderator   Logged Logged  
 
Andreas Haberstroh
Ethernet (n) Device used to catch the Etherbunny
  The administrator has disabled public write access.
#33
irok84 (User)
Posts: 3
graphgraph
User Offline Click here to see the profile of this user
Birthdate:
Re:SIGSEGV in boost_html in file element.hpp 4 Months, 1 Week ago Karma: 0  
Ok. I'm waiting for solution
 
Report to moderator   Logged Logged  
  The administrator has disabled public write access.
#34
softwareace (Admin)
Admin
Posts: 15
graph
User Offline Click here to see the profile of this user
Birthdate:
Re:SIGSEGV in boost_html in file element.hpp 4 Months, 1 Week ago Karma: 0  
I've been in the process of refactoring alot of the code, primarily the parser that reads the HTML/XML files. This problem is directly related to tag opening and tag closing.

The code chokes on the '<' and '>' in the javascript portions. I think in a few days the new code will be ready to download. If you find more examples of crashes, please post or email me the links, so I have a wider range to test against.
 
Report to moderator   Logged Logged  
 
Andreas Haberstroh
Ethernet (n) Device used to catch the Etherbunny
  The administrator has disabled public write access.
Go to top Post Reply
Powered by FireBoardget the latest posts directly to your desktop
Top