[redland-dev] [Redland Language Bindings 0000525]: NTriplesParser aborts while opening large file

Mantis Bug Tracker mantis-bug-sender at librdf.org
Tue Jan 15 04:25:47 EST 2013


The following issue has been SUBMITTED. 
====================================================================== 
http://bugs.librdf.org/mantis/view.php?id=525 
====================================================================== 
Reported By:                packi
Assigned To:                
====================================================================== 
Project:                    Redland Language Bindings
Issue ID:                   525
Category:                   
Reproducibility:            always
Severity:                   crash
Priority:                   normal
Status:                     new
Binding Language (java, perl, php, python, ruby, tcl): python 
====================================================================== 
Date Submitted:             2013-01-15 09:25
Last Modified:              2013-01-15 09:25
====================================================================== 
Summary:                    NTriplesParser aborts while opening large file
Description: 
I was trying to import a largish file (2.6 GB) using the NTripletsParser:

(temp3)packi at overlord:~/Downloads$ python
Python 2.7.3 (default, Sep 26 2012, 21:53:58) 
[GCC 4.7.2] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import RDF
>>> parser = RDF.NTriplesParser()
>>> for statement in
parser.parse_as_stream('file:///home/packi/Downloads/mappingbased_properties_en.nt'):
...     print repr(statement)
... 
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/usr/lib/python2.7/dist-packages/RDF.py", line 1737, in parse_as_stream
    uri._reduri, base_uri._reduri)
RDF.RedlandError: "failed to open file
'/home/packi/Downloads/mappingbased_properties_en.nt' - Value too large for
defined data type"
>>>

The code works using a smaller file. The strace shows that the open syscall
fails with EOVERFLOW:

open("/home/packi/Downloads/mappingbased_properties_en.nt", O_RDONLY) = -1
EOVERFLOW (Value too large for defined data type)


Steps to Reproduce: 
Download the dataset from
http://downloads.dbpedia.org/3.8/en/mappingbased_properties_en.nt.bz2 and unzip
it.
Run the attached python file.

====================================================================== 

Issue History 
Date Modified    Username       Field                    Change               
====================================================================== 
2013-01-15 09:25 packi          New Issue                                    
2013-01-15 09:25 packi          File Added: test.py                          
======================================================================



More information about the redland-dev mailing list