Archived
1
0

24 Commits

Author SHA1 Message Date
Nout van Deijck
b5c83125f7 Added extra request for chemspider link retreived from Wikipedia 2014-04-23 12:27:53 +02:00
Bas Vb
f926f86d7d Small fix because the cleaned up items were not send back 2014-04-23 12:14:20 +02:00
Nout van Deijck
6dd03c293a Added check for already visited redirects of compounds 2014-04-23 12:08:33 +02:00
Bas Vb
cb299df96f Added log statements 2014-04-23 11:46:43 +02:00
Bas Vb
fd5faf22e4 Added empty reliability and condition to prevent errors for now 2014-04-23 11:12:58 +02:00
Bas Vb
1c518af5a6 Remove per attribute getfunctions 2014-04-23 11:06:59 +02:00
Bas Vb
b0146cdce8 Added regular expressions to clean up temperature data 2014-04-22 09:46:19 +02:00
Bas Vb
be63315ca2 regex 2014-04-16 17:01:35 +02:00
Bas Vb
6f82b117c9 new function to clean up the datapoints 2014-04-16 16:23:33 +02:00
Bas Vb
74aa446f40 minor edits (comments etc.) 2014-04-16 15:27:36 +02:00
Bas Vb
34c3a8b4d6 remove empty data points 2014-04-16 15:22:47 +02:00
Bas Vb
ce3105f3c1 went to a general loop over all values, this way getting all elements from the Wikipedia infobox (except for those with a colspan, because these mess up) 2014-04-16 14:56:32 +02:00
Bas Vb
f1280dd66d get value not list from xpath 2014-04-16 13:23:50 +02:00
Bas Vb
d99548e3b6 Added density, molar entropy and heat capacity 2014-04-16 11:14:02 +02:00
Bas Vb
d778050f36 Able to parse the weblinks to other databases, one example done 2014-04-16 10:37:57 +02:00
Bas Vb
cd1637b0fe Both Boiling point and melting point are now parsed from chemical Wikipedia pages, there's one error about different types of attributes in the Result-items, this needs to be fixed by cleaning up the retrieved data. 2014-04-16 00:50:50 +02:00
Bas Vb
1ca3593ae1 Parse is runnable now. 2014-04-16 00:35:19 +02:00
Bas Vb
f9799c30d8 Parse is runnable now. 2014-04-08 14:59:09 +02:00
Jip J. Dekker
4b0c4acf96 Updated the wikipedia parser as an rightful subclass of Parser 2014-04-08 11:40:30 +02:00
Bas Vb
f3807c3018 Fixed the errors, but still not able to run/test the parse() function 2014-04-06 20:28:03 +02:00
Bas Vb
add4a13a4d Trying to make a start with the WikipediaParser, but I can't find out with the Scrapy website (or another way) what the structure of the file should be, and how I can test/run the crawling on a page. 2014-04-06 18:02:09 +02:00
Nout van Deijck
81a93c44bb added author 2014-04-03 12:19:17 +02:00
Bas Vb
60c409da3d New file and branch for the Wikipedia parser 2014-04-03 12:05:06 +02:00
Bas Vb
b4ff4a3c3b New file and branch for the Wikipedia parser 2014-04-03 12:00:27 +02:00