Archived
1
0

192 Commits

Author SHA1 Message Date
RTB
b46c7a309d if synonym name matched in search instead of primary name, emit primary name as synonym 2014-05-17 14:21:11 +02:00
RTB
afc1106838 NIST now logs an error if chemical name is not found 2014-05-17 14:11:03 +02:00
RTB
56ee6b1ad3 added ignore list 2014-05-17 14:09:10 +02:00
Rob tB
98f58ea4e2 added scraping for generic info except for synonyms 2014-05-15 14:29:28 +02:00
Rob tB
50c79e3b1f conditions in name (split by ' at ') are now moved to condition field for individual value page and aggregate data table 2014-05-14 13:44:43 +02:00
Jip J. Dekker
c7ad35239e Merge pull request #4 from Recondor/feature/wikipediafixes
Feature/wikipediafixes
2014-05-13 21:37:53 +02:00
Bas Vb
b54568bab0 Small fixes 2014-05-13 16:18:32 +02:00
Jip J. Dekker
afaa0d903f Merge pull request #3 from Recondor/feature/chemspider-parser-fixes
Feature/chemspider-parser-fixes
2014-05-09 23:06:43 +02:00
RTB
7e984b60d8 added uncertainty to results from scraping individual data points urls 2014-05-09 14:24:08 +02:00
RTB
775a920b9b NIST scraper now handles urls with individual data points 2014-05-09 13:00:22 +02:00
RTB
5e067fd572 altered scraping of aggregate data to test for and request url to individual data points 2014-05-09 12:36:54 +02:00
Jip J. Dekker
f193aac24a Fixed Duplicate Pipeline + rename 2014-05-08 15:45:42 +02:00
Rob tB
74dddace88 removed logging of Result objects in debug messages because pointless 2014-05-08 15:42:53 +02:00
Jip J. Dekker
2e654255c5 Added documentation to the pipeline. 2014-05-08 15:35:18 +02:00
Rob tB
f6fa5e8adf fixed scraping of unit for two kinds of tables 2014-05-08 15:22:48 +02:00
Jip J. Dekker
2fcec009bb Added an Pipeline to deal with attribute selection 2014-05-08 15:20:48 +02:00
Rob tB
151f1988a1 added function to scrape table for Antoine equation parameters 2014-05-08 14:54:30 +02:00
RTB
7abb491d3f added function for most generic tables 2014-05-07 22:08:43 +02:00
RTB
10dd74e026 added function to scrape transition tables 2014-05-07 21:58:52 +02:00
RTB
85595ecf35 created function to start scraping the aggregate data table 2014-05-07 18:12:08 +02:00
RTB
95e24f9c44 added code to recognize various table formats 2014-05-07 17:09:42 +02:00
RTB
9c80f291b6 search NIST exclusively for phase change data 2014-05-07 13:27:22 +02:00
Rob tB
930eb6cad5 NIST now scrapes the symbol table for later use 2014-05-04 21:20:46 +02:00
Rob tB
0cec4bd2d8 new_compound_request now returns a Request with a searh URL 2014-05-04 21:06:39 +02:00
Bas Vb
03e652d454 Wikipediaparser now works on chemboxes as well 2014-05-01 16:05:37 +02:00
Rob tB
e1e507f745 added several required imports 2014-05-01 15:30:28 +02:00
Bas Vb
f8d390d3e6 Starting with fixing the wikiparser 2014-05-01 15:04:11 +02:00
Rob tB
95e9675605 created stub for NIST parser 2014-05-01 14:57:09 +02:00
RTB
ca0a22ae7b Added test for empty values on properties in ChemSpider ExtendedCompoundInfo API 2014-05-01 12:14:52 +02:00
RTB
73753a6294 chemspider source now handles vague search requests 2014-05-01 12:04:44 +02:00
RTB
971552110a Added test for empty values on properties in ACD/Labs tab 2014-04-29 16:30:09 +02:00
Jip J. Dekker
d523d4edcd Spelling errors 2014-04-23 22:58:04 +02:00
Jip J. Dekker
c5bffffeda Delayed refractor from developing branch 2014-04-23 22:55:28 +02:00
Jip J. Dekker
964e0b8ade Merge branch 'develop' into feature/Wikipedia 2014-04-23 22:53:28 +02:00
Nout van Deijck
9cbdf57238 fixed comments 2014-04-23 16:24:27 +02:00
Nout van Deijck
150fc5bea7 added comments 2014-04-23 16:17:23 +02:00
Nout van Deijck
9cefd336e0 Cleaning up code and added log messages 2014-04-23 16:02:37 +02:00
Jip J. Dekker
90f03734a6 Refractored classname 2014-04-23 15:57:10 +02:00
Jip J. Dekker
e18e4b4b26 Resolved all references to the old folder 2014-04-23 15:55:38 +02:00
Jip J. Dekker
1e24453a11 Renamed filename of basic source class 2014-04-23 15:51:03 +02:00
Nout van Deijck
507006889b Fixed problem with strange urls, now adds all external identifiers as requests 2014-04-23 15:49:23 +02:00
Jip J. Dekker
662ee8f490 Renamed folder 2014-04-23 15:49:03 +02:00
Bas Vb
62475d965d Cleaning up code 2014-04-23 15:24:57 +02:00
Nout van Deijck
3e1b33164e Some comments and trying different for loop for adding requests 2014-04-23 13:48:44 +02:00
Nout van Deijck
1ced65e2b6 Parser now adds extra requests for every identifier to an external source that is in the Wikipedia chembox 2014-04-23 13:18:50 +02:00
Nout van Deijck
b5c83125f7 Added extra request for chemspider link retreived from Wikipedia 2014-04-23 12:27:53 +02:00
Bas Vb
f926f86d7d Small fix because the cleaned up items were not send back 2014-04-23 12:14:20 +02:00
Nout van Deijck
6dd03c293a Added check for already visited redirects of compounds 2014-04-23 12:08:33 +02:00
Bas Vb
cb299df96f Added log statements 2014-04-23 11:46:43 +02:00
Bas Vb
fd5faf22e4 Added empty reliability and condition to prevent errors for now 2014-04-23 11:12:58 +02:00