Archived
1
0

288 Commits

Author SHA1 Message Date
RTB
a4a21f2578 changed default reliability from empty string to Unknown as per UML design 2014-04-18 13:19:05 +02:00
RTB
9389af99ba removed manual Requests for wikipedia URLs as wikipedia parser handles those through synonyms 2014-04-18 13:17:24 +02:00
RTB
ae21fa7c67 chemspider now scrapes for reference data on synonyms 2014-04-18 13:16:22 +02:00
RTB
119d48890d fixed conditional for emitting synonyms, it compiles again 2014-04-18 12:14:54 +02:00
Jip J. Dekker
000bc556ec Merge branch 'develop' into feature/parser-selection
Conflicts:
	fourmi.py
2014-04-17 23:44:00 +02:00
RTB
04751b6670 chemspider parser now only emits synonyms labeled as 'English' 2014-04-17 22:47:43 +02:00
RTB
ce5eeb56a6 added scraping of synonym language 2014-04-17 22:37:37 +02:00
RTB
4f2c046c9c rewrote parse_synonyms and new_synonym to use an internal dictionary structure 2014-04-17 22:06:45 +02:00
RTB
2e95d35283 modified parse_synonyms and new_synonym to include a Selector for future edits 2014-04-17 21:30:53 +02:00
Bas Vb
be63315ca2 regex 2014-04-16 17:01:35 +02:00
Jip J. Dekker
bd7fb38497 Merge branch 'feature/chemspider-parser' of code.giphouse.nl:giphouse/descartes-2 into feature/chemspider-parser 2014-04-16 17:01:09 +02:00
Jip J. Dekker
3aaed985fe Merge branch 'develop' into feature/chemspider-parser 2014-04-16 17:00:59 +02:00
Jip J. Dekker
873231439c Merge branch 'develop' into feature/Wikipedia 2014-04-16 16:59:25 +02:00
Jip J. Dekker
d603e388e6 Merge branch 'hotfix/1-searchable' into develop 2014-04-16 16:58:53 +02:00
Jip J. Dekker
8cf307127a Merge branch 'hotfix/1-searchable' 2014-04-16 16:57:43 +02:00
Jip J. Dekker
ab2a3fdc08 typo! 2014-04-16 16:57:27 +02:00
Jip J. Dekker
f0d10902b5 Searchable can't be a list! 2014-04-16 16:57:08 +02:00
Jip J. Dekker
efacc08a3d Merge branch 'develop' into feature/Wikipedia
Conflicts:
	Fourmi.py
2014-04-16 16:49:03 +02:00
Bas Vb
6f82b117c9 new function to clean up the datapoints 2014-04-16 16:23:33 +02:00
Rob tB
9a78e186bc chemspider parser now grabs data from ExtendedCompoundInfo() of chemspider API (no units) 2014-04-16 16:22:47 +02:00
Bas Vb
74aa446f40 minor edits (comments etc.) 2014-04-16 15:27:36 +02:00
Rob tB
caf7d3df4e fixed ExtendedCompoundInfo url to have csid parameter instead of query 2014-04-16 15:27:10 +02:00
Bas Vb
34c3a8b4d6 remove empty data points 2014-04-16 15:22:47 +02:00
Rob tB
2d314aee6a created stub to parse ExtendedCompoundInfo from ChemSpider MassSpec API 2014-04-16 15:21:33 +02:00
Rob tB
7fc980befe chemspider should now only generate new Requests for wikipedia links from 'expert confirmed' synonyms 2014-04-16 15:02:37 +02:00
Bas Vb
ce3105f3c1 went to a general loop over all values, this way getting all elements from the Wikipedia infobox (except for those with a colspan, because these mess up) 2014-04-16 14:56:32 +02:00
Rob tB
87282fc572 new properties in parse_properties now use dictionary syntax 2014-04-16 14:26:27 +02:00
Rob tB
93a6f098a9 log messages are now DEBUG instead of WARNING 2014-04-16 13:28:59 +02:00
Bas Vb
f1280dd66d get value not list from xpath 2014-04-16 13:23:50 +02:00
Rob tB
c1b5f810cb unused Result properties are now empty string instead of None 2014-04-16 11:53:59 +02:00
Jip J. Dekker
92a74de9e0 Added the include and exclude options. 2014-04-16 11:17:48 +02:00
Bas Vb
d99548e3b6 Added density, molar entropy and heat capacity 2014-04-16 11:14:02 +02:00
Jip J. Dekker
e0e64bd65a Implemented source exclusion 2014-04-16 11:03:59 +02:00
Jip J. Dekker
d823c105e6 Implemented source inclusion 2014-04-16 10:48:29 +02:00
Bas Vb
d778050f36 Able to parse the weblinks to other databases, one example done 2014-04-16 10:37:57 +02:00
Jip J. Dekker
7b57d86178 Removed redundant source loader 2014-04-16 10:36:46 +02:00
Jip J. Dekker
9dcb150356 Merge branch 'develop' into feature/chemspider-parser 2014-04-16 10:24:52 +02:00
Jip J. Dekker
a06bf643f1 Made sourceloader a class and implemented the listing of all sources 2014-04-16 10:14:29 +02:00
Jip J. Dekker
8b7cfac2de Added an new command to the CLI, implementation will follow. 2014-04-16 09:33:07 +02:00
Bas Vb
cd1637b0fe Both Boiling point and melting point are now parsed from chemical Wikipedia pages, there's one error about different types of attributes in the Result-items, this needs to be fixed by cleaning up the retrieved data. 2014-04-16 00:50:50 +02:00
Bas Vb
1ca3593ae1 Parse is runnable now. 2014-04-16 00:35:19 +02:00
Jip J. Dekker
6799a1a956 Merge branch 'release/v0.1.0' into develop 1-searchable 2014-04-15 19:49:07 +02:00
Jip J. Dekker
2d5e39de81 Merge branch 'release/v0.1.0' v0.1.0 2014-04-15 19:48:55 +02:00
Jip J. Dekker
972e5da0d2 Removed debug code and typos. 2014-04-15 19:48:27 +02:00
Jip J. Dekker
d770f79a7a Bumped version number 2014-04-15 19:46:10 +02:00
Jip J. Dekker
878d8e5efb Merge branch 'feature/CLI' into develop 2014-04-15 19:44:41 +02:00
Jip J. Dekker
61ca2520e3 Added feed export functionality 2014-04-15 19:40:54 +02:00
Jip J. Dekker
e65d3a6898 Added the options for the Feed exports 2014-04-15 18:57:51 +02:00
RTB
8e46762a9e fix: if no experimental data, return predicted acd/labs data instead of None 2014-04-15 18:56:38 +02:00
Jip J. Dekker
ffb3861034 Search for single compound, filename should be lowercase 2014-04-15 18:49:30 +02:00