RTB
|
b46c7a309d
|
if synonym name matched in search instead of primary name, emit primary name as synonym
|
2014-05-17 14:21:11 +02:00 |
|
RTB
|
afc1106838
|
NIST now logs an error if chemical name is not found
|
2014-05-17 14:11:03 +02:00 |
|
RTB
|
56ee6b1ad3
|
added ignore list
|
2014-05-17 14:09:10 +02:00 |
|
Rob tB
|
98f58ea4e2
|
added scraping for generic info except for synonyms
|
2014-05-15 14:29:28 +02:00 |
|
Rob tB
|
50c79e3b1f
|
conditions in name (split by ' at ') are now moved to condition field for individual value page and aggregate data table
|
2014-05-14 13:44:43 +02:00 |
|
RTB
|
7e984b60d8
|
added uncertainty to results from scraping individual data points urls
|
2014-05-09 14:24:08 +02:00 |
|
RTB
|
775a920b9b
|
NIST scraper now handles urls with individual data points
|
2014-05-09 13:00:22 +02:00 |
|
RTB
|
5e067fd572
|
altered scraping of aggregate data to test for and request url to individual data points
|
2014-05-09 12:36:54 +02:00 |
|
Rob tB
|
74dddace88
|
removed logging of Result objects in debug messages because pointless
|
2014-05-08 15:42:53 +02:00 |
|
Rob tB
|
f6fa5e8adf
|
fixed scraping of unit for two kinds of tables
|
2014-05-08 15:22:48 +02:00 |
|
Rob tB
|
151f1988a1
|
added function to scrape table for Antoine equation parameters
|
2014-05-08 14:54:30 +02:00 |
|
RTB
|
7abb491d3f
|
added function for most generic tables
|
2014-05-07 22:08:43 +02:00 |
|
RTB
|
10dd74e026
|
added function to scrape transition tables
|
2014-05-07 21:58:52 +02:00 |
|
RTB
|
85595ecf35
|
created function to start scraping the aggregate data table
|
2014-05-07 18:12:08 +02:00 |
|
RTB
|
95e24f9c44
|
added code to recognize various table formats
|
2014-05-07 17:09:42 +02:00 |
|
RTB
|
9c80f291b6
|
search NIST exclusively for phase change data
|
2014-05-07 13:27:22 +02:00 |
|
Rob tB
|
930eb6cad5
|
NIST now scrapes the symbol table for later use
|
2014-05-04 21:20:46 +02:00 |
|
Rob tB
|
0cec4bd2d8
|
new_compound_request now returns a Request with a searh URL
|
2014-05-04 21:06:39 +02:00 |
|
Rob tB
|
e1e507f745
|
added several required imports
|
2014-05-01 15:30:28 +02:00 |
|
Rob tB
|
95e9675605
|
created stub for NIST parser
|
2014-05-01 14:57:09 +02:00 |
|
Jip J. Dekker
|
d523d4edcd
|
Spelling errors
|
2014-04-23 22:58:04 +02:00 |
|
Jip J. Dekker
|
c5bffffeda
|
Delayed refractor from developing branch
|
2014-04-23 22:55:28 +02:00 |
|
Jip J. Dekker
|
964e0b8ade
|
Merge branch 'develop' into feature/Wikipedia
|
2014-04-23 22:53:28 +02:00 |
|
Nout van Deijck
|
9cbdf57238
|
fixed comments
|
2014-04-23 16:24:27 +02:00 |
|
Nout van Deijck
|
150fc5bea7
|
added comments
|
2014-04-23 16:17:23 +02:00 |
|
Nout van Deijck
|
9cefd336e0
|
Cleaning up code and added log messages
|
2014-04-23 16:02:37 +02:00 |
|
Jip J. Dekker
|
90f03734a6
|
Refractored classname
|
2014-04-23 15:57:10 +02:00 |
|
Jip J. Dekker
|
e18e4b4b26
|
Resolved all references to the old folder
|
2014-04-23 15:55:38 +02:00 |
|
Jip J. Dekker
|
1e24453a11
|
Renamed filename of basic source class
|
2014-04-23 15:51:03 +02:00 |
|
Nout van Deijck
|
507006889b
|
Fixed problem with strange urls, now adds all external identifiers as requests
|
2014-04-23 15:49:23 +02:00 |
|
Jip J. Dekker
|
662ee8f490
|
Renamed folder
|
2014-04-23 15:49:03 +02:00 |
|
Bas Vb
|
62475d965d
|
Cleaning up code
|
2014-04-23 15:24:57 +02:00 |
|
Nout van Deijck
|
3e1b33164e
|
Some comments and trying different for loop for adding requests
|
2014-04-23 13:48:44 +02:00 |
|
Nout van Deijck
|
1ced65e2b6
|
Parser now adds extra requests for every identifier to an external source that is in the Wikipedia chembox
|
2014-04-23 13:18:50 +02:00 |
|
Nout van Deijck
|
b5c83125f7
|
Added extra request for chemspider link retreived from Wikipedia
|
2014-04-23 12:27:53 +02:00 |
|
Bas Vb
|
f926f86d7d
|
Small fix because the cleaned up items were not send back
|
2014-04-23 12:14:20 +02:00 |
|
Nout van Deijck
|
6dd03c293a
|
Added check for already visited redirects of compounds
|
2014-04-23 12:08:33 +02:00 |
|
Bas Vb
|
cb299df96f
|
Added log statements
|
2014-04-23 11:46:43 +02:00 |
|
Bas Vb
|
fd5faf22e4
|
Added empty reliability and condition to prevent errors for now
|
2014-04-23 11:12:58 +02:00 |
|
Bas Vb
|
1c518af5a6
|
Remove per attribute getfunctions
|
2014-04-23 11:06:59 +02:00 |
|
Jip J. Dekker
|
595af7aa32
|
PEP-8 and fixed a bug in set_spider
|
2014-04-22 19:03:29 +02:00 |
|
Jip J. Dekker
|
ba7bed0250
|
Disabled name mangling for the spider reference in the parsers
|
2014-04-22 18:55:14 +02:00 |
|
Jip J. Dekker
|
648b23e466
|
PEP-8 standards for a lot of things
|
2014-04-22 18:54:10 +02:00 |
|
Jip J. Dekker
|
0da2d74e2c
|
PEP-8 indentation for multi-line statements
|
2014-04-22 18:46:49 +02:00 |
|
Jip J. Dekker
|
7a1e99605b
|
Uniform TODO tags, indentation faults.
|
2014-04-22 18:40:14 +02:00 |
|
Bas Vb
|
b0146cdce8
|
Added regular expressions to clean up temperature data
|
2014-04-22 09:46:19 +02:00 |
|
RTB
|
63fb9f4733
|
added comment to parse_searchrequest and added optional todo for extract()[0] usage
|
2014-04-18 17:33:00 +02:00 |
|
RTB
|
3c5dbc44dc
|
added comments for chemspider parse_extendedinfo
|
2014-04-18 17:14:19 +02:00 |
|
RTB
|
2ac6d1711d
|
added comments for chemspider new_synonym
|
2014-04-18 17:11:04 +02:00 |
|
RTB
|
3862bfb7d8
|
added comments for ChemSpider class, parse_properties, and parse_synonyms
|
2014-04-18 16:54:30 +02:00 |
|