Jip J. Dekker
|
92a74de9e0
|
Added the include and exclude options.
|
2014-04-16 11:17:48 +02:00 |
|
Jip J. Dekker
|
e0e64bd65a
|
Implemented source exclusion
|
2014-04-16 11:03:59 +02:00 |
|
Jip J. Dekker
|
d823c105e6
|
Implemented source inclusion
|
2014-04-16 10:48:29 +02:00 |
|
Jip J. Dekker
|
7b57d86178
|
Removed redundant source loader
|
2014-04-16 10:36:46 +02:00 |
|
Jip J. Dekker
|
a06bf643f1
|
Made sourceloader a class and implemented the listing of all sources
|
2014-04-16 10:14:29 +02:00 |
|
Jip J. Dekker
|
8b7cfac2de
|
Added an new command to the CLI, implementation will follow.
|
2014-04-16 09:33:07 +02:00 |
|
Jip J. Dekker
|
6799a1a956
|
Merge branch 'release/v0.1.0' into develop
1-searchable
|
2014-04-15 19:49:07 +02:00 |
|
Jip J. Dekker
|
972e5da0d2
|
Removed debug code and typos.
|
2014-04-15 19:48:27 +02:00 |
|
Jip J. Dekker
|
d770f79a7a
|
Bumped version number
|
2014-04-15 19:46:10 +02:00 |
|
Jip J. Dekker
|
878d8e5efb
|
Merge branch 'feature/CLI' into develop
|
2014-04-15 19:44:41 +02:00 |
|
Jip J. Dekker
|
61ca2520e3
|
Added feed export functionality
|
2014-04-15 19:40:54 +02:00 |
|
Jip J. Dekker
|
e65d3a6898
|
Added the options for the Feed exports
|
2014-04-15 18:57:51 +02:00 |
|
Jip J. Dekker
|
ffb3861034
|
Search for single compound, filename should be lowercase
|
2014-04-15 18:49:30 +02:00 |
|
Jip J. Dekker
|
a4dd6e1835
|
Made logging work
|
2014-04-14 21:31:20 +02:00 |
|
Jip J. Dekker
|
2ad33080c6
|
First setup of the CLI, decided on a structure
|
2014-04-14 20:45:07 +02:00 |
|
Jip J. Dekker
|
ee01e697d3
|
Added Docopt as an CLI framework
|
2014-04-14 20:21:41 +02:00 |
|
Jip J. Dekker
|
debbc5e62a
|
Merge branch 'hotfix/none-requests' into develop
|
2014-04-08 11:44:42 +02:00 |
|
Jip J. Dekker
|
622dd4ad00
|
Small fix to ensure unique classes and load all parsers
|
2014-04-08 11:43:32 +02:00 |
|
Jip J. Dekker
|
da17a149c0
|
Spider is now able to handle none-request from parsers while handling new
compounds
|
2014-04-08 11:42:43 +02:00 |
|
Jip J. Dekker
|
3a074467e6
|
Merge branch 'hotfix/No_TABs' into develop
|
2014-04-02 14:22:13 +02:00 |
|
Jip J. Dekker
|
9805bb5adb
|
Merge branch 'hotfix/No_TABs'
|
2014-04-02 14:21:34 +02:00 |
|
Jip J. Dekker
|
f6981057df
|
Changed everything to spaces
|
2014-04-02 14:20:05 +02:00 |
|
Jip J. Dekker
|
595f0253e2
|
Merge branch 'release/v0.0.1' into develop
|
2014-04-01 21:44:31 +02:00 |
|
Jip J. Dekker
|
254e8db3aa
|
Merge branch 'release/v0.0.1'
v0.0.1
|
2014-04-01 21:44:08 +02:00 |
|
Jip J. Dekker
|
c9e09f8ab9
|
Added an version message
|
2014-04-01 21:42:54 +02:00 |
|
Jip J. Dekker
|
2e8017c590
|
Merge branch 'feature/parsing-scheme' into develop
|
2014-04-01 21:40:26 +02:00 |
|
Jip J. Dekker
|
7bc160f676
|
The spider is now able to start using the synonym generator
|
2014-04-01 21:38:11 +02:00 |
|
Jip J. Dekker
|
cd421cc2fb
|
Replaced literal for testing with a variable fix.
|
2014-04-01 21:24:04 +02:00 |
|
Jip J. Dekker
|
0bf2d102c6
|
Fixed parser importation, so it doesn't import imported classes.
|
2014-04-01 21:21:30 +02:00 |
|
Jip J. Dekker
|
683f8c09d4
|
Quick fix, python errors
|
2014-04-01 21:12:54 +02:00 |
|
Jip J. Dekker
|
f93dc2d160
|
Added an structure to get requests for all websites for a new synonym
|
2014-04-01 21:07:36 +02:00 |
|
Jip J. Dekker
|
e39ed3b681
|
Added a way for parsers to access the spider.
|
2014-04-01 20:56:32 +02:00 |
|
Jip J. Dekker
|
4d9e5307bf
|
Written an loader for all parsers in the parser directory.
|
2014-03-31 00:48:45 +02:00 |
|
Jip J. Dekker
|
0cc1b23353
|
Added the functionality to add parsers and automatically use them.
|
2014-03-30 23:37:42 +02:00 |
|
Jip J. Dekker
|
6e2df64fe4
|
Merge branch 'hotfix/spider-import-error' into develop
|
2014-03-30 23:08:14 +02:00 |
|
Jip J. Dekker
|
a6d3d4a716
|
Merge branch 'hotfix/spider-import-error'
spider-import-error
|
2014-03-30 23:07:52 +02:00 |
|
Jip J. Dekker
|
14c27458fc
|
Fixed an import error
|
2014-03-30 23:07:28 +02:00 |
|
Jip J. Dekker
|
e0556bbf16
|
Merge branch 'release/basic-scraper-structure'
basic-scraper-structure
|
2014-03-30 22:16:13 +02:00 |
|
Jip J. Dekker
|
e210ce8558
|
Merge branch 'develop', remote-tracking branch 'origin/develop' into develop
|
2014-03-30 22:08:21 +02:00 |
|
Jip J. Dekker
|
6bbee865c4
|
Merge branch 'feature/basic-structure' into develop
|
2014-03-28 14:46:43 +01:00 |
|
Jip J. Dekker
|
1e730e77ce
|
Merge branch 'feature/basic-structure' of code.giphouse.nl:giphouse/descartes-2 into feature/basic-structure
|
2014-03-28 14:44:29 +01:00 |
|
Jip J. Dekker
|
32cedecf2e
|
Added an basic parser class to extend, next step implementing the global function
|
2014-03-28 14:44:17 +01:00 |
|
Jip J. Dekker
|
325febe834
|
Added an basic parser class to extend, next step implementing the global function
|
2014-03-28 14:43:22 +01:00 |
|
Jip J. Dekker
|
d91706d6e5
|
The script should stop sometime, added a stopping signal
|
2014-03-28 14:14:39 +01:00 |
|
Jip J. Dekker
|
87d1041517
|
Made all Python files PEP-8 Compatible
|
2014-03-28 14:11:36 +01:00 |
|
Jip J. Dekker
|
5b17627504
|
The parsers however could use their own folder
|
2014-03-27 13:23:03 +01:00 |
|
Jip J. Dekker
|
8e9314e753
|
One spider should have it's own folder
|
2014-03-27 13:18:55 +01:00 |
|
Jip J. Dekker
|
bdcf359da7
|
Logical fixes to have some "working" case
|
2014-03-27 13:12:27 +01:00 |
|
Jip J. Dekker
|
8175e02f6c
|
New Structure, splitting on parsers instead of Spiders
|
2014-03-27 13:08:46 +01:00 |
|
Jip J. Dekker
|
306a37db1a
|
A better structure which is able to start multiple spiders.
|
2014-03-22 15:48:08 +01:00 |
|