Laura Stewart
|
63b95c3519
new imports, web feeder skeleton
|
8 年之前 |
Laura Stewart
|
d6518c38d5
Refactoring
|
8 年之前 |
Laura Stewart
|
09c4aac3ab
Moving functions (and tests) to feeder
|
8 年之前 |
Laura Stewart
|
13dfaea9da
Separating items related to gathering text samples from analysis.
|
8 年之前 |
Laura Stewart
|
746b507c85
Struct class test
|
8 年之前 |
Laura Stewart
|
6e7107b490
Being sure not to pass pyenchant empty strings.
|
8 年之前 |
Laura Stewart
|
ed6622470c
Removing print which was used for debutting purposes.
|
8 年之前 |
Laura Stewart
|
887beb0096
Source of error was PyEnchant not liking null strings. Have made this unecessary with a more graceful exit in a fork of PyEnchant, but fixing it in this code here instead/for now.
|
8 年之前 |
Laura Stewart
|
c983203fb5
Removed empty findMostUsedWords function (ended up integrating this into findRepeatWords as paramater).
|
8 年之前 |
Laura Stewart
|
a0e1cf554f
Unecessary multi-step test.
|
8 年之前 |
Laura Stewart
|
61cd2d11d4
Changing data object to a pseudo-struct
|
8 年之前 |
Laura Stewart
|
3e8829ba47
Starting to refactor into class structure which will be necessary for statistical comparisons. Currently awkward.
|
8 年之前 |
Laura Stewart
|
6c83630c37
Function to return list of frequencies for repeat words
|
8 年之前 |
Laura Stewart
|
611dad60b8
findRepeatWords also can specify a max number of words returned.
|
8 年之前 |
Laura Stewart
|
fb8d7dac43
FindRepeatWords function and test.
|
8 年之前 |
Laura Stewart
|
b970ed2f78
Regenerated frequency list allowing ', adjusted test to reflect new word rank.
|
8 年之前 |
Laura Stewart
|
48ed5c0f52
HowCommonIs test now functional
|
8 年之前 |
Laura Stewart
|
5973d9d06e
HowCommonIs funct now functional
|
8 年之前 |
Laura Stewart
|
19b808ee00
Word frequency funct
|
8 年之前 |
Laura Stewart
|
fb87a68666
Allow words with '
|
8 年之前 |
Laura Stewart
|
6e0b7b04e1
ngram parser test
|
8 年之前 |
Laura Stewart
|
f6035672a6
Modified mock data to test words with '
|
8 年之前 |
Laura Stewart
|
e09d0ba45f
Working on parser test with parser refactor.
|
8 年之前 |
Laura Stewart
|
1aa97ed010
Refactoring
|
8 年之前 |
Laura Stewart
|
c81f265230
Frequency list generated from data parsing script
|
8 年之前 |
Laura Stewart
|
efddd97896
ngram data mock file
|
8 年之前 |
Laura Stewart
|
810052be38
ngram parser test
|
8 年之前 |
Laura Stewart
|
d0540d64dc
Data parser for google n-gram formatted data.
|
8 年之前 |
Laura Stewart
|
27d306a8d7
Add Google frequency word list using only books published since 965
|
8 年之前 |
Laura Stewart
|
63f381087b
Unit test for lexical density
|
8 年之前 |