Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NMF optimization & documentation #2361

Merged
merged 201 commits into from
Jan 31, 2019
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
201 commits
Select commit Hold shift + click to select a range
343e46f
Implement first version of the algorithm
anotherbugmaster Mar 29, 2018
3171be3
Fix variable names
anotherbugmaster Mar 30, 2018
bd325bc
Add support for streaming corpora
anotherbugmaster Apr 2, 2018
19b3ba4
Add benchmark
anotherbugmaster Apr 2, 2018
9e52399
Fix bugs, introduce batches, add images to the benchmark notebook
anotherbugmaster Apr 15, 2018
c54fc92
Update notebook
anotherbugmaster Apr 22, 2018
6dc9d3e
Improve model
anotherbugmaster Apr 22, 2018
0554b7b
Merge remote-tracking branch 'upstream/develop' into online_nmf
anotherbugmaster Apr 22, 2018
5f4b3d3
Add show topics, change API
anotherbugmaster Apr 23, 2018
52fc956
Add more LDA-like API
anotherbugmaster Apr 23, 2018
ddebcf0
Fix logger name
anotherbugmaster Apr 23, 2018
6d0a1b3
Add more LDA API
anotherbugmaster Apr 23, 2018
cf430fc
Remove redundant method
anotherbugmaster Apr 23, 2018
df5a6e9
Remove commented out lines
anotherbugmaster Apr 23, 2018
25080b4
Fix flakes
anotherbugmaster Apr 23, 2018
83b1a6b
Cythonize
anotherbugmaster May 2, 2018
7f27f52
Dramatically improve performance
anotherbugmaster May 22, 2018
405e12f
Add parameters, improve accuracy and speed
anotherbugmaster Jun 2, 2018
7b45b23
Remove redundant W copying
anotherbugmaster Jun 5, 2018
a154a6e
Fix random seed again
anotherbugmaster Jun 5, 2018
e82628d
Optimize E/M step
anotherbugmaster Jun 12, 2018
1ca33f8
Add an eval_every option, use softmax for normalization
anotherbugmaster Jun 13, 2018
f19e6ce
Fixes
anotherbugmaster Jun 13, 2018
583cb15
Improve notebook examples a bit
anotherbugmaster Jun 13, 2018
fe0ab0a
Fix eval_every
anotherbugmaster Jun 13, 2018
8e647a1
Return outliers
anotherbugmaster Jun 16, 2018
89cc803
Optimizations
anotherbugmaster Jun 16, 2018
bbd3099
Experimenting with loss
anotherbugmaster Jun 16, 2018
f71ad89
Merge remote-tracking branch 'upstream/develop' into online_nmf
anotherbugmaster Aug 14, 2018
936e629
Fix PEP8
anotherbugmaster Aug 14, 2018
1c3a064
Return nmf import
anotherbugmaster Aug 14, 2018
ce4b7ee
Revert "Return nmf import"
anotherbugmaster Aug 20, 2018
f8de1d9
Fix
anotherbugmaster Aug 27, 2018
df9b8c7
Merge remote-tracking branch 'upstream/develop' into online_nmf
anotherbugmaster Aug 27, 2018
d159779
Fix minimum_probability & info -> debug logs
anotherbugmaster Aug 27, 2018
3dcdedc
Compute metrics
anotherbugmaster Aug 27, 2018
f11f2e2
Count error on-the-fly
anotherbugmaster Aug 28, 2018
8216541
Speed optimizations, changed error functions
anotherbugmaster Aug 28, 2018
ee3a7c7
Beat LDA
anotherbugmaster Aug 28, 2018
a3315f2
Outperform sklearn in speed (WTF)
anotherbugmaster Aug 28, 2018
3a03ff9
Remove redundant arg
anotherbugmaster Aug 28, 2018
70619e1
Add Olivietti faces
anotherbugmaster Aug 28, 2018
8c47ce0
Remove redundant code
anotherbugmaster Aug 28, 2018
e291664
Add Topics
anotherbugmaster Aug 28, 2018
3302b92
Make it pretty
anotherbugmaster Aug 28, 2018
5616bd6
Fix wrapper
anotherbugmaster Aug 28, 2018
ed8f29f
Save corpus & dict, minor fixes
anotherbugmaster Aug 30, 2018
2117c90
Add RandomCorpus
anotherbugmaster Aug 31, 2018
950115d
Dense -> sparse
anotherbugmaster Aug 31, 2018
54993c6
First doc2dense
anotherbugmaster Aug 31, 2018
572dc6c
Fix csc again
anotherbugmaster Aug 31, 2018
d40d89f
Fix len
anotherbugmaster Aug 31, 2018
7a3ef47
Experimenting
anotherbugmaster Sep 12, 2018
f94de09
Revert "Experimenting"
anotherbugmaster Sep 12, 2018
9ed2167
Fix evaluation
anotherbugmaster Sep 12, 2018
ad9443f
Sparse speedup
anotherbugmaster Sep 23, 2018
1a04660
Improve performance
anotherbugmaster Sep 25, 2018
87981bf
Divide A and B again
anotherbugmaster Sep 25, 2018
0b314c7
Fix A and B computation bug
anotherbugmaster Sep 25, 2018
b024dd6
Sparsify W init
anotherbugmaster Sep 25, 2018
35d5406
Experimenting
anotherbugmaster Sep 25, 2018
74acb37
New norm
anotherbugmaster Sep 25, 2018
8b28675
Sparse threshold -> sparse coefficient
anotherbugmaster Sep 25, 2018
588ef6a
Optimize residuals computation
anotherbugmaster Sep 26, 2018
8f84758
Fix residuals bug
anotherbugmaster Sep 26, 2018
8a67c44
W speedup
anotherbugmaster Sep 26, 2018
560f2bf
Experiment
anotherbugmaster Sep 26, 2018
cac2590
Revert changes a bit
anotherbugmaster Sep 26, 2018
060ab28
Fix corpus
anotherbugmaster Sep 26, 2018
cde937f
Fix init error|
anotherbugmaster Sep 26, 2018
66b753f
Merge branch 'online_nmf' of github.com:anotherbugmaster/gensim into …
anotherbugmaster Sep 26, 2018
18dbb6b
Resolve conflict
anotherbugmaster Sep 26, 2018
4b49d26
Fix corpus iteration issue
anotherbugmaster Sep 26, 2018
9c6cbc6
Switch to numpy algos
anotherbugmaster Oct 7, 2018
b23d016
Merge upstream
anotherbugmaster Oct 7, 2018
74ba37d
Train on wikipedia
anotherbugmaster Oct 7, 2018
c943264
Sparse coef -> density. More stable way to sparsify W matrix
anotherbugmaster Oct 9, 2018
a489807
Merge branch 'online_nmf' of github.com:anotherbugmaster/gensim into …
anotherbugmaster Oct 9, 2018
a95e345
Return old sparse algo
anotherbugmaster Oct 9, 2018
0f90484
Max
anotherbugmaster Oct 9, 2018
6ae43e4
Optimizations
anotherbugmaster Oct 10, 2018
335170b
Fix A and B computation
anotherbugmaster Oct 10, 2018
4cc8f1b
Fix A and B normalization
anotherbugmaster Oct 10, 2018
5c6fe60
Add random_state
anotherbugmaster Oct 23, 2018
dd459a2
Infer id2word
anotherbugmaster Oct 23, 2018
5121d85
Fix tests
anotherbugmaster Nov 6, 2018
5f4018a
Document __init__
anotherbugmaster Nov 14, 2018
dbd8474
Document whole nmf
anotherbugmaster Nov 14, 2018
5904f10
Merge remote-tracking branch 'upstream/develop' into online_nmf
anotherbugmaster Nov 14, 2018
cd4b9b0
Remove unnecessary comments
anotherbugmaster Nov 14, 2018
53a02a9
Add tutorial notebook
anotherbugmaster Nov 14, 2018
937e340
Document __init__
anotherbugmaster Nov 20, 2018
26a87bd
Fix flake version
anotherbugmaster Nov 28, 2018
261c13a
Fix flake warning
anotherbugmaster Nov 28, 2018
0147afc
Remove comments, reverse parallelization order
anotherbugmaster Nov 28, 2018
1ece3c1
Add NMF's cython extension to setup.py
anotherbugmaster Nov 28, 2018
e6409fa
Fix imports, add solve_r function
anotherbugmaster Nov 28, 2018
0743624
Remove comments
anotherbugmaster Nov 28, 2018
fd8088b
Add docstrings
anotherbugmaster Nov 28, 2018
e4ba0de
Common corpus and common dictionary
anotherbugmaster Nov 28, 2018
8537eef
Remove redundant test
anotherbugmaster Nov 28, 2018
d2e8385
Add signature flag
anotherbugmaster Nov 28, 2018
b72bf39
Add files to manifest
anotherbugmaster Nov 28, 2018
ed080a3
Fix flake8
anotherbugmaster Nov 29, 2018
67f6e75
Fix atol value
anotherbugmaster Nov 29, 2018
ee4373d
Implement top topics
anotherbugmaster Nov 29, 2018
d01c88c
Add rst files
anotherbugmaster Dec 10, 2018
8111080
Merge remote-tracking branch 'upstream/develop' into online_nmf
anotherbugmaster Dec 11, 2018
3de3646
Fix appveyor issue
anotherbugmaster Dec 11, 2018
183ea2d
Fix cython error
anotherbugmaster Dec 11, 2018
d2ac199
Merge remote-tracking branch 'upstream/develop' into online_nmf
anotherbugmaster Dec 12, 2018
2d664c6
Fix fmax/fmin not being on win-python27
anotherbugmaster Dec 12, 2018
c9a3577
Add word transformation test
anotherbugmaster Dec 12, 2018
fd0de20
Improve readability of residuals computation
anotherbugmaster Dec 21, 2018
fa384f2
Merge remote-tracking branch 'upstream/develop' into online_nmf
anotherbugmaster Dec 21, 2018
a811c67
Fix tests
anotherbugmaster Dec 21, 2018
d063a4f
A few fixes
anotherbugmaster Dec 21, 2018
b8f5d79
Blank line at the end of each docstring
anotherbugmaster Dec 21, 2018
361d160
Add blank line
anotherbugmaster Dec 21, 2018
e214582
Add the paper reference
anotherbugmaster Dec 21, 2018
9527f39
Fix long line
anotherbugmaster Dec 21, 2018
e1e1168
Add log_perplexity
anotherbugmaster Dec 30, 2018
3bf5be3
Merge remote-tracking branch 'remotes/upstream/develop' into online_nmf
anotherbugmaster Jan 7, 2019
d1c6e3e
Add NMF and LDA comparison table
anotherbugmaster Jan 9, 2019
7927b6b
Change the sign of log perplexity
anotherbugmaster Jan 9, 2019
1c6517e
Add Sklearn NMF comparison
anotherbugmaster Jan 9, 2019
278fb05
Merge sklearn and tm tables
anotherbugmaster Jan 9, 2019
a330327
Add F1
anotherbugmaster Jan 10, 2019
7ba9b84
Remove _solve_r
anotherbugmaster Jan 10, 2019
a14bfd3
Merge tutorial and benchmark
anotherbugmaster Jan 10, 2019
d28aef3
Identation's back
anotherbugmaster Jan 10, 2019
83ec0f6
Optimize optimizers
anotherbugmaster Jan 10, 2019
d25332f
Remove unnecessary pic
anotherbugmaster Jan 10, 2019
0e711d9
Optimize memory consumption
anotherbugmaster Jan 10, 2019
cc3085c
Add docstring
anotherbugmaster Jan 10, 2019
b090b6b
Optimize get_topic_words
anotherbugmaster Jan 10, 2019
e05a1c6
Merge remote-tracking branch 'upstream/develop' into online_nmf
anotherbugmaster Jan 10, 2019
ba8ce1c
Fix tests
anotherbugmaster Jan 10, 2019
6d78f83
Fix flake8
anotherbugmaster Jan 10, 2019
b16c1dd
Add missing test
anotherbugmaster Jan 11, 2019
7c1e240
Code review fixes
anotherbugmaster Jan 11, 2019
667ae99
n_tokens -> num_tokens
anotherbugmaster Jan 11, 2019
251d5f9
[skip ci] Add explicit normalize parameter
anotherbugmaster Jan 11, 2019
7a3f358
[skip ci] Add explicit normalize parameter[2]
anotherbugmaster Jan 11, 2019
c663f33
[skip ci] Update tutorial notebook
anotherbugmaster Jan 11, 2019
8e15cd4
[skip ci] [WIP] Update wikipedia notebook
anotherbugmaster Jan 11, 2019
b16e108
Merge remote-tracking branch 'upstream/develop' into online_nmf
anotherbugmaster Jan 11, 2019
3c76171
Merge branch 'online_nmf' of github.com:anotherbugmaster/gensim into …
anotherbugmaster Jan 15, 2019
4941745
Merge remote-tracking branch 'upstream/develop' into online_nmf
anotherbugmaster Jan 15, 2019
c4d6ebd
Add more description and metrics
anotherbugmaster Jan 15, 2019
3b1195d
[skip ci] Fix log_probabiliy
anotherbugmaster Jan 15, 2019
5edec1b
Multiple format fixes in notebook, outputs cleared til tomorrow
anotherbugmaster Jan 15, 2019
33ce1a3
Merge remote-tracking branch 'upstream/develop' into online_nmf
menshikh-iv Jan 16, 2019
1806bf6
Train on full corpus
anotherbugmaster Jan 16, 2019
3b9b8ea
Merge branch 'online_nmf' of github.com:anotherbugmaster/gensim into …
anotherbugmaster Jan 16, 2019
3f1af1d
[skip ci] Remove disclaimer
anotherbugmaster Jan 16, 2019
38143a9
Add RAM usage stats
anotherbugmaster Jan 16, 2019
72a02db
Native 20-newsgroups and additional text
anotherbugmaster Jan 16, 2019
7cf80e1
Truncate outputs
anotherbugmaster Jan 17, 2019
72178c0
Merge remote-tracking branch 'upstream/develop' into online_nmf
anotherbugmaster Jan 17, 2019
467a2ad
Fix last cell formatting
anotherbugmaster Jan 17, 2019
e34b939
[skip ci] Change model hyperparameters back
anotherbugmaster Jan 17, 2019
08b74c4
Merge from upstream
anotherbugmaster Jan 19, 2019
a270557
[skip ci] Add module docstring
anotherbugmaster Jan 28, 2019
6e5b288
Merge remote-tracking branch 'upstream/develop' into nmf_docs
anotherbugmaster Jan 28, 2019
96bb9c9
Merge branch 'online_nmf' of github.com:anotherbugmaster/gensim into …
anotherbugmaster Jan 28, 2019
24c999f
[skip ci] Massive speedups
anotherbugmaster Jan 29, 2019
28b6fa5
Merge branch 'nmf_docs' into nmf_speedups
anotherbugmaster Jan 29, 2019
69aba02
Checkout nmf_wikipedia from develop
anotherbugmaster Jan 29, 2019
d7b29b0
Fix tests
anotherbugmaster Jan 29, 2019
76d4483
Fix corpus description
anotherbugmaster Jan 29, 2019
effa895
Add components permutation to coordinate descent
anotherbugmaster Jan 29, 2019
bfbdc78
Fix tests
anotherbugmaster Jan 29, 2019
f6a0a28
Fix dictionary highlight
anotherbugmaster Jan 29, 2019
377d3b6
Fix tests again
anotherbugmaster Jan 29, 2019
83ea5ad
Remove r, it's not used for the time
anotherbugmaster Jan 30, 2019
f07cafc
Deprecate use_r
anotherbugmaster Jan 30, 2019
273bd04
[skip ci] Rearrange params
anotherbugmaster Jan 30, 2019
d7a94d5
[skip ci] Add disclaimer about `r`
anotherbugmaster Jan 30, 2019
83bc01b
Fix `normalize` and `minimum_probability` docstring
anotherbugmaster Jan 30, 2019
be65d34
Remove unused params
anotherbugmaster Jan 30, 2019
001cc1b
Add csc support
anotherbugmaster Jan 30, 2019
b3ea8ba
Add examples to the docstring
anotherbugmaster Jan 30, 2019
4ae9626
Update tutorial notebook
anotherbugmaster Jan 30, 2019
7458fa5
[skip ci] Update tutorial again
anotherbugmaster Jan 30, 2019
e477cab
[skip ci] Merge remote-tracking branch 'upstream/develop' into nmf_sp…
menshikh-iv Jan 30, 2019
e873434
[skip ci] fix PEP
menshikh-iv Jan 30, 2019
fd16b38
cast explicitly permutations to int32
menshikh-iv Jan 30, 2019
f116b69
[skip ci] Fix a typo
anotherbugmaster Jan 30, 2019
6095a96
Merge remote-tracking branch 'origin/nmf_speedups' into nmf_speedups
anotherbugmaster Jan 30, 2019
52f80fc
[skip ci] Remove clip and fix error count in update
anotherbugmaster Jan 30, 2019
b144de7
[skip ci] Fix error computation
anotherbugmaster Jan 30, 2019
abf3239
[skip ci] Fix error counting again
anotherbugmaster Jan 30, 2019
0948c85
[skip ci] Remove redundant imports
anotherbugmaster Jan 30, 2019
7e2782e
Fix grouper for csc matrices
anotherbugmaster Jan 30, 2019
e213d7e
Fix module docstring
anotherbugmaster Jan 30, 2019
4bb6f9f
Fix training corpus description
anotherbugmaster Jan 30, 2019
700bc36
Fix pep8
anotherbugmaster Jan 30, 2019
0fc38c4
Fix flake8 for real
anotherbugmaster Jan 30, 2019
f77873e
Normalize, sparsity and dictionary fixes
anotherbugmaster Jan 30, 2019
db62b49
Updated module docstring in the notebook
anotherbugmaster Jan 31, 2019
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Loading