Training Docker Image and Language Model Creation #3153

gokgozf · 2020-07-14T00:47:51Z

I really like the idea of separating the training, and build images though i have a doubt.

In my opinion being able to easily generate a new language model and being able to test is a great opportunity for training image which i believe is neglected in the image by not adding the following statements and some dependency installations

If it is aligned with your expectations as well, i can provide quick pull request on that

# Allow Python printing utf-8
ENV PYTHONIOENCODING UTF-8

# Build KenLM in /DeepSpeech/native_client/kenlm folder
WORKDIR /DeepSpeech/native_client
RUN rm -rf kenlm && \
	git clone https://github.com/kpu/kenlm && \
	cd kenlm && \
	git checkout 87e85e66c99ceff1fab2500a7c60c01da7315eec && \
	mkdir -p build && \
	cd build && \
	cmake .. && \
	make -j $(nproc)

The text was updated successfully, but these errors were encountered:

DanBmh · 2020-07-14T18:52:18Z

What version are you using?
I already added the KenLM building part some time ago to the Dockerfile.train.tmpl file.

lissyx · 2020-07-15T09:45:48Z

by not adding the following statements and some dependency installations

As @DanBmh said, we have that now. Can you please be clear in your wording? I'm not a big fan of mind reading, so "some dependency" is not really helpful.

lissyx · 2020-07-21T14:41:52Z

So @gokgozf can you elaborate explicitely on what is needed ? The only part of code you pasted is already there...

DanBmh · 2020-07-21T17:23:01Z

@lissyx Does scorer packaging still work? I've seen that the py file was replaced by another script with extra installation steps, but didn't test it yet.

lissyx · 2020-07-21T17:34:21Z

@lissyx Does scorer packaging still work?

In the dockerfile? it's possible we don't take care of that yet

lissyx · 2020-07-27T13:47:10Z

Please @gokgozf ? Can you elaborate on what you miss?

kdavis-mozilla · 2020-07-27T14:25:15Z

@gokgozf In order to help us help you, could you elaborate on what's missing?

lissyx · 2020-09-09T08:15:19Z

Without more information and no feedback, I'm closing this bug. Please reopen / send PR if you need to.

lissyx added the waiting-on-reporter Waitiing on more informations from reporter label Jul 21, 2020

lissyx closed this as completed Sep 9, 2020

lissyx removed the waiting-on-reporter Waitiing on more informations from reporter label Sep 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training Docker Image and Language Model Creation #3153

Training Docker Image and Language Model Creation #3153

gokgozf commented Jul 14, 2020

DanBmh commented Jul 14, 2020

lissyx commented Jul 15, 2020

lissyx commented Jul 21, 2020

DanBmh commented Jul 21, 2020

lissyx commented Jul 21, 2020

lissyx commented Jul 27, 2020

kdavis-mozilla commented Jul 27, 2020

lissyx commented Sep 9, 2020

Training Docker Image and Language Model Creation #3153

Training Docker Image and Language Model Creation #3153

Comments

gokgozf commented Jul 14, 2020

DanBmh commented Jul 14, 2020

lissyx commented Jul 15, 2020

lissyx commented Jul 21, 2020

DanBmh commented Jul 21, 2020

lissyx commented Jul 21, 2020

lissyx commented Jul 27, 2020

kdavis-mozilla commented Jul 27, 2020

lissyx commented Sep 9, 2020