Issue 499233003: Binding media stream audio track to speech recognition [renderer]

burnik

Please review my CL. :-)

6 years, 4 months ago (2014-08-25 14:14:42 UTC) #1

henrika (OOO until Aug 14)

This is a very large CL adding about 400 lines of code. Any design doc ...

6 years, 4 months ago (2014-08-25 14:24:48 UTC) #2

no longer working on chromium

I would like to see an updated version of CL which uses sync socket + ...

6 years, 4 months ago (2014-08-25 14:38:09 UTC) #3

tommi (sloooow) - chröme

First review round. Neatly written code. https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_recognition_audio_source_provider.cc File content/renderer/speech_recognition_audio_source_provider.cc (right): https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_recognition_audio_source_provider.cc#newcode22 content/renderer/speech_recognition_audio_source_provider.cc:22: ) put on ...

6 years, 4 months ago (2014-08-25 14:38:46 UTC) #4

henrika (OOO until Aug 14)

Agree, looks really good! Just some nits for now; see that Tommi added some comments ...

6 years, 4 months ago (2014-08-25 14:46:00 UTC) #5

burnik

SyncSocket implementation. Updating design doc & working on unit test. https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_recognition_audio_source_provider.cc File content/renderer/speech_recognition_audio_source_provider.cc (right): https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_recognition_audio_source_provider.cc#newcode22 ...

6 years, 3 months ago (2014-08-29 09:18:16 UTC) #6

SyncSocket implementation. Updating design doc & working on unit test.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
File content/renderer/speech_recognition_audio_source_provider.cc (right):

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:22: )
On 2014/08/25 14:38:40, tommi wrote:
> put on previous line?

Done.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:25:
output_params_(params),
I think so. AudioConverter marks them const. I'm only reading them here.
On 2014/08/25 14:38:45, tommi wrote:
> should output_params_ be const?

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:31:
DCHECK(shared_memory_.Map(memory_length));
On 2014/08/25 14:38:45, tommi wrote:
> I think this is a bug... DCHECK()ed code isn't included in release builds, so
> Map() will never run.

Done.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:42:
SpeechRecognitionAudioSourceProvider::~SpeechRecognitionAudioSourceProvider() {
On 2014/08/25 14:38:45, tommi wrote:
> missing thread check for dtor

Done.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:57:
DCHECK(input_params.IsValid());
I believe previous state of input_params_ does not matter. We overwrite it (or
write it for the first time) when this event occurs.
On 2014/08/25 14:38:45, tommi wrote:
> would it make sense to assert that input_params_ (member variable) has not
been
> set when we get here?

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:68:
DCHECK_EQ(0,output_params_.frames_per_buffer() * input_params_.sample_rate() %
On 2014/08/25 14:38:45, tommi wrote:
> space after ,

Done.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:71:
input_params_.sample_rate() / output_params_.sample_rate();
On 2014/08/25 14:38:44, tommi wrote:
> 4 spaces

Done.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:80: void
SpeechRecognitionAudioSourceProvider::OnReadyStateChanged(
On 2014/08/25 14:38:45, tommi wrote:
> thread check here?

Done.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:83: track_stopped_
= true;
On 2014/08/25 14:38:40, tommi wrote:
> would it make sense to add
> 
> else
>   DCHECK(!track_stopped_);
> 
> ?

Done.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:86: void
SpeechRecognitionAudioSourceProvider::OnData(
capturer thread. Done.
On 2014/08/25 14:38:45, tommi wrote:
> on which thread does this function run?

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:91: 
On 2014/08/25 14:38:45, tommi wrote:
> remove empty line

Done.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:94: if
(fifo_->frames() + number_of_frames > fifo_->max_frames()) {
On 2014/08/25 14:38:45, tommi wrote:
> it's not clear to me if fifo_ needs protection etc.  I think it would be good
if
> every method would have a thread check so that the threading model can be
easily
> understood and verified.

Done.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:103:
DCHECK(capture_thread_checker_.CalledOnValidThread());
On 2014/08/25 14:38:40, tommi wrote:
> ah... this should be at the top of the function.  As is, you don't run this
> check if you get into the if() statement above.

Done.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:126: // make sure
the previous output buffer was consumed by the client
This was only intended for protecting unconsumed_audio_buffers_. That's because
the NotifyAudioBusConsumed() was called on the main render thread, and OnData on
the capture thread. This event model has been removed in the next patchset.
On 2014/08/25 14:38:45, tommi wrote:
> why is it safe to touch the above member variables without the lock but not
the
> ones below? (some documentation would be good to have to explain - I may have
> missed it too, so just point me to it if that's the case)

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:141:
on_data_callback_.Run();
Locks removed from the design.
On 2014/08/25 14:38:45, tommi wrote:
> if there's a way to avoid holding the lock when firing this callback, then
that
> would be good.  I think that on_data_callback_ is const and won't ever change
> throughout the lifetime of |this|, so we likely do not need to hold the lock
> here.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
File content/renderer/speech_recognition_audio_source_provider.h (right):

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:51: //
MediaStreamAudioSink implementation.
On 2014/08/25 14:38:46, tommi wrote:
> Do these implementations need to be a part of the public interface of
> SpeechRecognitionAudioSourceProvider?
> If not (i.e. if they are an implementation detail of
> SpeechRecognitionAudioSourceProvider), then I'd like to make these protected
or
> private.

Done.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:68: virtual void
NotifyAudioBusConsumed();
Removed due to sync_socket.
On 2014/08/25 14:38:46, tommi wrote:
> OVERRIDE?

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:71: const int
kNumberOfBuffersInFifo = 2;
On 2014/08/25 14:38:45, tommi wrote:
> static?

Done.

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:80:
base::SharedMemory shared_memory_;
On 2014/08/25 14:46:00, henrika wrote:
> Could you add more information about what these members do?

Done.

tommi (sloooow) - chröme

https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_recognition_audio_source_provider.cc File content/renderer/speech_recognition_audio_source_provider.cc (right): https://codereview.chromium.org/499233003/diff/20001/content/renderer/speech_recognition_audio_source_provider.cc#newcode25 content/renderer/speech_recognition_audio_source_provider.cc:25: output_params_(params), On 2014/08/29 09:18:16, burnik wrote: > I think ...

6 years, 3 months ago (2014-08-29 11:25:32 UTC) #7

no longer working on chromium

https://codereview.chromium.org/499233003/diff/60001/base/native_sync_socket.h File base/native_sync_socket.h (right): https://codereview.chromium.org/499233003/diff/60001/base/native_sync_socket.h#newcode18 base/native_sync_socket.h:18: class NativeSyncSocket { You and I should have a ...

6 years, 3 months ago (2014-08-29 12:23:07 UTC) #8

henrika (OOO until Aug 14)

Seems like you have a large set of comments and I don't want to add ...

6 years, 3 months ago (2014-08-29 12:28:55 UTC) #9

burnik

All good comments. Some discusssion is required to resolve a few questions. https://codereview.chromium.org/499233003/diff/60001/base/native_sync_socket.h File base/native_sync_socket.h ...

6 years, 3 months ago (2014-08-29 13:26:18 UTC) #10

All good comments. Some discusssion is required to resolve a few questions.

https://codereview.chromium.org/499233003/diff/60001/base/native_sync_socket.h
File base/native_sync_socket.h (right):

https://codereview.chromium.org/499233003/diff/60001/base/native_sync_socket....
base/native_sync_socket.h:18: class NativeSyncSocket {
I agree. SyncSocket should have a descriptor which is cross-platform to avoid
these checks (as it's done in multiple places where code relies on using the
|SyncSocket|). Also should provide a method to prepare it for transit.
On 2014/08/29 11:25:30, tommi wrote:
> This class seems to rely on SyncSocket but doesn't provide any non-static
> functionality (i.e. it appears to be more of a namespace).  Why wouldn't we
> simply add these methods to SyncSocket instead of introducing new files for
> these very basic helpers?

https://codereview.chromium.org/499233003/diff/60001/base/native_sync_socket....
base/native_sync_socket.h:18: class NativeSyncSocket {
Agreed. Socket should be more cross-platform friendly.
On 2014/08/29 12:23:06, xians1 wrote:
> You and I should have a discussion with Tommi on if we should have this new
> class in base/ or not.

https://codereview.chromium.org/499233003/diff/60001/base/native_sync_socket....
base/native_sync_socket.h:23: static bool PrepareForeignSocketDescriptor(
Agreed. I think name should be something like |PreparePeerSocketDescriptor|. 

On 2014/08/29 11:25:30, tommi wrote:
> I know that this method name comes from elsewhere, but if we're moving it into
> base/ I think we should come up with a more descriptive name that makes it
clear
> that handover from one process to another is taking place.

https://codereview.chromium.org/499233003/diff/60001/base/native_sync_socket....
base/native_sync_socket.h:23: static bool PrepareForeignSocketDescriptor(
On 2014/08/29 11:25:30, tommi wrote:
> implementation should be in the cc file

Acknowledged.

https://codereview.chromium.org/499233003/diff/60001/base/native_sync_socket....
base/native_sync_socket.h:49: static int Unwrap(const Descriptor& descriptor) {
Good point. When impl moves to .cc it will make more sense.
On 2014/08/29 11:25:30, tommi wrote:
> use base::SyncSocket::Handle here as well as the return value.
> 
> The implementation should be in the .cc file and once you've fixed the return
> type, the #if defined() check can be inside the function and we can avoid
having
> multiple declarations of the function.
> I also think that this could cause compiler warnings (which could break the
> build) at some higher levels since although Handle might be typedefed as an
int,
> it's still a specific type.  Calling code should ideally not have to do the
#if
> checks.

https://codereview.chromium.org/499233003/diff/60001/content/common/speech_re...
File content/common/speech_recognition_messages.h (right):

https://codereview.chromium.org/499233003/diff/60001/content/common/speech_re...
content/common/speech_recognition_messages.h:130:
IPC_MESSAGE_ROUTED5(SpeechRecognitionMsg_AudioTrackReady,
I think it would be ok if we had |base::SyncSocket::SharedDescriptor|. It would
make more sense and indicate that this descriptor is to be used by the peer
process.
On 2014/08/29 11:25:30, tommi wrote:
> is this approach the same as what we do elsewhere where we use SyncSocket? (I
> ask since NativeSyncSocket is a new class but we've been using SyncSocket for
a
> while)

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
File content/renderer/speech_recognition_audio_source_provider.cc (right):

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:32: DLOG(ERROR) <<
"Could not map the shared memory";
Done. Although I will revisit this when I do more work on errors. 
On 2014/08/29 11:25:31, tommi wrote:
> Would this be a serious enough of an error to just do this?
> 
> CHECK(shared_memory_.Map(memory_length));
> 
> (maybe you could check other places where we call SharedMemory::Map in the
> renderer process)

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:42: if
(audio_converter_.get() && attached_converter_)
Adding the converter on construction will trigger problems with |ProvideInput()|
since we won't have enough data on the FIFO. Therefore I would have to Zero()
out the bus delivering empty data to the browser. I will check if RemoveInput is
necessary or not.

On 2014/08/29 12:23:06, xians1 wrote:
> you are complicating things, you should just add the input to the converter
when
> you construct the converter and you probably don't even need to call
> RemoveInput() in the destructor since the object is going away.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:50: bool
SpeechRecognitionAudioSourceProvider::IsAllowedAudioTrack(
It is static. Done.
On 2014/08/29 11:25:31, tommi wrote:
> this method is static, right?  If so, there should be a comment above
indicating
> that:
> 
> // static
> bool SpeechRecognitionAudioSourceProvider::IsAllowedAudioTrack(
> 
> If it is not static, then it's missing a thread check.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:53:
MediaStreamAudioSource* native_source =
Here I check if it's WebAudio.
On 2014/08/29 12:23:06, xians1 wrote:
> you need to check if the track is local or not, if it is a remote track, it
> might not have MediaStreamAudioSource

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:54: static_cast
<MediaStreamAudioSource*>(track.source().extraData());
On 2014/08/29 11:25:30, tommi wrote:
> no space after static_cast

Done.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:55:
StreamDeviceInfo device_info = native_source->device_info();
On 2014/08/29 11:25:30, tommi wrote:
> no need to create a new StreamDeviceInfo instance.  if you need a variable,
just
> use a const reference:
> const StreamDeviceInfo& device_info = native_source->device_info();
> 
> or, just call device_info() inline in expression you return:
> 
> return native_source->device_info().device.type ==
>        content::MEDIA_DEVICE_AUDIO_CAPTURE;

Done.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:56: return
(device_info.device.type == content::MEDIA_DEVICE_AUDIO_CAPTURE);
This is implemented as a response to a suggestion from an e-mail thread saying
WebAudio should not be allowed yet, although it is supported (e.g. if we return
true here, we can feed a WebAudio track). Policy might change or even be moved
from here.
On 2014/08/29 11:25:31, tommi wrote:
> this seems to me to be a 'supported' check rather than an 'allowed' check. 
> "Allowed" usually refers to policy or permission checks.  If this function is
> simply supposed to answer the question whether the implementation supports
this
> type of track, I'd prefer to name it IsAudioTrackSupported() or possibly
> IsTrackTypeSupported().

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:74: new
media::AudioConverter(input_params, output_params_, false));
Same comment regarding the way |ProvideInput()| is called.
On 2014/08/29 12:23:06, xians1 wrote:
> Call AddInput here.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:123:
audio_converter_->AddInput(this);
I might try this out and see what happens.
On 2014/08/29 12:23:06, xians1 wrote:
> I guess these code is workaround to fix AudioConverter calling ProvideInput
> twice at the beginning, I don't really think it is a good idea, instead,
trying
> changing the code a few lines above to:
> if (fifo_->frames() <= fifo_buffer_size_)
>   return;
> 
> And add a comment to explain why you do this.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:133: size_t
bytes_received = socket_.ReceiveWithTimeout(&peer_buffer_index,
On 2014/08/29 12:23:06, xians1 wrote:
> you have remove this ReceiveWithTimeout call, this OnData() is called on the
> real time audio thread, you can't block it. we need to figure out some other
way
> to do the synchronization.

Acknowledged.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:136: if
(bytes_received == 0)
Good point. Done. 
And I'm still considering ways to handle these dangerous situations which could
potentially fill up the FIFO.
On 2014/08/29 11:25:30, tommi wrote:
> should there be a NOTREACHED() in the body of this if statement?

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:147: // The send
can fail if the user changes his input audio device
As far as I've tested.
On 2014/08/29 12:23:06, xians1 wrote:
> is it true?

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:159: // Consume
queued input frames by passing them to |audio_converter_|
On 2014/08/29 12:23:06, xians1 wrote:
> nit, empty line.

Done.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
File content/renderer/speech_recognition_audio_source_provider.h (right):

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:46: // determines
the policy on what types of tracks are allowed
On 2014/08/29 11:25:31, tommi wrote:
> nit: empty line above this comment for readability. ultra nit: Comments start
> with a capital letter and end with a period.

Done.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:67: static const int
kNumberOfBuffersInFifo = 2;
On 2014/08/29 12:23:07, xians1 wrote:
> move it to the implementation.

Done.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:111: // We attach
the resampler once we have enough data in FIFO and not before.
Done.
On 2014/08/29 12:23:07, xians1 wrote:
> confusing comment, can it make it more clear on what this flag is used for?

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
File content/renderer/speech_recognition_dispatcher.cc (right):

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:72: 
On 2014/08/29 11:25:31, tommi wrote:
> remove this empty line

Done.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:75:
SpeechRecognitionAudioSourceProvider::IsAllowedAudioTrack(audio_track);
My intention is to move away as much logic as possible from the dispatcher.
Makes it much more easier to test. I agree that this static method should belong
perhaps to another class - such as |SpeechRecognitionAudioTrackPolicy| so I can
inject the policy rather than having it hardcoded.

On 2014/08/29 11:25:31, tommi wrote:
> would it make sense to have the IsAllowedAudioTrack() (or whatever it might
get
> renamed to) implemented in this file?  (in an anonymous namespace at the top
of
> this file)
> 
> This seems to be the only code that needs that functionality.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:76: audio_track_ =
audio_track;
Method needs refactoring.
On 2014/08/29 11:25:31, tommi wrote:
> first DCHECK that audio_track_ isn't valid?

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:83: audio_track_set_ = false;
Method needs refactoring.
On 2014/08/29 11:25:31, tommi wrote:
> reset/clear audio_track_ as well and set is_allowed_audio_track_ to false?

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:93: // destroy any previous
instance not to starve it waiting on chunk ACKs
On 2014/08/29 12:23:07, xians1 wrote:
> Destroy

Done.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:96: if (audio_track_set_ &&
!is_allowed_audio_track_) {
The JS |webkitSpeechRecognition.audioTrack| is not a mandatory field and can
therefore be null. That indicates audio_track_set_ == false and falls back to
default implementation on the browser (using AudioInputController). However if
we set |webkitSpeechRecognition.audioTrack| to a WebAudio MediaStreamTrack, then
it is set (the reference holds in JS). We should then tell the JS dev that this
is not allowed but still fall back so SR session starts. That's why I need two
members.

However - I think I can get rid of audio_track_set_ and use a pointer for
audio_track_ instead.

BTW - This is a good topic of discussion regarding how this should behave
towards the user in these corner cases.

On 2014/08/29 11:25:31, tommi wrote:
> do you really need is_allowed_audio_track_?  If the track isn't
> allowed/supported, then shouldn't we simply not set audio_track_set_ to true?

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:101:
WebSpeechRecognizerClient::NotAllowedError);
Good for discussion.
On 2014/08/29 12:23:07, xians1 wrote:
> probably you should fail the start call in such case.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:118:
msg_params.using_audio_track = (audio_track_set_ && is_allowed_audio_track_);
Same as comment above. The JS API behaviour is not set in stone. More discussion
should resolve this as well.

On 2014/08/29 11:25:31, tommi wrote:
> looks like you always test these variables together..
> 
> actually, I'm wondering if you need either of them.  What if you change
attach()
> so that it does not assign to audio_track_ if the track isn't
allowed/supported
> and then simply check audio_track_.isNull() here (and elsewhere where you
> currently check these two variables)?
> 
> detach() would of course have to call audio_track_.reset() then (which I think
> it should anyway).

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:198:
audio_source_provider_.reset();
This message is received from the browser. Sockets should resolve the rest.
Although, I should inspect this event further.
On 2014/08/29 11:25:31, tommi wrote:
> I'm assuming that the browser side will be aware of the now 'stopped or
aborted'
> state.  Is that correct?

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:219:
audio_source_provider_.reset();
This effectively kills of the sockets so the audio thread on the browser stops.
On 2014/08/29 11:25:31, tommi wrote:
> same question here since this feels functionally comparable to stop() or
abort()
> minus the message sent to the browser.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:263: // TODO(burnik): Log and
DCHECK(!audio_source_provider_).
It's still a bit fuzzy because we can only have one active session and I'm not
sure what multiple sessions would do. Tests will show.
On 2014/08/29 11:25:31, tommi wrote:
> can you do this now?

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:265:
audio_source_provider_.reset();
I would have to dig in deeper to check if this would be needed. I will probably
make these methods more robust. Tests will show.
On 2014/08/29 11:25:31, tommi wrote:
> should this be done in detach() also?

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:269: audio_track_, params,
memory, socket, length));
On 2014/08/29 11:25:31, tommi wrote:
> indent

Done.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
File content/renderer/speech_recognition_dispatcher.h (right):

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.h:49: const
blink::WebMediaStreamTrack&,
On 2014/08/29 11:25:31, tommi wrote:
> fix indent here and below

Done.

https://codereview.chromium.org/499233003/diff/60001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.h:96:
scoped_refptr<base::MessageLoopProxy> render_loop_;
Don't need it anymore.
On 2014/08/29 12:23:07, xians1 wrote:
> why do you need this?

burnik

burnik@chromium.org changed reviewers: + phoglund@chromium.org

6 years, 3 months ago (2014-09-12 11:53:45 UTC) #11

burnik

Took some time, but finally - unit test included. Also updated design doc. Hopefully it's ...

6 years, 3 months ago (2014-09-12 12:09:12 UTC) #12

henrika (OOO until Aug 14)

Looks impressive and a bit too big for me to swallow in one bite. Starting ...

6 years, 3 months ago (2014-09-12 12:27:47 UTC) #13

no longer working on chromium

Nice work in general. I haven't really looked at the unittest yet, it might be ...

6 years, 3 months ago (2014-09-15 08:31:29 UTC) #14

burnik

burnik@chromium.org changed reviewers: + jamesr@chromium.org, kenrb@chromium.org - phoglund@chromium.org

6 years, 3 months ago (2014-09-15 14:56:02 UTC) #15

burnik

Cleaned up a bit and simplified. Hoping to get more feedback from other reviewers. Special ...

6 years, 3 months ago (2014-09-15 15:00:07 UTC) #16

Cleaned up a bit and simplified. Hoping to get more feedback from other
reviewers. Special focus on the unit test.

+kenrb for IPC
+jamesr for Content

https://codereview.chromium.org/499233003/diff/80001/content/common/speech_re...
File content/common/speech_recognition_messages.h (right):

https://codereview.chromium.org/499233003/diff/80001/content/common/speech_re...
content/common/speech_recognition_messages.h:21: #include
"base/file_descriptor_posix.h"
Legacy from before SyncSocket::TransitDescriptor. Removed.
Will add IPC reviewers.
On 2014/09/15 08:31:27, xians1 wrote:
> ?? why do you need this?
> 
> Also, shouldn't this IPC msg be done on a separate CL?

https://codereview.chromium.org/499233003/diff/80001/content/content_tests.gypi
File content/content_tests.gypi (right):

https://codereview.chromium.org/499233003/diff/80001/content/content_tests.gy...
content/content_tests.gypi:684:
'renderer/speech_recognition_audio_source_provider_unittest.cc',
On 2014/09/15 08:31:27, xians1 wrote:
> alphabet order

Done.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
File content/renderer/speech_recognition_audio_source_provider.cc (right):

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:10: #include
"base/time/time.h"
Alphabetic order of what?
On 2014/09/15 08:31:28, xians1 wrote:
> nit, alphabet order

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:37:
peer_buffer_index_ = &(buffer->params.size);
It has been used, on the browser process. Was init to 0 upon alloc and share.
Makes sense to me to alloc and init in the same place.
On 2014/09/15 08:31:29, xians1 wrote:
> I think it is a bit wrong, the shared_memory_ has not been used before, why
> should you read the value there?
> Simply, you can initialize peer_buffer_index_ to 0 here.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:53: bool
SpeechRecognitionAudioSourceProvider::IsAllowedAudioTrack(
"Supported" would indicate there is a technical barrier to supporting. Here it's
actually a policy because of the dreaded *abuse* SR could experience.
On 2014/09/15 08:31:29, xians1 wrote:
> IsAudioTrackSupported() seems a more suitable name here.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:55:
DCHECK(track.source().type() == blink::WebMediaStreamSource::TypeAudio);
True, no checks were done elsewhere. Done.
On 2014/09/15 08:31:28, xians1 wrote:
> you can't put DCHECK here, this method is trigger by JS, and developer can do
> whatever they want.
> Just return false if it is not TypeAudio

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:58:
DCHECK(native_source);
On 2014/09/15 08:31:28, xians1 wrote:
> Same here, return false if native_source does not exist.

Done.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:73:
fifo_buffer_size_ = output_params_.frames_per_buffer() *
Floored. Integer division.
On 2014/09/15 08:31:28, xians1 wrote:
> how is this cast?

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:98: if
(track_stopped_) return;
Done. However, clang-format proposes this way.
On 2014/09/15 08:31:27, xians1 wrote:
> new line.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:99: if (state ==
blink::WebMediaStreamSource::ReadyStateEnded) {
On 2014/09/15 08:31:28, xians1 wrote:
> add an empty line before the second if (

Done.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:101:
MediaStreamAudioSink::RemoveFromAudioTrack(this, track_);
Are you sure? Will the MediaStreamAudioSink remove the track on it's own? Can
you point me to that code, please?
This is paired with the dtor of the class.
On 2014/09/15 08:31:28, xians1 wrote:
> Remove this line of code.
> track_ has already been ended, you should not call into the track_ any more.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:102:
NotifyErrorState(ErrorState::TRACK_STOPPED);
Agreed. It's here for now as I refactor.
On 2014/09/15 08:31:29, xians1 wrote:
> hmm, track ended state is not an error, ErrorState should not include
> TRACK_STOPPED at all.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:115:
NotifyErrorState(ErrorState::AUDIO_FIFO_OVERFLOW);
Logged via DLOG(ERROR).
Client can destroy the audio source provider and potentially end the session
early.
On 2014/09/15 08:31:28, xians1 wrote:
> Log it.
> Also, could you please explain what the client supposes to do when getting a
> AUDIO_FIFO_OVERFLOW callback?

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:125: if
(fifo_->frames() < fifo_buffer_size_) return;
On 2014/09/15 08:31:28, xians1 wrote:
> empty line for the return

Done.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:160:
audio_bus->Zero();
Yes. The else happens when we attach to the converter in |OnSetFormat|.
Otherwise wouldn't be removing the |attached_converter_|.
On 2014/09/15 08:31:29, xians1 wrote:
> do you know if the else case can happen here?

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:161: return 1.0;
On 2014/09/15 08:31:28, xians1 wrote:
> empty line before the return.

Done.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
File content/renderer/speech_recognition_audio_source_provider.h (right):

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:32: //
WebRtcLocalAudioTrack and stores the capture data to a FIFO.
Comment expanded.

On 2014/09/12 12:27:46, henrika wrote:
> I would say, "and stores the captured data in a FIFO". And "When there is
enough
> data in the FIFO..." and perhaps also explain what is meant by enough.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:40: // Used for
notifying the renderer client there is an issue with
Removed enum.
On 2014/09/12 12:27:46, henrika wrote:
> "...if/when there is an issue"

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:43: // Indicates a
notification send failed. Recoverable.
Ditto.
On 2014/09/12 12:27:47, henrika wrote:
> "Indicates that sending a notification failed"

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:45: // Indicates
client hasn't consumed last buffer. Recoverable.
Ditto.
On 2014/09/12 12:27:47, henrika wrote:
> "indicates client" feels wrong; can you rewrite? What about: "Indicates that a
> client..."?

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:48:
AUDIO_FIFO_OVERFLOW,
Logged instead. Only stop propagated via callback.
On 2014/09/15 08:31:29, xians1 wrote:
> I am not sure if there is any value to most of these error codes here? what
the
> client is supposed to do when getting errors like SEND_FAILED, BUFFER_SYNC_LAG
> and AUIDO_FIFO_OVERFLOW?

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:49: // Indicates the
audio track has stopped. Provider can then be destroyed.
Ditto.
On 2014/09/12 12:27:46, henrika wrote:
> "..that the audio track..."

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:63: // Determines
the policy on what types of tracks are allowed.
Good point. We determine. Implementation enforces. And is used outside via the
client.
On 2014/09/12 12:27:47, henrika wrote:
> Is this correct. How can a method which returns true or false determine
> anything? It is a plain getter, right?

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:67: //
MediaStreamAudioSink implementation.
It's |content| as well. But added.
On 2014/09/12 12:27:46, henrika wrote:
> No namespace here?

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:77: // so it has
been under the protection of |lock_|.
Comment deprecated.
On 2014/09/12 12:27:46, henrika wrote:
> "so it has been under..." sounds odd to me. Do you mean "..and a lock is
> therefore added to make the method thread safe"?

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:81: // Notifies
client there is an issue with delivering frames.
Removed from design.
On 2014/09/12 12:27:46, henrika wrote:
> "Notifies client there is" does not sound correct. I would say "Notifies the
> client when there is an issue..."

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:82: // TODO(burnik):
Runs on capture thread. Should run on main renderer thread!
TODO was for before landing. Removed from design.
On 2014/09/12 12:27:47, henrika wrote:
> This TODO needs a corresponding crbug.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:87: // consume it on
the |output_bus_|. Size of the buffer is depends on the
On 2014/09/12 12:27:46, henrika wrote:
> "is depends"??

Done.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:104:
base::SyncSocket* socket_;
Client owns it (renderer - dispatcher). This is for dependency injection as
well.
On 2014/09/15 08:31:29, xians1 wrote:
> why the socket_ is raw pointer? who owns it?

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:124: // Whether the
track has been stopped on the input.
On 2014/09/12 12:27:47, henrika wrote:
> "stopped on the input"??

Done.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:127: // Local
counter of audio buffers for synchronization on consumed buffers.
Looks excessive actually. Removed.
On 2014/09/12 12:27:47, henrika wrote:
> Isn't "of consumed" better?

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.h:133: // Callback
notifying an error has occured.
Removed from design. Replaced by OnStoppedCB and commented.
On 2014/09/12 12:27:47, henrika wrote:
> .."notifying that an...", or "Callback which is activated when..."

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
File content/renderer/speech_recognition_audio_source_provider_unittest.cc
(right):

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider_unittest.cc:29: //
Buffer to be shared between two fake sockets.
On 2014/09/12 12:09:12, burnik wrote:
> 'fake' is interchangeable with 'mock' regarding sockets.

Done.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider_unittest.cc:31: uint8
data[100000];
On 2014/09/15 08:31:29, xians1 wrote:
> noooooo, you can't allocate 100000 bytes in stack like this, change it the
code
> to use heap.

This is allocated on and owned by the FakeSpeechRecognizer.
Size is reduced to 8.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider_unittest.cc:442: //
(3) Miraculasly recovered from the socket failure.
On 2014/09/12 12:09:12, burnik wrote:
> * Miraculously

Done.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider_unittest.cc:453: //
First round of input has to have one additional buffer
On 2014/09/12 12:09:12, burnik wrote:
> This comment is deprecated.

Done.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
File content/renderer/speech_recognition_dispatcher.cc (right):

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_dispatcher.cc:299: void
SpeechRecognitionDispatcher::OnAudioTrackError(
Refactored stub - just used for detecting a stop. Refactoring further in next
iteration.
On 2014/09/12 12:09:12, burnik wrote:
> Clearly not useful yet. Consider it a stub.

burnik

On 2014/09/15 17:22:10, jamesr wrote: > Could this go in content/renderer/media/ instead? Done. Next patchset ...

6 years, 3 months ago (2014-09-16 09:04:16 UTC) #19

no longer working on chromium

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_recognition_audio_source_provider.cc File content/renderer/speech_recognition_audio_source_provider.cc (right): https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_recognition_audio_source_provider.cc#newcode9 content/renderer/speech_recognition_audio_source_provider.cc:9: #include "base/threading/thread_restrictions.h" why do you have this base/threading/thread_restrictions.h in ...

6 years, 3 months ago (2014-09-16 12:44:07 UTC) #20

burnik

Refactoring unit test and source provider, moved to media

6 years, 3 months ago (2014-09-16 19:06:46 UTC) #23

burnik

Comments addressed. A few questions still open. https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_recognition_audio_source_provider.cc File content/renderer/speech_recognition_audio_source_provider.cc (right): https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_recognition_audio_source_provider.cc#newcode9 content/renderer/speech_recognition_audio_source_provider.cc:9: #include "base/threading/thread_restrictions.h" ...

6 years, 3 months ago (2014-09-16 19:10:25 UTC) #24

burnik

burnik@chromium.org changed reviewers: + jochen@chromium.org - gshires@chromium.org

6 years, 3 months ago (2014-09-17 15:52:47 UTC) #25

no longer working on chromium

Some more comments about the production code, I will try to take a look at ...

6 years, 3 months ago (2014-09-17 15:55:20 UTC) #27

Some more comments about the production code, I will try to take a look at the
unittest later.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
File content/renderer/speech_recognition_audio_source_provider.cc (right):

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:73:
fifo_buffer_size_ = output_params_.frames_per_buffer() *
On 2014/09/16 19:10:22, burnik wrote:
> Input and output params are of media::AudioParameters type.
> All members here are int. Integer division omits decimals.
> Added DCHECK(output_params_.IsValid()); to next patchset which will check if
> output sample rate is 0.
> In production - input will be 44100 with 441 frames and output will be 16000
> with 1600 frames. 
> Also, DCHECKS which follow check if we have enough buffer.
> 

The example you are taking is just what it is on your machine, the input sample
rate can be any of the hardware sample rates, from 8k up to 192k

https://codereview.chromium.org/499233003/diff/100001/content/renderer/speech...
File content/renderer/speech_recognition_audio_source_provider.cc (right):

https://codereview.chromium.org/499233003/diff/100001/content/renderer/speech...
content/renderer/speech_recognition_audio_source_provider.cc:78:
DCHECK_GE(fifo_buffer_size_, output_params_.frames_per_buffer());
On 2014/09/16 19:10:23, burnik wrote:
> Do we ever have mic input which is 8K? What should be the size of the FIFO
then
> in a general case?
> On 2014/09/16 12:44:06, xians1 wrote:
> > these two DCHECKs are not always right, for example, what if
> > input_params_.sample_rate() is 8K and output_params_.sample_rate() is 16K.
> > Some more thought is needed here.
> 

Yes, 8K as sample rate does exist on all platforms.

https://codereview.chromium.org/499233003/diff/100001/content/renderer/speech...
content/renderer/speech_recognition_audio_source_provider.cc:146: // The send
usually fails if the user changes his input audio device.
On 2014/09/16 19:10:22, burnik wrote:
> As far as I've tested, yes.
> On 2014/09/16 12:44:06, xians1 wrote:
> > is this comment true?
> 

Interesting, to double check, if you changed the input device used by
getUserMedia when connecting to recognition, it will trigger this code?

https://codereview.chromium.org/499233003/diff/100001/content/renderer/speech...
content/renderer/speech_recognition_audio_source_provider.cc:164:
audio_bus->Zero();
On 2014/09/16 19:10:23, burnik wrote:
> Yes. Shown many times while testing. This occurs when we attach the sink via
> *audio_converter_->AddInput(this)*. 
> This is related to the story of the now deprecated |attached_converter_| if
you
> look back to one of the earliest patchsets.
> On 2014/09/16 12:44:06, xians1 wrote:
> > can the else case be possible at all?
> 

Got it, thanks.

https://codereview.chromium.org/499233003/diff/160001/content/common/speech_r...
File content/common/speech_recognition_messages.h (right):

https://codereview.chromium.org/499233003/diff/160001/content/common/speech_r...
content/common/speech_recognition_messages.h:124:
IPC_MESSAGE_ROUTED4(SpeechRecognitionMsg_AudioTrackReady, int /* request_id */,
nit, one line for each param if all params do not fit in one line.

https://codereview.chromium.org/499233003/diff/160001/content/renderer/media/...
File content/renderer/media/speech_recognition_audio_source_provider.cc (right):

https://codereview.chromium.org/499233003/diff/160001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider.cc:49: if
(audio_converter_.get()) audio_converter_->RemoveInput(this);
nit, new line

https://codereview.chromium.org/499233003/diff/160001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider.cc:142:
DLOG(WARNING) << "Buffer synchronization lag";
hmm, thinking about these code a bit, it seems risky to me. Is it possible that
something wrong with the socket, and the indexes gets out of sync and they will
never get recovered because of this check?

https://codereview.chromium.org/499233003/diff/160001/content/renderer/media/...
File content/renderer/media/speech_recognition_audio_source_provider.h (right):

https://codereview.chromium.org/499233003/diff/160001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider.h:83:
base::SyncSocket* socket_;
Please see my previous comments about this socket_, you are leaking it.

burnik

Fixed leaking SyncSocket - appears in next patchest. Anyone else care to take a look ...

6 years, 3 months ago (2014-09-18 09:19:35 UTC) #28

burnik

xians: Wanted to check with you if this is what you meant by calculating the ...

6 years, 3 months ago (2014-09-18 19:09:22 UTC) #29

xians: Wanted to check with you if this is what you meant by calculating the
size of the FIFO.

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
File content/renderer/speech_recognition_audio_source_provider.cc (right):

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:73:
fifo_buffer_size_ = output_params_.frames_per_buffer() *
On 2014/09/17 15:55:19, xians1 wrote:
> On 2014/09/16 19:10:22, burnik wrote:
> > Input and output params are of media::AudioParameters type.
> > All members here are int. Integer division omits decimals.
> > Added DCHECK(output_params_.IsValid()); to next patchset which will check if
> > output sample rate is 0.
> > In production - input will be 44100 with 441 frames and output will be 16000
> > with 1600 frames. 
> > Also, DCHECKS which follow check if we have enough buffer.
> > 
> 
> The example you are taking is just what it is on your machine, the input
sample
> rate can be any of the hardware sample rates, from 8k up to 192k

Ok, Agreed. 

So if I do it this way:

fifo_buffer_size_ =
      std::ceil(output_params_.frames_per_buffer() *
                static_cast<double>(input_params_.sample_rate()) /
                output_params_.sample_rate());

I've tested, and it would work properly for these:

================================
 in.sr  in.fpb  out.sr  out.fpb
--------------------------------
  8000      80   16000     1600
  8000     800   16000     1600
 16000     160   16000     1600
 16000    1600   16000     1600
 32000     320   16000     1600
 32000    3200   16000     1600
 44100     441   16000     1600
 44100    4410   16000     1600
 48000     480   16000     1600
 48000    4800   16000     1600
 96000     960   16000     1600
 96000    9600   16000     1600
 11025     111*  16000     1600
 11025    1103*  16000     1600
 22050     221*  16000     1600
 22050    2205   16000     1600
 88200     882   16000     1600
 88200    8820   16000     1600
 176400   1764   16000     1600
 176400  17640   16000     1600
 192000   1920   16000     1600
 192000  19200   16000     1600
================================

* These starred are always rounded up, right?

https://codereview.chromium.org/499233003/diff/100001/content/renderer/speech...
File content/renderer/speech_recognition_audio_source_provider.cc (right):

https://codereview.chromium.org/499233003/diff/100001/content/renderer/speech...
content/renderer/speech_recognition_audio_source_provider.cc:78:
DCHECK_GE(fifo_buffer_size_, output_params_.frames_per_buffer());
True, removed the second DCHECK. It doesn't really make sense anyway since the
FIFO is used for buffering the input.
On 2014/09/17 15:55:19, xians1 wrote:
> On 2014/09/16 19:10:23, burnik wrote:
> > Do we ever have mic input which is 8K? What should be the size of the FIFO
> then
> > in a general case?
> > On 2014/09/16 12:44:06, xians1 wrote:
> > > these two DCHECKs are not always right, for example, what if
> > > input_params_.sample_rate() is 8K and output_params_.sample_rate() is 16K.
> > > Some more thought is needed here.
> > 
> 
> Yes, 8K as sample rate does exist on all platforms.

https://codereview.chromium.org/499233003/diff/100001/content/renderer/speech...
content/renderer/speech_recognition_audio_source_provider.cc:146: // The send
usually fails if the user changes his input audio device.
If I change input device from System Sound on linux, this occurs.
On 2014/09/17 15:55:19, xians1 wrote:
> On 2014/09/16 19:10:22, burnik wrote:
> > As far as I've tested, yes.
> > On 2014/09/16 12:44:06, xians1 wrote:
> > > is this comment true?
> > 
> 
> Interesting, to double check, if you changed the input device used by
> getUserMedia when connecting to recognition, it will trigger this code?

https://codereview.chromium.org/499233003/diff/100001/content/renderer/speech...
content/renderer/speech_recognition_audio_source_provider.cc:164:
audio_bus->Zero();
On 2014/09/17 15:55:19, xians1 wrote:
> On 2014/09/16 19:10:23, burnik wrote:
> > Yes. Shown many times while testing. This occurs when we attach the sink via
> > *audio_converter_->AddInput(this)*. 
> > This is related to the story of the now deprecated |attached_converter_| if
> you
> > look back to one of the earliest patchsets.
> > On 2014/09/16 12:44:06, xians1 wrote:
> > > can the else case be possible at all?
> > 
> 
> Got it, thanks.

Acknowledged.

no longer working on chromium

Hi Kristijan, did you forget to upload a new version? https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_recognition_audio_source_provider.cc File content/renderer/speech_recognition_audio_source_provider.cc (right): https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_recognition_audio_source_provider.cc#newcode73 ...

6 years, 3 months ago (2014-09-19 08:58:56 UTC) #30

Hi Kristijan, did you forget to upload a new version?

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
File content/renderer/speech_recognition_audio_source_provider.cc (right):

https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
content/renderer/speech_recognition_audio_source_provider.cc:73:
fifo_buffer_size_ = output_params_.frames_per_buffer() *
On 2014/09/18 19:09:21, burnik wrote:
> On 2014/09/17 15:55:19, xians1 wrote:
> > On 2014/09/16 19:10:22, burnik wrote:
> > > Input and output params are of media::AudioParameters type.
> > > All members here are int. Integer division omits decimals.
> > > Added DCHECK(output_params_.IsValid()); to next patchset which will check
if
> > > output sample rate is 0.
> > > In production - input will be 44100 with 441 frames and output will be
16000
> > > with 1600 frames. 
> > > Also, DCHECKS which follow check if we have enough buffer.
> > > 
> > 
> > The example you are taking is just what it is on your machine, the input
> sample
> > rate can be any of the hardware sample rates, from 8k up to 192k
> 
> Ok, Agreed. 
> 
> So if I do it this way:
> 
> fifo_buffer_size_ =
>       std::ceil(output_params_.frames_per_buffer() *
>                 static_cast<double>(input_params_.sample_rate()) /
>                 output_params_.sample_rate());
> 
> I've tested, and it would work properly for these:
> 
> ================================
>  in.sr  in.fpb  out.sr  out.fpb
> --------------------------------
>   8000      80   16000     1600
>   8000     800   16000     1600
>  16000     160   16000     1600
>  16000    1600   16000     1600
>  32000     320   16000     1600
>  32000    3200   16000     1600
>  44100     441   16000     1600
>  44100    4410   16000     1600
>  48000     480   16000     1600
>  48000    4800   16000     1600
>  96000     960   16000     1600
>  96000    9600   16000     1600
>  11025     111*  16000     1600
>  11025    1103*  16000     1600
>  22050     221*  16000     1600
>  22050    2205   16000     1600
>  88200     882   16000     1600
>  88200    8820   16000     1600
>  176400   1764   16000     1600
>  176400  17640   16000     1600
>  192000   1920   16000     1600
>  192000  19200   16000     1600
> ================================
> 
> * These starred are always rounded up, right?
> 

I think this looks correct.

burnik

On 2014/09/19 08:58:56, xians1 wrote: > Hi Kristijan, did you forget to upload a new ...

6 years, 3 months ago (2014-09-19 09:24:09 UTC) #31

On 2014/09/19 08:58:56, xians1 wrote:
> Hi Kristijan, did you forget to upload a new version?
> 
>
https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
> File content/renderer/speech_recognition_audio_source_provider.cc (right):
> 
>
https://codereview.chromium.org/499233003/diff/80001/content/renderer/speech_...
> content/renderer/speech_recognition_audio_source_provider.cc:73:
> fifo_buffer_size_ = output_params_.frames_per_buffer() *
> On 2014/09/18 19:09:21, burnik wrote:
> > On 2014/09/17 15:55:19, xians1 wrote:
> > > On 2014/09/16 19:10:22, burnik wrote:
> > > > Input and output params are of media::AudioParameters type.
> > > > All members here are int. Integer division omits decimals.
> > > > Added DCHECK(output_params_.IsValid()); to next patchset which will
check
> if
> > > > output sample rate is 0.
> > > > In production - input will be 44100 with 441 frames and output will be
> 16000
> > > > with 1600 frames. 
> > > > Also, DCHECKS which follow check if we have enough buffer.
> > > > 
> > > 
> > > The example you are taking is just what it is on your machine, the input
> > sample
> > > rate can be any of the hardware sample rates, from 8k up to 192k
> > 
> > Ok, Agreed. 
> > 
> > So if I do it this way:
> > 
> > fifo_buffer_size_ =
> >       std::ceil(output_params_.frames_per_buffer() *
> >                 static_cast<double>(input_params_.sample_rate()) /
> >                 output_params_.sample_rate());
> > 
> > I've tested, and it would work properly for these:
> > 
> > ================================
> >  in.sr  in.fpb  out.sr  out.fpb
> > --------------------------------
> >   8000      80   16000     1600
> >   8000     800   16000     1600
> >  16000     160   16000     1600
> >  16000    1600   16000     1600
> >  32000     320   16000     1600
> >  32000    3200   16000     1600
> >  44100     441   16000     1600
> >  44100    4410   16000     1600
> >  48000     480   16000     1600
> >  48000    4800   16000     1600
> >  96000     960   16000     1600
> >  96000    9600   16000     1600
> >  11025     111*  16000     1600
> >  11025    1103*  16000     1600
> >  22050     221*  16000     1600
> >  22050    2205   16000     1600
> >  88200     882   16000     1600
> >  88200    8820   16000     1600
> >  176400   1764   16000     1600
> >  176400  17640   16000     1600
> >  192000   1920   16000     1600
> >  192000  19200   16000     1600
> > ================================
> > 
> > * These starred are always rounded up, right?
> > 
> 
> I think this looks correct.

xians: Did not forget. Still having some changes in prep and will upload
shortly.

burnik

As I'm refactoring for the next patchset, I would like to get more opinions on ...

6 years, 3 months ago (2014-09-22 07:43:12 UTC) #33

henrika (OOO until Aug 14)

Don't qualify as expert reviewer on the unit test. Added generic comments. https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/speech_recognition_audio_source_provider_unittest.cc File content/renderer/media/speech_recognition_audio_source_provider_unittest.cc ...

6 years, 3 months ago (2014-09-22 08:02:19 UTC) #34

burnik

Addresed comments. Waiting for other reviewers before next patchset. https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/speech_recognition_audio_source_provider_unittest.cc File content/renderer/media/speech_recognition_audio_source_provider_unittest.cc (right): https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/speech_recognition_audio_source_provider_unittest.cc#newcode84 content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:84: ...

6 years, 3 months ago (2014-09-22 09:17:37 UTC) #35

Addresed comments. Waiting for other reviewers before next patchset.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
File content/renderer/media/speech_recognition_audio_source_provider_unittest.cc
(right):

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:84:
////////////////////////////////////////////////////////////////////////////////
On 2014/09/22 08:02:19, henrika wrote:
> Please remove these non-standard separators.

Done.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:86:
class FakeSpeechRecognizer {
On 2014/09/22 08:02:19, henrika wrote:
> This looks like a very complex helper which is now without any comments. To
me,
> it seems like a risk to add such a complex class to a test since the test
might
> fail due to this special implementation instead of the actual code.

This is the mock consumer. Unit tests focus on the class being tested (the
producer here) and should show how that class is to be used as far as I know.
I've added more comments to explain what the class does. 

I don't really see how this unit test could fail because of this helper class.
The consumer code is covered in a different test.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:167:
////////////////////////////////////////////////////////////////////////////////
On 2014/09/22 08:02:19, henrika wrote:
> remove

Done.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:197:
// Initializes the producer and consumer with specified audio parameters.
On 2014/09/22 08:02:18, henrika wrote:
> Can you elaborate on what a producer and consumer is in this test.

Yes. It's explained on lines 228 - 238.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:262:
// Capturer.
On 2014/09/22 08:02:18, henrika wrote:
> These comments does not add much. Please explain what they do in the test or
> remove.

All these comments are now removed.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:294:
base::TimeDelta::FromMilliseconds(0), 1, false,
On 2014/09/22 08:02:18, henrika wrote:
> FromMilliseconds(0)?

Yes, no delay is required in the unit test.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:355:
TEST_F(SpeechRecognitionAudioSourceProviderTest, CheckIsSupportedAudioTrack) {
On 2014/09/22 08:02:18, henrika wrote:
> Could you make the name more clear? CheckIsSupported is not clear to me.

Added comment above test.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:382:
TEST_F(SpeechRecognitionAudioSourceProviderTest, RecognizerNotifiedOnSocket) {
On 2014/09/22 08:02:18, henrika wrote:
> Please add some lines of comments above each test explaining some more about
> what you test.

Done.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:400:
TEST_F(SpeechRecognitionAudioSourceProviderTest, AudioDataIsResampledOnSink) {
On 2014/09/22 08:02:19, henrika wrote:
> Lots of hardcoded values in this test. Makes it difficult to understand what
you
> test. Should it only work for these settings?

Added more comments.
I don't test that the resampler does a good job, since that is covered in it's
own unit test.
I use these typical values here just to make sure that the producer also does
resampling when it delivers frames to the consumer.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:444:
// (1) Start with no problems on the socket.
On 2014/09/22 08:02:18, henrika wrote:
> Remove (1) and (2)

Done.

no longer working on chromium

Great work, it is getting closer. https://codereview.chromium.org/499233003/diff/180001/content/common/speech_recognition_messages.h File content/common/speech_recognition_messages.h (right): https://codereview.chromium.org/499233003/diff/180001/content/common/speech_recognition_messages.h#newcode10 content/common/speech_recognition_messages.h:10: #include "base/process/process_handle.h" On ...

6 years, 3 months ago (2014-09-23 10:09:14 UTC) #36

henrika (OOO until Aug 14)

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/speech_recognition_audio_source_provider_unittest.cc File content/renderer/media/speech_recognition_audio_source_provider_unittest.cc (right): https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/speech_recognition_audio_source_provider_unittest.cc#newcode86 content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:86: class FakeSpeechRecognizer { I am not saying it will ...

6 years, 3 months ago (2014-09-23 10:45:33 UTC) #37

burnik

New patchset is out. :-) https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/speech_recognition_audio_source_provider.h File content/renderer/media/speech_recognition_audio_source_provider.h (right): https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/speech_recognition_audio_source_provider.h#newcode5 content/renderer/media/speech_recognition_audio_source_provider.h:5: #ifndef CONTENT_RENDERER_SPEECH_RECOGNITION_AUDIO_SOURCE_PROVIDER_H_ On 2014/09/23 ...

6 years, 3 months ago (2014-09-23 12:39:21 UTC) #39

New patchset is out. :-)

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
File content/renderer/media/speech_recognition_audio_source_provider.h (right):

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider.h:5: #ifndef
CONTENT_RENDERER_SPEECH_RECOGNITION_AUDIO_SOURCE_PROVIDER_H_
On 2014/09/23 10:09:12, xians1 wrote:
> update this macro, you have moved the code to media

Done. It is
CONTENT_RENDERER_MEDIA_SPEECH_RECOGNITION_AUDIO_SOURCE_PROVIDER_H_ 
in next patchset.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider.h:23: class
AudioParameters;
On 2014/09/23 10:09:13, xians1 wrote:
> remove this forward declaration.

Done.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider.h:84:
scoped_ptr<base::SyncSocket> socket_;
On 2014/09/23 10:09:12, xians1 wrote:
> You should use base::CancelableSyncSocket socket_ here, check how
> audio_device_thread.cc is done.

Ok. Let's say I pass in a CancelableSyncSocket from the dispatcher (renderer
client).
The SyncSocket is just owned here, but is injected as a pointer from the
renderer client.
I think then it's ok to have a SyncSocket here since I'm using only |Send()|?

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider.h:98: // Params
of the source audio. Can change when |OnSetFormat| occurs.
On 2014/09/23 10:09:12, xians1 wrote:
> nit, s/|OnSetFormat|/OnSetFormat()/g

Done.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
File content/renderer/media/speech_recognition_audio_source_provider_unittest.cc
(right):

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:67:
for (size_t i = 0; i < length; i++, buffer_->length++)
Changed to prefixed increment.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:76:
for (size_t i = buffer_->start; i < buffer_->length; i++, buffer_->start++)
Changed to prefixed increment.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:84:
////////////////////////////////////////////////////////////////////////////////
On 2014/09/23 10:09:13, xians1 wrote:
> On 2014/09/22 09:17:36, burnik wrote:
> > On 2014/09/22 08:02:19, henrika wrote:
> > > Please remove these non-standard separators.
> > 
> > Done.
> 
> Not done yet.

Yes, done for next patchset as advertised.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:86:
class FakeSpeechRecognizer {
On 2014/09/23 10:45:33, henrika wrote:
> I am not saying it will fail but that is a large helper without any comments.
> Hence, it is not clear to me why the test must contain all this code instead
of
> using existing (possibly mock) instead.

Ok. I'll revisit existing unit tests to see if anything can be reused. 
This could possibly happen after I add new stuff to other tests to check the
consumer.
Until then, I think it is a fairly simple helper showing how the consumer should
behave.
Also, I pointed out which methods are just mocks and helpers for testing.
Generally, any code duplication should be avoided, in that sense I do agree.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:137:
(*buffer_index_)++;
On 2014/09/23 10:09:13, xians1 wrote:
> ++(*buffer_index_)

Done.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:158:
MockSyncSocket* foreign_socket_;
On 2014/09/23 10:09:13, xians1 wrote:
> why this is a raw pointer?

It is owned by the recognizer and destroyed there.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/speech...
File content/renderer/speech_recognition_dispatcher.cc (right):

https://codereview.chromium.org/499233003/diff/180001/content/renderer/speech...
content/renderer/speech_recognition_dispatcher.cc:80:
WebSpeechRecognizerClient::NotAllowedError);
On 2014/09/23 10:09:13, xians1 wrote:
> return here, since we are failing the start()

Done.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/speech...
content/renderer/speech_recognition_dispatcher.cc:84: // Destroy any previous
instance not to starve it waiting on chunk ACKs.
On 2014/09/23 10:09:13, xians1 wrote:
> Please add more comment to explain why we stop the audio_source_provider_ when
a
> new session is started.

Done.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/speech...
content/renderer/speech_recognition_dispatcher.cc:177:
recognizer_client_->didReceiveNoMatch(GetHandleFromID(request_id),
On 2014/09/23 10:09:13, xians1 wrote:
> what will happen when getting a didReceiveNoMatch callback?

This is an API protocol thing as far as I know, not a local problem.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/speech...
content/renderer/speech_recognition_dispatcher.cc:255: audio_track_, params,
memory, socket.release(),
On 2014/09/23 10:09:13, xians1 wrote:
> you don't need to create a socket at all, just pass the handle to
> SpeechRecognitionAudioSourceProvider.

That would be true if the unit test did not have a fake socket. It's impossible
to inject a socket via handle, I would have to add a method or an overloaded
constructor which would indicate a specialized member for tests - yucky :-|.
Maybe the constructor should just indicate a scoped_ptr so we know the ownership
is being transferred.

https://codereview.chromium.org/499233003/diff/180001/content/renderer/speech...
File content/renderer/speech_recognition_dispatcher.h (right):

https://codereview.chromium.org/499233003/diff/180001/content/renderer/speech...
content/renderer/speech_recognition_dispatcher.h:16: #include
"content/renderer/media/speech_recognition_audio_source_provider.h"
On 2014/09/23 10:09:13, xians1 wrote:
> remove, because you forward declare it below.

Done.

jamesr

Please ping once you have an OWNER for content/renderer/media approve this. https://codereview.chromium.org/499233003/diff/200001/content/renderer/speech_recognition_dispatcher.h File content/renderer/speech_recognition_dispatcher.h (right): ...

6 years, 3 months ago (2014-09-23 23:31:40 UTC) #40

burnik

jamesr: Quick follow up on comments. https://codereview.chromium.org/499233003/diff/200001/content/renderer/speech_recognition_dispatcher.h File content/renderer/speech_recognition_dispatcher.h (right): https://codereview.chromium.org/499233003/diff/200001/content/renderer/speech_recognition_dispatcher.h#newcode64 content/renderer/speech_recognition_dispatcher.h:64: // Called by ...

6 years, 3 months ago (2014-09-24 09:04:38 UTC) #41

tommi (sloooow) - chröme

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/speech_recognition_audio_source_provider.cc File content/renderer/media/speech_recognition_audio_source_provider.cc (right): https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/speech_recognition_audio_source_provider.cc#newcode178 content/renderer/media/speech_recognition_audio_source_provider.cc:178: return 1.0; document what this means? https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/speech_recognition_audio_source_provider.h File content/renderer/media/speech_recognition_audio_source_provider.h ...

6 years, 3 months ago (2014-09-24 09:52:00 UTC) #42

burnik

Good review round. There are a few open questions for jamesr, tommi and Shijing. Please ...

6 years, 3 months ago (2014-09-24 11:54:23 UTC) #43

Good review round. 
There are a few open questions for jamesr, tommi and Shijing. 
Please reply before I push the next patchset.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
File content/renderer/media/speech_recognition_audio_source_provider.cc (right):

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider.cc:178: return
1.0;
On 2014/09/24 09:51:59, tommi wrote:
> document what this means?

// Return volume greater than zero to indicate we have more data.
SGTU?

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
File content/renderer/media/speech_recognition_audio_source_provider.h (right):

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider.h:27: //
SpeechRecognitionAudioSourceProvider works as an audio sink to the
On 2014/09/24 09:51:59, tommi wrote:
> Is 'source provider' a good name?  The reason I'm wondering is because usually
> things are chained together like: src->track->sink, so if this is an audio
sink,
> it feels odd to have it be called a "source provider".  That being said, I
> haven't looked at the implementation so perhaps this will make sense to me in
a
> bit.

Did it make sense in the impl? My reasoning that this audio data is not being
consumed here on the render, rather on the browser.

Do you think I should rename it to SpeechRecognitionAudioSink?

Shijing, your thoughts?

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider.h:40:
SpeechRecognitionAudioSourceProvider(const blink::WebMediaStreamTrack& track,
On 2014/09/24 09:51:59, tommi wrote:
> can you document these parameters? particularly how ownership is passed around
> etc.

Added a TODO for next round.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider.h:43:
base::SyncSocket* socket,
On 2014/09/24 09:51:59, tommi wrote:
> actually, if ownership is being passed here, please use scoped_ptr<> here (and
> .Pass() on the caller side).
Done. I suppose I should initialize in constructor like this as well:

: socket_(socket.Pass())


Right?

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
File content/renderer/media/speech_recognition_audio_source_provider_unittest.cc
(right):

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:29:
SharedBuffer() : start(0), length(0) {}
On 2014/09/24 09:52:00, tommi wrote:
> nit: what about also initializing data?
> 
> SharedBuffer() : data(), start(0), length(0) {}

Done.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:41:
in_failure_mode_(false) { }
On 2014/09/24 09:51:59, tommi wrote:
> nit: {}

Done.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:66:
uint8* b = static_cast<uint8*>(const_cast<void*>(buffer));
On 2014/09/24 09:51:59, tommi wrote:
> is this safe (if it is, please add a comment)?  What if someone passes truly
> read-only data like e.g. Send("foo", 4)?

Would this be safe?

const uint8* b = static_cast<const uint8*>(buffer);

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:68:
buffer_->data[buffer_->start + buffer_->length] = b[i];
On 2014/09/24 09:51:59, tommi wrote:
> hmm... I don't see why you need to cast away the constness... where do you
write
> to |b|?

Acknowledged.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:75:
uint8* b = static_cast<uint8*>(const_cast<void*>(buffer));
On 2014/09/24 09:51:59, tommi wrote:
> buffer isn't const, so no need for the const_cast

Done.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:110:
buffer_index_ = &(buffer->params.size);
On 2014/09/24 09:51:59, tommi wrote:
> what about just having a member variable of type media::AudioInputBuffer*
> instead?
> this feels like a bit more magic since it's just an int pointer into space,
> whereas the AudioInputBuffer will point to a more better defined type.

I think AudioInputBuffer is a terrible name for this purpose. It is used on both
ends and that's the reason of proposing AudioSharedBuffer which would mitigate
this magic.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:148:
uint32 buffer_index() { return *buffer_index_; }
On 2014/09/24 09:51:59, tommi wrote:
> isn't this returning 'size' rather than a buffer index?

Again. I'm actually using 'size' to count the buffers. See what I mean with
AudioSharedBuffer?

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:372:
const int kAudioParams[kNumAudioParamTuples][2] = {
On 2014/09/24 09:52:00, tommi wrote:
> add 24000?

Done.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:400:
for (uint32 i = 0; i < kSourceDataLength; ++i)
On 2014/09/24 09:51:59, tommi wrote:
> {}

Done.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:407:
int16 sink_data[kSinkDataLength];
On 2014/09/24 09:51:59, tommi wrote:
> nit: = {0};

Done.
Does this array init-to-zero on stack work for all platforms and compilers we
use?

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:417:
for (uint32 i = 0; i < kNumFramesToTest; ++i)
On 2014/09/24 09:51:59, tommi wrote:
> {}

Done.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:419:
ASSERT_EQ(0, sink_data[i * kOutputChannels + c]);
On 2014/09/24 09:51:59, tommi wrote:
> EXPECT_EQ?

Done.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:434:
for (uint32 i = 0; i < kNumFramesToTest; ++i)
On 2014/09/24 09:51:59, tommi wrote:
> {}

Done.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_source_provider_unittest.cc:436:
ASSERT_EQ(kExpectedData[i], sink_data[i * kOutputChannels + c]);
On 2014/09/24 09:51:59, tommi wrote:
> EXPECT_EQ?

Done.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/speech...
File content/renderer/speech_recognition_dispatcher.cc (right):

https://codereview.chromium.org/499233003/diff/200001/content/renderer/speech...
content/renderer/speech_recognition_dispatcher.cc:33: next_id_(1) { }
On 2014/09/24 09:52:00, tommi wrote:
> {}

Done.

https://codereview.chromium.org/499233003/diff/200001/content/renderer/speech...
content/renderer/speech_recognition_dispatcher.cc:39:
audio_source_provider_.reset();
On 2014/09/24 09:52:00, tommi wrote:
> on which thread does this execute?

Same as constructor would be my best guess.
Actually looks like dead code - either legacy or a future thing. 
This is never called according to code search.

no longer working on chromium

https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/speech_recognition_audio_source_provider.h File content/renderer/media/speech_recognition_audio_source_provider.h (right): https://codereview.chromium.org/499233003/diff/200001/content/renderer/media/speech_recognition_audio_source_provider.h#newcode27 content/renderer/media/speech_recognition_audio_source_provider.h:27: // SpeechRecognitionAudioSourceProvider works as an audio sink to the ...

6 years, 3 months ago (2014-09-24 13:40:47 UTC) #44

jamesr

On 2014/09/24 09:04:38, burnik wrote: > https://codereview.chromium.org/499233003/diff/200001/content/renderer/speech_recognition_dispatcher.h#newcode68 > content/renderer/speech_recognition_dispatcher.h:68: // Called By > |audio_source_provider_|. > ...

6 years, 3 months ago (2014-09-24 18:40:25 UTC) #45

burnik

On 2014/09/24 18:40:25, jamesr wrote: > On 2014/09/24 09:04:38, burnik wrote: > > > https://codereview.chromium.org/499233003/diff/200001/content/renderer/speech_recognition_dispatcher.h#newcode68 ...

6 years, 2 months ago (2014-09-25 09:50:51 UTC) #46

burnik

Patchset 10. Renamed the new class and unit test. Nits addressed. Any owners feel like ...

6 years, 2 months ago (2014-09-25 12:56:24 UTC) #47

burnik

https://codereview.chromium.org/499233003/diff/220001/content/renderer/speech_recognition_dispatcher.cc File content/renderer/speech_recognition_dispatcher.cc (right): https://codereview.chromium.org/499233003/diff/220001/content/renderer/speech_recognition_dispatcher.cc#newcode79 content/renderer/speech_recognition_dispatcher.cc:79: WebString("Provided audioTrack is not supported. Ignoring track."), Removed "Ignoring ...

6 years, 2 months ago (2014-09-25 13:15:21 UTC) #48

no longer working on chromium

Some nits, lgtm after you address them. https://codereview.chromium.org/499233003/diff/240001/content/common/speech_recognition_messages.h File content/common/speech_recognition_messages.h (right): https://codereview.chromium.org/499233003/diff/240001/content/common/speech_recognition_messages.h#newcode68 content/common/speech_recognition_messages.h:68: // Wheter ...

6 years, 2 months ago (2014-09-29 09:28:57 UTC) #49

burnik

Nits addressed. Waiting for remaining owners to stamp. https://codereview.chromium.org/499233003/diff/240001/content/common/speech_recognition_messages.h File content/common/speech_recognition_messages.h (right): https://codereview.chromium.org/499233003/diff/240001/content/common/speech_recognition_messages.h#newcode68 content/common/speech_recognition_messages.h:68: // ...

6 years, 2 months ago (2014-09-29 10:24:32 UTC) #50

burnik

Addressed remaining comment from previous patchset. https://codereview.chromium.org/499233003/diff/240001/content/renderer/speech_recognition_dispatcher.cc File content/renderer/speech_recognition_dispatcher.cc (right): https://codereview.chromium.org/499233003/diff/240001/content/renderer/speech_recognition_dispatcher.cc#newcode290 content/renderer/speech_recognition_dispatcher.cc:290: void SpeechRecognitionDispatcher::ResetAudioSourceProvider() { ...

6 years, 2 months ago (2014-09-29 10:38:12 UTC) #51

henrika (OOO until Aug 14)

LGTM w/ nits ;-) https://codereview.chromium.org/499233003/diff/260001/content/renderer/media/speech_recognition_audio_sink.cc File content/renderer/media/speech_recognition_audio_sink.cc (right): https://codereview.chromium.org/499233003/diff/260001/content/renderer/media/speech_recognition_audio_sink.cc#newcode36 content/renderer/media/speech_recognition_audio_sink.cc:36: // Buffer index for sync ...

6 years, 2 months ago (2014-09-29 10:38:41 UTC) #52

burnik

Updated CL description and comments. https://codereview.chromium.org/499233003/diff/260001/content/renderer/media/speech_recognition_audio_sink.cc File content/renderer/media/speech_recognition_audio_sink.cc (right): https://codereview.chromium.org/499233003/diff/260001/content/renderer/media/speech_recognition_audio_sink.cc#newcode36 content/renderer/media/speech_recognition_audio_sink.cc:36: // Buffer index for ...

6 years, 2 months ago (2014-09-29 12:07:31 UTC) #54

Updated CL description and comments.

https://codereview.chromium.org/499233003/diff/260001/content/renderer/media/...
File content/renderer/media/speech_recognition_audio_sink.cc (right):

https://codereview.chromium.org/499233003/diff/260001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_sink.cc:36: // Buffer index for
sync with client is |params.size| on the shared memory.
On 2014/09/29 10:38:41, henrika wrote:
> Odd language here as well. Not clear to me what you mean. Can you rewrite? ..
is
> ... on the ... 
Done.
// Peer's buffer index is accessed via |params.size| in shared memory.

https://codereview.chromium.org/499233003/diff/260001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_sink.cc:42: // Client must
manage his own counter and reset it.
s/client/peer/g

https://codereview.chromium.org/499233003/diff/260001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_sink.cc:72: // Purposely only
support tracks from an audio device. Dissallow WebAudio.
On 2014/09/29 10:38:41, henrika wrote:
> Dissallow WebAudio? Does it mean that it is not supported or what?

No. Just dissallowed as documented (abuse mitigation).

https://codereview.chromium.org/499233003/diff/260001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_sink.cc:83: // We need detach
the thread here because it will be a new capture thread
On 2014/09/29 10:38:41, henrika wrote:
> nit, 'need to' or must detach perhaps.

Detach the thread here because it will be a new capture thread ...

https://codereview.chromium.org/499233003/diff/260001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_sink.cc:95: static const int
kNumberOfBuffersInFifo = 2;
On 2014/09/29 10:38:41, henrika wrote:
> How was 2 chosen. What would happen if it was 20 instead?

The constant 2 here is a minimum integer number of buffers which can allow
delays.
This in fact means that if we need to accumulate 100 ms of data before
releasing, we can actually store 200ms at any moment. so this is a 100%
overhead.

If it were 20, we would use up more heap than we need - e.g. for desired 100 ms
for output, we could accumulate 2s of audio input. That would amount to 1900%
overhead.
This amount of delay would indicate a serious issue with performance on the
browser process and it wouldn't make sense to do anything other than stop
gathering audio input since it isn't going anywhere. For such a large delay, the
remote API would also disconnect the upstream connection, killing the SR
session. This case is handled in the OnData callback.

https://codereview.chromium.org/499233003/diff/260001/content/renderer/media/...
File content/renderer/media/speech_recognition_audio_sink.h (right):

https://codereview.chromium.org/499233003/diff/260001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_sink.h:32: //
WebSpeechRecognizer increments the shared buffer index to synchronize.
On 2014/09/29 10:38:41, henrika wrote:
> Can you clarify? What is synchronized and how does it work?

The buffer indices are synchronized.
Detailed in design doc. http://goo.gl/9Ot3PC

https://codereview.chromium.org/499233003/diff/260001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_sink.h:48: /* Socket ownership
is passed to here. */
On 2014/09/29 10:38:41, henrika wrote:
> 'to here' sounds odd. Can you rewrite?

Done:
*Socket ownership is transferred.*

jochen (gone - plz use gerrit)

lgtm with nits https://codereview.chromium.org/499233003/diff/280001/content/renderer/media/speech_recognition_audio_sink.h File content/renderer/media/speech_recognition_audio_sink.h (right): https://codereview.chromium.org/499233003/diff/280001/content/renderer/media/speech_recognition_audio_sink.h#newcode39 content/renderer/media/speech_recognition_audio_sink.h:39: SpeechRecognitionAudioSink(/* ExtraData reference is copied from ...

6 years, 2 months ago (2014-09-30 08:23:59 UTC) #55

burnik

One open questions for tommi/xians/jochen before next patchset. https://codereview.chromium.org/499233003/diff/280001/content/renderer/media/speech_recognition_audio_sink.h File content/renderer/media/speech_recognition_audio_sink.h (right): https://codereview.chromium.org/499233003/diff/280001/content/renderer/media/speech_recognition_audio_sink.h#newcode39 content/renderer/media/speech_recognition_audio_sink.h:39: SpeechRecognitionAudioSink(/* ...

6 years, 2 months ago (2014-09-30 09:57:47 UTC) #56

burnik

Quick follow up on the previous question. https://codereview.chromium.org/499233003/diff/280001/content/renderer/media/speech_recognition_audio_sink.h File content/renderer/media/speech_recognition_audio_sink.h (right): https://codereview.chromium.org/499233003/diff/280001/content/renderer/media/speech_recognition_audio_sink.h#newcode39 content/renderer/media/speech_recognition_audio_sink.h:39: SpeechRecognitionAudioSink(/* ExtraData ...

6 years, 2 months ago (2014-09-30 10:16:35 UTC) #57

jochen (gone - plz use gerrit)

I think that the parameter types make the ownership clear enough.

6 years, 2 months ago (2014-10-06 07:13:04 UTC) #58

tommi (sloooow) - chröme

agree with jochen re ownership documentation. The cl looks good but I'd like to chat ...

6 years, 2 months ago (2014-10-06 20:30:33 UTC) #59

burnik

How about it? :) https://codereview.chromium.org/499233003/diff/300001/content/renderer/media/speech_recognition_audio_sink.cc File content/renderer/media/speech_recognition_audio_sink.cc (right): https://codereview.chromium.org/499233003/diff/300001/content/renderer/media/speech_recognition_audio_sink.cc#newcode40 content/renderer/media/speech_recognition_audio_sink.cc:40: peer_buffer_index_ = &(buffer->params.size); On 2014/10/06 ...

6 years, 2 months ago (2014-10-07 09:01:09 UTC) #60

no longer working on chromium

https://codereview.chromium.org/499233003/diff/300001/content/renderer/media/speech_recognition_audio_sink.cc File content/renderer/media/speech_recognition_audio_sink.cc (right): https://codereview.chromium.org/499233003/diff/300001/content/renderer/media/speech_recognition_audio_sink.cc#newcode40 content/renderer/media/speech_recognition_audio_sink.cc:40: peer_buffer_index_ = &(buffer->params.size); On 2014/10/07 09:01:09, burnik wrote: > ...

6 years, 2 months ago (2014-10-07 09:46:19 UTC) #61

burnik

Removed the peer_buffer_index_ , using audio_input_buffer()->params.size. Also a bit of refactoring. https://codereview.chromium.org/499233003/diff/300001/content/renderer/media/speech_recognition_audio_sink.cc File content/renderer/media/speech_recognition_audio_sink.cc (right): ...

6 years, 2 months ago (2014-10-07 15:05:48 UTC) #62

Removed the peer_buffer_index_ , using audio_input_buffer()->params.size.

Also a bit of refactoring.

https://codereview.chromium.org/499233003/diff/300001/content/renderer/media/...
File content/renderer/media/speech_recognition_audio_sink.cc (right):

https://codereview.chromium.org/499233003/diff/300001/content/renderer/media/...
content/renderer/media/speech_recognition_audio_sink.cc:40: peer_buffer_index_ =
&(buffer->params.size);
On 2014/10/07 09:46:18, xians1 wrote:
> On 2014/10/07 09:01:09, burnik wrote:
> > On 2014/10/06 20:30:33, tommi wrote:
> > > instead of peer_buffer_index_ I'd rather want to use a pointer to the
shared
> > > memory.
> > > 
> > > You actually already have a pointer to that via shared_memory_, so instead
> of
> > > having multiple pointers to the same thing, what about having a private
> method
> > > like this:
> > > 
> > > media::AudioInputBuffer* audio_input_buffer() {
> > >   <dcheck correct calling thread>
> > >   <dcheck shared_memory_ is valid>
> > >   return static_cast<media::AudioInputBuffer*>(shared_memory_.memory());
> > > }
> > > 
> > > and then wherever you currently use peer_buffer_index_, use
> > > |audio_input_buffer()->params.size| instead.
> > 
> > What about:
> > 
> > http://crrev.com/526113002
> > 
> > struct AudioSharedBuffer {
> >   uint32 buffer_index;
> >   int8 audio[1];
> > };
> > 
> > media::AudioSharedBuffer* audio_shared_buffer() {
> >   <dcheck correct calling thread>
> >   <dcheck shared_memory_ is valid>
> >   return static_cast<media::AudioSharedBuffer*>(shared_memory_.memory());
> > }
> > 
> > and then use:
> > |audio_shared_buffer()->buffer_index|
> > 
> > ?
> > 
> > Sounds much cleaner than params.size. Also usable in the browser CL and I
> > believe in a few other places.
> > 
> 
> I don't think you should make this CL more complicated given that the
reviewers
> on that thread are not convinced by the approach.
> 
> I will suggest you do what Tommi proposed.

Done.

no longer working on chromium

https://codereview.chromium.org/499233003/diff/320001/content/renderer/media/speech_recognition_audio_sink.h File content/renderer/media/speech_recognition_audio_sink.h (right): https://codereview.chromium.org/499233003/diff/320001/content/renderer/media/speech_recognition_audio_sink.h#newcode65 content/renderer/media/speech_recognition_audio_sink.h:65: media::AudioInputBuffer* audio_input_buffer() const; nit, I am afraid hacker_style is ...

6 years, 2 months ago (2014-10-07 15:21:12 UTC) #63

burnik

https://codereview.chromium.org/499233003/diff/320001/content/renderer/media/speech_recognition_audio_sink.h File content/renderer/media/speech_recognition_audio_sink.h (right): https://codereview.chromium.org/499233003/diff/320001/content/renderer/media/speech_recognition_audio_sink.h#newcode65 content/renderer/media/speech_recognition_audio_sink.h:65: media::AudioInputBuffer* audio_input_buffer() const; On 2014/10/07 15:21:12, xians1 wrote: > ...

6 years, 2 months ago (2014-10-07 15:27:26 UTC) #64

jamesr

https://codereview.chromium.org/499233003/diff/360001/content/renderer/speech_recognition_dispatcher.h File content/renderer/speech_recognition_dispatcher.h (right): https://codereview.chromium.org/499233003/diff/360001/content/renderer/speech_recognition_dispatcher.h#newcode52 content/renderer/speech_recognition_dispatcher.h:52: blink::WebSpeechRecognizerClient*) override; not lgtm, you should never use override ...

6 years, 2 months ago (2014-10-08 19:14:11 UTC) #68

burnik

Good point, can't say why anyone had put it there earlier... https://codereview.chromium.org/499233003/diff/360001/content/renderer/speech_recognition_dispatcher.h File content/renderer/speech_recognition_dispatcher.h (right): ...

6 years, 2 months ago (2014-10-09 07:35:08 UTC) #70

burnik

Good point, can't say why anyone had put it there earlier...

6 years, 2 months ago (2014-10-09 07:35:14 UTC) #71

no longer working on chromium

https://codereview.chromium.org/499233003/diff/400001/content/content_renderer.gypi File content/content_renderer.gypi (right): https://codereview.chromium.org/499233003/diff/400001/content/content_renderer.gypi#newcode629 content/content_renderer.gypi:629: 'renderer/media/speech_recognition_audio_sink.cc', have you excluded these classed when enable_webrtc == ...

6 years, 2 months ago (2014-10-09 12:19:03 UTC) #73

burnik

Nits done. https://codereview.chromium.org/499233003/diff/400001/content/content_renderer.gypi File content/content_renderer.gypi (right): https://codereview.chromium.org/499233003/diff/400001/content/content_renderer.gypi#newcode629 content/content_renderer.gypi:629: 'renderer/media/speech_recognition_audio_sink.cc', On 2014/10/09 12:19:02, xians1 wrote: > ...

6 years, 2 months ago (2014-10-09 13:13:04 UTC) #75

jamesr

content/renderer lgtm https://codereview.chromium.org/499233003/diff/420001/content/renderer/speech_recognition_dispatcher.cc File content/renderer/speech_recognition_dispatcher.cc (right): https://codereview.chromium.org/499233003/diff/420001/content/renderer/speech_recognition_dispatcher.cc#newcode10 content/renderer/speech_recognition_dispatcher.cc:10: #if defined(ENABLE_WEBRTC) put the #ifdef block below ...

6 years, 2 months ago (2014-10-09 17:10:29 UTC) #77

burnik

Good point on the comment. Thx. :) https://codereview.chromium.org/499233003/diff/420001/content/renderer/speech_recognition_dispatcher.cc File content/renderer/speech_recognition_dispatcher.cc (right): https://codereview.chromium.org/499233003/diff/420001/content/renderer/speech_recognition_dispatcher.cc#newcode10 content/renderer/speech_recognition_dispatcher.cc:10: #if defined(ENABLE_WEBRTC) ...

6 years, 2 months ago (2014-10-09 20:03:35 UTC) #81

commit-bot: I haz the power

CQ is trying da patch. Follow status at https://chromium-cq-status.appspot.com/patch-status/499233003/480001

6 years, 2 months ago (2014-10-09 20:47:25 UTC) #83

commit-bot: I haz the power

Patchset 21 (id:??) landed as https://crrev.com/2eeb46675144086a100ce6497f4b8ac5e56fcdb1 Cr-Commit-Position: refs/heads/master@{#298981}

6 years, 2 months ago (2014-10-09 21:31:35 UTC) #85

burnik

burnik@chromium.org changed reviewers: - henrika@chromium.org, jamesr@chromium.org, jochen@chromium.org, kenrb@chromium.org, tommi@chromium.org

6 years, 2 months ago (2014-10-10 09:59:14 UTC) #88

Please review the fix.

Patchset #22 (id:670001) has been deleted

Issue 499233003: Binding media stream audio track to speech recognition [renderer] (Closed)

Description

Patch Set 1 #

Patch Set 2 : style fix #

Patch Set 3 : SyncSocket implementation + refactoring #

Patch Set 4 : Platform checks removed from dispatcher #

Patch Set 5 : Add unit test and refactor #

Patch Set 6 : Refactoring on callbacks and error states #

Patch Set 7 : Refactoring unit test and source provider, moved to media #

Patch Set 8 : SyncSocket leak and FIFO fixes. Test 8-192KHz for input. #

Patch Set 9 : Refactoring, error states, more comments. #

Patch Set 10 : s/SpeechRecognitionAudioSourceProvider/SpeechRecognitionAudioSink/ #

Patch Set 11 : Change error type for unsupported tracks #

Patch Set 12 : Nits, comments, refactoring, rebasing. #

Patch Set 13 : Comments + bugfix #

Patch Set 14 : Unit test nits + ctor comments #

Patch Set 15 : Remove peer_buffer_index_ #

Patch Set 16 : s/audio_input_buffer/GetAudioInputBuffer/ #

Patch Set 17 : Add ENABLE_WEBRTC flag checks #

Patch Set 18 : Remove override for blink impl #

Patch Set 19 : Rebase on master - merge fix #

Patch Set 20 : s/OnSharedAudioBusReady/OnAudioReceiverReady/ + nits #

Patch Set 21 : Nits done. Preland checks. #

Messages