Speech v3 - MSSDK - Speech to Text CustomApr 18, 2013 01:34 PM
<P>Hi,</P>
<P> </P>
<P>Has anyone else tried using the new custom S2T functionality of version 3 SDK? I just downloaded the new .dll file and included it with my project. I don't see a SpeechToTextCustom method anywhere in the RequestFactory class. Has the SDK been updated to include this new method? If so, where can I find it?</P>
<P> </P>
<P>I find AT&T's documentation lacks quite a bit of information. It doesn't go into ver much detail about what the properties actually mean or do. Does anyone know of better documentation somewhere?<BR /><BR /><BR />Thanks,<BR />Nick</P>
'Re: Speech v3 - MSSDK - Speech to Text Custom'Apr 19, 2013 08:59 AM
Hi Brett,
Thanks for your reply. Is there an open source version of the SDK? I think this would make it much easier for community contributions and creating custom call methods to the API.
'Re: Speech v3 - MSSDK - Speech to Text Custom'Apr 19, 2013 11:33 AM
Some additional thoughts on the documentation portion of this thread. Remember that the ATT Developer API is, at it¿s core, a publicly exposed set of RESTful web services. The SDKs are primarily platform specific wrappers around these non-platform specific web services. Therefore, you can always find pretty detailed information on any API in the base documentation (not necessarily specific to any particular platform or SDK). For speech, V3 it¿s here: http://developer.att.com/developer/basicTemplate.jsp?passedItemId=13100102&api=Speech&version=3&method=&provider=.
Since this discussion is about the MS SDK, which is built on top of the .Net framework, you can use an IL disassembler (your choice as to which one you prefer) to look into the disassembled source code. This is an open window into the existing SDK code. Furthermore, the RequestFactory class, which I¿m just holding up as example for this particular case, is a public class. Therefore, if you ever find yourself not wanting to wait for the next SDK which promises to contain a particular feature, you can either implement your own sub class and/or .Net extension methods (http://msdn.microsoft.com/en-us/library/vstudio/bb383977.aspx).
'Re: Speech v3 - MSSDK - Speech to Text Custom'Jun 12, 2013 01:32 PM
Hi David,
I too have tried to use speechToTextCustom (speech v3). Using simple https post via ajax. Get an http 502 error. My Request Headers seem ok. Looks something like this.
'Re: Speech v3 - MSSDK - Speech to Text Custom'Jun 12, 2013 02:22 PM
Hi David,
Thanks -- I'll look to see if Titanium httpClient has problems with posting via https protocol. I was assuming the request line showing the endpoint was hidden from the network sniffer because of https protocol. When I switch to straight http protocol I see the endpoint (but a 503 response). Note the Requst Line....
Anyway, yes I've raised a ticket. I've been using the plain SpeechToText method with no issue using the Titanium module (it's just that now I want multi language support and grammarFile support).
'Re: Speech v3 - MSSDK - Speech to Text Custom'Jun 12, 2013 02:24 PM
Hi Eduardo,
Another thing to check is whether you are truly handling the binary audio data. It could be an char encoding issue, so you need to make sure you read and write the audio data without any char encoding (i.e. should be binary). I believe you would want the readAsBinaryString and sendAsBinary methods.
'Re: Speech v3 - MSSDK - Speech to Text Custom'Jun 12, 2013 02:54 PM
Thanks for the tip Brett!
Also re: service enpoint.... after turning on SSL Proxying feature in the network sniffer (Charles), the endpoint does indeed show up.
Will look into encoding. I believe I do encode to binary base64 prior to sending.. I have also tried adding a request header "Content-Transfer-Encoding: binary" -- to no avail...