Archive for the 'Audio' Category

Published by alax on 01 Jan 2006

Coctail Party Effect

A review of the cocktail party effect

Published by alax on 27 Dec 2005

Speech Codecs Input/Output Summary

The table below shows codecs features:

Codec Frame Length Input Output
AMR GSM 06.90 20 ms 8 KHz, 16 bit, Mono Bitrates: 4750, 5150, 5900, 6700, 7400, 7950, 10200, 12200; a discontinuous transmission mode (DTX) available with 1750 bps non-speech frames
G.726 8 ms 8 KHz, 16 bit, Mono (a-law, μ-law native support possible) Bitrates: 16000, 24000, 32000, 40000

Published by alax on 25 Dec 2005

More Speech Codex

By the way, packaging more speech codecs into DMO interface is expected to come soon. With a certain luck it would go even farther to video coding…

To appear:

  • AMR-WB G.722.2
  • G.711
  • G.722
  • G.722.1
  • G.723.1
  • G.726
  • G.728
  • G.729
  • GSM 06.10

Already:

I would also need to make a summary table on input/output formats, bitrates etc.

Chances are that the following are also to be put into DMO:

  • Echo Canceller

Published by alax on 24 Dec 2005

AMR GSM 06.90 DMO

The mix of the technologies works very well.

  • MS VC++.NET 2003, ATL, WTL - for development environment
  • MS DirectShow of DirectX 9 - for multimedia infrastructure
  • Intel Integrated Performance Primitives 5.0 - for standard code implementation base

All together - AMR GSM 06.90 Speech Codec DirectX Media Object (DMO) as shown below:

Alax.Info AMR Objects in GraphEdit
Runtime requirements include:

  • MS CRT runtime msvcr71.dll redistributable
  • Intel IPP 5 redistributables, including:
    • at the very least - libguide40.dll, ippcore.dll, ipps.dll, ippsc.dll
    • one or more per processor type sets, eg. - ippspx.dll, ippscpx.dll

Published by alax on 21 Dec 2005

AMR Speech DMO

So, DirectX Media Objects (DMO) appeared to be a nice interface to extend DirectShow functionality. I could quite easily create encoder/decoder pair of DMOs for AMR speech coding (GSM 06.90). Reasonable ease and development speed. I will probably even prepare an informational version and put here on website. It is likely to contain forced silence intevals because it’s a licensed stuff and can’t be distributed freely.

Published by alax on 19 Dec 2005

DirectX Media Objects

How could it happen that an expreienced DirectShow guy only recently discovered DirectX Media Objects (DMO)? And the idea is really nice. Definitely got to check this at work with some audio processing.

Published by alax on 13 Dec 2005

3D Audio

Undoubtedly from the very beginning, the idea of sound source localization is far from being new. However, while it’s brewing up in the head it gives more and more keywords and finally I got a lot of theoretical and practical findings in this area. Still the Q is if there is any practical progress, something easy to use to see the effectiveness.

« Prev - Next »