public class DiarizationEngine extends VoiceSdkNativePeer
This class serves as gateway to native Voice SDK implementation and allocates resources on native heap.
To release the allocated memory, AutoCloseable.close()
method should be invoked when the instance is no longer needed.
Any method that delegates to native call may throw VoiceSdkEngineException
nativeId
Constructor and Description |
---|
DiarizationEngine(java.lang.String initDataPath)
Sole constructor.
|
Modifier and Type | Method and Description |
---|---|
DiarizationResult |
getSegmentation(byte[] pcm16Bytes,
int sampleRate)
Performs diarization on audio bytes provided in PCM16 format.
|
DiarizationResult |
getSegmentation(byte[] pcm16Bytes,
int sampleRate,
int numSpeakers)
Performs diarization on audio bytes provided in PCM16 format.
|
DiarizationResult |
getSegmentation(float[] floatSamples,
int sampleRate)
Performs diarization on audio provided in normalized float format.
|
DiarizationResult |
getSegmentation(float[] floatSamples,
int sampleRate,
int numSpeakers)
Performs diarization on audio provided in normalized float format.
|
DiarizationResult |
getSegmentation(short[] pcm16Samples,
int sampleRate)
Performs diarization on audio provided in PCM16 format.
|
DiarizationResult |
getSegmentation(short[] pcm16Samples,
int sampleRate,
int numSpeakers)
Performs diarization on audio provided in PCM16 format.
|
DiarizationResult |
getSegmentation(java.lang.String audioFile)
Performs diarization on audio file.
|
DiarizationResult |
getSegmentation(java.lang.String audioFile,
int numSpeakers)
Performs diarization on audio file.
|
protected DiarizationResult |
getSegmentation1(byte[] pcm16Bytes,
int sampleRate,
int numSpeakers) |
protected DiarizationResult |
getSegmentation2(short[] pcm16Samples,
int sampleRate,
int numSpeakers) |
protected DiarizationResult |
getSegmentation3(float[] floatSamples,
int sampleRate,
int numSpeakers) |
protected DiarizationResult |
getSegmentation4(java.lang.String audioFile,
int numSpeakers) |
protected static long |
init(java.lang.String initDataPath) |
protected void |
release()
This method should release resources on native layer (indirectly using nativeId)
|
close, equals, finalize, hashCode
public DiarizationEngine(java.lang.String initDataPath)
initDataPath
- full path to the directory with SDK dataVoiceSdkEngineException
- wraps native exceptionspublic DiarizationResult getSegmentation(java.lang.String audioFile)
audioFile
- Path to the audio fileDiarizationResult
VoiceSdkEngineException
- wraps native exceptionspublic DiarizationResult getSegmentation(java.lang.String audioFile, int numSpeakers)
audioFile
- Path to the audio filenumSpeakers
- max number of speakers to detectDiarizationResult
VoiceSdkEngineException
- wraps native exceptionspublic DiarizationResult getSegmentation(float[] floatSamples, int sampleRate)
floatSamples
- Array of float audio samples (in [-1, 1] range)sampleRate
- Sample rate of the audio, HzDiarizationResult
VoiceSdkEngineException
- wraps native exceptionspublic DiarizationResult getSegmentation(float[] floatSamples, int sampleRate, int numSpeakers)
floatSamples
- Array of float audio samples (in [-1, 1] range)sampleRate
- Sample rate of the audio, HznumSpeakers
- max number of speakers to detectDiarizationResult
VoiceSdkEngineException
- wraps native exceptionspublic DiarizationResult getSegmentation(short[] pcm16Samples, int sampleRate)
pcm16Samples
- Array of PCM16 audio samplessampleRate
- Sample rate of the audio, HzDiarizationResult
VoiceSdkEngineException
- wraps native exceptionspublic DiarizationResult getSegmentation(short[] pcm16Samples, int sampleRate, int numSpeakers)
pcm16Samples
- Array of PCM16 audio samplessampleRate
- Sample rate of the audio, HznumSpeakers
- max number of speakers to detectDiarizationResult
VoiceSdkEngineException
- wraps native exceptionspublic DiarizationResult getSegmentation(byte[] pcm16Bytes, int sampleRate)
pcm16Bytes
- Array of little-endian PCM16 audio bytessampleRate
- Sample rate of the audio, HzDiarizationResult
VoiceSdkEngineException
- wraps native exceptionspublic DiarizationResult getSegmentation(byte[] pcm16Bytes, int sampleRate, int numSpeakers)
pcm16Bytes
- Array of little-endian PCM16 audio bytessampleRate
- Sample rate of the audio, HznumSpeakers
- max number of speakers to detectDiarizationResult
VoiceSdkEngineException
- wraps native exceptionsprotected static long init(java.lang.String initDataPath)
protected DiarizationResult getSegmentation4(java.lang.String audioFile, int numSpeakers)
protected DiarizationResult getSegmentation3(float[] floatSamples, int sampleRate, int numSpeakers)
protected DiarizationResult getSegmentation2(short[] pcm16Samples, int sampleRate, int numSpeakers)
protected DiarizationResult getSegmentation1(byte[] pcm16Bytes, int sampleRate, int numSpeakers)
protected void release()
VoiceSdkNativePeer
release
in class VoiceSdkNativePeer