gid-embedplda-v1 (Gender Identification)
Version Changelog
Plugin Version | Change |
---|---|
v1.0.0 | Initial plugin release, shares models with gdd-embedplda-v1 released with 5.4.0 |
v1.0.1 | Code streamlining and minor bug fixes, released with 5.5.0 |
Description
Gender Identification (GID) plugins attempt to identify the gender of the talker in the audio submitted to the plugin for scoring. The goal is to distinguish between male and female speakers as a quick triage of incoming data, or to serve as a front-end for other plugins that may have a gender-specific workflow.
This plugin is powered by the PLDA Embeddings models originally released with gdd-embedplda-v1.
Domains
- multi-v1
- Generic domain for most close talking conditions with signal-to-noise ratio above 10 dB.
Inputs
Audio file or buffer and an optional identifier.
Outputs
Gender ID plugins report a score for each gender, in the format shown below. As with LID, scores are log-likelihood ratios, where a score greater than the default threshold of "0" is considered to be a detection.
input-audio.wav Female -3.212653
input-audio.wav Male 5.40590000
Functionality (Traits)
The functions of this plugin are defined by its Traits and implemented API messages. A list of these Traits is below, along with the corresponding API messages for each. Click the message name below to go to additional implementation details below.
- GLOBAL_SCORER – Score all submitted audio, returning a single score for the entire audio segment for each gender.
Compatibility
OLIVE 5.4+
Limitations
Labeling Granularity
GID plugins assume that an audio segment contains only a single gender and may be scored as a unit. If a segment contains multiple speakers who may or may not be of a different gender, the entire segment will still be scored as a unit. Ensuring that audio submitted for scoring consists of a single speaker is important for reliable results.
Age
All current Gender ID plugins are trained only on adult male and female speech. Speech from children will likely be confused with female speech, or be otherwise volatile.
Interface
For command line interface use see the appropriate section of the OLIVE CLI User Guide. For API usage see the appropriate section of the OLIVE Application Programming Interface Guide.