Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction

Q: What is the best open-source implementation of "Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction"?

The best maintained implementation is haoqiu-yan/perceptiveagent with 50 stars on GitHub. Confidence: high. Reproducibility: Limited.

Q: How reproducible is "Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction"?

Estimated time to first reproduction: a few days. Risk flags: No CI workflows detected, Dependency manifest is missing. Start with haoqiu-yan/perceptiveagent and validate setup instructions in README.

Q: Are there pretrained models available for "Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction"?

Yes, 3 Hugging Face models found. The top result is prithivMLmods/Phi-4-Empathetic with 56 downloads.

Q: What framework is used to implement "Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction"?

The primary implementation uses pytorch.

Published: Jun 1, 2024

Best maintained implementation now

Evidence: Direct

Domain fit: AI-adjacent

Verified repos: 2

Top repo stars: 50

Paper appears method- or tooling-adjacent to AI workflows with partial ecosystem coverage.

Framework: pytorch

Time to first repro: a few days

2 risk flags

arXiv PDF

Technical details

Canonical key: arxiv-2406.12707

Cache status: Fresh

Generated at: Mar 14, 2026, 6:35 AM

Artifact coverage: direct

HF provider: ok (token)

PWC source used: Yes

LLM status: ready

LLM model: openai/gpt-5.1-20251113

LLM generated: Mar 13, 2026, 5:27 AM

LLM content type: researcher_benchmark_brief

HF policy: hf-relevance-v27

LLM evidence refs: evidencePack.paperSections[id=paper_17], evidencePack.paperSections[id=paper_18], evidencePack.paperSections[id=paper_caption_4], evidencePack.paperSections[id=paper_caption_5], evidencePack.paperSections[id=paper_caption_1], guidance.riskFlags[0], guidance.riskFlags[1], repos[0].fullName, researcherSummary.hardwareNotes[0], researcherSummary.timeToFirstMeaningfulRun, evidencePack.paperSections[id=paper_caption_3], evidencePack.paperSections[id=paper_23], researcherSummary.benchmarkSnapshot[0], paper.title, summary.hasReliableImplementation

Researcher verdict

Recommended implementation path available

implementation baseline

Benchmark trust: thin evidence

This page has evidence-backed benchmark findings and a concrete implementation recommendation anchored on haoqiu-yan/perceptiveagent. Use it as an implementation baseline, then validate benchmark parity before adapting it.

Why this page is still worth reading

A concrete repository path exists via haoqiu-yan/perceptiveagent, so this page can act as a practical starting point.
Reproduction risks are surfaced explicitly, which helps decide whether the paper is worth immediate prototyping.

Benchmark trust

Some benchmark signal exists in the extracted evidence, but it is not structured strongly enough yet for a confident benchmark decision.

Use this page as

Start here when you need the most practical implementation path quickly.

Results & Benchmarks

Freshness tier: hot

Direct + Inferred Evidence

Empathetic Dialogue Perceptible Acoustic Reception Reaction

EXPRESSO

Accuracy

87.9

Source: paper fulltext

Benchmark evidence drill-down

1 findings

Audit each benchmark finding before selecting an implementation path. Evidence refs map to the disclosure section below.

Task	Dataset	Metric	Value	Source	Evidence refs
Empathetic Dialogue Perceptible Acoustic Reception Reaction	EXPRESSO	Accuracy	87.9	paper-derived	No explicit refs

Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction is the primary contribution described in this paper.

Use This Implementation Because…

Confidence: high

haoqiu-yan/perceptiveagent is the strongest maintained implementation based on ranking signals. License is declared (Apache-2.0).

Open haoqiu-yan/perceptiveagent

Reproduction Risks

No CI workflows detected
Dependency manifest is missing

Hardware Notes

Expect multi-day setup/compute for meaningful reproduction based on current guidance.

Evidence disclosure

Evidence graph: 4 refs, 4 links.

Utility signals: depth 100/100, grounding 95/100, status high.

Implementation Comparison

Top 3 paths

Compare maintenance quality, reproducibility coverage, and evidence confidence before choosing a reproduction baseline.

haoqiu-yan/perceptiveagent

best maintained

Maintenance: Stale

Confidence: High

Reproducibility: Limited

Official implementation from Papers with Code · Repository link is mentioned in the paper metadata

Stars: 50
Last push: Aug 6, 2024 (585d ago)

Risk flags

No push in 12+ months
No CI pipeline detected
No tagged releases

youngsheen/gpst

alternative

Maintenance: Stale

Confidence: Low

Reproducibility: Limited

Community adoption signal (69 stars)

Stars: 69
Last push: Nov 1, 2024 (498d ago)

Risk flags

No push in 12+ months
No CI pipeline detected
No tagged releases

Haoqiu-Yan/PerceptiveAgent

alternative

Maintenance: Stale

Confidence: Medium

Reproducibility: Limited

Matched via arXiv identifier search · Strong overlap with paper title keywords

Stars: 50
Last push: Aug 6, 2024 (585d ago)

Risk flags

No push in 12+ months
No CI pipeline detected
No tagged releases

Paper summary

AI-generated

AI-generated summary grounded in paper metadata and artifact signals.

PerceptiveAgent implements an empathetic dialogue pipeline that combines a speech captioner trained on TextrolSpeech with an MSMA-Synthesizer vocoder trained on EXPRESSO, LJSpeech, and VCTK to generate affectively aligned acoustic. This page includes benchmark evidence for Empathetic dialogue acoustic response generation on EXPRESSO. Reproduction guidance focuses on implementation viability and concrete risk controls.

Key contributions

PerceptiveAgent implements an empathetic dialogue pipeline that combines a speech captioner trained on TextrolSpeech with an MSMA-Synthesizer vocoder trained on EXPRESSO, LJSpeech, and VCTK to generate affectively.
The MSMA-Synthesizer is designed to preserve speaking style factors such as emotion, speed, energy, gender, and pitch, and is evaluated using accuracy on EXPRESSO read and conversation subsets and F0 Frame Error.
PerceptiveAgent’s performance is decomposed into cognitive empathy measured with BERTScore on linguistic content and affective empathy measured with accuracy on generated acoustic responses.
The speech captioner shows attribute-dependent performance differences across genders, with notably lower F1-scores for female speech on pitch and speed, indicating potential bias or reduced robustness.
Ablation results for the MSMA-Synthesizer indicate that removing individual style factors such as speed, energy, gender, or pitch sharply reduces accuracy on EXPRESSO read and conversation subsets, showing strong.

Implementation guidance

Use haoqiu-yan/perceptiveagent first because deterministic ranking and extracted evidence align on implementation viability. Start with the repo setup path, then validate benchmark reproduction before adaptation.

Reproducibility notes

Environment setup may fail or produce inconsistent behavior because there is no dependency manifest or CI to pin library versions.
Vocoder pretraining on EXPRESSO, LJSpeech, and VCTK is compute-intensive and, if undertrained, can degrade downstream empathy metrics.
Misconfiguration of Vicuna or llama_model paths in the capspt_train configuration can prevent the language model component from loading correctly.
Differences in preprocessing or dataset splits for EXPRESSO or TextrolSpeech can lead to benchmark scores that diverge from the reported accuracy and BERTScore.
Ablation experiments may be irreproducible if style conditioning flags are not toggled exactly as in the MSMA-Synthesizer evaluation setup.

Best implementation now

haoqiu-yan/perceptiveagent

Confidence: High

Reproducibility: Limited

Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))

Stars: 50

Forks: 3

Last push: Aug 6, 2024

License: Apache-2.0

Official implementation from Papers with Code

Repository link is mentioned in the paper metadata

Strong overlap with paper title keywords

Community adoption signal (50 stars)

License ✓

CI –

Deps –

Docker –

Selected haoqiu-yan/perceptiveagent as the strongest maintained implementation for new work.
Repository activity is within the last 24 months.

Reproduction path

Direct

Follow the direct implementation path

1

Start with haoqiu-yan/perceptiveagent and validate setup instructions in README.
2

Reproduce the baseline result with the provided defaults before modifying hyperparameters.
3

Log exact dependency versions and runtime environment for reproducibility.

Time to first repro: a few days

No CI workflows detected

Dependency manifest is missing

Additional implementations

Official

No additional official repositories detected.

Community

Haoqiu-Yan/PerceptiveAgent
Confidence: Medium

Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))

Stars: 50

Last push: Aug 6, 2024

License: Apache-2.0

Possible but unverified matches (1)

These repositories had low-confidence matching signals and are hidden by default.

youngsheen/gpst

Confidence: Low

Stars: 69

Hugging Face artifacts

No direct paper-linked artifacts were found. Showing strongest curated related artifacts for faster exploration.

Models

prithivMLmods/Phi-4-Empathetic

Curated Related

Downloads: 56

Likes: 12
mradermacher/Phi-4-Empathetic-i1-GGUF

Curated Related

Downloads: 91

Likes: 3
mradermacher/Phi-4-Empathetic-GGUF

Curated Related

Downloads: 65

Likes: 4

Broaden model search

Empathetic Dialogue Perceptible Acoustic Reception Reaction talk with human like

Datasets

facebook/empathetic_dialogues

Curated Related

Downloads: 4,474

Likes: 128

Updated: Jan 18, 2024
Estwld/empathetic_dialogues_llm

Curated Related

Downloads: 271

Likes: 33

Updated: Apr 2, 2024