Semin Speech Lang 2016; 37(02): 128-142
DOI: 10.1055/s-0036-1580745
Thieme Medical Publishers 333 Seventh Avenue, New York, NY 10001, USA.

HomeBank: An Online Repository of Daylong Child-Centered Audio Recordings

Mark VanDam1, Anne S. Warlaumont2, Elika Bergelson3, Alejandrina Cristia4, Melanie Soderstrom5, Paul De Palma6, Brian MacWhinney7
  • 1Department of Speech and Hearing Sciences, Elson S. Floyd College of Medicine, Washington State University, and Spokane Hearing Oral Program of Excellence (HOPE), Spokane, Washington
  • 2Cognitive and Information Sciences, University of California, Merced, California
  • 3Department of Brain and Cognitive Sciences, University of Rochester, Rochester, New York
  • 4Laboratoire de Sciences Cognitives et Psycholinguistique (ENS, EHESS, CNRS), Département d'Etudes Cognitives, Ecole Normale Supérieure, PSL Research University, Paris, France
  • 5Department of Psychology, University of Manitoba, Winnipeg, MB, Canada
  • 6Department of Computer Science, School of Engineering and Applied Science, Gonzaga University, Spokane, Washington
  • 7Department of Psychology, Carnegie Mellon University, Pittsburgh, Pennsylvania
Weitere Informationen

Publikationsverlauf

Publikationsdatum:
25.April 2016 (online)

Abstract

HomeBank is introduced here. It is a public, permanent, extensible, online database of daylong audio recorded in naturalistic environments. HomeBank serves two primary purposes. First, it is a repository for raw audio and associated files: one database requires special permissions, and another redacted database allows unrestricted public access. Associated files include metadata such as participant demographics and clinical diagnostics, automated annotations, and human-generated transcriptions and annotations. Many recordings use the child-perspective LENA recorders (LENA Research Foundation, Boulder, Colorado, United States), but various recordings and metadata can be accommodated. The HomeBank database can have both vetted and unvetted recordings, with different levels of accessibility. Additionally, HomeBank is an open repository for processing and analysis tools for HomeBank or similar data sets. HomeBank is flexible for users and contributors, making primary data available to researchers, especially those in child development, linguistics, and audio engineering. HomeBank facilitates researchers' access to large-scale data and tools, linking the acoustic, auditory, and linguistic characteristics of children's environments with a variety of variables including socioeconomic status, family characteristics, language trajectories, and disorders. Automated processing applied to daylong home audio recordings is now becoming widely used in early intervention initiatives, helping parents to provide richer speech input to at-risk children.