LDaCA Newsletter Quarter 4 2023

LDaCA Newsletter - Quarter 4 2023

LDaCA Newsletter | Quarter 4, 2023



Welcome to the second issue of this newsletter. We thank you for your interest in the Language Data Commons of Australia and the Australian Text Analytics Program, and we hope that this newsletter will help to keep you informed about our activities. If you have questions about anything you read here, or if you have feedback for us, you can mail us at ldaca@uq.edu.au (or use one of the alternative contact points listed at the end of this message).


New Team Members

Two people have joined our team in the last quarter, Otis Carmichael and Rosanna Smith and we will let them introduce themselves:

Hi there, my name’s Otis Carmichael, I’m a Waanyi man from Brisbane and a technical designer on the LDaCA team. My initial involvement with the team was to design the team’s logo and other promotional material, however I am now involved in Community Engagement and working on design throughout the website and tools. I’m excited to be on board with the project because I believe that it has potential to really support the domain of language research, by helping to ensure that research data is stored in a more sustainable manner and thus will remain accessible for the language’s community.

Hi, my name's Rosanna Smith and I'm a user design analyst on the LDaCA team, based in Melbourne. My background is in linguistics, having managed a variety of language projects for use in machine learning development and language technology. I’m excited to contribute to a project such as LDaCA that is guided fundamentally by FAIR and CARE principles, as well as the opportunity to apply my previous experience and further promote diversity and accessibility in language data management and research.


Forthcoming Events

Workshop on immigrant language corpora in Australia

When: 9-10 November, 2023

Where: Onsite. ANU, Acton, ACT 2601

Guest speakers: John Hajek, Ingrid Piller, Sophie Loy-Wilson, Adrian Vickers

Australia has long been seen as one of the world’s most multilingual and multicultural societies, with more than 490 languages coming from around 300 ancestries and cultural traditions (ABS, 2021, 2022). For decades, the language and cultural maintenance of various immigrant groups have been under investigation by many scholars, not only in linguistics but also in history, sociology, anthropology, and many other disciplines. This work has amassed a large body of immigrant language data, that provides information about how Australia’s immigration history has contributed to the country today.

The purpose of this workshop is to bring together scholars working with language corpora from across different disciplines. The workshop is being run as part of the Language Data Commons of Australia (LDaCA), which is working to build national research infrastructure for the Humanities and Social Sciences, facilitating sustainable access to and controlled use of digital language corpora for linguists, scholars across the Humanities and Social Sciences, and non-academics.

The workshop would consist of two main parts:

  1. presentations on Australian immigrant language data collected from scholars across disciplines, and

  2. a panel discussion on needs and challenges around managing and archiving immigrant language data in a way that is ethical, legal and culturally sensitive, and how LDaCA can help support that

(Organised by Li Nguyen & Catherine Travis)

2023 Vocabulary Symposium: FAIR Vocabularies For All

When: 14-15 November, 2023

Where: Onsite and online.  The in-person event will be held at ANU Research School of Social Sciences (RSSS).  146 Ellery Crescent, Acton, ACT 2601

Following on from the success of the 2022 Vocabulary Symposium, the ARDC invites you to the 2023 Vocabulary Symposium on November 14 and 15, 2023.  A two day hybrid event bringing together people from domains across disciplines spanning research, government and industry, including those from the ARDC  Thematic Research Data Commons.  The aim is to communicate and promote approaches to using, developing, publishing, and maintaining vocabularies. The Symposium will cover Australian and international initiatives that demonstrate the impact and value of FAIR vocabularies for maximising the re-use value of data within and across domains.

Anyone with an interest in vocabularies is welcome to participate.  Including informatics people, researchers, data managers, infrastructure providers, vocabulary publishers, librarians and policy developers across research, government and industry. The event will be of interest to local and international participants.

The Symposium is free to attend, either in-person or online.

Attendees must register. (There is no financial charge for registration)  


(Led by: ARDC, in partnership with ADA, AHRA, APPF, AURIN, AuScope, AusTraits, CODATA, IMOS, LDaCA, and TERN.)

New tools for corpus linguistics


Presenter:  Prof Monika Bednarek

Where:  Online webinar

When:  9 October 2023  19:00-20:00 AEDT


(Organised by Sydney Corpus Lab & ESRC Centre for Corpus Approaches to Social Science, Lancaster University)

Conferences and Training Events

Between now and the end of the year, team members are participating in a variety of conferences and training events:

  • eResearch Australasia Conference (Brisbane October 16-20): presentations by Peter Sefton, Alex Ip, Sara King, Sam Hames and Simon Musgrave, and a Birds of a Feather session run by Peter Sefton, Robert McLellan and ARDC colleagues.

  • ResBaz events ( Darwin October 25-26, Brisbane  November 21-23): Ben Foley will take part in ResBaz NT, presenting a session titled “Notebook-based data analysis for reproducible research”. The Brisbane event will include contributions from Martin Schweinberger, Sam Hames, Sara King and Simon Musgrave, and a panel discussion with our Graduate Digital Research Fellowship group.

  • Australian Linguistic Society Conference (Sydney November 29 - December 1): Peter Sefton, Li Nguyen and Simon Musgrave will facilitate a workshop on Language Data Management in the 21st Century and Robert McLellan, Ben Foley and Simon Musgrave are running a themed session on Indigenous Data in the Language Data Commons of Australia. (And see also the News item above)

Recent Events

ResBaz Sydney

Dr Chao Sun (Sydney Informatics Hub, University of Sydney) led a session at this event. He gave an overview of  ATAP and then focused on the Juxtorpus tool.

Team Member's Tip

Share this with a friend

LDaCA acknowledges Traditional Owners of Country throughout Australia and recognises the continuing connection to lands, waters and communities. We pay our respects to their Ancestors and their descendants, who continue cultural and spiritual connections to Country.

You are receiving this email because you have provided us with your email address for promotional purposes.

Republishing is encouraged - CC BY text and infographics.

If you have questions about republishing, please contact ldaca@uq.edu.au

©LDaCA - 2023