LDaCA Newsletter Quarter 4 2023

LDaCA Newsletter - Quarter 4 2023

LDaCA Newsletter | Quarter 4, 2023

06/10/23

Welcome

Welcome to the second issue of this newsletter. We thank you for your interest in the Language Data Commons of Australia and the Australian Text Analytics Program, and we hope that this newsletter will help to keep you informed about our activities. If you have questions about anything you read here, or if you have feedback for us, you can mail us at ldaca@uq.edu.au (or use one of the alternative contact points listed at the end of this message).

News

ARDC All Staff Meeting

ARDC held an event in Melbourne last month for all of their staff. Michael Haugh and Robert McLellan were invited to present at the event about working with the ARDC on LDaCA. Their presentation, which was very well received, was part of a showcase of projects in the ARDC's three thematic data commons (Planet, People, HASS and Indigenous).

Michael Haugh and Robert McLellan at the ARDC staff event

Image Credit: ARDC

Event at the Australian Linguistic Society Conference

We will be holding a meeting during the Australian Linguistics Society 2023 Conference to provide an update on the activities of the project. This event will take place at lunchtime on the last day of the conference (Friday December 1), venue TBC.

New Team Members

Two people have joined our team in the last quarter, Otis Carmichael and Rosanna Smith and we will let them introduce themselves:

Hi there, my name’s Otis Carmichael, I’m a Waanyi man from Brisbane and a technical designer on the LDaCA team. My initial involvement with the team was to design the team’s logo and other promotional material, however I am now involved in Community Engagement and working on design throughout the website and tools. I’m excited to be on board with the project because I believe that it has potential to really support the domain of language research, by helping to ensure that research data is stored in a more sustainable manner and thus will remain accessible for the language’s community.

Hi, my name's Rosanna Smith and I'm a user design analyst on the LDaCA team, based in Melbourne. My background is in linguistics, having managed a variety of language projects for use in machine learning development and language technology. I’m excited to contribute to a project such as LDaCA that is guided fundamentally by FAIR and CARE principles, as well as the opportunity to apply my previous experience and further promote diversity and accessibility in language data management and research.

Events

Forthcoming Events

Workshop on immigrant language corpora in Australia

When: 9-10 November, 2023

Where: Onsite. ANU, Acton, ACT 2601

Guest speakers: John Hajek, Ingrid Piller, Sophie Loy-Wilson, Adrian Vickers

Australia has long been seen as one of the world’s most multilingual and multicultural societies, with more than 490 languages coming from around 300 ancestries and cultural traditions (ABS, 2021, 2022). For decades, the language and cultural maintenance of various immigrant groups have been under investigation by many scholars, not only in linguistics but also in history, sociology, anthropology, and many other disciplines. This work has amassed a large body of immigrant language data, that provides information about how Australia’s immigration history has contributed to the country today.

The purpose of this workshop is to bring together scholars working with language corpora from across different disciplines. The workshop is being run as part of the Language Data Commons of Australia (LDaCA), which is working to build national research infrastructure for the Humanities and Social Sciences, facilitating sustainable access to and controlled use of digital language corpora for linguists, scholars across the Humanities and Social Sciences, and non-academics.

The workshop would consist of two main parts:

presentations on Australian immigrant language data collected from scholars across disciplines, and
a panel discussion on needs and challenges around managing and archiving immigrant language data in a way that is ethical, legal and culturally sensitive, and how LDaCA can help support that

(Organised by Li Nguyen & Catherine Travis)

2023 Vocabulary Symposium: FAIR Vocabularies For All

When: 14-15 November, 2023

Where: Onsite and online. The in-person event will be held at ANU Research School of Social Sciences (RSSS). 146 Ellery Crescent, Acton, ACT 2601

Following on from the success of the 2022 Vocabulary Symposium, the ARDC invites you to the 2023 Vocabulary Symposium on November 14 and 15, 2023. A two day hybrid event bringing together people from domains across disciplines spanning research, government and industry, including those from the ARDC Thematic Research Data Commons. The aim is to communicate and promote approaches to using, developing, publishing, and maintaining vocabularies. The Symposium will cover Australian and international initiatives that demonstrate the impact and value of FAIR vocabularies for maximising the re-use value of data within and across domains.

Anyone with an interest in vocabularies is welcome to participate. Including informatics people, researchers, data managers, infrastructure providers, vocabulary publishers, librarians and policy developers across research, government and industry. The event will be of interest to local and international participants.

The Symposium is free to attend, either in-person or online.

Attendees must register. (There is no financial charge for registration)

Registrations

(Led by: ARDC, in partnership with ADA, AHRA, APPF, AURIN, AuScope, AusTraits, CODATA, IMOS, LDaCA, and TERN.)

New tools for corpus linguistics

(Webinar)

Presenter: Prof Monika Bednarek

Where: Online webinar

When: 9 October 2023 19:00-20:00 AEDT

Registration: https://forms.office.com/e/326ZsRdpu7

(Organised by Sydney Corpus Lab & ESRC Centre for Corpus Approaches to Social Science, Lancaster University)

Conferences and Training Events

Between now and the end of the year, team members are participating in a variety of conferences and training events:

eResearch Australasia Conference (Brisbane October 16-20): presentations by Peter Sefton, Alex Ip, Sara King, Sam Hames and Simon Musgrave, and a Birds of a Feather session run by Peter Sefton, Robert McLellan and ARDC colleagues.
ResBaz events ( Darwin October 25-26, Brisbane November 21-23): Ben Foley will take part in ResBaz NT, presenting a session titled “Notebook-based data analysis for reproducible research”. The Brisbane event will include contributions from Martin Schweinberger, Sam Hames, Sara King and Simon Musgrave, and a panel discussion with our Graduate Digital Research Fellowship group.
Australian Linguistic Society Conference (Sydney November 29 - December 1): Peter Sefton, Li Nguyen and Simon Musgrave will facilitate a workshop on Language Data Management in the 21st Century and Robert McLellan, Ben Foley and Simon Musgrave are running a themed session on Indigenous Data in the Language Data Commons of Australia. (And see also the News item above)

Recent Events

Data Migration Workshop

Peter Sefton along with a number of LDaCA colleagues ran our first data migration skills workshop at ANU (Canberra July 20-21). Participants, who included data librarians, archivists, and researchers, were introduced to the LDaCA software stack and data migration methods on day one before teaming up with LDaCA software developers on day two to work through the (meta)data migration process with their own data on day two. We have more data migration workshops planned for the future so stay tuned to all the usual channels for future announcements.

Puliima

Several team members as well as Jenny Fewster from ARDC, attended the Puliima conference in Darwin last month. The conference focuses on Indigenous languages and technology so this was a wonderful opportunity to hear about all the inspiring work that is being done in those areas, as well as to present some of our own work (and to enjoy the climate in Darwin!)

ResBaz Sydney

Dr Chao Sun (Sydney Informatics Hub, University of Sydney) led a session at this event. He gave an overview of ATAP and then focused on the Juxtorpus tool.

Team Member's Tip

Dr Sara King is the Training and Engagement Lead for AARNet and also a member of LDaCA’s engagement team. She is focused on outreach within the research sector, developing communities of interest around training, outreach and skills development in eResearch. She is currently working on creating reusable guidance information for Jupyter Notebooks and other AARNet services to be adapted for Carpentry training workshops. She is passionate about helping others develop the infrastructure and digital literacies required for working in a data-driven world, translating technology so it is accessible to everyone.

Sara’s tip: When applying dates to digital file names, use the International Organization for Standardization (ISO) date format. Using a standard format eliminates confusion (especially around the different way dates are represented in the USA compared to Europe, for example) and makes your files more easily searchable and sortable. The specific format is year first, followed by month then day. Each of these is separated by a hyphen, with numbers less than 10 preceded by a zero. By using this format, you will be able to sort your digital files more easily into chronological order.

For example: 3 June 2022 = 2022-06-03

Big tip: There are no spaces!

Your file names may end up looking something like this:

2020-12-01_participants-list.xml

2022-06-03_interview.doc

2022-07-03_analysis.r

Remember that computer file systems have a specific way of sorting files. If your file names all start with an ISO date, they will automatically sort from oldest to newest using the default alphabetical sort. Any other date-based naming convention would create different results.

Joint office hours - ATAP and ADO

When: fortnightly
Where: zoom

The Joint Office Hour run by ATAP and the Australian Digital Observatory happens every second Tuesday at 2pm AEST. In the next three months, the office hour will occur on: October 10, October 24, November 7, November 21, December 5 and December 19 . Office Hours — Australian Digital Observatory

UPDATE: We are very happy to receive feedback about how we can make future issues more useful to you. If the newsletter was forwarded to you, you can subscribe here.

Share this with a friend

LDaCA acknowledges Traditional Owners of Country throughout Australia and recognises the continuing connection to lands, waters and communities. We pay our respects to their Ancestors and their descendants, who continue cultural and spiritual connections to Country.

You are receiving this email because you have provided us with your email address for promotional purposes.

Republishing is encouraged - CC BY text and infographics.

If you have questions about republishing, please contact ldaca@uq.edu.au

©LDaCA - 2023

LDaCA Newsletter Quarter 4 2023

LDaCA Newsletter | Quarter 4, 2023

Welcome

News

ARDC All Staff Meeting

Event at the Australian Linguistic Society Conference

New Team Members

Events

Forthcoming Events

Recent Events

Data Migration Workshop

Puliima

Team Member's Tip

Joint office hours - ATAP and ADO

About

Resources

News

Contact