Publication:

When does aggregating multiple skills with multi-task learning work? A case study in financial NLP

Date

Date

Date
2023
Conference or Workshop Item
Published version
cris.lastimport.scopus2025-06-21T03:36:05Z
cris.virtual.orcidhttps://orcid.org/0000-0001-5983-2360
cris.virtualsource.orcid0331cda6-e903-4e22-9b44-f89f54f581dc
dc.contributor.institutionUniversity of Zurich
dc.date.accessioned2023-08-02T08:22:21Z
dc.date.available2023-08-02T08:22:21Z
dc.date.issued2023-07-14
dc.description.abstract

Multi-task learning (MTL) aims at achieving a better model by leveraging data and knowledge from multiple tasks. However, MTL does not always work – sometimes negative transfer occurs between tasks, especially when aggregating loosely related skills, leaving it an open question when MTL works. Previous studies show that MTL performance can be improved by algorithmic tricks. However, what tasks and skills should be included is less well explored. In this work, we conduct a case study in Financial NLP where multiple datasets exist for skills relevant to the domain, such as numeric reasoning and sentiment analysis. Due to the task difficulty and data scarcity in the Financial NLP domain, we explore when aggregating such diverse skills from multiple datasets with MTL can work. Our findings suggest that the key to MTL success lies in skill diversity, relatedness between tasks, and choice of aggregation size and shared capacity. Specifically, MTL works well when tasks are diverse but related, and when the size of the task aggregation and the shared capacity of the model are balanced to avoid overwhelming certain tasks.

dc.identifier.othermerlin-id:23709
dc.identifier.scopus2-s2.0-85174385740
dc.identifier.urihttps://www.zora.uzh.ch/handle/20.500.14742/208782
dc.language.isoeng
dc.subject.ddc330 Economics
dc.title

When does aggregating multiple skills with multi-task learning work? A case study in financial NLP

dc.typeconference_item
dcterms.accessRightsinfo:eu-repo/semantics/openAccess
dcterms.bibliographicCitation.journaltitleProceedings of the Annual Meeting of the Association for Computational Linguistics
dcterms.bibliographicCitation.number1
dcterms.bibliographicCitation.originalpublishernameAssociation for Computational Linguistics
dcterms.bibliographicCitation.originalpublisherplaceToronto, Canada
dcterms.bibliographicCitation.pageend7488
dcterms.bibliographicCitation.pagestart7465
dcterms.bibliographicCitation.urlhttps://aclanthology.org/2023.acl-long.412/
dspace.entity.typePublicationen
oairecerif.event.countryCanada
oairecerif.event.endDate2023-07-14
oairecerif.event.placeToronto
oairecerif.event.startDate2023-07-09
uzh.contributor.authorLeippold, Markus
uzh.contributor.authorNi, Jingwei
uzh.contributor.authorJin, Zhijing
uzh.contributor.authorWang, Qian
uzh.contributor.authorSachan, Mrinmaya
uzh.contributor.correspondenceYes
uzh.contributor.correspondenceNo
uzh.contributor.correspondenceNo
uzh.contributor.correspondenceNo
uzh.contributor.correspondenceNo
uzh.document.availabilitypublished_version
uzh.eprint.datestamp2023-08-02 08:22:21
uzh.eprint.lastmod2025-03-24 04:42:03
uzh.eprint.statusChange2023-08-02 08:22:21
uzh.event.presentationTypepaper
uzh.event.title61st Annual Meeting of the Association for Computational Linguistics (ACL’23)
uzh.event.typeconference
uzh.harvester.ethYes
uzh.harvester.nbNo
uzh.identifier.doi10.5167/uzh-235040
uzh.jdb.eprintsId48195
uzh.oastatus.zoraGreen
uzh.publication.citationLeippold, Markus; Ni, Jingwei; Jin, Zhijing; Wang, Qian; Sachan, Mrinmaya (2023). When does aggregating multiple skills with multi-task learning work? A case study in financial NLP. In: 61st Annual Meeting of the Association for Computational Linguistics (ACL’23), Toronto, Canada, 9 July 2023 - 14 July 2023. Association for Computational Linguistics, 7465-7488.
uzh.publication.freeAccessAtofficialurl
uzh.publication.originalworkoriginal
uzh.publication.publishedStatusfinal
uzh.publication.scopedisciplinebased
uzh.publication.seriesTitleProceedings of the Annual Meeting of the Association for Computational Linguistics
uzh.scopus.impact3
uzh.workflow.chairSubjectDepartment of Banking and Finance
uzh.workflow.chairSubjectoecIBF1
uzh.workflow.doajuzh.workflow.doaj.false
uzh.workflow.eprintid235040
uzh.workflow.fulltextStatuspublic
uzh.workflow.revisions16
uzh.workflow.rightsCheckkeininfo
uzh.workflow.statusarchive
Files

Original bundle

Name:
2023.acl_long.412.pdf
Size:
1.04 MB
Format:
Adobe Portable Document Format
Publication available in collections: