Dependency Parsing for Urdu : Resources, Conversions and Learning

Lade...
Vorschaubild
Dateien
Ehsan_2-hi1dd9gbaw916.pdf
Ehsan_2-hi1dd9gbaw916.pdfGröße: 198.36 KBDownloads: 61
Datum
2020
Autor:innen
Ehsan, Toqeer
Herausgeber:innen
Kontakt
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
DOI (zitierfähiger Link)
ArXiv-ID
Internationale Patentnummer
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Open Access Bookpart
Sammlungen
Core Facility der Universität Konstanz
Gesperrt bis
Titel in einer weiteren Sprache
Forschungsvorhaben
Organisationseinheiten
Zeitschriftenheft
Publikationstyp
Beitrag zu einem Konferenzband
Publikationsstatus
Published
Erschienen in
CALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed. and others. Proceedings of the Twelfth Language Resources and Evaluation Conference (LREC 2020). Paris: ELRA, European Language Resources Association, 2020, pp. 5202-5207
Zusammenfassung

This paper adds to the available resources for the under-resourced language Urdu by converting different types of existing treebanks for Urdu into a common format that is based on Universal Dependencies. We present comparative results for training two dependency parsers, the MaltParser and a transition-based BiLSTM parser on this new resource. The BiLSTM parser incorporates word embeddings which improve the parsing results significantly. The BiLSTM parser outperforms the MaltParser with a UAS of 89.6 and an LAS of 84.2 with respect to our standardized treebank resource.

Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
400 Sprachwissenschaft, Linguistik
Schlagwörter
Konferenz
12th Conference on Language Resources and Evaluation (LREC 2020), 11. Mai 2020 - 16. Mai 2020, Marseille, France
Rezension
undefined / . - undefined, undefined
Zitieren
ISO 690EHSAN, Toqeer, Miriam BUTT, 2020. Dependency Parsing for Urdu : Resources, Conversions and Learning. 12th Conference on Language Resources and Evaluation (LREC 2020). Marseille, France, 11. Mai 2020 - 16. Mai 2020. In: CALZOLARI, Nicoletta, ed., Frédéric BÉCHET, ed., Philippe BLACHE, ed. and others. Proceedings of the Twelfth Language Resources and Evaluation Conference (LREC 2020). Paris: ELRA, European Language Resources Association, 2020, pp. 5202-5207
BibTex
@inproceedings{Ehsan2020Depen-59689,
  year={2020},
  title={Dependency Parsing for Urdu : Resources, Conversions and Learning},
  url={https://aclanthology.org/2020.lrec-1.640/},
  publisher={ELRA, European Language Resources Association},
  address={Paris},
  booktitle={Proceedings of the Twelfth Language Resources and Evaluation Conference (LREC 2020)},
  pages={5202--5207},
  editor={Calzolari, Nicoletta and Béchet, Frédéric and Blache, Philippe},
  author={Ehsan, Toqeer and Butt, Miriam}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/59689">
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2023-01-12T11:42:12Z</dc:date>
    <dcterms:issued>2020</dcterms:issued>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/59689"/>
    <dc:contributor>Butt, Miriam</dc:contributor>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/59689/1/Ehsan_2-hi1dd9gbaw916.pdf"/>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2023-01-12T11:42:12Z</dcterms:available>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dc:rights>terms-of-use</dc:rights>
    <dc:contributor>Ehsan, Toqeer</dc:contributor>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dcterms:title>Dependency Parsing for Urdu : Resources, Conversions and Learning</dcterms:title>
    <dcterms:abstract xml:lang="eng">This paper adds to the available resources for the under-resourced language Urdu by converting different types of existing treebanks for Urdu into a common format that is based on Universal Dependencies. We present comparative results for training two dependency parsers, the MaltParser and a transition-based BiLSTM parser on this new resource. The BiLSTM parser incorporates word embeddings which improve the parsing results significantly. The BiLSTM parser outperforms the MaltParser with a UAS of 89.6 and an LAS of 84.2 with respect to our standardized treebank resource.</dcterms:abstract>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:language>eng</dc:language>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <dc:creator>Ehsan, Toqeer</dc:creator>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/59689/1/Ehsan_2-hi1dd9gbaw916.pdf"/>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <dc:creator>Butt, Miriam</dc:creator>
  </rdf:Description>
</rdf:RDF>
Interner Vermerk
xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter
Kontakt
URL der Originalveröffentl.
Prüfdatum der URL
2023-01-10
Prüfungsdatum der Dissertation
Finanzierungsart
Kommentar zur Publikation
Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Ja
Begutachtet
Diese Publikation teilen