Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

multiword (contractions) #1

Closed
arademaker opened this issue Nov 24, 2016 · 6 comments
Closed

multiword (contractions) #1

arademaker opened this issue Nov 24, 2016 · 6 comments

Comments

@arademaker
Copy link
Contributor

http://universaldependencies.org/format.html

The read and write functions still do not deal with the multiword lines.

@arademaker
Copy link
Contributor Author

arademaker commented Dec 13, 2016

See UniversalDependencies/UD_Portuguese-Bosque#23 (comment) how to encode the fusions and contractions ? @fcbr my idea would be to add one more slot in the sentence class for the list of contractions, what do you think ?

@arademaker
Copy link
Contributor Author

BTW, @fcbr note that UD_Portuguese is not following the documentation, since the lines should contain 10 fields as all other tokens

@arademaker
Copy link
Contributor Author

@GPPassos can you and João try next to solve this issue?

@fcbr
Copy link
Contributor

fcbr commented Jan 2, 2017 via email

@arademaker
Copy link
Contributor Author

I have simplified the code and tested it, closed by addd6fd

@arademaker
Copy link
Contributor Author

arademaker commented Jan 7, 2017

The final test was:

$ sbcl --eval '(ql:quickload :cl-conllu)' --eval '(in-package :cl-conllu)'  --eval '(write-conllu (append (read-conllu "pt-ud-dev.conllu") (read-conllu "pt-ud-test.conllu") (read-conllu "pt-ud-train.conllu")) "lixo.conllu")'
$ cat pt-ud-dev.conllu pt-ud-test.conllu pt-ud-train.conllu > lixo1.conllu
$ diff lixo.conllu lixo1.conllu

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants