Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transforms 1.0.0a0 refactored language transforms #879

Merged
merged 135 commits into from
Dec 17, 2024
Merged

Conversation

matouma
Copy link
Contributor

@matouma matouma commented Dec 14, 2024

Why are these changes needed?

Added refactored pdf2parquet
Added refactored html2parquet
Added refactored doc_chunk
Added refactored text_encoder
Added refactored docid
Added langId
Added DocQuality
Added hap

Related issue number (if any).

touma-I and others added 30 commits November 13, 2024 17:03
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
…o longer valid as it is based on folder name

Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
matouma and others added 19 commits December 14, 2024 18:01
Signed-off-by: matouma <[email protected]>
Signed-off-by: matouma <[email protected]>
Signed-off-by: matouma <[email protected]>
Signed-off-by: matouma <[email protected]>
Signed-off-by: matouma <[email protected]>
Signed-off-by: matouma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
Signed-off-by: Maroun Touma <[email protected]>
@touma-I touma-I marked this pull request as ready for review December 17, 2024 19:18
Copy link
Collaborator

@touma-I touma-I left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

First iteration used for alpha testing

@touma-I touma-I merged commit 03b7e09 into IBM:dev Dec 17, 2024
137 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants