B4J Library [NLP] Apache Tika - Text extraction

Hamied Abou Hulaikah · Aug 16, 2021

Excellent, go ahead, today & tomorrow is the era of AI & machine learning.

paragkini · Aug 21, 2021

Getting below error.

B4X:

Compiling generated Java code.    Error
javac 1.8.0_291
src\b4j\example\tika.java:387: error: local variable password is accessed from within inner class; needs to be declared final
                    if (password.length() > 0)
                        ^
Note: Some input files use unchecked or unsafe operations.
Note: Recompile with -Xlint:unchecked for details.
1 error

Copied Tika folder also.

Am I missing something or doing something wrong ?

DonManfred · Aug 21, 2021

paragkini said:
Getting below error.

Erel said:
Copy the Tika folder to the additional libraries folder. Make sure to keep the Tika folder.

to the ADDITIONAL library Folder! NOT the internal one.

paragkini · Aug 21, 2021

DonManfred said:
to the ADDITIONAL library Folder! NOT the internal one.

Tried this as well. But still getting same error. Could it be because of Java version?

DonManfred · Aug 21, 2021

paragkini said:
Tried this as well. But still getting same error.

Put the additional library folder OUT OF the programfiles folder as this is a restricted folder.

Add a Folder in Documents and use this path.

paragkini · Aug 21, 2021

Tried it putting in different folder in D and then reference it in configure paths. But it gave other error. Then put it in internal libraries as well (just to try if it works) but still gave same error. Will try re-arranging all the folders again later. Will keep you posted.

Erel · Aug 22, 2021

While @DonManfred is correct that you should configure the additional libraries folder to be outside of Program Files, the error is related to the Java version.
Switch to OpenJDK 11.

B4J – RAD development tool for cross platform desktop, server and IoT solutions

B4J is a 100% free programming tool, similar to B4A that generates desktop, server and web applications.

www.b4x.com

I will add a message about it in the first post.

paragkini · Aug 22, 2021

Thanks both.

hanyelmehy · Feb 29, 2024

I get this error for larg file ,how i can fix this issue

B4X:

org.apache.tika.sax.WriteOutContentHandler$WriteLimitReachedException: Your document contained more than 100000 characters, and so your requested limit has been reached. To receive the full text of the document, increase your limit. (Text up to the limit is however available).

also you say (Tika will not work with the standalone package (which is the same as B4JPackager11)
is there are any other way to do standalone app

DonManfred · Feb 29, 2024

hanyelmehy said:
how i can fix this issue

see the code in the b4xlib file (it´s a ZIP).

change

B4X:

    Public BodyContentLengthLimit As Int = 100000

hanyelmehy · Mar 5, 2024

i am not able to use it as standalone on other oc

DonManfred · Mar 7, 2024

hanyelmehy said:
i am not able to use it as standalone on other oc

See Notes in #1

2. Tika will not work with the standalone package (which is the same as B4JPackager11).

hanyelmehy · Mar 8, 2024

DonManfred said:
See Notes in #1

thank you ,i alread see this ,my question that can we make standalone app manualy with any other method

B4J Library [NLP] Apache Tika - Text extraction

Apache Tika – Apache Tika

Attachments

Hamied Abou Hulaikah

Well-Known Member

paragkini

Member

DonManfred

Expert

paragkini

Member

DonManfred

Expert

paragkini

Member

Erel

B4X founder

B4J – RAD development tool for cross platform desktop, server and IoT solutions

paragkini

Member

hanyelmehy

Well-Known Member

DonManfred

Expert

hanyelmehy

Well-Known Member

DonManfred

Expert

hanyelmehy

Well-Known Member

Similar Threads