Apache Solr and Apache Tika Integration Tutorial

This article is a tutorial about Apache Solr and Apache Tika Integration. 1. Introduction A Solr index can accept data from many different sources, such as CSV, XML, databases and common binary files. If the data to be indexed is in binary format, such as WORD, PPT, XLS, and PDF, the Solr Content Extraction Library (the Solr Cell framework) built ...

Apache Solr OpenNLP Tutorial – Part 2

1. Introduction In Part 1 we’ve set up Apache Solr OpenNLP integration and used its analysis components, tokenizer, and filters, to process and analyze the sample data. In this example, we are going to explore another powerful feature provided by Solr OpenNLP integration: extracting named entities at index time by using OpenNLP NER (Named Entity Recognition) model. Table Of Contents ...

Apache Solr OpenNLP Tutorial – Part 1

This is an article about Apache Solr OpenNLP. 1. Introduction Natural Language Processing (NLP) is a field focusing on processing and analyzing human languages by using computers. Using NLP in a search will help search service providers to have a better understanding of what their customers really mean in their searches, thus to run search queries more efficiently and to ...

Apache Hadoop Knox Tutorial

In this tutorial, we will learn about Apache Knox. Knox provides the REST API Gateway for the Apache Hadoop Ecosystem. We will go through the basics of Apache Knox in the following sections.                     1. Introduction Apache Knox is the open source project under Apache Software Foundation similar to most other ...

Big Data Hadoop Tutorial for Beginners

This tutorial is for the beginners who want to start learning about Big Data and Apache Hadoop Ecosystem. This tutorial gives the introduction of different concepts of Big Data and Apache Hadoop which will set the base foundation for further learning. Table Of Contents 1. Introduction 2. Big Data? 2.1 Examples of Big Data. 3. Characteristics of Big Data 3.1 ...

Scala Tutorial for Beginners

In this Tutorial article, we will see how to work with Scala Programming language, which is similar to Java but has got lot more advancements, as it was designed to overcome the pitfalls/shortcomings of Java programming language. According to Wikipedia, the definition of Scala Programming language goes as follows. Scala (SKAH-lah) is a general purpose programming language. Scala has full ...

