Introduction
With 3.2 billion internet users [1] and 6.4 billion internet connected devices by 2016 [2], unprecedented amount of data is being generated and process daily and increasing every year. The advent of web 2.0 has fueled the growth and creation of new and more complex types of data which creates a natural demand to analyze new data sources in order to gain knowledge. This new data volume and complexity of the data is being called Big Data, famously characterised by Volume, Variety and Velocity; has created data management and processing challenges due to technological limitations, efficiency or cost to store and process in a timely fashion. The large volume and complex data is unable to be handled and/or processed by most current information systems in a timely manner and the traditional data mining and analytics methods developed for a centralized data systems may not be practical for big data.
The paper provides background and related literature on the Big Data, studies the concept from Relational Database to current NoSQL database which have been fueled by the growth Big Data and importance of managing it. And surveys the Big Data challenges from the perspective of its characteristics Volume, Variety and Velocity and attempts to study how those challenges can be addressed.
This is where NoSQL systems have been created to solve the data management challenges posed by Big Data. NoSQL is not a single system that can solve every single Big Data problem
Abstract- Big data is a hot research topic in today’s world. Data has become an indispensable part of every economy, industry, organization, business function and individual. With the fast growth now-a-days organizations has filled with the collection of millions of data with large number of combinations. This big data challenges over business problems. Big Data is a new term used to identify the datasets that due to their large size and complexity. Big Data mining is the capability of extracting useful information from these large datasets or streams of data, that due to its volume, variability, and velocity. We address broad issues related to big data and/or big data mining, and point out opportunities which help to reshape the subject area of today’s data mining technology toward solving tomorrow’s bigger challenges emerging in accordance with big data.
In order to overcome these limitations, a new database model known as Not Only SQL (NoSQL) database emerged with a set of new features. The main objective of NoSQL is not to discard SQL, but to be used as an alternative database data model for new features [1] [2] [3]. NoSQL database increases the performance of relational databases by a set of new characteristics and advantages. In contrast to relational databases, NoSQL databases introduced an additional feature that provides flexible and horizontal scalability and taking advantage of new clusters. The rise of NoSQL provides cost-effective management of data in modern web applications. With its new features, NoSQL can be used with applications that have a large transaction, and require low-latency access to huge datasets, service availability while
Big Data is garnering great recognition for its data-driven decision making methodology. Right from data acquisition where there is a flood of data available, we need to make effective decisions about usage of data. Privacy, scalability, complexity and timeliness are the problems that hinder the progress of Big Data.
Many social networking and/or big data companies like Facebook, Twitter, Yahoo, Google and Amazon are now known for using NoSQL databases. This is because NoSQL systems are non-relational and do not structure their data in tables or typically manipulate or process the data with SQL. Having less restrictions than a relational database, NoSQL has the ability to better handle huge quantities of data in a more efficient way (Moniruzzaman, “NoSQL Database…”). This paper will dig deeper in the several characteristics of NoSQL database systems that separate them from the relational ones. It will also introduce the different models that make up the system as well and a few examples that are currently being used and becoming popular today.
Therefore, the consecutive sections discussed the definition of big data, tools for analyzing big data, data mining, knowledge discovery, visualization and collaborative
NoSQL is generally interpreted as “Not only SQL” [1]. It is a class of database management systems that are used for non-relational database. Typically NoSQL database does not use two-dimensional table to store data. The four generally categories of NoSQL database are key-values database, column databases, document databases, and graph databases [2]. NoSQL database is an indispensable part of big data. Most company choose NoSQL database because it yields better performance when compared to relation database. Many relational databases have been existing for more than 20 years, while most NoSQL databases have a history of less than 5 years. Because NoSQL databases are so young, they exposes lots of security issues. Many NoSQL databases are still focusing on adding features and improving performance, while strength security mechanism is still a low priority task. There were already two data breaches happened in companies that are using NoSQL databases (MongoHQ in 2013 and LinkedIn in 2012 [3]).
NoSQL was created to remedy the architecture of relational databases, to make the schema more dynamic and ever expanding. With the emergence of cloud computing, unstructured data such as social media posts in need of storage, and Agile development practice, The Document
Big data is an extensive collection of structured and unstructured data. It is a modern day technology which is applied to store, manage and analyze data that are not possible to manage, store and analyze by using the commonly used software or tools. Since all of our daily tasks are overtaken by the modern technologies and all the businesses and organizations are using internet system to operate, the production of data has increased significantly in past
For example, Facebook which is the most popular social networking website recently announced their adoption of a NoSQL based graph data store for efficient storage of user data. In other words, NoSQL has already made its way into the enterprise. However, just like every other widely accepted technology, NoSQL has its own set of advantages and disadvantages. It is important for an enterprise to quantify the pros and cons of a particularly new database technology against the already existing solutions based on their custom requirements. For example, legacy enterprise applications may require extensive community support from their database vendors. Moreover, traditional relational database vendors such as Oracle have already established themselves for providing excellent support. On the other hand, NoSQL has been rapidly growing since the past few years and is consistently evolving in terms of big data handling, data warehousing and lesser complexity. Hence, there is a need to study the current market of data stores based on the most popular NoSQL data stores and how well they fair against the widely accepted traditional database systems. This requires a study of the commonly used NoSQL data stores.
NoSQL databases are a significant departure from the relational model that has dominated the business world for the past few decades. Standing for “Not Only SQL,” these products are all some variation of a non-relational, key-value pair database, and they are becoming very popular with companies that use Big Data and prioritize speed or availability over consistency of data.
The evolution of distributed web based applications and cloud computing have generated the demand to store voluminous of big data in distributed databases efficiently to offer excessive availability and scalability to users. The new type of database resolves many new challenges especially in large-scale and high concurrency applications which are not present in relational database. These new sorts of databases are not relational by using explanations and hence they do not prop up whole SQL performance. As progressively insightful big data is being saved in NoSQL databases, it is essential to preserve higher security measures to ensure safe and trusted communication across the network. In this paper, we describe the security of NoSQL database against intruders which is growing rapidly. This paper also delineates probably the most prominent NoSQL databases and descriptions their security aspects and problems.
NoSQL is able to address the massive traffic loads experienced by database servers at corporations that specialize in data processing like Google, Facebook and Amazon. NoSQL technologies can provide near constant availability, massive user concurrency and lightning fast responses. There are four primary NoSQL database implementation types being used today: document based, wide column (or columnar), key-value and graph. The different properties of SQL and NoSQL databases will be examined and an overview of each NoSQL implementation type along with an example will be given.
There are many fundamental issue areas that need to be addressed in dealing with big data: data acquisition, data storage, data transfer, data management, and data processing. Each of these issues represents a large set of technical research problems and challenges in its own right.
The modern RDBMS advancements are not capable of supporting unstructured information with ideal space necessity. The plan winds up plainly mind-boggling and is henceforth troublesome for designers. The requirement for unstructured information administration is so annoying with conventional RDBMS arrangements (Big data in financial services industry: Market trends, challenges, and prospects 2013 - 2018). Moreover, RDBMS turns out to be an exorbitant answer for creating light-footed web applications with direct information investigation necessities. NoSQL is developing as a proficient possibility in this situation, which connects the issues related with RDBMS innovation. The market development can credit to creative dispatches of NoSQL arrangements, and collective endeavors by NoSQL sellers and clients. The endeavors of organizations, to enhance their market offerings, are creating the request of NoSQL, as a back-end bolster (Big data in financial services industry: Market trends, challenges, and prospects 2013 - 2018). The emergence of agile software development is creating the demand for NoSQL (Big data in financial services industry: Market trends, challenges, and prospects 2013 - 2018). They offer users much more avenues to accept data in many different forms. NoSQL is adaptable as SQL but offers many more uses that can apply to many organizations.
Big data is a term to explain large complex data set, and big data is challenging the traditional data handling method. The big data itself is useless, but after processed and analyzed the big data would generate valuable information. This article would discuss relevant technologies and areas in the big data age.