ANALYSIS NOSQL DATABASE MANAGEMENT DEPENDING ON THE FEATURES AND DIFFERENTIATION OF RDBMS
ZAHRAA MUSTAFA ABDULRAHMAN AL-ANI
JUNE 2015
ANALYSIS NOSQL DATABASE MANAGEMENT DEPENDING ON THE FEATURES AND DIFFERENTIATION OF RDBMS
A THESIS SUBMITTED TO
THE GRADUATE SCHOOL OF NATURAL AND APPLIED
SCIENCES OF
ÇANKAYA UNIVERSITY
BY
ZAHRAA MUSTAFA ABDULRAHMAN AL-ANI
IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE
DEGREE OF
MASTER OF SCIENCE
IN
THE DEPARTMENT OF
MATHEMATICS AND COMPUTER SCIENCEINFORMATION TECHNOLOGY PROGRAM
JUNE 2015
ABSTRACT
ANALYSIS NOSQL DATABASE MANAGEMENT DEPENDING ON THE FEATURES AND DIFFERENTIATION OF RDBMS
AL-ANI, Zahraa Mustafa Abdulrahman
M.Sc., Department of
…show more content…
In this paper, we will review one of the graph database (Neo4j), which the graph database is part of the emerging technology that is called NoSQL and compared it with one of the traditional relational databases (MySQL). MySQL, it is being another name for Relational Databases and it has been used for a long time period until now. However, with the emergence of Big Data there was clearly a need for more flexible databases. Facebook 's Graph Search use Neo4j, a graph database, is an application which clearly displays how relationships need to be modeled in a more efficient and sophisticated manner than using conventional relational models. In this paper, we will make a comparison between MySQL and Neo4j based on the features like ACID, replication, availability and the language that is used in both of them.
Keywords: RDBMS, NoSQL, Relational Databases, MySQL, Neo4j, ACID.
TABLE OF CONTENTS
ABSTRACT …………..…………………………………....................………............
TABLE OF CONTENTS……………………………………………………………….
LIST OF FIGURES …………………………….………...………………………….
CHAPTERS:
1. INTRODUCTION……………………………………………….……………….1
1.1 Motivation………………………………………………………...…………..3
1.2 Aim Of The Study……………………………………………...……………...3
1.3 Scope Of The Study………………………………………………………….. 3
1.4 Thesis question …………………………………………………..……………3
2. LITERATURE REVIEW ……………………………………………………….4
2.1 Data……………………………………………………………………………4
2.2 Database……………………………………………………………………….4
2.3
Abstract- This research documents a comprehensive evaluation of the emerging graph databases along with a benchmark study to compare it to the existing relational model. With the ease of the graphical representation brought in with Neo4j, we saw the opportunity to attempt getting details about the various attributes in the dataset and analyze this data to present a statistical view along with its popular counterpart, MySQL. The ultimate goal of this study is to determine whether a traditional relational database system like MySQL, can be replaced completely in production, by a graph database, such as Neo4j.
Challenges: As Marcos explained: “A relational database wasn’t satisfying our requirements about performance and simplicity, due the complexity of our queries.” To address this, Marcos’ team decided to use Neo4j, a graph database, for which category Neo4j is the market leader.
Many RDBMS programs also provide the tools you need to create end-user applications that interact with the data stored in the database. Of course, the quality of an RDBMS is a direct function of the extent to which it supports the relational database model. Even among “true” RDBMSs, support for the relational database varies among vendors, and there is yet to be a full implementation of the relational model’s potential. Despite this, all RDBMS programs continue to evolve and become more full-featured and powerful than ever
SAP also known as systems applications and products in data processing is one of the better IT companies at the moment. It is a German software company which develops software for both small and large businesses to track customer and business interactions. Their largest software is ERP (Enterprise Planning Resource). The software was given the same name as the company name which can be slightly confusing at times. The company have offices in almost 130 countries and their software operates in 180 countries with 335,000 users spread across. As per task this research paper will challenge the comparison and contrast of NoSQL databases with relational database managing system which will be explained in greater detail further in the
A graph database represents data and relationships between this data using concepts from graph data structures like nodes, edges and properties. Nodes represents the data entities, properties represent information about the nodes and edges which connect two nodes or a node and a property represent the relationship between the connected elements. [1]
STRUCTURE OF DATA: The data structure of a relational database comprises of table structure. Every table is identified by a unique name or label. The data tables are described as the collection of rows and columns. Each row of the table is known as the record and each column is known as the field of the specific data table. All the data sets are well organized and logical linked to each other through definite and unique relationships. A table, therefore can also be defined as the “structured collection of relationships”. The fundamental aim of developing No SQL database systems is to easily and effectively handle vast quantity of data or information in advanced web-scale applications. In order to achieve this purpose, the No SQL systems are designed as the schema-free database systems. There are different modes to define the No SQL databases that typically depend on the requirements of the data that has to be managed. The main No SQL data structures include column database, key-value store database, document store database, graph database and
Answer: NoSQL DB is document based DB and represented in collection without solid structure key-value pairs, documents, graph Db or wide-column stores and has not predefine schema, use Dynamic schema. NoSQL DB is horizontally scalable that will increase amount of DB servers in the pool of
application. Specifically, this report investigates the use of relational database design versus the no-SQL model as the preferred basis of the new application.
NoSQL databases had made for unraveling the Big Data issue by utilizing a distributed system to bring out excellent performance in data storage and retrieval at very large-scale. At this scale, pieces of the system often fail and NoSQL is created to handle these failures (Chow, 2013) (Ron, Shulman-Peleg, & Bronshtein, 2015). Various companies have espouse different sorts of non-relational databases, ordinarily alluded to as
The paper provides background and related literature on the Big Data, studies the concept from Relational Database to current NoSQL database which have been fueled by the growth Big Data and importance of managing it. And surveys the Big Data challenges from the perspective of its characteristics Volume, Variety and Velocity and attempts to study how those challenges can be addressed.
The demands on database technology have been ever expanding since its introduction in the 1960’s. Today traffic on the internet requires that millions upon millions of records be stored and queried each second. Data must be highly available and quickly retrievable. These requirements put together have given rise to new forms of database technologies collectively called “NoSQL” or “Not Only SQL”. NoSQL eschews the strict guidelines that govern the creation and function of traditional relational databases. These guidelines are put aside in order to rise to the new demands of an increasingly interconnected world. The rigorous standards and data definitions of relational databases give way in order to provide the ability to rapidly
The modern RDBMS advancements are not capable of supporting unstructured information with ideal space necessity. The plan winds up plainly mind-boggling and is henceforth troublesome for designers. The requirement for unstructured information administration is so annoying with conventional RDBMS arrangements (Big data in financial services industry: Market trends, challenges, and prospects 2013 - 2018). Moreover, RDBMS turns out to be an exorbitant answer for creating light-footed web applications with direct information investigation necessities. NoSQL is developing as a proficient possibility in this situation, which connects the issues related with RDBMS innovation. The market development can credit to creative dispatches of NoSQL arrangements, and collective endeavors by NoSQL sellers and clients. The endeavors of organizations, to enhance their market offerings, are creating the request of NoSQL, as a back-end bolster (Big data in financial services industry: Market trends, challenges, and prospects 2013 - 2018). The emergence of agile software development is creating the demand for NoSQL (Big data in financial services industry: Market trends, challenges, and prospects 2013 - 2018). They offer users much more avenues to accept data in many different forms. NoSQL is adaptable as SQL but offers many more uses that can apply to many organizations.
NoSQL Databases are being used in the social media applications and big data processing based portals in which huge, heterogeneous and unstructured data formats are handled. NoSQL Databases are used for faster access of records from the big dataset at back-end. The AADHAAR Card implementation in India was done using NoSQL Databases as huge amount of information is associated including Text Data, Images, Thumb Impressions and Iris Detection. Any classical database system cannot handle the dataset of different types (Image, Text, Video, Audio, Video, Thumb Impressions for Pattern Recognition, Iris Sample) simultaneously.
In order to make it easy to analyze the performance of the database. We need to categorize the above databases. We can categorize them due to the content of the database. Dividing them into “bibliographic, document-text, statistical, or multimedia objects. Another way is by their application area, for example, accounting, music compositions, movies, banking, manufacturing, or insurance” [1]. In our project, we divide the database according to the computer software aspect. Mainly including the SQL database (relational database) and NoSQL database (non-relation database). We compare these two databases in below aspects:
Currently, there are two major of database management systems which are used to deal with data, the first one called Relational Database Management System (RDBMS) which is the traditional relational databases, it deals with structured data and have been popular since decades from 1970, while the second one called Not only Structure Query Language databases (NoSQL), they have been dealing with semi-structured and unstructured data; the NoSQL term was introduced for the first time in 1998 by Carlo Strozzi and Eric Evans reintroduced the term NoSQL in early 2009, and now the NoSQL types are gaining their popularity with the development of the internet and the social media. NoSQL are intending to override the cons of RDBMS, such as fixed schemas, JOIN operations and handling the scalability problems. With the appearance of Big Data,