Hadoop Articles
MarkLogic CEO Gary Bloom and his team brought the MarkLogic World Tour 2014 to Wall Street to showcase customer use cases and explain the key new features coming out in MarkLogic 8. The overarching theme for the new release is ease of use, said Bloom.
Posted June 04, 2014
Actian Corporation has announced that the Actian Analytics Platform, an end-to-end analytics platform that runs natively in Hadoop, is addressing the current challenges facing business analysts who want to use SQL on Hadoop.
Posted June 03, 2014
MapR has launched a Hadoop application gallery to make it easier for companies to find solutions within the Hadoop ecosystem. The company also announced a partnership with Syncsort targeted at helping customers move their mission-critical workloads to Hadoop.
Posted June 03, 2014
The database market is heating up again. Recent venture funding and acquisition announcements from key Hadoop and NoSQL startups are drawing attention to the big data space.
Posted May 28, 2014
RainStor 5.5 has been certified to run on Cloudera Enterprise 5. This new certification enables Cloudera customers to run RainStor natively on HDFS, while offering enterprise-grade security features. "RainStor running on Cloudera Enterprise 5 is a significant step forward for customers taking a serious look at Hadoop," said Mark Cusack, chief architect, RainStor.
Posted May 22, 2014
Fast decision-making depends on real-time data movement that allows businesses to gather data from multiple locations into Hadoop as well as conventional data warehouses. Unfortunately, traditional ETL tools use slow data-scraping techniques that put a heavy load on operational systems and cannot meet the low latency required by many businesses.
Posted May 21, 2014
Concurrent CEO Gary Nakamura says the latest release of Cascading will give enterprises the flexibility to build data-oriented applications on Hadoop once, and then run the applications on the platform that best meets their business needs. "What we are providing is a standard way to develop data-centric applications without the risk of having to rewrite those applications when distributions or the providers of the computation engines underneath it change direction one day."
Posted May 13, 2014
New Cloudera certification enables customers to use Tungsten Replicator 3.0 to replicate transactions from operational database systems such as MySQL, MariaDB and Oracle to Cloudera Enterprise 5 in real-time. The Cloudera certification confirms that the data replicated by Tungsten Replicator matches the source and target to ensure data quality, and does not stress either side of the replication stream in the process.
Posted May 12, 2014
DataStax and Databricks are partnering to integrate Cassandra and Spark. "More and more, we see customers in the community wanting to do analytics on data in as real time as possible. That is what this is really about," said Martin Van Ryswyk, executive vice president of engineering, DataStax.
Posted May 12, 2014
The value proposition for the Splice Machine database, according to Monte Zweben, CEO and cofounder of Splice Machine, is that it enables companies to replace traditional RDBMSs when they hit a wall, either from a performance or cost perspective, with a full-featured, transactional SQL database on Hadoop, to power both operational applications and real-time analytics.
Posted May 12, 2014
It's an inevitable fact that every software system will have problems, but an enterprise-grade Hadoop infrastructure puts minimizing and managing these system errors at the forefront. When considering a distribution's dependability, you should evaluate a Hadoop distribution's position in five foundational necessities.
Posted May 08, 2014
MongoDB's Kelly Stirman and Cloudera's Yuri Bukhan recently talked with DBTA about the companies' new partnership and what it will mean for the big data ecosystem in the future. There is a need to demistify big data, they say, so that organizations can understand what technologies are right for their individual needs.
Posted May 08, 2014
Just after data is created, there is high value attached to it. As data begins to age, its value does not diminish, but the nature of that value begins to change. For many enterprises that are dealing with large data volumes, timely data access can be a major issue, especially when customers demand quick response times. Let's examine the challenge of processing data in real time reliably and meeting customers' expectations for quick responses.
Posted April 30, 2014
TEKsystems, an IT staffing solutions provider, says that employers are finding it increasingly difficult to hire business intelligence and security experts.
Posted April 23, 2014
Hadoop distribution companies Cloudera, Hortonworks and MapR have joined a new Big Data Protection Partner Program launched by Dataguise, a provider of data privacy protection and risk management analytics.
Posted April 22, 2014
MapR has added the complete Apache Spark technology stack to the MapR Distribution for Hadoop. Spark is an Spark is an in-memory processing framework that provides speed, programming ease, and advantages for real-time processing.
Posted April 15, 2014
New offerings for IBM System z are aimed at helping customers with rapid development and deployment of mobile applications as well as the ability to integrate them with core business processes, applications, and data. As part of this effort, As part IBM is enabling the industry's first commercial Hadoop for Linux on System z - zDoop software - provided through through Veristorm, an IBM partner.
Posted April 14, 2014
IBM, which marked the 50th anniversary of the mainframe today, looked to the future by rolling out new mobile, storage, cloud, and Hadoop for Big Data offerings for System z. According to IBM, as it celebrates this landmark occasion, more than 70% of enterprise data resides on a mainframe and 71% of all Fortune 500 companies have their core businesses on a mainframe
Posted April 08, 2014
Splice Machine is the newest member of the more than 800-member Cloudera Connect Partner Program. According to Splice Machine, its technology enables Cloudera users to tap into real-time updates with transactional integrity and standard ANSI SQL, which the company says are necessary features for organizations that are looking to become real-time, data-driven businesses.
Posted April 07, 2014
InfiniDB has announced the results of a new, independent benchmark from Radiant Advisors that examined the performance of leading open source SQL-on-Hadoop query engines, including InfiniDB for Hadoop 4.0
Posted April 07, 2014
Teradata has introduced the Teradata Database 15 with a new software product called Teradata QueryGrid that provides virtual compute capability within and beyond the Teradata Unified Data Architecture. The company also announced Teradata Active Enterprise Data Warehouse 6750 platform with new capabilities to support customers' most demanding real-time workloads.
Posted April 07, 2014
About 3 years ago, the AMP (Algorithms, Machines, People) lab was established at U.C. Berkeley to attack the emerging challenges of advanced analytics and machine learning on big data. The resulting Berkeley Data Analytics Stack—particularly the Spark processing engine—has shown rapid uptake and tremendous promise.
Posted April 04, 2014
A new partnership between Hortonworks and LucidWorks, which provides a search development platform leveraging Apache Solr, will enable users throughout an organization to easily access and gain insight from big data sets that were previously available only to developers, analyst and data scientists.
Posted April 03, 2014
The theme for COLLABORATE 14-IOUG Forum is "Become Your Office Superhero," because, while you may look like a mild mannered technical resource in meetings or at your desk, you fight a daily battle to protect your organization's data, improve performance and generate new business opportunities. COLLABORATE is your chance to recharge your superpowers and to take on new skills.
Posted April 02, 2014
Continuent, Inc., a provider of open source database clustering and replication solutions, has announced the availability of Continuent Tungsten Replicator 3.0, an open source replication solution for Hadoop.
Posted April 01, 2014
Cloudera has launched what it describes as the industry's first hands-on Cloudera Certified Professional: Data Scientist (CCP:DS) data science certification. According to Cloudera, it is launching the data science certification program now to address to address a pressing challenge in the IT industry: job openings for data scientists are currently outpacing the supply of these in-demand workers, a situation that is aggravated by the fact that there has historically not been a clearly established skill set or university degree that an individual could acquire to qualify as a data scientist.
Posted March 28, 2014
The need for better Hadoop security is widely acknowledged. However, the transformative potential of big data is spurring the industry to quickly fill Hadoop's security gaps. To keep pace with these developments, organizations must keep a close watch on the new tools and practices being deployed.
Posted March 27, 2014
Cloud technologies and frameworks have matured in recent years and enterprises are realizing the benefits cloud adoption presents. The future of cloud deployments will involve rapid adoption of new technology frameworks beyond Hadoop, open standards in the area of cloud security, identity, and trust, as well as a universal and simple query language for aggregating data from legacy and emerging data stores.
Posted March 27, 2014
Datameer, which provides a self-service and schema-free big data analytics application for Hadoop, has introduced Datameer 4.0 which enables big data analytics workflow with visual insights at every step of analysis.
Posted March 27, 2014
Today, businesses are ending up with more and more critical dependency on their data infrastructure. If underlying database systems are not available, manufacturing floors cannot operate, stock exchanges cannot trade, retail stores cannot sell, banks cannot serve customers, mobile phone users cannot place calls, stadiums cannot host sports games, gyms cannot verify their subscribers' identity. Here is a look at some of the trends and how they are going to impact data management professionals.
Posted March 26, 2014
Continuing to expands its Asia-Pacific presence, San Jose-based MapR has opened a new Melbourne, Australia office.
Posted March 24, 2014
Cloudera has closed on a new round of funding for $160 million which will be used to further drive the enterprise adoption of and innovation in Hadoop and promote the enterprise data hub (EDH) market; support geographic expansion into Europe and Asia; expand its services and support capabilities; and scale the field and engineering organizations. The funding round was led by T. Rowe Price, and included an investment by Google Ventures and an affiliate of MSD Capital, L.P., the private investment firm for Michael S. Dell and his family.
Posted March 18, 2014
Pivotal has introduced Pivotal HD 2.0 and Pivotal GemFire XD, which along with the HAWQ query engine, form the foundation for the Business Data Lake architecture, a big data application framework for enterprise
Posted March 17, 2014
MapR Technologies, a provider of an an enterprise-grade platform for NoSQL and Hadoop, is expanding its Asia Pac presence with a new office in Seoul and a partnership agreement with LG CNS, a global IT service provider that will provide system integration and consulting services across Korea for the MapR Distribution for Hadoop and NoSQL.
Posted February 19, 2014
Splice Machine, provider of a real-time SQL-on-Hadoop database for big data applications, has completed a $15M Series B round of funding, led by InterWest Partners, along with returning Series A investor Mohr Davidow Ventures (MDV). The investment will be used to accelerate product development and expand sales and marketing in preparation for the company's upcoming public beta offering later this quarter.
Posted February 18, 2014
To help secure sensitive data, including regulated and high-risk data like medical, payment, insurance and financial data, Gazzang is providing data encryption and key management support for Pivotal HD, the Hadoop distribution from Pivotal.
Posted February 11, 2014
Today at the Strata conference in Santa Clara, MapR Technologies unveiled the latest MapR Distribution including Hadoop 2.2 with YARN for next-generation resource management. The company also announced availability of the MapR Sandbox for Hadoop, which provides a fully-configured virtual machine installation of the MapR Distribution for Apache Hadoop to allow users to jump-start their Hadoop exploration; and the early access release of the HP Vertica Analytics Platform on MapR.
Posted February 11, 2014
Veristorm is providing a commercial Hadoop distribution for Linux on the mainframe, with drag-and-drop access to mainframe databases and files. The platform, vStorm Enterprise, also enables access and analytics of sensitive mainframe data and other big data sources.
Posted January 21, 2014
Not all Hadoop packages offer a unique distribution of the Hadoop core, but all attempt to offer a differentiated value proposition through additional software utilities, hardware, or cloud packaging. Against that backdrop, Intel's distribution of Hadoop might appear to be an odd duck since Intel is not in the habit of offering software frameworks, and the brand, while ubiquitous, is not associated specifically with Hadoop, databases or big data software. However, given its excellent partnerships across the computer industry, Intel has support from a variety of vendors, including Oracle and SAP, and many of the innovations in its distribution show real promise.
Posted December 18, 2013
While there have always been many database choices, it's only recently that enterprises have been embarking on new journeys with their data strategies. Today's database landscape is increasingly specialized and best of breed, due to the expanding range of new varieties of databases and platforms—led by NoSQL, NewSQL, and Hadoop. This is complicating the already difficult job of bringing all these data types together into a well-integrated, well-architected environment.
Posted December 04, 2013