Hadoop

The Apache Hadoop framework for the processing of data on commodity hardware is at the center of the Big Data picture today. Key solutions and technologies include the Hadoop Distributed File System (HDFS), YARN, MapReduce, Pig, Hive, Security, as well as a growing spectrum of solutions that support Business Intelligence (BI) and Analytics.

Hadoop Articles

Qubole Reveals Cache Feature Called RubiX for Cloud Storage Systems

Qubole unveiled a new feature to its Qubole Data Service (QDS) called auto-caching, a next-generation disk cache for cloud storage systems that works across different data engines.

Posted June 28, 2016

Hortonworks and AtScale Collaborate to Offer Easy Way to Connect to Hadoop

Hortonworks, Inc.,is partnering with AtScale to resell AtScale's technology, providing users with the ability to query data without any data movement from any business intelligence tool.

Posted June 28, 2016

New DgSecure Release Advances Cross-Platform Data Security

Dataguise has announced the availability of Dataguise DgSecure 6.0, the company's data security platform. According to the vendor, DgSecure 6.0 offers a monitoring solution for all data source types, allowing users to quickly understand what, where, and how sensitive data is being detected, protected, and accessed across the enterprise.

Posted June 28, 2016

Attunity Rolls Out New Hadoop Platform

Attunity Ltd. is releasing a new version of Attunity Visibility for Hadoop with enhanced technology to enable comprehensive data usage analytics for large-scale and fast-growing Hadoop Data Lake environments. The new release brings "a very unique, needed solution to the fast and growing market of Hadoop," said Itamar Ankorion, EVP of business development and corporate strategy at Attunity.

Posted June 28, 2016

MapR Increases User Productivity and Hadoop Support with Spyglass Initiative

MapR Technologies is introducing a new initiative that will help support Hadoop deployments and increase user and administrator productivity.

Posted June 28, 2016

Pepperdata Introduces New Solution to Assess the Health of Hadoop Clusters

Pepperdata is unveiling a new tool that will evaluate and assess Hadoop clusters and provide visibility into current cluster conditions.

Posted June 27, 2016

BI Vendors Partner with Teradata to Enhance SQL-on-Hadoop Solution

At the 2016 Hadoop Summit in San Jose, Teradata announced the certification of multiple BI and visualization solutions on the Teradata Distribution of Presto.

Posted June 27, 2016

Trifacta Strengthens Partnership with Hortonworks and Gains Apache Atlas Certification

Trifacta, a provider of data wrangling software, is deepening technical integration with the Hortonworks Data Platform (HDP) and the industry's first certification for Apache Atlas, a data governance and metadata framework for Hadoop.

Posted June 23, 2016

Dealing With Big Data’s Trough of Disillusionment

For those who haven't encountered the term, the "trough of disillusionment" is a standard phase within the Gartner hype cycle. New technologies are expected to pass from a "peak of inflated expectations" through the trough of disillusionment before eventually reaching the "plateau of productivity." Most new technologies are expected to go through this trough, so it's hardly surprising to find big data entering this phase.

Posted June 22, 2016

12 Game-Changing Technologies Fueling the Data-Driven Enterprise

The data manager now sits in the center of a revolution swirling about enterprises. In today's up-and-down global economy, opportunities and threats are coming in from a number of directions. Business leaders recognize that the key to success in hyper-competitive markets is the ability to leverage data to draw insights that predict and provide prescriptive action to stay ahead of markets and customer preferences. For that, they need to keep up with the latest solutions and approaches in data management. Here are 12 of the key technologies turning heads—or potentially opening enterprise wallets—in today's data centers.

Posted June 22, 2016

Koverse 2.0 Offers Organizations a Data Lake In-a-Box to Help Derive Faster Value from Big Data

Announcing a new version of its "data lake in-a-box," Koverse, Inc. has released the Koverse Platform Version 2.0, which provides enhancements for organizations trying to extract value out of their investments in big data and analytics. With this introduction, the company says it is offering enterprises a 30-day guarantee to bring their data into production with a data lake capable of delivering insights for real-world organizational challenges.

Posted June 21, 2016

Hortonworks Upgrades its Global Professional Services Program

Hortonworks, Inc. is enhancing its Global Professional Services (GPS) program to support and enable Hortonworks Connected Data Platforms customers.

Posted June 16, 2016

Talend Now Provides All Products on Amazon Web Services

Talend is "going all-in" with Amazon Web Services (AWS), now providing its entire product line on AWS cloud. The latest release of Talend Integration Cloud extends the company's ability to allow IT organizations to quickly "spin up and spin down" big data and data integration workloads running on Amazon Redshift or Amazon EMR.

Posted June 10, 2016

Cloudera and Microsoft Partner to offer new Open Source Platform Called Livy

Cloudera is collaborating with Microsoft to build a new open source platform that will reduce the burden on application developers leveraging Spark. The two entities, together with other open source contributors, have built a new open source Apache licensed REST-based Spark Service, called Livy, which is still in early alpha development.

Posted June 09, 2016

Progress Unveils New Suite of Solutions to Boost Digital Businesses

Progress is releasing a new package of platforms that will enable enterprises to tap into the full potential of digital business. Progress DigitalFactory is a new cloud-based platform that provides a holistic, extensible solution for businesses to create omni-channel digital experiences.

Posted June 08, 2016

What's Ahead in 2016 - 10 Emerging Companies to Watch in Data

Emerging and newer vendors can offer fresh, innovative ways of dealing with data management and analytics challenges. Here, DBTA looks at the 10 companies whose approaches we think are worth watching.

Posted June 06, 2016

Teradata Democratizes Big Data with Aster Connector for Spark

Teradata has introduced the Teradata Aster Connector for Spark, an integration of Apache Spark analytics with Teradata Aster Analytics. The connector enables pre-built analytics functions from both solutions to be executed from Aster Analytics, enabling anyone who can use Aster Analytics to also run advanced analytics on Spark without the need to learn or know Scala.

Posted June 06, 2016

Couchbase Adds Spark Connector to Fuel Operational Business Insights

NoSQL database technology vendor Couchbase has introduced a new Couchbase Spark Connector. According to Couchbase, the new Spark connector will enable businesses to gain business insights faster, enabling them to deliver better customer experiences through web, mobile and IoT applications.

Posted June 06, 2016

MapR Launches Dedicated Enterprise-Grade Spark Distribution

Today at Spark Summit, MapR Technologies is announcing a new enterprise-grade Apache Spark Distribution. "This is a Spark-focused distribution that combines Apache Spark with the real time, persistent, web-scale data layer of MapR," said Jack Norris, SVP, Data and Applications, MapR. The new Spark Distribution option for the MapR Converged Data Platform enables advanced analytics - including batch processing, machine learning, procedural SQL, and graph computation, and is a production-ready platform for Spark workloads on-premise and in the cloud.

Posted June 06, 2016

OpenText Acquires Recommind to Strengthen its Portfolio of Solutions

Open Text Corporation, a provider of Enterprise Information Management solutions, is entering into a definitive agreement to acquire Recommind, Inc., a leading provider of eDiscovery, and information analytics. The transaction purchase price is approximately $163 million and with this acquisition, Recommind's eDiscovery platform will complement OpenText's own enterprise information management (EIM) solutions.

Posted June 02, 2016

Oracle Announces 5 Key Enhancements to NoSQL Database

Oracle is introducing version 4.0 of its NoSQL database. First introduced in 2011, the Oracle NoSQL Database is a key-value database that evolved from the company's acquisition of BerkeleyDB Java Edition, a mature, high-performance embeddable database. Ashok Joshi, senior director of NoSQL, Berkeley Database, and Database Mobile Server at Oracle, outlined the key enhancements in the new release.

Posted June 01, 2016

Dynatrace Partners with Pivotal on Cloud Foundry

Dynatrace, a digital performance software company, is teaming up with Pivotal, to deploy its application monitoring solutions for the Pivotal Cloud Foundry (PCF) platform. The integration of Dynatrace with Pivotal Cloud Foundry will enable companies to take advantage of this acceleration by collecting analytics for applications running on PCF, allowing them to detect and act on performance shortcomings and optimize end-to-end transaction latencies.

Posted May 26, 2016

Alpine Data Upgrades its Integrated Analytics Platform

Alpine Data is making advancements to its Chorus platform, combining an integrated analytics platform with improvements that will accelerate the delivery of data. Chorus 6 now delivers capabilities that will help business leaders take the reins in assisting organizations in managing processes that connect machine learning to business behavior.

Posted May 26, 2016

Trends To Watch: Data Lakes in Clouds, Behavioral Analytics Goes Mainstream

Thanks to the cloud and other empowering technologies such as Hadoop and Apache Spark, we're at the tipping point for big data. These technologies now provide a path to big data success for companies who otherwise lack the specialized big data skills or heretofore proprietary (and expensive) infrastructure to do it themselves. As 2016 progresses, we'll see the broader market put big data capabilities to work and the benefits of big data will, in turn, spread beyond the privileged few companies that were early big data adopters.

Posted May 25, 2016

EMC Strengthens Archiving Platform for Unstructured and Structured Data

EMC Corp.'s Enterprise Content Division (ECD) is releasing an upgraded version of its EMC InfoArchive platform, enhancing the ability to secure and leverage large amounts of critical data and content.

Posted May 25, 2016

Trifacta Launches New Wrangler Partner Program

Trifacta, a provider of data prep software and tools, is introducing the Wrangler Partner Program, a global program that will support and enable partners to sell, implement, and innovate with Trifacta.

Posted May 24, 2016

Syncsort Updates Platform to Enable Integration of Streaming Data

Syncsort is adding new capabilities to its platform, including native integration with Apache Spark and Apache Kafka. DMX-h v9 allows organizations to access and integrate enterprise-wide data with streams from real-time sources.

Posted May 18, 2016

Latest SnapLogic Release Adds Streaming Data Integration

SnapLogic is unveiling new updates to its SnapLogic Elastic Integration Platform that add the ability to integrate streaming data and power big data analytics in the cloud. The Spring 2016 release adds support for Apache Kafka, Microsoft HDInsight, and Google Cloud Storage, plus multiple enhancements that automate data shaping and management tasks.

Posted May 18, 2016

BI-on-Hadoop Provider AtScale Raises Series B Funding Round

AtScale, Inc., which provides a self-service BI platform for Hadoop, has raised a Series B round of $11 million, bringing its total funding to date to $20 million. According to Bruno Aziza, chief marketing officer of AtScale, its platform is different from others in three key ways, making it applicable to use cases in an array of industries including healthcare, telecommunications, retail, and financial services.

Posted May 17, 2016

Cognitive Search and Powerful Analytics Combine in Sinequa ES Version 10 with Spark at its Core

Sinequa has announced the general availability of Sinequa ES Version 10. Powered by machine learning capabilities, the new version aims to deliver deep analytics of contents and user behavior, and offer information with continually improving relevance to users in their work environments. In order to achieve this advancement into the world of cognitive computing, with this new version, Sinequa has integrated the Spark platform in its distributed architecture and implemented machine learning algorithms on Spark within the core of its product

Posted May 05, 2016

What Oracle’s NoSQL SQL Database Reveals

Say what you will about Oracle, it certainly can't be accused of failing to move with the times. Typically, Oracle comes late to a technology party but arrives dressed to kill.

Posted May 04, 2016

Qubole Offers Open Sourced Version and Partners with Looker

Qubole is announcing two major changes. It is releasing an open sourced version of its StreamX tool and forming a partnership with Looker.

Posted May 02, 2016

Melissa Data Partners with Pentaho on Data Quality Tools for Hadoop

Enabled by a partnership with Pentaho, a Hitachi Group Company, and integration with Pentaho's Big Data Integration and Analytics platform, Melissa Data's data quality tools and services can now be scaled across the Hadoop cluster to cleanse and verify data center records.

Posted April 27, 2016

MapR, Cisco, and SAP Come Together to Create Tools for SAP HANA

Cisco is launching an appliance that includes the MapR Converged Data Platform for SAP HANA, making it easier and faster for users to take advantage of big data. The UCS Integrated Infrastructure for SAP HANA is made easy to deploy, speeds time to market, and will reduce operational expenses along with providing users with the flexibility to choose a scale-up (on-premises) or scale-out (cloud) storage strategy.

Posted April 27, 2016

Cloudera Enterprise 5.7 Boosts Data Processing with Hive-on-Spark Support

Cloudera, provider of a data management and analytics platform built on Apache Hadoop and open source technologies, has announced the general availability of Cloudera Enterprise 5.7. According to the vendor, the new release offers an average 3x improvement for data processing with added support of Hive-on-Spark, and an average 2x improvement for business intelligence analytics with updates to Apache Impala (incubating).

Posted April 26, 2016

Neo4j Boosts its Graph Database

Neo Technology, creator of Neo4j, is releasing an improved version of its signature platform, enhancing its scalability, introducing new language drivers and a host of other developer friendly features.

Posted April 26, 2016

Ground-Breaking Research on New IT Trends Adoption is Presented at COLLABORATE 16

The COLLABORATE 16 conference for Oracle users kicked off with a presentation by Unisphere Research analyst Joe McKendrick who shared insights from a ground-breaking study that examined future trends and technology among 690 members of three major Oracle users groups.

Posted April 25, 2016

Bridging the Data Divide: Getting the Most Value From Data With Integration

The need for data integration has never been more intense than it has been recently. The Internet of Things and its muscular sibling, the Industrial Internet of Things, are now being embraced as a way to better understand the status and working order of products, services, partners, and customers. Mobile technology is ubiquitous, pouring in a treasure trove of geolocation and usage data. Analytics has become the only way to compete, and with it comes a need for terabytes—and gigabytes—worth of data. The organization of 2016, in essence, has become a data machine, with an insatiable appetite for all the data that can be ingested.

Posted April 25, 2016

GridGain Offers New Edition of Flagship Platform

GridGain Systems, provider of enterprise-grade in-memory data fabric solutions based on Apache Ignite, is releasing a new version of its platform. GridGain Professional Edition includes the latest version of Apache Ignite plus LGPL libraries, along with a subscription that includes monthly maintenance releases with bug fixes that have been contributed to the Apache Ignite project but will be included only with the next quarterly Ignite release.

Posted April 20, 2016

Dataguise DgSecure Extends Data Security with Support for AWS EMR/S3

Dataguise, a provider of data security solutions, is making DgSecure available for the detection, monitoring, and protection of sensitive data across Amazon Web Services (AWS) Simple Storage Service (S3) and all Elastic MapReduce (EMR) platforms that use AWS S3.

Posted April 19, 2016

Sumo Logic Offers Platform that Analyzes Metrics and Log Analytics on AWS

Sumo Logic, a provider of cloud-native, machine data analytics services, is unveiling a new platform that natively ingests, indexes, and analyzes structured metrics data, and unstructured log data together in real-time.

Posted April 18, 2016

Hortonworks Augments its Platform and Strengthens its Partnerships

Hortonworks is making several key updates to its platform along with furthering its mission as being a leading innovator of open and connected data solutions by enhancing partnerships with Pivotal and expanding upon established integrations with Syncsort.

Posted April 15, 2016

Databricks' Kavitha Mariappan on Why Spark is So Hot Now

First created as part of a research project at UC Berkeley AMPLab, Spark is an open source project in the big data space, built for sophisticated analytics, speed, and ease of use. It unifies critical data analytics capabilities such as SQL, advanced analytics, and streaming in a single framework. Databricks is a company that was founded by the team that created and continues to lead both the development and training around Apache Spark.

Posted April 14, 2016

Pages
1	2	3	4	5	6	7	8	9	10	11	12	13	14	15	16	17

Newsletters

Hadoop Articles

White Papers

Sponsors