Newsletters




Data Warehousing

Hardware and software that support the efficient consolidation of data from multiple sources in a Data Warehouse for Reporting and Analytics include ETL (Extract, Transform, Load), EAI (Enterprise Application Integration), CDC (Change Data Capture), Data Replication, Data Deduplication, Compression, Big Data technologies such as Hadoop and MapReduce, and Data Warehouse Appliances.



Data Warehousing Articles

As we advance deeper into the digitally roaring 2020s, data executives and profession­als are seeing change on a scale never seen before in their careers. A new generation of technologies that often build on previous solutions means new ways of working and ensuring performance for today's increasingly data-driven enterprises. We asked industry leaders for their views on what technology is enhancing enterprises' ability to compete on data.

Posted June 22, 2022

When organizations move their SAP applications to a cloud provider, what they are really doing is placing their mission-critical application into someone else's hands—this exacerbates the need for more cybersecurity monitoring to ensure the provider is handling it with care. Beware, some cloud service providers offer a monitoring service, but the customer also needs to have a process in place to understand what activities are ongoing in the hosted SAP system. 

Posted June 22, 2022

Traceable AI, a leading API security and observability company, is introducing an enhanced API Catalog solution, enabling organizations to overcome their challenges with API discovery and risk assessment.

Posted June 21, 2022

Aerospike, Inc., a real-time data platform provider, is introducing Aerospike SQL Powered by Starburst, enabling customers to run massively parallel, complex SQL queries on petabyte-scale data stored in the Aerospike real-time Data Platform. A hardened solution built on top of open-source Trino platform, the Aerospike SQL Powered by Starburst solution gives data analysts and data scientists a single point of access to federated data using existing SQL analytic tools such as Tableau, Qlik, Power BI, and others.

Posted June 16, 2022

Snow Software, a provider of technology intelligence, and Anodot, the business monitoring company, are forming a new strategic partnership to help organizations solve the urgent challenge of managing rapidly growing and increasingly complex cloud costs. Snow and Anodot will bridge the disciplines of IT asset management (ITAM) and finance operations (FinOps) together to address evolving issues associated with cloud cost management.

Posted June 13, 2022

BMC has announced the launch of the BMC Innovation Labs Preferred Partner Program, a collaborative co-innovation program built for, and with, BMC partners and customers. The program is focused on ideation and experimentation to design and ultimately commercialize modern technology solutions to support customers on their Autonomous Digital Enterprise journey. BMC partners can visit the BMC Innovation Labs Preferred Partner Program website to request consideration for the invitation-only program.

Posted June 13, 2022

Data archiving is an important aspect of data governance and data management. Not only does archiving help to reduce hardware and storage costs, but it is also an important aspect of long-term data retention and a key participant in regu­latory compliance efforts. When long-term data retention is imposed on your data—anything more than a couple of years—then archiving it can be the most optimal solution.

Posted June 13, 2022

MariaDB Corporation and MindsDB, a provider of in-database machine learning, are partnering on a solution that makes machine learning predictions easy and accessible to cloud database users. By using MindsDB in SkySQL, MariaDB's fully managed cloud database service, data science and data engineering teams can increase their organization's predictive capabilities to plan for and address real-world business issues, according to the vendors.

Posted June 09, 2022

Oracle has announced that its deal to acquire Cerner Corp. will close on June 8, 2022. Cerner is a provider of digital information systems used within hospitals and health systems by medical professionals. Oracle previously announced on June 1 that all required antitrust approvals have been obtained for its proposed acquisition of Cerner, including European Commission clearance. Larry Ellison, Oracle's chairman of the board and chief technology officer, will outline Oracle's strategy to redefine the future of healthcare at a virtual event on June 9, 2022, at 3 p.m. CT.

Posted June 07, 2022

Lacework, the data-driven cloud security company, is introducing new agentless scanning for workloads, providing organizations with comprehensive and frictionless visibility into vulnerability risks across all active hosts, containers, and application language libraries in their environment.

Posted June 07, 2022

Coalesce, the data transformation company, is forming partnerships with Fivetran and Snowflake to make data teams more efficient and productive, delivering faster results than any other data transformation tool on the market. The partnership with Coalesce allows Snowflake customers to automate the data transformation process, cutting down on time and resources spent preparing data to get the most out of Snowflake. Without Coalesce, the process is manual and time consuming, according to the vendors.

Posted June 07, 2022

Cockroach Labs, the company behind CockroachDB, is introducing CockroachDB 22.1, delivering updates across the entire application lifecycle that empower developers and architects to accomplish more, with less effort.

Posted June 03, 2022

Rivery, the SaaS ELT, announced a new funding round of venture capital, receiving $30 million that will enable the company to expedite its growth across all teams in New York and Tel Aviv HQ including R&D, Product, and Sales, as well as expanding on EMEA where a London office has been launched to focus on the regional market.

Posted June 03, 2022

To serve the data analytics and applications for business growth and operational needs, data lakes are being widely adopted as the data infrastructure because of their scalability and flexibility. Data lakes are strong at parking petabytes of data and production delivery as a result of their "schema-on-read" structure. But every coin has two sides. Data lakes, as a semantically flexible data store and bypassed governance efforts, have been seen as muddy swamps and inefficient in data management.

Posted June 02, 2022

Broadcom Inc., a global technology provider that designs, develops, and supplies semiconductor and infrastructure software solutions, announced it is acquiring VMware, Inc., an innovator in enterprise software. Broadcom will acquire all of the outstanding shares of VMware in a cash-and-stock transaction that values VMware at approximately $61 billion, based on the closing price of Broadcom common stock on May 25, 2022. In addition, Broadcom will assume $8 billion of VMware net debt.

Posted June 02, 2022

There are so many new buzzwords lately, including the data lakehouse, data mesh, and data fabric, just to name a few. But what do all these terms mean, and how do they compare to a data warehouse? This presentation covers all of them in detail and explains the pros and cons of each, with suggested use cases so attendees can see what approach will really work best for their big data needs.

Posted June 02, 2022

Microsoft has recently released a powerful new DMV specifically to help with memory issues, sys.dm_os_out_of_memory_events. It is currently available in Azure SQL Database and Azure SQL Man­aged Instances. This DMV consolidates and simplifies telemetry from SQL Server ring buffers, applies heuristics, and provides a result set. The DMV stores a record for each out-of-memory (OOM) event that occurs within the database, providing details about the OOM root cause, the memory consumption of database engine components at that point in time, potential sources of memory leaks, and more.

Posted June 02, 2022

PlanetScale, the serverless database provider powered by MySQL and Vitess, is offering a number of new innovations that accelerate delivery of "The Future Database," with new Insights providing granular performance visibility, Portals for multi-region deployment, and Connect enabling expansive analytics platform integrations.

Posted June 02, 2022

Data Intensity recently announced its deepened commitment to accelerate and transform Oracle-powered workloads to Oracle Cloud Infrastructure (OCI), combining a strategic partnership with its own migration and lifecycle management portfolio of expert technical and functional support services.

Posted June 01, 2022

The coming decade is going to require a modern data warehouse to meet demanding new requirements for machine learning, data variety, and real-time analytics—while still satisfying the more traditional need for analysis of structured data at scale.

Posted June 01, 2022

Deepnote, an early-stage startup backed by Accel and Index Ventures, is emerging from beta with version 1.0, opening up to the general availability of collaborative data science notebooks to data teams worldwide. Since the company's Series A announcement in Jan 2022, Deepnote has added many features going into the 1.0 launch. Most notably is the addition of Deepnote Workspaces, which empowers data teams to organize and surface data projects, notebooks, and apps in one place.

Posted May 31, 2022

SAP is introducing new innovations that deliver business value for customers in four critical areas: supply chain resilience, sustainability, business process transformation, and no-code application development. The innovations announced will help SAP customers accelerate their transformation journey with cloud-based solutions that provide the end-to-end business process support customers most need, according to the vendor.

Posted May 25, 2022

Push Technology, a provider of real-time data streaming and messaging solutions, is releasing Diffusion 6.8, adding new features that include the Diffusion Gateway Framework, expanded data wrangling calculations and conditionals, and journal logging.

Posted May 20, 2022

Data consumers need data for BI and analytics to make business decisions. But for most organizations, their current data infrastructure isn't keeping up with demand. In a presentation at Data Summit 2022, titled "Building the Open Data Lakehouse," Mark Lyons, senior director, product management, Dremio, explained why more organizations are moving their analytics and BI to an open data lakehouse and how you can build a successful lakehouse strategy.

Posted May 18, 2022

No other subject seems to capture the attention of IT leaders right now like database migrations. If there were an IT theme for 2022, it would be: Enterprises migrate from legacy data warehouses to the cloud. And it is no longer just the "early adopters" but the entire customer base that is looking to make the move to cloud-based systems. Let's examine the three most common problems that hamper the execution of migration projects and what can be done to avert migration disasters.

Posted May 18, 2022

Thomas Hazel, founder/CTO, ChaosSearch, examined the tools and technologies to get more value from data and how to determine which ones are right for your organization in a Data Summit 2022 keynote. By stripping away data engineering complexity and lowering total cost of infrastructure ownership and maintenance, more and more organizations are unlocking the value of analytics at scale.

Posted May 17, 2022

Data is often described as "the new oil"—a valuable fuel flowing through organizations. But it is time to stop talking about data as the new oil and concentrate instead on acting on its true importance. This is the view of Doug Laney author of "Infonomics," who gave the opening keynote talk at Data Summit 2022 in Boston.

Posted May 17, 2022

Around 85% of analytics, big data, and AI projects will fail, despite massive investments of money. It's not new news, but it still reflects on how powerfully design affects speed, scale, and usage. At Data Summit 2022, Brian O'Neill, founder and principal, Designing for Analytics presented his session, "Technically Right, Effectively Wrong: How to Avoid Creating the ML or Analytics Application No Customer Wants to Use."

Posted May 17, 2022

The case for increased data automation is clear. "Data teams are spending significant amounts of time on service requests like infrastructure, user provisioning, and incident coordination and communication," said Tina Huang, CTO and founder of Transposit. "Teams today are often manually creating tickets, Slack channels, and Zoom meetings, plus communicating with stakeholders. Data teams must ensure internal customers using data have access to the data they need and real-time updates about interferences with that data." Other tasks ripe for automation include log parsing, correlation, permissions and access, and more.

Posted May 16, 2022

Couchbase has announced version 7.1 of Couchbase Server, a new release that delivers advancements in performance, storage capacity, and workload breadth, including expanded operational analytics support with direct Tableau integration-all while reducing deployment cost. According to Couchbase, with 7.1, enterprise architects and development teams reduce the cost of building and running applications while gaining operational efficiency. "More organizations are experiencing the drawbacks of deploying first-generation cloud architectures, and one of the main disadvantages is the cost of cloud instance sprawl," said Ravi Mayuram, chief technology officer at Couchbase.

Posted May 10, 2022

Domino Data Lab, provider of a leading enterprise MLOps platform is introducing Domino 5.2, continuing Domino's progress towards helping enterprises become model-driven.

Posted May 09, 2022

Alluxio, the developer of the open source data orchestration platform for data driven workloads such as large-scale analytics and AI/ML, is releasing version 2.8 of its Data Orchestration Platform, featuring enhanced interface support for the Amazon S3 REST API; security improvements for sensitive applications with strict encryption compliance and regulatory requirements; and strengthened automated data movement functionality across heterogeneous storage systems.

Posted May 04, 2022

The volume, velocity and veracity of today's data deluge has put immense pressure on underlying data platforms and organizations' abilities to manage them effectively. And the pandemic has only exacerbated the problem. According to a 2021 survey, nearly half of digital architects are under high or extremely high pressure to deliver digital projects, but 61% blame legacy technology for making it difficult to complete modernization efforts. That said, databases of all types—SQL, NoSQL, or NewSQL—be they on-prem, cloud, hybrid, or edge, are struggling to navigate this new reality.

Posted May 04, 2022

The value of normalization is in understanding the data well enough to create the normalized design. Pulling out the business rules, business terms, and relationships from the mass of jumbled together raw content is critical. The business rules that result from performing the normalization exercise establish the requirements that need to be satisfied by solutions, whether they are either built or purchased. When an organization creates and maintains a normalized design for the data within the important areas of their business, they reduce work on all future systems.

Posted May 04, 2022

The 9th annual Data Summit conference will be held May 17-18, 2022, at the Hyatt Regency Boston. Pre-conference workshops will take place on May 16, 2022. The program is available for review and a variety of pass options are available to suit individual requirements.

Posted May 04, 2022

It is well known that a database is the fundamental building block for any data-based initiative. Databases are used when collecting, storing, processing, and analyzing data. A database is the silent component that drives business decisions and operational improvements or simply keeps track of inventory. As much as the database should be the almost invisible part of these processes, it is crucial to make the right choice. While it might look easy to select a suitable database, there are a few things to evaluate when making a decision.

Posted May 04, 2022

Having access to the latest version of open source databases is important to optimize your workloads for availability, performance, security, and more. In February 2022, AWS launched MariaDB version 10.6 for Amazon RDS for MariaDB alongside a number of other exciting capabilities.

Posted May 03, 2022

Many organizations are working hard to move to the cloud, but find that with a migration there is also complexity. Recently, Derek Swanson, CTO of Silk, offered advice on what to evaluate to successfully take advantage of all cloud has to offer, the issues to consider when determining what infrastructure will best serve each workload, and the risks of going to the cloud with the wrong strategy.

Posted May 02, 2022

Ocient, a hyperscale data analytics solutions company, is releasing version 19 of the Ocient Hyperscale Data Warehouse, enabling organizations to execute previously infeasible workloads in interactive time. With Ocient, organizations can tackle CPU-intensive workloads with ease, including large-scale joins and full-table scans with extreme I/O performance, returning results in seconds or minutes versus hours or days, according to the vendor.

Posted April 29, 2022

LogDNA, a leading observability data platform, is introducing several platform capabilities that empower companies to get more out of log data while maintaining control over costs. Enterprise users can now access Variable Retention and Enterprise Organizations, while all users benefit from new log control features, including Log Data Restoration, Usage Quotas, and Index Rate Alerting.

Posted April 29, 2022

Arcion is partnering with Databricks offer preconfigured, validated data replication for users of Databricks through that company's new Partner Connect program. Arcion's product enables faster, more agile analytics and AI/ML by empowering enterprises to integrate mission-critical transactional systems with their Databricks Lakehouse in real time, at scale, and with guaranteed transactional integrity, according to the vendor.

Posted April 28, 2022

Airbyte, creators of an open-source data integration platform, is releasing its cloud service for data movement in the U.S. "With Airbyte Cloud, we remove the headache of building and maintaining custom data infrastructure by providing a simple, economical way for enterprises to move data as needed," said Michel Tricot, co-founder and CEO, Airbyte.

Posted April 28, 2022

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44

Sponsors