02.09.2012

Open-Source Software Addresses Big Data

02.09.2012
Terry Flanagan

Cassandra and Hadoop promise orders of magnitude gains over relational databases.

Capital markets firms are experimenting with open-source database technology capable of capturing, storing, and analyzing enormous amounts of data.

Open-source data storage systems such as Hadoop and Cassandra are ideal for capital markets apps because they can process, store and trigger actions based on a high-volume real-time event stream, perform analytics on historical data, and update models directly into the application.

“A number of our customers are running projects to evaluate and test new tools such as Hadoop and Cassandra,” Roji Oommen, senior director, business development for financial services at Savvis, told Markets Media.

The explosion of Big Data has affected all industries, but the capital markets has its own unique set of issues, such as the need to capture time-series data and merge it with real-time event processing systems.

“As electronic trading becomes pervasive, and you’re collecting full depth tick data feeds, it’s a staggering amount of data,” said Oommen. “The data management issues associated with storing and transforming information are complex.”
Cassandra is an open source distributed database management system designed to store and allow very low-latency access to large amounts of data.

The Cassandra data model is designed for distributed data on a very large scale.

In a relational database, data is stored in tables and the tables comprising an application are typically related to each other.
Cassandra, is a column-oriented database, meaning that it stores its content by column rather than by row. This has advantages for heavy-duty-number crunching apps that involve complex queries.

“Columnar databases such are faster for processing time-series data than relational databases,” said Oommen. “Cassandra is an open-source columnar database, and firms are testing its applicability to tick data management.”

Hadoop is an open-source framework that allows for distributed processing of large data sets across clusters of computers. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage.

“Hadoop is a distributed computing framework developed by Yahoo,” said Oommen. “Hadoop distributes data and workload to commodity services and can scale arbitrarily large, up to exobytes.”

Pension funds, sovereign wealth funds, endowments and other institutional asset owners are sitting on vast troves of data -- but extracting value from that data is more challenging than ever.

#AssetOwners #DataQuality

Technology costs in asset management have grown disproportionately, but McKinsey research finds the increased spending hasn’t consistently translated into higher productivity.
#AI #Fiance

We're in the FINAL WEEK for the European Women in Finance Awards nominations – don't miss your chance to spotlight the incredible women driving change in finance!
#WomenInFinance #FinanceAwards #FinanceCommunity #EuropeanFinance @WomeninFinanceM

ICYMI: @marketsmedia sat down with EDXM CEO Tony Acuña-Rohter to discuss the launch of EDXM International’s perpetual futures platform in Singapore and what it means for institutional crypto trading.
Read the full interview: https://bit.ly/45xRUWh

Load More

Related articles

  1. Chainlink enables 21X to bring real-time, verifiable market data for tokenized securities onchain.

  2. The typology will help trading firms ready themselves for the pending European consolidated tape.

  3. This enables traders to anticipate volatility, minimize market impact & optimize execution in real time.

  4. This is a significant milestone towards mainstream adoption of onchain finance.

  5. From The Markets

    SIX Selects Corvil Analytics

    Corvil Analytics provides improved data transparency and helps optimize low-latency data delivery.

We're Enhancing Your Experience with Smart Technology

We've updated our Terms & Conditions and Privacy Policy to introduce AI tools that will personalize your content, improve our market analysis, and deliver more relevant insights.These changes take effect on Aug 25, 2025.
Your data remains protected—we're simply using smart technology to serve you better. [Review Full Terms] | [Review Privacy Policy] By continuing to use our services after Aug 25, 2025, you agree to these updates.

Close the CTA