Share this
Calculated Data Types Now Available in DataSignals® Cloud
by Sam Lockshin
A major arctic blast will bring cold temperatures to a large swath of the Central US with the coldest temperatures – and resulting peak electricity demand – expected Thursday morning. In contrast to the winter storm in mid-January, the cold air will not reach as far south; however, the most extreme temperatures will affect more of the Great Plains and Midwest regions this time around.
Additionally, the peak load hours are expected to coincide with low renewable production levels, meaning that the net load on the system could set new winter records. If that happens, the risk of market-wide energy price spikes will grow, particularly if temperatures come in even a few degrees below forecast.
Why Add Calculated Data to the Cloud?
Calculated data types, also known as functional, derived, or feature-engineered data sets, are generated by combining multiple base data types to produce new, meaningful data signals.
Estimated congestion is one example of a calculated data type. Some Independent System Operators (ISOs), such as the Electric Reliability Council of Texas (ERCOT), don’t report the congestion component of their locational marginal pricing (LMP) data – they only provide the price and the energy components from multiple reports.
To understand a location’s congestion at a point in time, data scientists, data engineers, or financial traders must manually calculate congestion at the node using the LMP and energy data ERCOT publishes. Or they can leverage the Yes Energy DataSignals API.
The DataSignals API calculates multiple derived datasets, including congestion for each ERCOT node. It enables you to search one location and data type at a time for a specific data range.
While ideal for smaller, targeted use cases, the API has a limited historical bandwidth and throttle limits that dictate how much and how frequently users can pull data. The requested data points are calculated in real time and displayed to the user but aren’t materialized as tables anywhere in the system. Users looking to analyze multiple data sets must pull each separately and concatenate them, creating their own materializations.
This API’s inherent lack of materialization and limited scalability can bottleneck broader historical studies. DataSignals Cloud, on the other hand, is well suited to facilitate such analyses.
Introducing Calculated Data Types in DataSignals Cloud
DataSignals Cloud, Yes Energy’s data warehouse, is built on the Snowflake platform. Snowflake excels at storing large amounts of historical data at scale, enabling us to provide customers with a faster, more convenient way to access derived data sets than is possible with just the API.
Our new calculated data type feature leverages the hundreds of millions of rows of historical data stored in DataSignals Cloud to populate well-organized tables. Because these tables live in the cloud, users can query them to quickly gather all the data necessary to complete their analysis, rather than piecing the information together from multiple data sets extracted with the API.
Calculated data types help you simplify your data pipelines.
In the first phase of our calculated data type rollout, DataSignals Cloud has been updated to include 15 new searchable data types.
First, we’ve added the complete history of calculated congestion for ERCOT’s day ahead and real-time markets, including for every single price node and historical date-time record.
Sample data of our new calculated day ahead estimated congestion for ERCOT, showing the deep history and data structure.
We’ve also integrated the 13 data types that comprise our calculated weather normals data series across North America, including temperature and dew point norms, cloud cover, wind speed, and other data points critical to understanding the weather patterns and historical trends that inform energy demand predictions.
Sample data of our new calculated weather normals data, showing the various data series and data structure.
Let’s note one significant difference between the data in the API and the more robust historical data in the DS Cloud. While the API pulls data in real time, the calculated data in the cloud is near real time, updated at least once a day. Because this poses some challenges in orchestrating data updates, we opted for daily updates so we can materialize the vast historical data as tables or views that users can filter. We are looking to increase the data refresh frequency as we continue to develop this feature, assisted by Right Triangle Consulting.
Streamline Your Operations with Calculated Data Types in DS Cloud
Whether you're a financial trader working in the day-ahead market, an asset developer making decisions about where you want to site a plant, or a wind farm operator evaluating how to best optimize an asset, this new approach to gathering calculated data will streamline your operations. By materializing historical values in the cloud, this new feature saves you time and money and better supports your trading and business decisions.
With DS Cloud’s derived data, you no longer have to establish multiple data-scraping mechanisms – the complete historical data set is already calculated and available to input into your backcast models, streamlining your energy price and grid condition predictions.
Additionally, the feature reduces your extract-load-transform processes because each derived data type is consolidated into an easy-to-use, familiar table structure.
Finally, the feature accelerates workflows for those wanting to write the calculated data to their storage systems before conducting grid analytics and backcast modeling.
Further Expansion of Calculated Data on the Horizon
Stay tuned. Phase one of the calculated data rollout includes 15 new data types – the day ahead and real-time congestion data sets for all ERCOT price nodes and our 13 weather normals data types for all weather stations. These tables are live and available for use, but they’re just the beginning.
Our team will continue to release additional calculated data types that enrich your data collections for grid analytics, forecasting, and modeling purposes. Ultimately, we plan to have calculated datasets available in the DataSignals Cloud for each ISO and Regional Transmission Organization (RTO).
We also plan to add metadata in the Snowflake metadata views for each derived data point.
Finally, while the calculated data type tables are currently only available in DataSignals Cloud, we foresee these datasets being available in DataSignals Lake in the near future.
The addition of calculated data types to the DataSignals Cloud is just one way Yes Energy demonstrates its commitment to better data and evolving and expanding our state-of-the-art DataSignals product. We believe this new feature will save you time and money and streamline your forecasting and backcasting operations to help you make better market decisions.
If you’re a current customer, you can learn more about this on Knowledge Owl, and you can check out more in-depth examples such as code snippets and workflows in our code repository.
If you want to see more, contact Yes Energy to schedule a demo and learn how our new cloud-based calculated data types can help you leverage the latest power market data to improve your energy price predictions, help you make better business and trading decisions, and drive more revenue.

Share this
- Industry News & Trends (91)
- Power Traders (77)
- Asset Managers (40)
- Data, Digital Transformation & Data Journey (35)
- Asset Developers (29)
- Market Events (29)
- ERCOT (26)
- Utilities (25)
- Market Driver Alerts - Live Power (24)
- PowerSignals (22)
- ISO Changes & Expansion (21)
- Infrastructure Insights Dataset (21)
- Yes Energy Demand Forecasts (20)
- DataSignals (19)
- Live Power (19)
- Renewable Energy (18)
- Risk Management (17)
- Data Scientists (16)
- CAISO (12)
- Energy Storage / Battery Technology (12)
- PJM (12)
- QuickSignals (10)
- MISO (9)
- Position Management (9)
- SPP (9)
- EnCompass (8)
- Power Markets 101 (8)
- Submission Services (8)
- Financial Transmission Rights (7)
- Snowflake (6)
- Powered by Yes Energy (5)
- Asset Developers/Managers (4)
- Data Centers (4)
- FTR Positions Dataset (4)
- Geo Data (4)
- Solutions Developers (4)
- Commercial Vendors (3)
- ISO-NE (3)
- NYISO (3)
- Battery Operators (2)
- Independent Power Producers (2)
- PeopleOps (2)
- data quality (2)
- AI and Machine Learning (1)
- Crypto Mining (1)
- Europe (1)
- FERC (1)
- Ireland (1)
- Japanese Power Markets (1)
- Natural Gas (1)
- Western Markets (1)
- hydro storage (1)
- nuclear power (1)
- February 2025 (7)
- January 2025 (7)
- December 2024 (4)
- November 2024 (7)
- October 2024 (6)
- September 2024 (5)
- August 2024 (10)
- July 2024 (9)
- June 2024 (4)
- May 2024 (7)
- April 2024 (8)
- March 2024 (6)
- February 2024 (9)
- January 2024 (4)
- December 2023 (4)
- November 2023 (5)
- October 2023 (6)
- September 2023 (2)
- August 2023 (5)
- July 2023 (3)
- May 2023 (4)
- April 2023 (2)
- March 2023 (1)
- February 2023 (2)
- January 2023 (4)
- December 2022 (2)
- November 2022 (1)
- October 2022 (3)
- September 2022 (5)
- August 2022 (4)
- July 2022 (3)
- June 2022 (2)
- May 2022 (1)
- April 2022 (3)
- March 2022 (3)
- February 2022 (6)
- January 2022 (3)
- December 2021 (1)
- November 2021 (2)
- October 2021 (4)
- September 2021 (1)
- August 2021 (1)
- July 2021 (1)
- June 2021 (2)
- May 2021 (3)
- April 2021 (3)
- March 2021 (3)
- February 2021 (3)
- December 2020 (3)
- November 2020 (4)
- October 2020 (2)
- September 2020 (3)
- August 2020 (2)
- July 2020 (2)
- June 2020 (1)
- May 2020 (8)
- November 2019 (1)
- August 2019 (2)
- June 2019 (1)
- May 2019 (2)
- January 2019 (1)