From Raw Data to Engineering Results: The NI Technical Data Management Solution
Overview
This paper is designed to help managers, engineers, and scientists understand their options for managing large amounts of test and simulation data. It explores the different approaches to test data management, including their advantages and drawbacks. It introduces a new National Instruments solution to the increasingly costly and time-consuming challenge of data management.
Table of Contents
The Pain Points of Data Management
Today’s complex products require data acquisition throughout the complete design, development, and verification process. Test engineers continue to face the challenge of testing increasingly complicated designs with ever-shrinking timelines and budgets to meet consumer demand for higher-quality products at lower prices. From the cost of simulation systems, data acquisition hardware, and automation systems to the associated personnel required to perform and analyze the tests, companies make a significant investment in the data they collect. However, when planning a complete test system, companies often fail to design an all-encompassing solution; they do not fully consider and invest in ways to manage their data once they collect it. Increasing microprocessor speed and storage capacity and decreasing hardware and software costs have resulted in an explosion of collected data being stored in files and databases. While technology is enabling faster and richer data retention, managing and making good use of this data remains the real challenge. More than ever in today’s fiercely competitive business environment, companies need to rapidly turn test and simulation data into usable information to efficiently drive product development and reduce time to market.
Not all companies can afford the luxury of designing test systems from the ground up. Many simulation and test systems have grown over several years, often independently of each other and with equipment from different suppliers. As a result, companies store data in different file formats and in different locations, with little or no descriptive information. All of these factors present roadblocks that impede the optimal exchange and extraction of information, and make it extremely difficult to locate a particular data set and derive decisions from it.
Even if you have a history of consistent data formats containing descriptive information, the challenge of making the data available for others to both easily access and efficiently use still remains. How do you ensure that all data consumers in your organization can find the data they need? Once they do find the desired data, how do you guarantee that they have the right tools to quickly turn that data into usable information? Common approaches to these challenges range from primitive file and folder naming conventions to complex and costly database solutions. Your ability to quickly turn raw data into interpretable results and easily share these results throughout your organization is critical to the success of your overall development process.
Traditional Data Management Approaches
The challenges involved in building and maintaining scalable data management systems have caused many companies to continue managing their critical data assets by using simple file- and folder-naming conventions. Though the file and folder approach requires no up-front cost, you realize this cost many times over in the long run when you must manually search through deep directory structures to find the data you need. Often this data is stored on different machines and in different file formats, adding even more complexity to your situation. Correlation of data and test results across multiple machines and formats under these circumstances can be challenging and time consuming. File- and folder-naming data management systems are highly ineffective, resulting in lost productivity, and are easily corrupted when files are inadvertently moved out of their designated folders or renamed.

Figure 1. File and Folder Approach across Multiple Test Machines
You may tolerate these challenges because you can load much of this data into readily available spreadsheet tools such as Microsoft Excel. However, the cost saved by not having to purchase dedicated engineering software is lost when you spend time and resources recreating common engineering analysis and reporting routines unavailable in business-centric spreadsheet packages.
At the opposite end of the spectrum is the approach of using dedicated databases for managing technical data. Because of their reputation for organizing data for easy search and retrieval, databases have garnered the attention of many test engineers as one way to manage their test data. However, there are several drawbacks to the implementation of a standard database such as Oracle or Access for your data management system. These databases are not set up to work out of the box with your test data; first you need to complete a significant amount of data model and database design work that can take months of preparation and planning. For larger solutions, the initial cost of setting up a database solution can easily reach six-digit dollar figures; more important is the often overlooked ongoing maintenance required to keep databases up and running. Extra time and effort is required to expand the database design and custom client software every time your test needs change. This maintenance and scaling over time not only costs more and more money but usually requires you to engage IT experts who know how to keep these systems running. Often, the IT group does not have the bandwidth to help test groups.

Figure 2. Database Approach of Storing Data in a Central Location
In the end, neither the file and folder approach nor the database approach meets the needs of most test groups. While they may seem like viable solutions at design time, you are still left with maintenance headaches and inefficiencies as your test needs change over time and you continue to store more data.
NI Technical Data Management Solution
National Instruments recognizes that engineers and scientists face these data management challenges every day when designing and improving their test systems. Whether your current solution involves spending hours a week tediously searching through directories and files to find just one specific test or redesigning an expensive database and client application to accommodate additional information, both approaches lack the same thing – engineering intelligence from test data. You must spend a lot of time designing and performing tests that allow you to move quickly from test data to test results. All too often today, you are expected to make the crucial transition from test data to test results without proper data management tools; therefore, NI has developed a data management solution to meet these challenges and make it easier and faster for to move from data collection to usable results.

Figure 3. The NI TDM Solution Consists of Three Components
NI has identified three key pieces that complete the data management puzzle: flexible and organized file storage, comprehensive search capabilities, and an interactive post-processing environment. As a result, the NI technical data management (TDM) solution consists of three components: the TDM data model for storing descriptive information with your test files, NI DataFinder for searching and mining your test data regardless of file format, and NI DIAdem software for analysis and reporting. Though many engineers already have components in place for raw data file writing as well as analysis and reporting, with the lowest common denominator being saving ASCII files and performing analysis in Excel, most engineers are lacking the middle component, NI DataFinder, to complete their data management approaches.
The foundation of the NI data management solution involves storing rich meta information with your data, which is then searchable using NI DataFinder. With the NI TDM data model, you can easily store properties and organize your data hierarchically within your test file. TDM and TDM Streaming files, written natively using NI DIAdem, LabVIEW, LabVIEW SignalExpress, or LabWindows™/CVI, provide three levels of hierarchy to structure your test data – file, group, and channel levels. The TDM data model removes the burden of designing and maintaining your own custom file format. Designed to scale with your needs, the TDM data model provides the most efficient way to properly document your test and simulation data in LabVIEW and LabWindows/CVI. A rich and comprehensive set of properties associated with your files, groups, and channels dramatically increases the range of possible search conditions.

Figure 4. The TDM Data Model for Saving Well-Documented Test Data
Once you have properly documented your data with properties, NI DataFinder gives you the out-of-the-box ability to search and mine your test data without the headache and expense of setting up and maintaining a large database. With NI DataFinder, you can perform Internet-like searches across all your data files, regardless of format and location within your company intranet. Simply install NI DataFinder with DIAdem, point it to the location of your data files, and within seconds you can search for your files just as you would search for information on the Internet.
NI DataFinder does this by automatically building and maintaining an index of all files that meet the file type and location criteria in the NI DataFinder configuration. You can use properties that are automatically stored in the NI DataFinder index in query conditions. When a valid data file is created, deleted, or edited, NI DataFinder automatically notices and reindexes the hierarchy and properties of the file. When you save properties not yet in NI DataFinder in a newly created file, these properties are automatically added to the index. NI DataFinder dynamically manages its own data tables and updates them based on file events and the contents of each file. Therefore, unlike many expensive database solutions, you can change and add information as your needs change without redesigning your data management solution.
The final component of the NI TDM solution for moving quickly between raw data and engineering results is NI DIAdem for posttest analysis and reporting. Using DIAdem, you can take advantage of hundreds of built-in engineering analysis functions to extract meaningful results from your raw data, and then share these results with easy-to-build, reusable report templates. DIAdem also provides a built-in interface to both DIAdem DataFinder and NI DataFinder Server Edition. Therefore, you can run interactive searches within DIAdem as well as save and automate your queries. Within one environment, you can mine your data for trends, analyze these trends, and then save these results to reports. Based on the size of your test group and the amount of data you are collecting, there is an NI DataFinder solution to meet your needs.
NI DataFinder
With this pivotal middle component of the NI TDM solution, you can customize your data management and mining solution. Once you have installed NI DataFinder, configure it to search through particular folders and directories for specific file types. NI DataFinder then automatically builds a fully searchable index of the metadata of these files. You can add and remove search areas with one click as well as configure NI DataFinder to index your own custom file formats.
Once you have set up your search areas, you can query your data from two interfaces in DIAdem – the simple and advanced search. In the simple search, you type in words or values to run a quick search and return files that contain all of these values somewhere within the file. Using the advanced search, you can search for properties on specific hierarchy levels (file, group, and channel) and return your results as channel, group, or file listings. For example, you could search on “Group.Test_Status = fail” and return all the groups that contain this property and failed. In this case, you can search for all channels across multiple files and formats that meet your search criteria and then load just the returned channels from each of the files. You now have more control of your data because you only load the channels or properties of interest from each file.
Figure 5. Using the Advanced Search, you can quickly find trends and correlations within your test data.
Out of the box, NI DataFinder automatically indexes TDM and TDM Streaming files within the search areas you specify. However, you may have a number of existing files in different formats that you also want to search using NI DataFinder. For these, you can use a DataPlugin to load your files in DIAdem or LabVIEW. Once you have a DataPlugin for an arbitrary data file, you can index, search, and load it just like a TDM file. There are more than 75 free DataPlugins available for download at ni.com/dataplugins. You also can use the DataPlugin API, which includes examples, to help you write your own DataPlugin for your custom formats.
Figure 6. The index stores all the descriptive information included with a file, so you can mine and search on these values.
As mentioned previously, the NI DataFinder index is completely self-configurable and self-maintainable. Once you point NI DataFinder to the folders and directories you want to index, it builds the index of all the metadata stored within the files. As you collect more test data and store the files within the search areas, NI DataFinder receives a Windows File Event notification and automatically indexes the new data files. NI DataFinder also monitors file events that are triggered when a file is changed or deleted, so you always have the most up-to-date index of your data. NI DataFinder has a number of native properties, such as name and description, which are stored with all files. However, in many tests that you perform, you may want to store your own custom properties such as Test Status or Test Procedure that are also indexed and searchable. Because NI DataFinder is a completely self-scaling index of all your metadata, the index automatically adapts to these new properties and indexes them along with the native properties. With a traditional database solution, adding new properties can be a costly undertaking if the database is not designed properly from the beginning.
DIAdem DataFinder and NI DataFinder Server Edition
National Instruments offers two data management options to meet your needs – DIAdem DataFinder and NI DataFinder Server Edition. DIAdem DataFinder, included in all packages of DIAdem, helps you search data stored locally on your machine or on a network. Larger groups accessing data across multiple machines with limited network bandwidth and numerous file formats require a more unified and robust approach.
NI DataFinder Server Edition is software installed on a server machine, but it is designed for easy installation and configuration without the need for IT involvement. Just like the local DIAdem DataFinder, you configure particular search areas and file formats for indexing. Once you have configured these search areas, multiple client computers using DIAdem have simultaneous visibility to collected test data using the one central shared index on the server machine. Your entire group or department can find and analyze data without the hassle of searching through files across several machines. Just select to connect to NI DataFinder Server Edition, and you are immediately connected to the server index just as if you were connected to your local DIAdem DataFinder index.

Figure 7. NI DataFinder Server Edition centralizes the metadata from multiple test stations so you can easily access and mine it with multiple clients simultaneously.
NI DataFinder Server Edition offers faster and easier access to data stored not only on your local machine but also on all your network machines. Expedited access to more data helps you focus on the important tasks of design and analysis instead of wasting valuable time and resources searching for data. Besides being able to find the data you are looking for, you can inspect multiple files at once to find important trends or anomalies.
From the ability to support multiple concurrent connections to built-in user management and archiving capabilities, NI DataFinder Server Edition provides numerous advantages over NI DataFinder for large groups. For more information on how NI DataFinder Server Edition can further streamline your simulation and test data management solution, read the NI DataFinder Server Edition technical white paper.
Analyzing and Reporting Your Test Data
The last element in the NI TDM solution is the query and reporting client, DIAdem. DIAdem provides a built-in user interface to NI DataFinder to help you search through all indexed data files with a simple Internet-like keyword search or a powerful advanced search. DIAdem combines this data-searching functionality with general-purpose data visualization, analysis, and reporting environments. Once you have found the data you need – either through DIAdem DataFinder or NI DataFinder Server Edition – you can analyze the data using hundreds of built-in engineering analysis functions and create consistent reports to share your results. DIAdem provides a customizable postprocessing solution due to its ability to export reusable templates from your most commonly used queries, reports, and interactive views. In addition to using DIAdem and NI DataFinder as a data-handling environment, you can fully automate them with VBScript and custom dialogs. Engineers often use this solution as the basis for fully customized data management and evaluation applications.

[+] Enlarge Image
Figure 8. DIAdem for Data Postanalysis, Reporting, and Automation
Conclusion
The NI data management solution covers all the bases of a scalable, cost-effective, search-ready data management system. You no longer need to worry about the details of file parsing, data model and data table structure, database management, search GUI design, or support from outside groups such as the IT department. All you have to do is use the easy-to-configure DIAdem DataFinder or NI DataFinder Server Edition to automatically create and maintain a data index. Finally, you can take advantage of the DIAdem built-in search GUIs and easy-to-use interactive data analysis and reporting environment to quickly turn your saved data into the reports and result information you need.
DIAdem DataFinder and NI DataFinder Server Edition benefits include:
- Low cost of initial purchase and ongoing maintenance
- No need to install, design, and maintain expensive and complex databases
- Simple setup – simply point at the directories you wish to index and install the correct DataPlugins for all your data formats
- Automatic indexing of data files added to NI DataFinder search areas
- Automatic scaling from single users to complete engineering groups and enterprises
- Easy DataPlugin creation for your custom file formats
- Interactive or automatic data analysis and reporting with DIAdem
The mark LabWindows is used under a license from Microsoft Corporation. Windows is a registered trademark of Microsoft Corporation in the United States and other countries.
Additional Resources
- Learn more about NI DIAdem
- Learn more about NI DataFinder Server Edition
- Watch a webcast on the NI Technical Data Management Platform
Reader Comments | Submit a comment »
Legal
This tutorial (this "tutorial") was developed by National Instruments ("NI"). Although technical support of this tutorial may be made available by National Instruments, the content in this tutorial may not be completely tested and verified, and NI does not guarantee its quality in any way or that NI will continue to support this content with each new revision of related products and drivers. THIS TUTORIAL IS PROVIDED "AS IS" WITHOUT WARRANTY OF ANY KIND AND SUBJECT TO CERTAIN RESTRICTIONS AS MORE SPECIFICALLY SET FORTH IN NI.COM'S TERMS OF USE (http://ni.com/legal/termsofuse/unitedstates/us/).


