US9672257B2 - Time-series data storage and processing database system - Google Patents

Time-series data storage and processing database system Download PDF

Info

Publication number
US9672257B2
US9672257B2 US15/171,494 US201615171494A US9672257B2 US 9672257 B2 US9672257 B2 US 9672257B2 US 201615171494 A US201615171494 A US 201615171494A US 9672257 B2 US9672257 B2 US 9672257B2
Authority
US
United States
Prior art keywords
time
data
series data
series
database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
US15/171,494
Other versions
US20160357828A1 (en
Inventor
David Tobin
Dylan Scott
Orcun Simsek
Steven Fackler
Wilson Wong
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Palantir Technologies Inc
Original Assignee
Palantir Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to US15/171,494 priority Critical patent/US9672257B2/en
Application filed by Palantir Technologies Inc filed Critical Palantir Technologies Inc
Priority to EP16173056.9A priority patent/EP3101560B1/en
Assigned to Palantir Technologies Inc. reassignment Palantir Technologies Inc. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: WONG, WILSON, Fackler, Steven, SIMSEK, ORCUN, Scott, Dylan, TOBIN, DAVID
Publication of US20160357828A1 publication Critical patent/US20160357828A1/en
Priority to US15/614,388 priority patent/US10585907B2/en
Publication of US9672257B2 publication Critical patent/US9672257B2/en
Application granted granted Critical
Assigned to MORGAN STANLEY SENIOR FUNDING, INC., AS ADMINISTRATIVE AGENT reassignment MORGAN STANLEY SENIOR FUNDING, INC., AS ADMINISTRATIVE AGENT SECURITY INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: Palantir Technologies Inc.
Assigned to ROYAL BANK OF CANADA, AS ADMINISTRATIVE AGENT reassignment ROYAL BANK OF CANADA, AS ADMINISTRATIVE AGENT SECURITY INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: Palantir Technologies Inc.
Priority to US16/805,257 priority patent/US11687543B2/en
Assigned to MORGAN STANLEY SENIOR FUNDING, INC. reassignment MORGAN STANLEY SENIOR FUNDING, INC. SECURITY INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: Palantir Technologies Inc.
Assigned to Palantir Technologies Inc. reassignment Palantir Technologies Inc. RELEASE BY SECURED PARTY (SEE DOCUMENT FOR DETAILS). Assignors: ROYAL BANK OF CANADA
Assigned to Palantir Technologies Inc. reassignment Palantir Technologies Inc. CORRECTIVE ASSIGNMENT TO CORRECT THE ERRONEOUSLY LISTED PATENT BY REMOVING APPLICATION NO. 16/832267 FROM THE RELEASE OF SECURITY INTEREST PREVIOUSLY RECORDED ON REEL 052856 FRAME 0382. ASSIGNOR(S) HEREBY CONFIRMS THE RELEASE OF SECURITY INTEREST. Assignors: ROYAL BANK OF CANADA
Assigned to WELLS FARGO BANK, N.A. reassignment WELLS FARGO BANK, N.A. ASSIGNMENT OF INTELLECTUAL PROPERTY SECURITY AGREEMENTS Assignors: MORGAN STANLEY SENIOR FUNDING, INC.
Assigned to WELLS FARGO BANK, N.A. reassignment WELLS FARGO BANK, N.A. SECURITY INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: Palantir Technologies Inc.
Priority to US18/316,894 priority patent/US20230359638A1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • G06F17/30551
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2477Temporal data queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/2428Query predicate definition using graphical user interfaces, including menus and forms
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/248Presentation of query results
    • G06F17/30398
    • G06F17/30554

Definitions

  • the present disclosure relates to database systems that store and process data for display in an interactive user interface.
  • a database may store a large quantity of data.
  • a system may comprise a large number of sensors that each collect measurements at regular intervals, and the measurements may be stored in the database.
  • the measurement data can be supplemented with other data, such as information regarding events that occurred while the system was operational, and the supplemental data can also be stored in the database.
  • a user may attempt to analyze a portion of the stored data. For example, the user may attempt to analyze a portion of the stored data that is associated with a specific time period. In response, the user's device may retrieve the appropriate data from the database. However, as the quantity of data stored in the database increases over time, retrieving the appropriate data from the database and performing the analysis can become complicated and time consuming. Thus, the user may experience noticeable delay in the display of the desired data.
  • a database system that includes components for storing time-series data and executing custom, user-defined computational expressions in substantially real-time such that the results can be provided to a user device for display in an interactive user interface.
  • the database system may include memory storage, disk storage, and/or one or more processors.
  • Data received from a data source may include value and timestamp pairs and, once written to disk, may be immutable.
  • the database system may not overwrite a portion of the data or append additional data to the written data once the data is written to disk. Because the data is immutable, all data written to disk can be memory mapped given that the location of the data will not change.
  • the database system may process stored time-series data in response to requests from a user device.
  • the user may request to view time-series data by manipulating an interactive user interface.
  • the request received by the database system from the user device (possibly via a server), may include a start time, an end time, a period, and/or a computational expression.
  • the start time and end time may correspond with a range of timestamp values for which associated time-series data values should be retrieved.
  • the period may indicate, when analyzed in conjunction with the start time and end time, a number of data points requested by the user device for display in the interactive user interface.
  • the computational expression may indicate an arithmetic (and/or other type of) operation, if any, that the user wishes to perform on one or more sets of time-series data.
  • Example arithmetic operations include a sum, a difference, a product, a ratio, a zScore, a square root, and/or the like.
  • the database system may begin retrieving the appropriate time-series data and performing the indicated arithmetic (and/or other types of) operations via the one or more processors.
  • the one or more processors may perform pointwise operations or sliding window operations.
  • the one or more processors can access the data files from memory, rather than from disk, to perform the indicated operations.
  • the database system described herein may then achieve better performance when generating the new data values as compared with conventional databases.
  • the database system may transmit the new data values to the user device (for example, via the server) for display in the interactive user interface.
  • FIG. 1A illustrates a block diagram showing the various components of a time-series data storage and processing database system.
  • FIG. 1B illustrates a more detailed block diagram of a processing node, such as a processing node of FIG. 1A .
  • FIGS. 2A-2C illustrate example state diagrams that depict the process of retrieving and manipulating time-series data.
  • FIGS. 3A-3B illustrate an interactive user interface depicting graphs of time-series data that may be generated and displayed by a user device, such as the user device of FIG. 1A .
  • FIG. 3C illustrates an example nested set of arithmetic operations.
  • FIGS. 4A-4C illustrate an example file structure as stored in a node data store, such as a node data store of FIG. 1B .
  • FIG. 5A is a flowchart depicting an illustrative operation of processing time-series data by a database for display in an interactive user interface.
  • FIG. 5B is another flowchart depicting an illustrative operation of processing time-series data by a database for display in an interactive user interface.
  • FIG. 6 illustrates a computer system with which certain methods discussed herein may be implemented, according to an embodiment.
  • a user may attempt to analyze a portion of data stored in a database. For example, the user may attempt to analyze time-series data measured or captured by a data source. The user may attempt to analyze a single set of time-series data (for example, data measured or captured by a single data source over a period of time) or multiple sets of time-series data (for example, data measured or captured by different data sources over a period of time) at once.
  • time-series data measured or captured by a data source over a period of time
  • multiple sets of time-series data for example, data measured or captured by different data sources over a period of time
  • Such analysis may include viewing the time-series data at different periods of time, viewing the time-series data at different zoom levels, simultaneously viewing different time-series data over the same time period, combining (for example, adding, subtracting, dividing, multiplying, determining a ratio, determining a zScore, determining a square root, etc.) different time-series data to generate new time-series data, and/or the like.
  • a system that allows users to analyze the data stored in the database may include the database, a server, and a user device.
  • the user device may provide information to the server regarding the type of analysis desired by the user.
  • the server may retrieve the appropriate data from the database, perform the analysis, and provide the results to the user device for display.
  • the database may take a longer period of time to search in memory or on disk for the desired data.
  • the server and the database may be in communication via a wired or wireless network.
  • the bandwidth of the network may be limited such that data can only be streamed from the database to the server at certain speeds. This can be problematic and inefficient because the computational power of the server may be at a level high enough such that the server can finish analyzing a first portion of data before a second portion is received from the database. Because the data may not be transmitted from the server to the user device until all of the data has been analyzed, the user may notice a delay in the display of the desired data.
  • the database could be configured to perform the analysis.
  • the database may retrieve the desired data from memory or disk, perform the analysis, and provide the results to the server. The server may then immediately forward the results to the user device for display.
  • While some conventional databases perform a basic level of computation, these databases may not be suitable for replacing the functionality of the server.
  • some conventional databases can generate generic statistics based on the stored data. Such statistics may describe attributes of the stored data, such as mean data values, time ranges over which stored data is available, and/or the like. Furthermore, the statistics may be organized over preset time periods, such as days, months, or years.
  • these conventional databases are not configured to perform arbitrary or custom analyses or computations.
  • these conventional databases do not allow for organizing data over custom time periods.
  • these conventional databases are not designed to receive a computational expression generated by another device (such as the server or the user device), execute the received computational expression on only the portion of stored data identified by the other device, and provide the results. Thus, conventional databases may be inadequately designed to reduce or eliminate the inefficiencies described above.
  • a database system that includes components for storing time-series data and executing custom, user-defined computational expressions in substantially real-time such that the results can be provided to a user device for display in an interactive user interface.
  • the database system may include memory storage, disk storage, and/or one or more processors.
  • Data received from a data source may include value and timestamp pairs.
  • the data When the data is initially received from a data source, the data may be written to a write ahead log that is stored in memory and/or written to disk (e.g., such that the write ahead log can be restored if the database system crashes or experiences a forced shutdown). Although the write ahead log may be written to disk, the write ahead log may not be immutable.
  • the data may be written in the write ahead log in the order received from the data source rather than in an order based on the timestamp values.
  • the database system may maintain a mapping that indicates what portion of the data stored in the write ahead log is in order according to the timestamp values and what portion of the data stored in the write ahead log is not in order according to the timestamp values.
  • the write ahead log may have a size limit and once the size limit is reached or about to be reached (or some other criteria is met, such as the passage of a threshold period of time), the data in the write ahead log may be flushed and written to disk. Using the write ahead log as a buffer and only writing to disk periodically or aperiodically may reduce the overhead associated with writing to disk.
  • the database system may use the mapping to reorder the data such that the data is written to disk in an order according to the timestamp values.
  • Data from a single data source may be written to a single data file or may be written to multiple data files. If written to multiple data files and the two or more of the data files include overlapping timestamp values, the data may be merged before any computations are performed, as described in greater detail below. Alternatively, the two or more data files may be compacted prior to any computations to generate a new data file, and the new data file may be used for any subsequent computations. Compacting the data files may be more efficient because then a merge operation may not need to be performed each time the same data files with overlapping timestamp values are requested by the user device.
  • Data written to disk may be immutable.
  • the database system may not overwrite a portion of the data or append additional data to the written data once the data is written to disk.
  • all data written to disk can be memory mapped (for example, a segment of virtual or non-virtual memory can be assigned a direct byte-for-byte or bit-for-bit correlation with at least a portion of a data file) given that the location of the data will not change.
  • Memory mapping the data files may decrease data retrieval times, especially for large data files, because a read operation may not be necessary to access the data (for example, because the data can be stored directly in memory) and/or the database system may not need to copy the data retrieved from disk into memory before the data is usable.
  • the database system may process stored time-series data in response to requests from a user device.
  • the user may request to view time-series data by manipulating an interactive user interface.
  • the request received by the database system from the user device (possibly via a server), may include a start time, an end time, a period, and/or a computational expression.
  • the start time and end time may correspond with a range of timestamp values for which associated time-series data values should be retrieved.
  • the period may indicate, when analyzed in conjunction with the start time and end time, a number of data points requested by the user device for display in the interactive user interface.
  • the period may correspond with a time period that falls within the width of a pixel, where the interactive user interface displays a time-series data value associated with the start time and a time-series data value associated with the end time N pixels apart (e.g., where N corresponds to the number of data points requested by the user device).
  • the computational expression may indicate an arithmetic operation, if any, that the user wishes to perform on one or more sets of time-series data.
  • Example arithmetic operations include a sum, a difference, a product, a ratio, a zScore, a square root, and/or the like.
  • the user may wish to combine the values in two different sets of time-series data.
  • the user device may generate a computational expression that indicates that an addition operation is to be performed on data values in the first time-series data set and in the second time-series data set that correspond with timestamp values that fall between the start time and the end time.
  • the computational expression may identify a single arithmetic operation or may identify a nested or recursive set of arithmetic (and/or other types of) operations.
  • the computational expression may indicate that data values in a first time-series data set are to be added to data values in a second time-series data set, and the result of the addition is to be subtracted from data values in a third time-series data set.
  • the database system may begin retrieving the appropriate time-series data and performing the indicated arithmetic (and/or other types of) operations via the one or more processors.
  • the one or more processors may perform pointwise operations or sliding window operations. For example, if performing an addition operation, the one or more processors may take single data values from the same time-series data file or from different time-series data files and execute the operation on the single data values to generate a new data value.
  • the one or more processors may take a window of data values (for example, a plurality of data values) and execute the operation on the window of data values taken as a whole to generate a new data value.
  • a window of data values for example, a plurality of data values
  • the one or more processors can perform the indicated operations on time-series data sets that have matching timestamp values. For example, the one or more processors can execute an operation on data values from different time-series data sets if the data values correspond to the same timestamp value. In some cases, however, the timestamp values from two or more different time-series data sets may not align. In such a situation, the one or more processors may perform interpolation to estimate data values that may correspond to any missing timestamp values. The interpolation may occur prior to executing the operation or during execution of the operation (for example, interpolation may occur once the one or more processors receives a data value for a timestamp not present in another time-series data set that is being processed).
  • time-series data originating from a single data source may be stored in multiple data files. If the multiple data files include overlapping ranges of timestamp values (e.g., a first data file includes timestamps at a first time, a second time, and a third time, and a second data file includes timestamps at the second time, the third time, and a fourth time), then the data values in the multiple data files may be merged by the one or more processors before executing the operation (if, for example, the multiple data files were not already compacted into a new data file as described above).
  • timestamp values e.g., a first data file includes timestamps at a first time, a second time, and a third time
  • a second data file includes timestamps at the second time, the third time, and a fourth time
  • the one or more processors may choose a data value from a most-recently modified file (or least-recently modified file) as the data value to be used for the timestamp value when executing the operation.
  • a reserved value can be written in association with the timestamp value to indicate that the previously written data value at the timestamp value should be deleted.
  • the one or more processors may generate new data values by performing a sequential scan of existing time-series data.
  • the one or more processors can access the data files from memory, rather than from disk, to perform the indicated operations.
  • the database system described herein may then achieve better performance while performing the sequential scan to produce the new data values when compared with conventional databases.
  • the database system may transmit the new data values to the user device (for example, via the server) for display in the interactive user interface.
  • the immutable status of the data files written to disk enables the database system to generate quick and efficient data backups. For example, because the data files are immutable, the stored location of the data files on disk will not change. If new data is received from a data source, the new data may be stored in a different data file on disk. Generally, backups include a copy of the actual data file. However, because the location of a data file will not change, the backup can include a link to the location of the data file rather than a copy of the actual data file itself. Thus, the database system described herein may generate backups faster than conventional databases given that the process of generating links may be faster than the process of copying actual data files.
  • FIG. 1A illustrates a block diagram showing the various components of a time-series data storage and processing database system 100 .
  • the time-series data storage and processing database system 100 may include a data source 110 , a data server 140 , a time-series data store 150 , and a user device 160 .
  • the data source 110 may be any computing or mechanical device that can determine, measure, and/or capture data values.
  • the data source 110 may be a sensor, such as a sensor that measures physical parameters, a financial system, a medical electronic records system, and/or the like. While FIG. 1A illustrates a single data source 110 , this is not meant to be limiting.
  • the time-series data storage and processing database system 100 may include any number of data sources 110 .
  • the data source 110 may transmit determined, measured, and/or captured time-series data to the time-series data store 150 .
  • the data source 110 and the time-series data store 150 communicate via a network 120 .
  • the network 120 may include any communications network, such as the Internet.
  • the network 120 may be a wired network, a wireless network, or a combination of the two.
  • network 120 may be a local area network (LAN) and/or a wireless area network (WAN).
  • the network 120 may include cables and/or other equipment that allow the transport of data from underwater locations to above-ground locations and/or vice-versa.
  • the time-series data store 150 may store time-series data received from the data source 110 and perform analyses on the stored data based on requests received from the user device 160 via the data server 140 .
  • the time-series data store 150 may include a discovery node 152 , a time-series mapping data store 153 , processing nodes 154 A-C, and node data stores 156 A-C.
  • the discovery node 152 can be a single node or a cluster of nodes. Three processing nodes 154 A-C and node data stores 156 A-C are depicted for illustrative purposes only and is not meant to be limiting.
  • the time-series data store 150 may include any number of processing nodes and/or node data stores.
  • the time-series data received from various data sources 110 may be stored in different node data stores 156 A-C.
  • the time-series mapping data store 153 may store a mapping that identifies the processing node 154 A-C that is associated with a time-series data set (and thus the node data store 156 A-C in which the time-series data set is stored).
  • the discovery node 152 may receive the time-series data, communicate with the time-series mapping data store 153 to determine the processing node 154 A-C associated with the time-series data, and transmit the time-series data to the appropriate processing node 154 A-C.
  • the data source 110 can cache information indicating the processing node 154 A-C that is associated with a time-series data set such that the data source 110 can transmit the time-series data directly to the appropriate processing node 154 A-C.
  • the processing node 154 A-C may then store the received time-series data in the associated node data store 156 A-C (after, for example, the write ahead log of the processing node 154 A-C is flushed, as described in greater detail below with respect to FIG. 1B ).
  • the discovery node 152 may analyze the computational expression to identify the time-series data upon which an arithmetic (and/or other type of) operation may be performed.
  • the discovery node 152 may communicate with the time-series mapping data store 153 to identify the processing node(s) 154 A-C associated with the identified time-series data.
  • the computational expression, along with the start time, the end time, and the period, may be transmitted to the processing node 154 A that is associated with the identified time-series data.
  • the user device 160 or data server 140 can cache information indicating the processing node 154 A-C that is associated with a time-series data set such that the user device 160 or data server 140 can analyze the computational expression to identify the time-series data upon which the arithmetic (and/or other type of) operation may be performed and transmit the computational expression, along with the start time, the end time, and the period, directly to the appropriate processing node 154 A-C. If the computational expression identifies a plurality of time-series data sets that are associated with different processing nodes 154 A-C, then the discovery node 152 may select one of the processing nodes 154 A-C to perform the arithmetic operation(s).
  • the selected processing node 154 A-C may retrieve time-series data from another processing node 154 A-C (for example, time-series data that is not associated with the selected processing node 154 A-C) in order to perform the arithmetic operation(s). While the description herein refers to “arithmetic operations” for simplicity, any other type of mathematical operation may similarly be performed on the time-series data.
  • a processing node 154 A-C may use the start time, the end time, the period (for example, a value that identifies a duration of time), and the computational expression to manipulate time-series data and/or to generate new time-series data.
  • the processing node 154 A-C may perform the arithmetic operation(s) identified by the computational expression on the time-series data identified by the computational expression for data values in the time-series data that correspond with timestamp values that fall between the start time and the end time (interpolating when necessary as described herein).
  • the processing node 154 A-C may aggregate data values (for example, average, sum, subtract, minimum, maximum, etc.) after (or prior to) applying the arithmetic operation(s) such that the number of data values equals the number of periods between the start time and the end time. For example, if the start time is 1:00 pm, the end time is 1:01 pm, the period is 10 seconds, and the timestamp values of a time-series data set increment every 1 second, then the period of time between each timestamp value (for example, 1 second) is less than the period (for example, 10 seconds) and the number of periods between the start time and the end time is 6.
  • data values for example, average, sum, subtract, minimum, maximum, etc.
  • the processing node 154 A-C may aggregate data values corresponding to the first 10 timestamp values (for example, data values corresponding to times 1:00:01 pm through 1:00:10 pm), the second 10 timestamp values (for example, data values corresponding to times 1:00:11 pm through 1:00:20 pm), and so on until the processing node 154 A-C has generated 6 aggregated data values.
  • the processing node 154 A-C may repeat this process for each identified time-series data set. In some cases, a single time-series data set may not have a fixed period between data values.
  • the processing node 154 A-C may aggregate a portion of data values (e.g., the portion of data values for which the period of time between each timestamp value is less than the period received from the user device 160 or data server 140 ). In some embodiments, the processing node 154 A-C performs the arithmetic operation(s) before aggregating the data values. In other embodiments, the processing node 154 A-C performs the arithmetic operation(s) using the aggregated data values.
  • the processing nodes 154 A-C can perform pointwise operations and/or sliding window operations.
  • the processing nodes 154 A-C may apply the arithmetic operation(s) on single data values (for example, data values corresponding to the same timestamp value).
  • the processing nodes 154 A-C may apply the arithmetic operation(s) on a window of data values (for example, data values corresponding to a range of timestamp values).
  • the processing nodes 154 A-C may aggregate the results into a new time-series data set.
  • the new time-series data set may be stored in the associated node data store 156 A-C.
  • the new time-series data set may be transmitted to the data server 140 (which then forwards the new time-series data set to the user device 160 ) and/or the user device 160 .
  • the computational expression includes no arithmetic operations to be performed. For example, this may occur if the user scrolls or pans within a time-series data set displayed in the interactive user interface, thus requesting to view time-series data that was not previously visible within the interactive user interface. In such a situation, the processing nodes 154 A-C may not generate new time-series data, but may instead retrieve and provide a different set of data values than was previously provided for display in the interactive user interface.
  • the data server 140 may receive requests from the user device 160 (for example, the computational expression, the start time, the end time, and the period) and forward such requests to the time-series data store 150 .
  • the data server 140 may also receive updated time-series data and/or new time-series data from the time-series data store 150 and forward such data to the user device 160 for display in the interactive user interface.
  • the data server 140 may be implemented as a special-purpose computer system having logical elements.
  • the logical elements may comprise program instructions recorded on one or more machine-readable storage media.
  • the logical elements may be implemented in hardware, firmware, or a combination thereof.
  • the data server 140 may be implemented in a Java Virtual Machine (JVM) that is executing in a distributed or non-distributed computer system.
  • JVM Java Virtual Machine
  • the data server 140 may be implemented as a combination of programming instructions written in any programming language (e.g. C++, Visual Basic, Python, etc.) and hardware components (e.g., memory, CPU time) that have been allocated for executing the program instructions.
  • the user device 160 may transmit requests for updated or new time-series data to the data server 140 for transmission to the time-series data store.
  • requests may include the start time, the end time, the period, and/or the computational expression.
  • the requests may be generated in response to the manipulation of the interactive user interface by a user. Manipulations may include panning, scrolling, zooming, selecting an option to modify, combine and/or aggregate one or more time-series data sets to produce a new time-series data set, and/or the like.
  • the user may be viewing, via the interactive user interface, a first time-series data set that illustrates a first physical parameter (e.g., temperature) associated with a component and a second time-series data set that illustrates a second physical parameter (e.g., humidity) associated with the component.
  • the user may then select an option to view the values of the first and second physical parameters associated with the component. Selection of this option may cause the user device 160 to generate a computational expression that identifies the first time-series data set, the second time-series data set, and an arithmetic operation (for example, addition).
  • the selection may also cause the user device 160 to identify the start time and the end time, which can be user-defined and/or based on an earliest timestamp value and a latest timestamp value currently viewable in the interactive user interface.
  • the selection may also cause the user device 160 to identify the period, which may be user-defined and/or may be the range of time between the start time and the end time that corresponds with the width of a pixel.
  • the range of time may be determined based on the zoom level of a graph depicting time-series data.
  • the period may be dependent on the number of pixels in the horizontal direction (if time is along the x-axis) or vertical direction (if time is along the y-axis) that are devoted to displaying the requested time-series data.
  • the user device 160 may update user interface data used by the user device 160 to render and display the interactive user interface to display the data and timestamp value pairs.
  • the data server 140 may update the user interface data and provide the updated user interface data to the user device 160 .
  • the user device 160 can include a wide variety of computing devices, including personal computing devices, terminal computing devices, laptop computing devices, tablet computing devices, electronic reader devices, mobile devices (e.g., mobile phones, media players, handheld gaming devices, etc.), wearable devices with network access and program execution capabilities (e.g., “smart watches” or “smart eyewear”), wireless devices, set-top boxes, gaming consoles, entertainment systems, televisions with network access and program execution capabilities (e.g., “smart TVs”), and various other electronic devices and appliances.
  • the user devices 160 may execute a browser application to communicate with the data server 140 .
  • FIG. 1B illustrates a more detailed block diagram of a processing node, such as a processing node 154 A-C of FIG. 1A .
  • the processing node 154 may include a processor 170 and memory 180 . While a single processor 170 is depicted, this is not meant to be limiting.
  • the processing node 154 may include any number of processors 170 .
  • the processor 170 may retrieve time-series data from the memory 180 and/or the node data store 156 to perform requested arithmetic operation(s).
  • the memory 180 may store a write ahead log 182 , a memory map 184 , and one or more time-series data files 186 .
  • the processing node 154 may initially store the received time-series data in the write ahead log 182 .
  • the time-series data may be written in the write ahead log 182 in the order received from the data source 110 rather than in an order based on the timestamp values in the time-series data.
  • the processing node 154 in the memory 180 or in another hardware component, may maintain a mapping that indicates what portion of the time-series data stored in the write ahead log 182 is in order according to the timestamp values and what portion of the time-series data stored in the write ahead log 182 is not in order according to the timestamp values.
  • the first four entries in the write ahead log 182 may be in order according to timestamp values and the second four entries in the write ahead log 182 may be in order according to timestamp values.
  • the first four entries and the second four entries may not be in order according to timestamp values.
  • the write ahead log 182 may have a data size limit and once the size limit is reached or about to be reached (or some other criteria is met, such as the passage of a threshold period of time), the time-series data in the write ahead log 182 may be flushed and written to disk (for example, the node data store 156 ).
  • the processing node 154 may use the mapping to reorder the time-series data such that the time-series data is written to disk in an order according to the timestamp values. For example, using the example above, the processing node 154 may reorder the first four entries and the second four entries such that all eight entries are written to disk in an order according to the timestamp values.
  • the memory map 184 may identify the segments of the memory 180 that are assigned to at least a portion of a data file of a time-series (for example, the time-series data files 186 ).
  • the memory 180 may use the memory map 184 to identify the location of at least a portion of the time-series data files 186 requested by the processor 170 .
  • the use of the memory map 184 may decrease data retrieval times, especially for large time-series data files, because a read operation on the node data store 156 may not be necessary to access the time-series data (for example, because the time-series data can be stored directly in the memory 180 ) and/or the processing node 154 may not need to copy the time-series data retrieved from the node data store 156 into the memory 180 before the data is usable.
  • the processor 170 may request a data value stored in a particular page of a time-series data file.
  • processor 170 may then subsequently request another data value that follows the initial data value or that is within a range of the initial data value, where both data values are stored in the same page or in contiguous pages.
  • a read operation on the node data store 156 may not be necessary.
  • the time-series data files 186 may be stored in the memory 180 (for example, after the time-series data files have been written to the node data store 156 ). Alternatively, a portion of the time-series data files 186 may be stored in the memory 180 , such as one or more pages of a respective time-series data file 186 .
  • the operating system of the processing node 154 may determine if or when to perform read operations to pull data from the node data store 156 into the memory 180 .
  • FIGS. 2A-2C illustrate example state diagrams that depict the process of retrieving and manipulating time-series data.
  • the data source 110 may transmit time-series data (1) to the time-series data store 150 .
  • the time-series data store 150 may store the time-series data (2). Once written to disk, the file including the time-series data may be immutable.
  • a user may manipulate the interactive user interface displayed by the user device 160 in a way that causes the user device 160 to request processing of time-series data (3) from the data server 140 .
  • the user may pan, scroll, zoom, select an option to modify, combine, and/or aggregate time-series data, and/or the like.
  • the request may include the start time, the end time, the period, and/or the computational expression.
  • the data server 140 may forward the request to the time-series data store 150 .
  • the time-series data store 150 may retrieve time-series data identified in the computational expression and execute arithmetic operation(s) on data values in the retrieved time-series data to generate new time-series data (4). Execution of the arithmetic operation(s) may involve pointwise operations and/or sliding window operations.
  • the time-series data store 150 may transmit the new time-series data (5) to the data server 140 .
  • the data server 140 may then forward the new time-series data to the user device 160 .
  • the data server 140 aggregates the new time-series data into a different format (e.g., a format more understandable by humans, a format that can be more easily displayed by the user device 160 , etc.) before forwarding the new time-series data to the user device 160 .
  • the user device 160 may display the new time-series data (6) in the interactive user interface.
  • the state diagram depicted in FIG. 2B is similar to the state diagram depicted in FIG. 2A .
  • the user manipulation of the interactive user interface displayed by the user device 160 may cause the user device 160 to transmit an indication of an adjustment in the time-series data time scale (3).
  • such manipulation may include panning a graph displayed in the interactive user interface, scrolling through the graph displayed in the interactive user interface, and/or changing a zoom level depicted in the graph displayed in the interactive user interface.
  • the indication may include the start time, the end time, the period, and/or the computational expression.
  • the indication may be received by the data server 140 and forwarded to the time-series data store 150 .
  • the time-series data store 150 may retrieve time-series data identified in the computational expression and generate updated values corresponding to the new time scale using the retrieved time-series data (4).
  • the time-series data store 150 may generate the updated values by executing arithmetic operation(s) on data values in the retrieved time-series data.
  • the time-series data store 150 may transmit the updated values corresponding to the new time scale (5) to the data server 140 .
  • the data server 140 may then forward the updated values to the user device 160 .
  • the user device 160 may display the updated values (6) in the interactive user interface.
  • FIG. 2C illustrates an example state diagram depicting the processes performed by a processing node 154 in the time-series data store 150 when analyzing and performing the computational expression (for example, steps (4) in FIGS. 2A-2B ).
  • the processor 170 may request first time-series data between a start time and an end time (1) from the memory 180 .
  • the memory 180 may then transmit the first time-series data (2) to the processor 170 .
  • the processor 170 may request second time-series data between a start time and an end time (3) from the memory 180 (if, for example, the computational expression identifies the second time-series data).
  • the memory 180 may then transmit the second time-series data (4) to the processor 170 .
  • the processor 170 may execute a pointwise or sliding window operation using the retrieved time-series data (5).
  • the processor 170 (with or without the use of the memory 180 or additional memory to store intermediate states) may execute a pointwise or sliding window operation based on the type of arithmetic operation identified in the computational expression. For example, the processor 170 may execute a pointwise operation if the arithmetic operation is addition and the processor 170 may execute a sliding window operation if the arithmetic operation is a moving average.
  • FIGS. 3A-3B illustrate an interactive user interface 300 depicting graphs of time-series data that may be generated and displayed by a user device, such as the user device 160 .
  • the interactive user interface 300 includes a graph 310 displaying time-series data showing water allocation values over time and a graph 320 displaying time-series data showing temperature values over time.
  • the user device 160 may generate a start time, an end time, a period, and a computational expression to transmit to the time-series data store 150 .
  • the start time may be 1:01 pm and the end time may be 1:05 pm given that these times correspond with the earliest timestamp and the latest timestamp visible in the interactive user interface 300 .
  • the number of pixels in the horizontal direction between a data value corresponding to the earlier timestamp and a data value corresponding to the latest timestamp may dictate the value of the period. For example, if 240 pixels exist between these two data points, then the period may be 1 second (for example, there may be 240 seconds between 1:05 pm and 1:01 pm and thus each pixel may correspond to 1 second).
  • the computational expression may identify the time-series data set displayed in the first graph 310 .
  • the computational expression may also identify the time-series data set displayed in the second graph 320 if, for example, the user selects an option to view a time-series data set that comprises some combination (for example, addition, subtraction, ratio, etc.) of the time-series data sets displayed in the graphs 310 and 320 .
  • the computational expression may also identify any arithmetic operation(s) to be performed.
  • portion 350 in the graph 310 includes gaps in data values.
  • data values in the gaps may not have been stored in the time-series data store 150 .
  • the time-series data store 150 may use interpolation to estimate possible data values associated with the timestamp values in which data is missing. The interpolated data, along with the actual stored data, may then be used to generate the new time-series data set.
  • the computational expression may include a nested set of arithmetic operations.
  • FIG. 3C illustrates an example nested set of arithmetic operations.
  • a first arithmetic operation may include the addition of data values from the time-series data displayed in the graph 310 with data values from the time-series data displayed in the graph 320 .
  • the second arithmetic operation may include the division of data values from the time-series data displayed in the graph 320 over the results from the first arithmetic operation.
  • a computed set of time-series data 330 is output as a result of the computational expression.
  • the computed set of time-series data 330 may then be displayed to the user in one of the graphs 310 or 330 , and/or another graph of the user interface. While two arithmetic operations are depicted, this is not meant to be limiting. A computational expression may include any number of nested or un-nested arithmetic operations. Further, while, for clarity of description, FIG. 3C illustrates a computational expression performed on displayed time series data, in other embodiments computational expressions are performed on time-series data that may not be displayed. For example, the user may request a display of a graph of time-series data that may only be produced by execution of a computational expression on two or more time-series of data.
  • the system may automatically access the necessary time-series of data (as described above and below), execute the computational expression (as described above and below), and then provide the requested time-series date for display to the user (e.g., to the user device, as described above).
  • FIGS. 4A-4C illustrate an example file structure as stored in a node data store, such as a node data store 156 A-C.
  • a data directory may include a lock file and a time-series directory.
  • the lock file may be used to prevent another instance from writing and/or reading from the data directory when a write operation is occurring.
  • the time-series directory may include one or more subfolders that are each associated with a time-series data set.
  • subfolder series-2721235254 may be selected. Selection of the subfolder displays additional links, files, and subdirectories that may be representative of the contents of the other time-series directory subfolders, as illustrated in FIG. 4B .
  • Series-2721235254 may include a data directory that includes one or more time-series data files, where the time-series data files may be immutable. While the series-2721235254 subfolder may correspond to a single time-series data set, the data may be separated into separate files. In some cases, the different time-series data files may include overlapping timestamp values.
  • the processing node 154 may perform a merge operation as the data is used when executing arithmetic operation(s). For example, when the processing node 154 comes across a timestamp value corresponding to two or more different values, the processing node 154 may select the data value from the most-recently modified file (or the least-recently modified file) as the data value to use in computations. A special value (e.g., a reserved value) may be written at the time of the data value that is not selected to be used in computations. The special value will eventually be removed once the time-series data files are compacted.
  • a merge operation as the data is used when executing arithmetic operation(s). For example, when the processing node 154 comes across a timestamp value corresponding to two or more different values, the processing node 154 may select the data value from the most-recently modified file (or the least-recently modified file) as the data value to use in computations.
  • a special value e.g.,
  • the processing node 154 may compact two or more time-series data files to generate a single new time-series data file in which overlapping timestamp value issues are resolved (for example, the new time-series data file includes a single data value for each timestamp value, where the data value is determined in a manner as described above).
  • the new time-series data file may then be used in future computations in place of the original time-series data files used to generate the new time-series data file.
  • Series-2721235254 may also include a current snapshot link that links to the snapshots directory.
  • the snapshots directory can be a staging location for an in-progress snapshot when a time-series data file is being added or compacted. As described above, the time-series data files may be immutable and thus the location in the node data store 156 is unchanged. Thus, the snapshots directory may include links, rather than actual copies of data. Each of the links in the snapshots directory may link to the time-series data file identified by the name of the link.
  • the state of a time-series can be changed with a single file system operation (e.g., a single atomic file system operation) to change a current snapshot link associated with the time-series.
  • Series-2721235254 may also include a log file, which may be the write ahead log described herein.
  • Series-2721235254 may also include a metadata file that includes key-value pairs about the time-series.
  • the information in the metadata file may include a Unicode identification of the time-series and an indication of whether the data values are stored as floats or doubles.
  • Raw data 450 received from a data source 110 and stored in the time-series data files may be in the form of data and timestamp value pairs, as illustrated in FIG. 4C .
  • the data values may be stored as integers, floats, doubles, and/or the like.
  • the timestamp values may be absolute values (for example, wall clock time) or relative values (for example, the amount of time that has passed since the last data value was measured).
  • the data and/or timestamp values can be compressed prior to storage in the time-series data files.
  • the time-series data files or a separate file may further include an index indicating the location of the compressed data and/or timestamp values.
  • FIG. 5A is a flowchart 500 depicting an illustrative operation of processing time-series data by a database for display in an interactive user interface.
  • the method of FIG. 5A may be performed by various computing devices, such as by the time-series data store 150 described above.
  • the method of FIG. 5A may include fewer and/or additional blocks and the blocks may be performed in an order different than illustrated.
  • an identification of a series expression, a start time, an end time, and a period may be received from a user device.
  • the series expression may be a computational expression.
  • the series expression may identify one or more arithmetic operations and one or more time-series data sets upon which the arithmetic operations are to be performed.
  • a time-series data file corresponding to the series expression may be retrieved from memory.
  • the time-series data file may be a data file associated with a time-series data set identified by the series expression.
  • a value based on a computation identified by the series expression applied to a portion of the time-series data file is generated.
  • the computation may be one or more arithmetic operations.
  • the computation may be applied to data values stored in the time-series data file that are associated with timestamp values that fall between the start time and the end time.
  • the generated values may be transmitted to the user device for display in a user interface, such as an interactive user interface.
  • the generated values are also stored in the node data store 156 .
  • FIG. 5B is another flowchart 550 depicting an illustrative operation of processing time-series data by a database for display in an interactive user interface.
  • the method of FIG. 5B may be performed by various computing devices, such as by the time-series data store 150 described above.
  • the method of FIG. 5B may include fewer and/or additional blocks and the blocks may be performed in an order different than illustrated.
  • an identification of a series expression, a start time, an end time, and a period may be received from a user device.
  • the series expression may be a computational expression.
  • the series expression may identify one or more arithmetic operations and one or more time-series data sets upon which the arithmetic operations are to be performed.
  • a first time-series data file and a second time-series data file corresponding to the series expression may be retrieved from memory.
  • the time-series data files may be a data files associated with time-series data sets identified by the series expression.
  • a value based on a computation identified by the series expression applied to a portion of the first time-series data file and a portion of the second time-series data file is generated.
  • the computation may be one or more arithmetic operations.
  • the computation may be applied to data values stored in the first time-series data file and in the second time-series data file that are associated with timestamp values that fall between the start time and the end time.
  • the generated values may be transmitted to the user device for display in a user interface, such as an interactive user interface.
  • the generated values are also stored in the node data store 156 .
  • the techniques described herein are implemented by one or more special-purpose computing devices.
  • the special-purpose computing devices may be hard-wired to perform the techniques, or may include digital electronic devices such as one or more application-specific integrated circuits (ASICs) or field programmable gate arrays (FPGAs) that are persistently programmed to perform the techniques, or may include one or more general purpose hardware processors programmed to perform the techniques pursuant to program instructions in firmware, memory, other storage, or a combination.
  • ASICs application-specific integrated circuits
  • FPGAs field programmable gate arrays
  • Such special-purpose computing devices may also combine custom hard-wired logic, ASICs, or FPGAs with custom programming to accomplish the techniques.
  • the special-purpose computing devices may be desktop computer systems, server computer systems, portable computer systems, handheld devices, networking devices or any other device or combination of devices that incorporate hard-wired and/or program logic to implement the techniques.
  • Computing device(s) are generally controlled and coordinated by operating system software, such as iOS, Android, Chrome OS, Windows XP, Windows Vista, Windows 7, Windows 8, Windows Server, Windows CE, Unix, Linux, SunOS, Solaris, iOS, Blackberry OS, VxWorks, or other compatible operating systems.
  • operating system software such as iOS, Android, Chrome OS, Windows XP, Windows Vista, Windows 7, Windows 8, Windows Server, Windows CE, Unix, Linux, SunOS, Solaris, iOS, Blackberry OS, VxWorks, or other compatible operating systems.
  • the computing device may be controlled by a proprietary operating system.
  • Conventional operating systems control and schedule computer processes for execution, perform memory management, provide file system, networking, I/O services, and provide a user interface functionality, such as a graphical user interface (“GUI”), among other things.
  • GUI graphical user interface
  • FIG. 6 is a block diagram that illustrates a computer system 600 upon which an embodiment may be implemented.
  • any of the computing devices discussed herein, such as the data source 110 , the data server 140 , the time-series data store 150 , and/or the user device 160 may include some or all of the components and/or functionality of the computer system 600 .
  • Computer system 600 includes a bus 602 or other communication mechanism for communicating information, and a hardware processor, or multiple processors, 604 coupled with bus 602 for processing information.
  • Hardware processor(s) 604 may be, for example, one or more general purpose microprocessors.
  • Computer system 600 also includes a main memory 606 , such as a random access memory (RAM), cache and/or other dynamic storage devices, coupled to bus 602 for storing information and instructions to be executed by processor 604 .
  • Main memory 606 also may be used for storing temporary variables or other intermediate information during execution of instructions to be executed by processor 604 .
  • Such instructions when stored in storage media accessible to processor 604 , render computer system 600 into a special-purpose machine that is customized to perform the operations specified in the instructions.
  • Main memory 606 may also store cached data, such as zoom levels and maximum and minimum sensor values at each zoom level.
  • Computer system 600 further includes a read only memory (ROM) 608 or other static storage device coupled to bus 602 for storing static information and instructions for processor 604 .
  • ROM read only memory
  • a storage device 610 such as a magnetic disk, optical disk, or USB thumb drive (Flash drive), etc., is provided and coupled to bus 602 for storing information and instructions.
  • the storage device 610 may store measurement data obtained from a plurality of sensors.
  • Computer system 600 may be coupled via bus 602 to a display 612 , such as a cathode ray tube (CRT) or LCD display (or touch screen), for displaying information to a computer user.
  • a display 612 such as a cathode ray tube (CRT) or LCD display (or touch screen)
  • the display 612 can be used to display any of the user interfaces described herein with respect to FIGS. 2A through 8 .
  • An input device 614 is coupled to bus 602 for communicating information and command selections to processor 604 .
  • cursor control 616 is Another type of user input device, such as a mouse, a trackball, or cursor direction keys for communicating direction information and command selections to processor 604 and for controlling cursor movement on display 612 .
  • This input device typically has two degrees of freedom in two axes, a first axis (e.g., x) and a second axis (e.g., y), that allows the device to specify positions in a plane.
  • a first axis e.g., x
  • a second axis e.g., y
  • the same direction information and command selections as cursor control may be implemented via receiving touches on a touch screen without a cursor.
  • Computing system 600 may include a user interface module to implement a GUI that may be stored in a mass storage device as executable software codes that are executed by the computing device(s).
  • This and other modules may include, by way of example, components, such as software components, object-oriented software components, class components and task components, processes, functions, attributes, procedures, subroutines, segments of program code, drivers, firmware, microcode, circuitry, data, databases, data structures, tables, arrays, and variables.
  • module refers to logic embodied in hardware or firmware, or to a collection of software instructions, possibly having entry and exit points, written in a programming language, such as, for example, Java, Lua, C, or C++.
  • a software module may be compiled and linked into an executable program, installed in a dynamic link library, or may be written in an interpreted programming language such as, for example, BASIC, Perl, or Python. It will be appreciated that software modules may be callable from other modules or from themselves, and/or may be invoked in response to detected events or interrupts.
  • Software modules configured for execution on computing devices may be provided on a computer readable medium, such as a compact disc, digital video disc, flash drive, magnetic disc, or any other tangible medium, or as a digital download (and may be originally stored in a compressed or installable format that requires installation, decompression or decryption prior to execution).
  • Such software code may be stored, partially or fully, on a memory device of the executing computing device, for execution by the computing device.
  • Software instructions may be embedded in firmware, such as an EPROM.
  • hardware modules may be comprised of connected logic units, such as gates and flip-flops, and/or may be comprised of programmable units, such as programmable gate arrays or processors.
  • the modules or computing device functionality described herein are preferably implemented as software modules, but may be represented in hardware or firmware. Generally, the modules described herein refer to logical modules that may be combined with other modules or divided into sub-modules despite their physical organization or storage
  • Computer system 600 may implement the techniques described herein using customized hard-wired logic, one or more ASICs or FPGAs, firmware and/or program logic which in combination with the computer system causes or programs computer system 600 to be a special-purpose machine. According to one embodiment, the techniques herein are performed by computer system 600 in response to processor(s) 604 executing one or more sequences of one or more instructions contained in main memory 606 . Such instructions may be read into main memory 606 from another storage medium, such as storage device 610 . Execution of the sequences of instructions contained in main memory 606 causes processor(s) 604 to perform the process steps described herein. In alternative embodiments, hard-wired circuitry may be used in place of or in combination with software instructions.
  • non-transitory media refers to any media that store data and/or instructions that cause a machine to operate in a specific fashion. Such non-transitory media may comprise non-volatile media and/or volatile media.
  • Non-volatile media includes, for example, optical or magnetic disks, such as storage device 610 .
  • Volatile media includes dynamic memory, such as main memory 606 .
  • non-transitory media include, for example, a floppy disk, a flexible disk, hard disk, solid state drive, magnetic tape, or any other magnetic data storage medium, a CD-ROM, any other optical data storage medium, any physical medium with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, NVRAM, any other memory chip or cartridge, and networked versions of the same.
  • Non-transitory media is distinct from but may be used in conjunction with transmission media.
  • Transmission media participates in transferring information between non-transitory media.
  • transmission media includes coaxial cables, copper wire and fiber optics, including the wires that comprise bus 602 .
  • transmission media can also take the form of acoustic or light waves, such as those generated during radio-wave and infra-red data communications.
  • Various forms of media may be involved in carrying one or more sequences of one or more instructions to processor 604 for execution.
  • the instructions may initially be carried on a magnetic disk or solid state drive of a remote computer.
  • the remote computer can load the instructions into its dynamic memory and send the instructions over a telephone line using a modem.
  • a modem local to computer system 600 can receive the data on the telephone line and use an infra-red transmitter to convert the data to an infra-red signal.
  • An infra-red detector can receive the data carried in the infra-red signal and appropriate circuitry can place the data on bus 602 .
  • Bus 602 carries the data to main memory 406 , from which processor 604 retrieves and executes the instructions.
  • the instructions received by main memory 606 may retrieve and execute the instructions.
  • the instructions received by main memory 606 may optionally be stored on storage device 610 either before or after execution by processor 604 .
  • Computer system 600 also includes a communication interface 618 coupled to bus 602 .
  • Communication interface 618 provides a two-way data communication coupling to a network link 620 that is connected to a local network 622 .
  • communication interface 618 may be an integrated services digital network (ISDN) card, cable modem, satellite modem, or a modem to provide a data communication connection to a corresponding type of telephone line.
  • ISDN integrated services digital network
  • communication interface 618 may be a local area network (LAN) card to provide a data communication connection to a compatible LAN (or WAN component to communicated with a WAN).
  • LAN local area network
  • Wireless links may also be implemented.
  • communication interface 618 sends and receives electrical, electromagnetic or optical signals that carry digital data streams representing various types of information.
  • Network link 620 typically provides data communication through one or more networks to other data devices.
  • network link 620 may provide a connection through local network 622 to a host computer 624 or to data equipment operated by an Internet Service Provider (ISP) 626 .
  • ISP 626 in turn provides data communication services through the world wide packet data communication network now commonly referred to as the “Internet” 628 .
  • Internet 628 uses electrical, electromagnetic or optical signals that carry digital data streams.
  • the signals through the various networks and the signals on network link 620 and through communication interface 618 which carry the digital data to and from computer system 600 , are example forms of transmission media.
  • Computer system 600 can send messages and receive data, including program code, through the network(s), network link 620 and communication interface 618 .
  • a server 630 might transmit a requested code for an application program through Internet 628 , ISP 626 , local network 622 and communication interface 618 .
  • the received code may be executed by processor 604 as it is received, and/or stored in storage device 610 , or other non-volatile storage for later execution.
  • Conditional language such as, among others, “can,” “could,” “might,” or “may,” unless specifically stated otherwise, or otherwise understood within the context as used, is generally intended to convey that certain embodiments include, while other embodiments do not include, certain features, elements and/or steps. Thus, such conditional language is not generally intended to imply that features, elements and/or steps are in any way required for one or more embodiments or that one or more embodiments necessarily include logic for deciding, with or without user input or prompting, whether these features, elements and/or steps are included or are to be performed in any particular embodiment.

Abstract

A database system is described that includes components for storing time-series data and executing custom, user-defined computational expressions in substantially real-time such that the results can be provided to a user device for display in an interactive user interface. For example, the database system may process stored time-series data in response to requests from a user device. The request may include a start time, an end time, a period, and/or a computational expression. The database system may retrieve the time-series data identified by the computational expression and, for each period, perform the arithmetic operation(s) identified by the computational expression on data values corresponding to times within the start time and the end time. Once all new data values have been generated, the database system may transmit the new data values to the user device for display in the interactive user interface.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS
This application claims a priority benefit under 35 U.S.C. §119 to U.S. Provisional Patent Application No. 62/171,875, filed on Jun. 5, 2015, and titled “TIME-SERIES DATA STORAGE AND PROCESSING DATABASE SYSTEM,” which is hereby incorporated by reference herein in its entirety.
TECHNICAL FIELD
The present disclosure relates to database systems that store and process data for display in an interactive user interface.
BACKGROUND
A database may store a large quantity of data. For example, a system may comprise a large number of sensors that each collect measurements at regular intervals, and the measurements may be stored in the database. The measurement data can be supplemented with other data, such as information regarding events that occurred while the system was operational, and the supplemental data can also be stored in the database.
In some cases, a user may attempt to analyze a portion of the stored data. For example, the user may attempt to analyze a portion of the stored data that is associated with a specific time period. In response, the user's device may retrieve the appropriate data from the database. However, as the quantity of data stored in the database increases over time, retrieving the appropriate data from the database and performing the analysis can become complicated and time consuming. Thus, the user may experience noticeable delay in the display of the desired data.
SUMMARY
The systems, methods, and devices described herein each have several aspects, no single one of which is solely responsible for its desirable attributes. Without limiting the scope of this disclosure, several non-limiting features will now be discussed briefly.
Disclosed herein is a database system that includes components for storing time-series data and executing custom, user-defined computational expressions in substantially real-time such that the results can be provided to a user device for display in an interactive user interface. For example, the database system may include memory storage, disk storage, and/or one or more processors. Data received from a data source may include value and timestamp pairs and, once written to disk, may be immutable. Thus, the database system may not overwrite a portion of the data or append additional data to the written data once the data is written to disk. Because the data is immutable, all data written to disk can be memory mapped given that the location of the data will not change.
The database system may process stored time-series data in response to requests from a user device. For example, the user may request to view time-series data by manipulating an interactive user interface. The request, received by the database system from the user device (possibly via a server), may include a start time, an end time, a period, and/or a computational expression. The start time and end time may correspond with a range of timestamp values for which associated time-series data values should be retrieved. The period may indicate, when analyzed in conjunction with the start time and end time, a number of data points requested by the user device for display in the interactive user interface. The computational expression may indicate an arithmetic (and/or other type of) operation, if any, that the user wishes to perform on one or more sets of time-series data. Example arithmetic operations include a sum, a difference, a product, a ratio, a zScore, a square root, and/or the like.
Once the database system receives the request, the database system may begin retrieving the appropriate time-series data and performing the indicated arithmetic (and/or other types of) operations via the one or more processors. Depending on the type of indicated operation(s) to be performed, the one or more processors may perform pointwise operations or sliding window operations. As described above, because the data files may be memory mapped, the one or more processors can access the data files from memory, rather than from disk, to perform the indicated operations. The database system described herein may then achieve better performance when generating the new data values as compared with conventional databases. Once all new data values have been generated, the database system may transmit the new data values to the user device (for example, via the server) for display in the interactive user interface.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1A illustrates a block diagram showing the various components of a time-series data storage and processing database system.
FIG. 1B illustrates a more detailed block diagram of a processing node, such as a processing node of FIG. 1A.
FIGS. 2A-2C illustrate example state diagrams that depict the process of retrieving and manipulating time-series data.
FIGS. 3A-3B illustrate an interactive user interface depicting graphs of time-series data that may be generated and displayed by a user device, such as the user device of FIG. 1A.
FIG. 3C illustrates an example nested set of arithmetic operations.
FIGS. 4A-4C illustrate an example file structure as stored in a node data store, such as a node data store of FIG. 1B.
FIG. 5A is a flowchart depicting an illustrative operation of processing time-series data by a database for display in an interactive user interface.
FIG. 5B is another flowchart depicting an illustrative operation of processing time-series data by a database for display in an interactive user interface.
FIG. 6 illustrates a computer system with which certain methods discussed herein may be implemented, according to an embodiment.
DETAILED DESCRIPTION OF SPECIFIC EMBODIMENTS Overview
As described above, a user may attempt to analyze a portion of data stored in a database. For example, the user may attempt to analyze time-series data measured or captured by a data source. The user may attempt to analyze a single set of time-series data (for example, data measured or captured by a single data source over a period of time) or multiple sets of time-series data (for example, data measured or captured by different data sources over a period of time) at once. Such analysis may include viewing the time-series data at different periods of time, viewing the time-series data at different zoom levels, simultaneously viewing different time-series data over the same time period, combining (for example, adding, subtracting, dividing, multiplying, determining a ratio, determining a zScore, determining a square root, etc.) different time-series data to generate new time-series data, and/or the like.
Typically, a system that allows users to analyze the data stored in the database may include the database, a server, and a user device. The user device may provide information to the server regarding the type of analysis desired by the user. The server may retrieve the appropriate data from the database, perform the analysis, and provide the results to the user device for display. However, as the quantity of data stored in the database increases over time, there may be considerable delay in the retrieval of the data from the database by the server. For example, the database may take a longer period of time to search in memory or on disk for the desired data.
In addition, network issues may contribute to the data retrieval delay. For example, the server and the database may be in communication via a wired or wireless network. The bandwidth of the network may be limited such that data can only be streamed from the database to the server at certain speeds. This can be problematic and inefficient because the computational power of the server may be at a level high enough such that the server can finish analyzing a first portion of data before a second portion is received from the database. Because the data may not be transmitted from the server to the user device until all of the data has been analyzed, the user may notice a delay in the display of the desired data.
Thus, it may be desirable to co-locate the hardware components that store the data and the hardware components that execute the instructions to perform the analysis. For example, instead of having the server perform the analysis, the database could be configured to perform the analysis. In such a scenario, the database may retrieve the desired data from memory or disk, perform the analysis, and provide the results to the server. The server may then immediately forward the results to the user device for display.
While some conventional databases perform a basic level of computation, these databases may not be suitable for replacing the functionality of the server. For example, some conventional databases can generate generic statistics based on the stored data. Such statistics may describe attributes of the stored data, such as mean data values, time ranges over which stored data is available, and/or the like. Furthermore, the statistics may be organized over preset time periods, such as days, months, or years. However, these conventional databases are not configured to perform arbitrary or custom analyses or computations. Moreover, these conventional databases do not allow for organizing data over custom time periods. For example, these conventional databases are not designed to receive a computational expression generated by another device (such as the server or the user device), execute the received computational expression on only the portion of stored data identified by the other device, and provide the results. Thus, conventional databases may be inadequately designed to reduce or eliminate the inefficiencies described above.
Accordingly, disclosed herein is a database system that includes components for storing time-series data and executing custom, user-defined computational expressions in substantially real-time such that the results can be provided to a user device for display in an interactive user interface. For example, the database system may include memory storage, disk storage, and/or one or more processors. Data received from a data source may include value and timestamp pairs. When the data is initially received from a data source, the data may be written to a write ahead log that is stored in memory and/or written to disk (e.g., such that the write ahead log can be restored if the database system crashes or experiences a forced shutdown). Although the write ahead log may be written to disk, the write ahead log may not be immutable. The data may be written in the write ahead log in the order received from the data source rather than in an order based on the timestamp values. The database system may maintain a mapping that indicates what portion of the data stored in the write ahead log is in order according to the timestamp values and what portion of the data stored in the write ahead log is not in order according to the timestamp values. The write ahead log may have a size limit and once the size limit is reached or about to be reached (or some other criteria is met, such as the passage of a threshold period of time), the data in the write ahead log may be flushed and written to disk. Using the write ahead log as a buffer and only writing to disk periodically or aperiodically may reduce the overhead associated with writing to disk. When writing to disk, the database system may use the mapping to reorder the data such that the data is written to disk in an order according to the timestamp values.
Data from a single data source may be written to a single data file or may be written to multiple data files. If written to multiple data files and the two or more of the data files include overlapping timestamp values, the data may be merged before any computations are performed, as described in greater detail below. Alternatively, the two or more data files may be compacted prior to any computations to generate a new data file, and the new data file may be used for any subsequent computations. Compacting the data files may be more efficient because then a merge operation may not need to be performed each time the same data files with overlapping timestamp values are requested by the user device.
Data written to disk may be immutable. Thus, the database system may not overwrite a portion of the data or append additional data to the written data once the data is written to disk. Because the data is immutable, all data written to disk can be memory mapped (for example, a segment of virtual or non-virtual memory can be assigned a direct byte-for-byte or bit-for-bit correlation with at least a portion of a data file) given that the location of the data will not change. Memory mapping the data files may decrease data retrieval times, especially for large data files, because a read operation may not be necessary to access the data (for example, because the data can be stored directly in memory) and/or the database system may not need to copy the data retrieved from disk into memory before the data is usable.
The database system may process stored time-series data in response to requests from a user device. For example, the user may request to view time-series data by manipulating an interactive user interface. The request, received by the database system from the user device (possibly via a server), may include a start time, an end time, a period, and/or a computational expression. The start time and end time may correspond with a range of timestamp values for which associated time-series data values should be retrieved. The period may indicate, when analyzed in conjunction with the start time and end time, a number of data points requested by the user device for display in the interactive user interface. As an example, the period may correspond with a time period that falls within the width of a pixel, where the interactive user interface displays a time-series data value associated with the start time and a time-series data value associated with the end time N pixels apart (e.g., where N corresponds to the number of data points requested by the user device).
The computational expression may indicate an arithmetic operation, if any, that the user wishes to perform on one or more sets of time-series data. Example arithmetic operations include a sum, a difference, a product, a ratio, a zScore, a square root, and/or the like. For example, the user may wish to combine the values in two different sets of time-series data. Thus, the user device may generate a computational expression that indicates that an addition operation is to be performed on data values in the first time-series data set and in the second time-series data set that correspond with timestamp values that fall between the start time and the end time.
The computational expression may identify a single arithmetic operation or may identify a nested or recursive set of arithmetic (and/or other types of) operations. For example, the computational expression may indicate that data values in a first time-series data set are to be added to data values in a second time-series data set, and the result of the addition is to be subtracted from data values in a third time-series data set.
Once the database system receives the request, the database system may begin retrieving the appropriate time-series data and performing the indicated arithmetic (and/or other types of) operations via the one or more processors. Depending on the type of indicated operation(s) to be performed, the one or more processors may perform pointwise operations or sliding window operations. For example, if performing an addition operation, the one or more processors may take single data values from the same time-series data file or from different time-series data files and execute the operation on the single data values to generate a new data value. As another example, if performing a zScore operation, the one or more processors may take a window of data values (for example, a plurality of data values) and execute the operation on the window of data values taken as a whole to generate a new data value.
The one or more processors can perform the indicated operations on time-series data sets that have matching timestamp values. For example, the one or more processors can execute an operation on data values from different time-series data sets if the data values correspond to the same timestamp value. In some cases, however, the timestamp values from two or more different time-series data sets may not align. In such a situation, the one or more processors may perform interpolation to estimate data values that may correspond to any missing timestamp values. The interpolation may occur prior to executing the operation or during execution of the operation (for example, interpolation may occur once the one or more processors receives a data value for a timestamp not present in another time-series data set that is being processed).
As described above, time-series data originating from a single data source may be stored in multiple data files. If the multiple data files include overlapping ranges of timestamp values (e.g., a first data file includes timestamps at a first time, a second time, and a third time, and a second data file includes timestamps at the second time, the third time, and a fourth time), then the data values in the multiple data files may be merged by the one or more processors before executing the operation (if, for example, the multiple data files were not already compacted into a new data file as described above). For example, if data values in the multiple data files each correspond to the same timestamp value, then the one or more processors may choose a data value from a most-recently modified file (or least-recently modified file) as the data value to be used for the timestamp value when executing the operation. To delete one or more data values that correspond to the same timestamp value (even if the data files are otherwise immutable), a reserved value can be written in association with the timestamp value to indicate that the previously written data value at the timestamp value should be deleted.
Thus, the one or more processors may generate new data values by performing a sequential scan of existing time-series data. As described above, because the data files may be memory mapped, the one or more processors can access the data files from memory, rather than from disk, to perform the indicated operations. The database system described herein may then achieve better performance while performing the sequential scan to produce the new data values when compared with conventional databases. Once all new data values have been generated, the database system may transmit the new data values to the user device (for example, via the server) for display in the interactive user interface.
In an embodiment, the immutable status of the data files written to disk enables the database system to generate quick and efficient data backups. For example, because the data files are immutable, the stored location of the data files on disk will not change. If new data is received from a data source, the new data may be stored in a different data file on disk. Generally, backups include a copy of the actual data file. However, because the location of a data file will not change, the backup can include a link to the location of the data file rather than a copy of the actual data file itself. Thus, the database system described herein may generate backups faster than conventional databases given that the process of generating links may be faster than the process of copying actual data files.
Example System Overview
FIG. 1A illustrates a block diagram showing the various components of a time-series data storage and processing database system 100. As illustrated in FIG. 1A, the time-series data storage and processing database system 100 may include a data source 110, a data server 140, a time-series data store 150, and a user device 160.
In an embodiment, the data source 110 may be any computing or mechanical device that can determine, measure, and/or capture data values. For example, the data source 110 may be a sensor, such as a sensor that measures physical parameters, a financial system, a medical electronic records system, and/or the like. While FIG. 1A illustrates a single data source 110, this is not meant to be limiting. The time-series data storage and processing database system 100 may include any number of data sources 110.
The data source 110 may transmit determined, measured, and/or captured time-series data to the time-series data store 150. In an embodiment, the data source 110 and the time-series data store 150 communicate via a network 120. The network 120 may include any communications network, such as the Internet. The network 120 may be a wired network, a wireless network, or a combination of the two. For example, network 120 may be a local area network (LAN) and/or a wireless area network (WAN). The network 120 may include cables and/or other equipment that allow the transport of data from underwater locations to above-ground locations and/or vice-versa.
The time-series data store 150 may store time-series data received from the data source 110 and perform analyses on the stored data based on requests received from the user device 160 via the data server 140. For example, as illustrated in FIG. 1, the time-series data store 150 may include a discovery node 152, a time-series mapping data store 153, processing nodes 154A-C, and node data stores 156A-C. The discovery node 152 can be a single node or a cluster of nodes. Three processing nodes 154A-C and node data stores 156A-C are depicted for illustrative purposes only and is not meant to be limiting. The time-series data store 150 may include any number of processing nodes and/or node data stores.
In some embodiments, the time-series data received from various data sources 110 may be stored in different node data stores 156A-C. The time-series mapping data store 153 may store a mapping that identifies the processing node 154A-C that is associated with a time-series data set (and thus the node data store 156A-C in which the time-series data set is stored). When time-series data is received by the time-series data store 150 from a data source 110, the discovery node 152 may receive the time-series data, communicate with the time-series mapping data store 153 to determine the processing node 154A-C associated with the time-series data, and transmit the time-series data to the appropriate processing node 154A-C. Alternatively, the data source 110 can cache information indicating the processing node 154A-C that is associated with a time-series data set such that the data source 110 can transmit the time-series data directly to the appropriate processing node 154A-C. The processing node 154A-C may then store the received time-series data in the associated node data store 156A-C (after, for example, the write ahead log of the processing node 154A-C is flushed, as described in greater detail below with respect to FIG. 1B).
Likewise, when a user device 160 or data server 140 provides a computational expression to the time-series data store 150, the discovery node 152 may analyze the computational expression to identify the time-series data upon which an arithmetic (and/or other type of) operation may be performed. The discovery node 152 may communicate with the time-series mapping data store 153 to identify the processing node(s) 154A-C associated with the identified time-series data. The computational expression, along with the start time, the end time, and the period, may be transmitted to the processing node 154A that is associated with the identified time-series data. Alternatively, the user device 160 or data server 140 can cache information indicating the processing node 154A-C that is associated with a time-series data set such that the user device 160 or data server 140 can analyze the computational expression to identify the time-series data upon which the arithmetic (and/or other type of) operation may be performed and transmit the computational expression, along with the start time, the end time, and the period, directly to the appropriate processing node 154A-C. If the computational expression identifies a plurality of time-series data sets that are associated with different processing nodes 154A-C, then the discovery node 152 may select one of the processing nodes 154A-C to perform the arithmetic operation(s). The selected processing node 154A-C may retrieve time-series data from another processing node 154A-C (for example, time-series data that is not associated with the selected processing node 154A-C) in order to perform the arithmetic operation(s). While the description herein refers to “arithmetic operations” for simplicity, any other type of mathematical operation may similarly be performed on the time-series data.
A processing node 154A-C may use the start time, the end time, the period (for example, a value that identifies a duration of time), and the computational expression to manipulate time-series data and/or to generate new time-series data. For example, the processing node 154A-C may perform the arithmetic operation(s) identified by the computational expression on the time-series data identified by the computational expression for data values in the time-series data that correspond with timestamp values that fall between the start time and the end time (interpolating when necessary as described herein). If the period of time between each timestamp value is less than the period received from the user device 160 or data server 140, then the processing node 154A-C may aggregate data values (for example, average, sum, subtract, minimum, maximum, etc.) after (or prior to) applying the arithmetic operation(s) such that the number of data values equals the number of periods between the start time and the end time. For example, if the start time is 1:00 pm, the end time is 1:01 pm, the period is 10 seconds, and the timestamp values of a time-series data set increment every 1 second, then the period of time between each timestamp value (for example, 1 second) is less than the period (for example, 10 seconds) and the number of periods between the start time and the end time is 6. The processing node 154A-C may aggregate data values corresponding to the first 10 timestamp values (for example, data values corresponding to times 1:00:01 pm through 1:00:10 pm), the second 10 timestamp values (for example, data values corresponding to times 1:00:11 pm through 1:00:20 pm), and so on until the processing node 154A-C has generated 6 aggregated data values. The processing node 154A-C may repeat this process for each identified time-series data set. In some cases, a single time-series data set may not have a fixed period between data values. Thus, the processing node 154A-C may aggregate a portion of data values (e.g., the portion of data values for which the period of time between each timestamp value is less than the period received from the user device 160 or data server 140). In some embodiments, the processing node 154A-C performs the arithmetic operation(s) before aggregating the data values. In other embodiments, the processing node 154A-C performs the arithmetic operation(s) using the aggregated data values.
As described herein, the processing nodes 154A-C can perform pointwise operations and/or sliding window operations. When performing pointwise operations, the processing nodes 154A-C may apply the arithmetic operation(s) on single data values (for example, data values corresponding to the same timestamp value). When performing sliding window operations, the processing nodes 154A-C may apply the arithmetic operation(s) on a window of data values (for example, data values corresponding to a range of timestamp values).
Once the arithmetic operation(s) identified by the computational expression are applied to the appropriate data values, the processing nodes 154A-C may aggregate the results into a new time-series data set. The new time-series data set may be stored in the associated node data store 156A-C. Alternatively or in addition, the new time-series data set may be transmitted to the data server 140 (which then forwards the new time-series data set to the user device 160) and/or the user device 160.
In some embodiments, the computational expression includes no arithmetic operations to be performed. For example, this may occur if the user scrolls or pans within a time-series data set displayed in the interactive user interface, thus requesting to view time-series data that was not previously visible within the interactive user interface. In such a situation, the processing nodes 154A-C may not generate new time-series data, but may instead retrieve and provide a different set of data values than was previously provided for display in the interactive user interface.
The data server 140 may receive requests from the user device 160 (for example, the computational expression, the start time, the end time, and the period) and forward such requests to the time-series data store 150. The data server 140 may also receive updated time-series data and/or new time-series data from the time-series data store 150 and forward such data to the user device 160 for display in the interactive user interface.
The data server 140 may be implemented as a special-purpose computer system having logical elements. In an embodiment, the logical elements may comprise program instructions recorded on one or more machine-readable storage media. Alternatively, the logical elements may be implemented in hardware, firmware, or a combination thereof. In one embodiment, the data server 140 may be implemented in a Java Virtual Machine (JVM) that is executing in a distributed or non-distributed computer system. In other embodiments, the data server 140 may be implemented as a combination of programming instructions written in any programming language (e.g. C++, Visual Basic, Python, etc.) and hardware components (e.g., memory, CPU time) that have been allocated for executing the program instructions.
The user device 160 may transmit requests for updated or new time-series data to the data server 140 for transmission to the time-series data store. Such requests may include the start time, the end time, the period, and/or the computational expression. The requests may be generated in response to the manipulation of the interactive user interface by a user. Manipulations may include panning, scrolling, zooming, selecting an option to modify, combine and/or aggregate one or more time-series data sets to produce a new time-series data set, and/or the like. For example, the user may be viewing, via the interactive user interface, a first time-series data set that illustrates a first physical parameter (e.g., temperature) associated with a component and a second time-series data set that illustrates a second physical parameter (e.g., humidity) associated with the component. The user may then select an option to view the values of the first and second physical parameters associated with the component. Selection of this option may cause the user device 160 to generate a computational expression that identifies the first time-series data set, the second time-series data set, and an arithmetic operation (for example, addition). The selection may also cause the user device 160 to identify the start time and the end time, which can be user-defined and/or based on an earliest timestamp value and a latest timestamp value currently viewable in the interactive user interface. The selection may also cause the user device 160 to identify the period, which may be user-defined and/or may be the range of time between the start time and the end time that corresponds with the width of a pixel. The range of time may be determined based on the zoom level of a graph depicting time-series data. Thus, the period may be dependent on the number of pixels in the horizontal direction (if time is along the x-axis) or vertical direction (if time is along the y-axis) that are devoted to displaying the requested time-series data.
Once updated or new time-series data is received from the data server 140 and/or directly from the time-series data store 150, the user device 160 may update user interface data used by the user device 160 to render and display the interactive user interface to display the data and timestamp value pairs. In other embodiments, the data server 140 may update the user interface data and provide the updated user interface data to the user device 160.
The user device 160 can include a wide variety of computing devices, including personal computing devices, terminal computing devices, laptop computing devices, tablet computing devices, electronic reader devices, mobile devices (e.g., mobile phones, media players, handheld gaming devices, etc.), wearable devices with network access and program execution capabilities (e.g., “smart watches” or “smart eyewear”), wireless devices, set-top boxes, gaming consoles, entertainment systems, televisions with network access and program execution capabilities (e.g., “smart TVs”), and various other electronic devices and appliances. The user devices 160 may execute a browser application to communicate with the data server 140.
FIG. 1B illustrates a more detailed block diagram of a processing node, such as a processing node 154A-C of FIG. 1A. As illustrated in FIG. 1B, the processing node 154 may include a processor 170 and memory 180. While a single processor 170 is depicted, this is not meant to be limiting. The processing node 154 may include any number of processors 170. The processor 170 may retrieve time-series data from the memory 180 and/or the node data store 156 to perform requested arithmetic operation(s).
The memory 180 may store a write ahead log 182, a memory map 184, and one or more time-series data files 186. For example, as described herein, when time-series data is initially received from the data source 110, the processing node 154 may initially store the received time-series data in the write ahead log 182. The time-series data may be written in the write ahead log 182 in the order received from the data source 110 rather than in an order based on the timestamp values in the time-series data. The processing node 154, in the memory 180 or in another hardware component, may maintain a mapping that indicates what portion of the time-series data stored in the write ahead log 182 is in order according to the timestamp values and what portion of the time-series data stored in the write ahead log 182 is not in order according to the timestamp values. For example, the first four entries in the write ahead log 182 may be in order according to timestamp values and the second four entries in the write ahead log 182 may be in order according to timestamp values. However, the first four entries and the second four entries may not be in order according to timestamp values.
The write ahead log 182 may have a data size limit and once the size limit is reached or about to be reached (or some other criteria is met, such as the passage of a threshold period of time), the time-series data in the write ahead log 182 may be flushed and written to disk (for example, the node data store 156). When writing to disk, the processing node 154 may use the mapping to reorder the time-series data such that the time-series data is written to disk in an order according to the timestamp values. For example, using the example above, the processing node 154 may reorder the first four entries and the second four entries such that all eight entries are written to disk in an order according to the timestamp values.
The memory map 184 may identify the segments of the memory 180 that are assigned to at least a portion of a data file of a time-series (for example, the time-series data files 186). The memory 180 may use the memory map 184 to identify the location of at least a portion of the time-series data files 186 requested by the processor 170. As described herein, the use of the memory map 184 may decrease data retrieval times, especially for large time-series data files, because a read operation on the node data store 156 may not be necessary to access the time-series data (for example, because the time-series data can be stored directly in the memory 180) and/or the processing node 154 may not need to copy the time-series data retrieved from the node data store 156 into the memory 180 before the data is usable. For example, the processor 170 may request a data value stored in a particular page of a time-series data file. An expectation may be that the processor 170 may then subsequently request another data value that follows the initial data value or that is within a range of the initial data value, where both data values are stored in the same page or in contiguous pages. Thus, by loading at least a portion (e.g., at least a page) of a time-series data file 186 into memory 180, a read operation on the node data store 156 may not be necessary.
The time-series data files 186 may be stored in the memory 180 (for example, after the time-series data files have been written to the node data store 156). Alternatively, a portion of the time-series data files 186 may be stored in the memory 180, such as one or more pages of a respective time-series data file 186. The operating system of the processing node 154 may determine if or when to perform read operations to pull data from the node data store 156 into the memory 180.
Example State Diagrams
FIGS. 2A-2C illustrate example state diagrams that depict the process of retrieving and manipulating time-series data. As illustrated in FIG. 2A, the data source 110 may transmit time-series data (1) to the time-series data store 150. The time-series data store 150 may store the time-series data (2). Once written to disk, the file including the time-series data may be immutable.
At some time after the time-series data is stored in the time-series data store 150, a user may manipulate the interactive user interface displayed by the user device 160 in a way that causes the user device 160 to request processing of time-series data (3) from the data server 140. For example, the user may pan, scroll, zoom, select an option to modify, combine, and/or aggregate time-series data, and/or the like. The request may include the start time, the end time, the period, and/or the computational expression.
The data server 140 may forward the request to the time-series data store 150. Using information in the request, the time-series data store 150 may retrieve time-series data identified in the computational expression and execute arithmetic operation(s) on data values in the retrieved time-series data to generate new time-series data (4). Execution of the arithmetic operation(s) may involve pointwise operations and/or sliding window operations.
The time-series data store 150 may transmit the new time-series data (5) to the data server 140. The data server 140 may then forward the new time-series data to the user device 160. In some embodiments, the data server 140 aggregates the new time-series data into a different format (e.g., a format more understandable by humans, a format that can be more easily displayed by the user device 160, etc.) before forwarding the new time-series data to the user device 160. The user device 160 may display the new time-series data (6) in the interactive user interface.
The state diagram depicted in FIG. 2B is similar to the state diagram depicted in FIG. 2A. However, the user manipulation of the interactive user interface displayed by the user device 160 may cause the user device 160 to transmit an indication of an adjustment in the time-series data time scale (3). For example, such manipulation may include panning a graph displayed in the interactive user interface, scrolling through the graph displayed in the interactive user interface, and/or changing a zoom level depicted in the graph displayed in the interactive user interface. The indication may include the start time, the end time, the period, and/or the computational expression.
The indication may be received by the data server 140 and forwarded to the time-series data store 150. Using information in the indication, the time-series data store 150 may retrieve time-series data identified in the computational expression and generate updated values corresponding to the new time scale using the retrieved time-series data (4). For example, the time-series data store 150 may generate the updated values by executing arithmetic operation(s) on data values in the retrieved time-series data.
The time-series data store 150 may transmit the updated values corresponding to the new time scale (5) to the data server 140. The data server 140 may then forward the updated values to the user device 160. The user device 160 may display the updated values (6) in the interactive user interface.
FIG. 2C illustrates an example state diagram depicting the processes performed by a processing node 154 in the time-series data store 150 when analyzing and performing the computational expression (for example, steps (4) in FIGS. 2A-2B). For example, the processor 170 may request first time-series data between a start time and an end time (1) from the memory 180. The memory 180 may then transmit the first time-series data (2) to the processor 170.
Optionally, the processor 170 may request second time-series data between a start time and an end time (3) from the memory 180 (if, for example, the computational expression identifies the second time-series data). The memory 180 may then transmit the second time-series data (4) to the processor 170.
For each period, the processor 170 may execute a pointwise or sliding window operation using the retrieved time-series data (5). The processor 170 (with or without the use of the memory 180 or additional memory to store intermediate states) may execute a pointwise or sliding window operation based on the type of arithmetic operation identified in the computational expression. For example, the processor 170 may execute a pointwise operation if the arithmetic operation is addition and the processor 170 may execute a sliding window operation if the arithmetic operation is a moving average.
Example Interactive User Interfaces
FIGS. 3A-3B illustrate an interactive user interface 300 depicting graphs of time-series data that may be generated and displayed by a user device, such as the user device 160. As illustrated in FIG. 3A, the interactive user interface 300 includes a graph 310 displaying time-series data showing water allocation values over time and a graph 320 displaying time-series data showing temperature values over time.
If, for example, the user manipulates the graph 310, then the user device 160 may generate a start time, an end time, a period, and a computational expression to transmit to the time-series data store 150. The start time may be 1:01 pm and the end time may be 1:05 pm given that these times correspond with the earliest timestamp and the latest timestamp visible in the interactive user interface 300. The number of pixels in the horizontal direction between a data value corresponding to the earlier timestamp and a data value corresponding to the latest timestamp may dictate the value of the period. For example, if 240 pixels exist between these two data points, then the period may be 1 second (for example, there may be 240 seconds between 1:05 pm and 1:01 pm and thus each pixel may correspond to 1 second).
The computational expression may identify the time-series data set displayed in the first graph 310. The computational expression may also identify the time-series data set displayed in the second graph 320 if, for example, the user selects an option to view a time-series data set that comprises some combination (for example, addition, subtraction, ratio, etc.) of the time-series data sets displayed in the graphs 310 and 320. Finally, the computational expression may also identify any arithmetic operation(s) to be performed.
As illustrated in FIG. 3B, portion 350 in the graph 310 includes gaps in data values. For example, data values in the gaps may not have been stored in the time-series data store 150. If the user desired to view a time-series data set that comprised some combination of the time-series data sets in the graphs 310 and 320, then the time-series data store 150 may use interpolation to estimate possible data values associated with the timestamp values in which data is missing. The interpolated data, along with the actual stored data, may then be used to generate the new time-series data set.
As described herein, the computational expression may include a nested set of arithmetic operations. FIG. 3C illustrates an example nested set of arithmetic operations. As illustrated in FIG. 3C, a first arithmetic operation may include the addition of data values from the time-series data displayed in the graph 310 with data values from the time-series data displayed in the graph 320. The second arithmetic operation may include the division of data values from the time-series data displayed in the graph 320 over the results from the first arithmetic operation. A computed set of time-series data 330 is output as a result of the computational expression. The computed set of time-series data 330 may then be displayed to the user in one of the graphs 310 or 330, and/or another graph of the user interface. While two arithmetic operations are depicted, this is not meant to be limiting. A computational expression may include any number of nested or un-nested arithmetic operations. Further, while, for clarity of description, FIG. 3C illustrates a computational expression performed on displayed time series data, in other embodiments computational expressions are performed on time-series data that may not be displayed. For example, the user may request a display of a graph of time-series data that may only be produced by execution of a computational expression on two or more time-series of data. As a result, the system may automatically access the necessary time-series of data (as described above and below), execute the computational expression (as described above and below), and then provide the requested time-series date for display to the user (e.g., to the user device, as described above).
Example Data Directory
FIGS. 4A-4C illustrate an example file structure as stored in a node data store, such as a node data store 156A-C. As illustrated in FIG. 4A, a data directory may include a lock file and a time-series directory. The lock file may be used to prevent another instance from writing and/or reading from the data directory when a write operation is occurring. The time-series directory may include one or more subfolders that are each associated with a time-series data set.
For example, using cursor 410, subfolder series-2721235254 may be selected. Selection of the subfolder displays additional links, files, and subdirectories that may be representative of the contents of the other time-series directory subfolders, as illustrated in FIG. 4B. Series-2721235254 may include a data directory that includes one or more time-series data files, where the time-series data files may be immutable. While the series-2721235254 subfolder may correspond to a single time-series data set, the data may be separated into separate files. In some cases, the different time-series data files may include overlapping timestamp values. In such a situation, the processing node 154 may perform a merge operation as the data is used when executing arithmetic operation(s). For example, when the processing node 154 comes across a timestamp value corresponding to two or more different values, the processing node 154 may select the data value from the most-recently modified file (or the least-recently modified file) as the data value to use in computations. A special value (e.g., a reserved value) may be written at the time of the data value that is not selected to be used in computations. The special value will eventually be removed once the time-series data files are compacted. Alternatively, the processing node 154 may compact two or more time-series data files to generate a single new time-series data file in which overlapping timestamp value issues are resolved (for example, the new time-series data file includes a single data value for each timestamp value, where the data value is determined in a manner as described above). The new time-series data file may then be used in future computations in place of the original time-series data files used to generate the new time-series data file.
Series-2721235254 may also include a current snapshot link that links to the snapshots directory. The snapshots directory can be a staging location for an in-progress snapshot when a time-series data file is being added or compacted. As described above, the time-series data files may be immutable and thus the location in the node data store 156 is unchanged. Thus, the snapshots directory may include links, rather than actual copies of data. Each of the links in the snapshots directory may link to the time-series data file identified by the name of the link. The state of a time-series can be changed with a single file system operation (e.g., a single atomic file system operation) to change a current snapshot link associated with the time-series.
Series-2721235254 may also include a log file, which may be the write ahead log described herein. Series-2721235254 may also include a metadata file that includes key-value pairs about the time-series. For example, the information in the metadata file may include a Unicode identification of the time-series and an indication of whether the data values are stored as floats or doubles.
Raw data 450 received from a data source 110 and stored in the time-series data files may be in the form of data and timestamp value pairs, as illustrated in FIG. 4C. The data values may be stored as integers, floats, doubles, and/or the like. The timestamp values may be absolute values (for example, wall clock time) or relative values (for example, the amount of time that has passed since the last data value was measured). The data and/or timestamp values can be compressed prior to storage in the time-series data files. The time-series data files or a separate file may further include an index indicating the location of the compressed data and/or timestamp values.
Example Process Flows
FIG. 5A is a flowchart 500 depicting an illustrative operation of processing time-series data by a database for display in an interactive user interface. Depending on the embodiment, the method of FIG. 5A may be performed by various computing devices, such as by the time-series data store 150 described above. Depending on the embodiment, the method of FIG. 5A may include fewer and/or additional blocks and the blocks may be performed in an order different than illustrated.
In block 502, an identification of a series expression, a start time, an end time, and a period may be received from a user device. For example, the series expression may be a computational expression. The series expression may identify one or more arithmetic operations and one or more time-series data sets upon which the arithmetic operations are to be performed.
In block 504, a time-series data file corresponding to the series expression may be retrieved from memory. The time-series data file may be a data file associated with a time-series data set identified by the series expression.
In block 506, within each period between the start time and the end time, a value based on a computation identified by the series expression applied to a portion of the time-series data file is generated. For example, the computation may be one or more arithmetic operations. The computation may be applied to data values stored in the time-series data file that are associated with timestamp values that fall between the start time and the end time.
In block 508, the generated values may be transmitted to the user device for display in a user interface, such as an interactive user interface. In some embodiments, the generated values are also stored in the node data store 156.
FIG. 5B is another flowchart 550 depicting an illustrative operation of processing time-series data by a database for display in an interactive user interface. Depending on the embodiment, the method of FIG. 5B may be performed by various computing devices, such as by the time-series data store 150 described above. Depending on the embodiment, the method of FIG. 5B may include fewer and/or additional blocks and the blocks may be performed in an order different than illustrated.
In block 552, an identification of a series expression, a start time, an end time, and a period may be received from a user device. For example, the series expression may be a computational expression. The series expression may identify one or more arithmetic operations and one or more time-series data sets upon which the arithmetic operations are to be performed.
In block 554, a first time-series data file and a second time-series data file corresponding to the series expression may be retrieved from memory. The time-series data files may be a data files associated with time-series data sets identified by the series expression.
In block 556, within each period between the start time and the end time, a value based on a computation identified by the series expression applied to a portion of the first time-series data file and a portion of the second time-series data file is generated. For example, the computation may be one or more arithmetic operations. The computation may be applied to data values stored in the first time-series data file and in the second time-series data file that are associated with timestamp values that fall between the start time and the end time.
In block 558, the generated values may be transmitted to the user device for display in a user interface, such as an interactive user interface. In some embodiments, the generated values are also stored in the node data store 156.
Implementation Mechanisms
According to one embodiment, the techniques described herein are implemented by one or more special-purpose computing devices. The special-purpose computing devices may be hard-wired to perform the techniques, or may include digital electronic devices such as one or more application-specific integrated circuits (ASICs) or field programmable gate arrays (FPGAs) that are persistently programmed to perform the techniques, or may include one or more general purpose hardware processors programmed to perform the techniques pursuant to program instructions in firmware, memory, other storage, or a combination. Such special-purpose computing devices may also combine custom hard-wired logic, ASICs, or FPGAs with custom programming to accomplish the techniques. The special-purpose computing devices may be desktop computer systems, server computer systems, portable computer systems, handheld devices, networking devices or any other device or combination of devices that incorporate hard-wired and/or program logic to implement the techniques.
Computing device(s) are generally controlled and coordinated by operating system software, such as iOS, Android, Chrome OS, Windows XP, Windows Vista, Windows 7, Windows 8, Windows Server, Windows CE, Unix, Linux, SunOS, Solaris, iOS, Blackberry OS, VxWorks, or other compatible operating systems. In other embodiments, the computing device may be controlled by a proprietary operating system. Conventional operating systems control and schedule computer processes for execution, perform memory management, provide file system, networking, I/O services, and provide a user interface functionality, such as a graphical user interface (“GUI”), among other things.
For example, FIG. 6 is a block diagram that illustrates a computer system 600 upon which an embodiment may be implemented. For example, any of the computing devices discussed herein, such as the data source 110, the data server 140, the time-series data store 150, and/or the user device 160, may include some or all of the components and/or functionality of the computer system 600.
Computer system 600 includes a bus 602 or other communication mechanism for communicating information, and a hardware processor, or multiple processors, 604 coupled with bus 602 for processing information. Hardware processor(s) 604 may be, for example, one or more general purpose microprocessors.
Computer system 600 also includes a main memory 606, such as a random access memory (RAM), cache and/or other dynamic storage devices, coupled to bus 602 for storing information and instructions to be executed by processor 604. Main memory 606 also may be used for storing temporary variables or other intermediate information during execution of instructions to be executed by processor 604. Such instructions, when stored in storage media accessible to processor 604, render computer system 600 into a special-purpose machine that is customized to perform the operations specified in the instructions. Main memory 606 may also store cached data, such as zoom levels and maximum and minimum sensor values at each zoom level.
Computer system 600 further includes a read only memory (ROM) 608 or other static storage device coupled to bus 602 for storing static information and instructions for processor 604. A storage device 610, such as a magnetic disk, optical disk, or USB thumb drive (Flash drive), etc., is provided and coupled to bus 602 for storing information and instructions. For example, the storage device 610 may store measurement data obtained from a plurality of sensors.
Computer system 600 may be coupled via bus 602 to a display 612, such as a cathode ray tube (CRT) or LCD display (or touch screen), for displaying information to a computer user. For example, the display 612 can be used to display any of the user interfaces described herein with respect to FIGS. 2A through 8. An input device 614, including alphanumeric and other keys, is coupled to bus 602 for communicating information and command selections to processor 604. Another type of user input device is cursor control 616, such as a mouse, a trackball, or cursor direction keys for communicating direction information and command selections to processor 604 and for controlling cursor movement on display 612. This input device typically has two degrees of freedom in two axes, a first axis (e.g., x) and a second axis (e.g., y), that allows the device to specify positions in a plane. In some embodiments, the same direction information and command selections as cursor control may be implemented via receiving touches on a touch screen without a cursor.
Computing system 600 may include a user interface module to implement a GUI that may be stored in a mass storage device as executable software codes that are executed by the computing device(s). This and other modules may include, by way of example, components, such as software components, object-oriented software components, class components and task components, processes, functions, attributes, procedures, subroutines, segments of program code, drivers, firmware, microcode, circuitry, data, databases, data structures, tables, arrays, and variables.
In general, the word “module,” as used herein, refers to logic embodied in hardware or firmware, or to a collection of software instructions, possibly having entry and exit points, written in a programming language, such as, for example, Java, Lua, C, or C++. A software module may be compiled and linked into an executable program, installed in a dynamic link library, or may be written in an interpreted programming language such as, for example, BASIC, Perl, or Python. It will be appreciated that software modules may be callable from other modules or from themselves, and/or may be invoked in response to detected events or interrupts. Software modules configured for execution on computing devices may be provided on a computer readable medium, such as a compact disc, digital video disc, flash drive, magnetic disc, or any other tangible medium, or as a digital download (and may be originally stored in a compressed or installable format that requires installation, decompression or decryption prior to execution). Such software code may be stored, partially or fully, on a memory device of the executing computing device, for execution by the computing device. Software instructions may be embedded in firmware, such as an EPROM. It will be further appreciated that hardware modules may be comprised of connected logic units, such as gates and flip-flops, and/or may be comprised of programmable units, such as programmable gate arrays or processors. The modules or computing device functionality described herein are preferably implemented as software modules, but may be represented in hardware or firmware. Generally, the modules described herein refer to logical modules that may be combined with other modules or divided into sub-modules despite their physical organization or storage
Computer system 600 may implement the techniques described herein using customized hard-wired logic, one or more ASICs or FPGAs, firmware and/or program logic which in combination with the computer system causes or programs computer system 600 to be a special-purpose machine. According to one embodiment, the techniques herein are performed by computer system 600 in response to processor(s) 604 executing one or more sequences of one or more instructions contained in main memory 606. Such instructions may be read into main memory 606 from another storage medium, such as storage device 610. Execution of the sequences of instructions contained in main memory 606 causes processor(s) 604 to perform the process steps described herein. In alternative embodiments, hard-wired circuitry may be used in place of or in combination with software instructions.
The term “non-transitory media,” and similar terms, as used herein refers to any media that store data and/or instructions that cause a machine to operate in a specific fashion. Such non-transitory media may comprise non-volatile media and/or volatile media. Non-volatile media includes, for example, optical or magnetic disks, such as storage device 610. Volatile media includes dynamic memory, such as main memory 606. Common forms of non-transitory media include, for example, a floppy disk, a flexible disk, hard disk, solid state drive, magnetic tape, or any other magnetic data storage medium, a CD-ROM, any other optical data storage medium, any physical medium with patterns of holes, a RAM, a PROM, and EPROM, a FLASH-EPROM, NVRAM, any other memory chip or cartridge, and networked versions of the same.
Non-transitory media is distinct from but may be used in conjunction with transmission media. Transmission media participates in transferring information between non-transitory media. For example, transmission media includes coaxial cables, copper wire and fiber optics, including the wires that comprise bus 602. Transmission media can also take the form of acoustic or light waves, such as those generated during radio-wave and infra-red data communications.
Various forms of media may be involved in carrying one or more sequences of one or more instructions to processor 604 for execution. For example, the instructions may initially be carried on a magnetic disk or solid state drive of a remote computer. The remote computer can load the instructions into its dynamic memory and send the instructions over a telephone line using a modem. A modem local to computer system 600 can receive the data on the telephone line and use an infra-red transmitter to convert the data to an infra-red signal. An infra-red detector can receive the data carried in the infra-red signal and appropriate circuitry can place the data on bus 602. Bus 602 carries the data to main memory 406, from which processor 604 retrieves and executes the instructions. The instructions received by main memory 606 may retrieve and execute the instructions. The instructions received by main memory 606 may optionally be stored on storage device 610 either before or after execution by processor 604.
Computer system 600 also includes a communication interface 618 coupled to bus 602. Communication interface 618 provides a two-way data communication coupling to a network link 620 that is connected to a local network 622. For example, communication interface 618 may be an integrated services digital network (ISDN) card, cable modem, satellite modem, or a modem to provide a data communication connection to a corresponding type of telephone line. As another example, communication interface 618 may be a local area network (LAN) card to provide a data communication connection to a compatible LAN (or WAN component to communicated with a WAN). Wireless links may also be implemented. In any such implementation, communication interface 618 sends and receives electrical, electromagnetic or optical signals that carry digital data streams representing various types of information.
Network link 620 typically provides data communication through one or more networks to other data devices. For example, network link 620 may provide a connection through local network 622 to a host computer 624 or to data equipment operated by an Internet Service Provider (ISP) 626. ISP 626 in turn provides data communication services through the world wide packet data communication network now commonly referred to as the “Internet” 628. Local network 622 and Internet 628 both use electrical, electromagnetic or optical signals that carry digital data streams. The signals through the various networks and the signals on network link 620 and through communication interface 618, which carry the digital data to and from computer system 600, are example forms of transmission media.
Computer system 600 can send messages and receive data, including program code, through the network(s), network link 620 and communication interface 618. In the Internet example, a server 630 might transmit a requested code for an application program through Internet 628, ISP 626, local network 622 and communication interface 618.
The received code may be executed by processor 604 as it is received, and/or stored in storage device 610, or other non-volatile storage for later execution.
TERMINOLOGY
Each of the processes, methods, and algorithms described in the preceding sections may be embodied in, and fully or partially automated by, code modules executed by one or more computer systems or computer processors comprising computer hardware. The processes and algorithms may be implemented partially or wholly in application-specific circuitry.
The various features and processes described above may be used independently of one another, or may be combined in various ways. All possible combinations and subcombinations are intended to fall within the scope of this disclosure. In addition, certain method or process blocks may be omitted in some implementations. The methods and processes described herein are also not limited to any particular sequence, and the blocks or states relating thereto can be performed in other sequences that are appropriate. For example, described blocks or states may be performed in an order other than that specifically disclosed, or multiple blocks or states may be combined in a single block or state. The example blocks or states may be performed in serial, in parallel, or in some other manner. Blocks or states may be added to or removed from the disclosed example embodiments. The example systems and components described herein may be configured differently than described. For example, elements may be added to, removed from, or rearranged compared to the disclosed example embodiments.
Conditional language, such as, among others, “can,” “could,” “might,” or “may,” unless specifically stated otherwise, or otherwise understood within the context as used, is generally intended to convey that certain embodiments include, while other embodiments do not include, certain features, elements and/or steps. Thus, such conditional language is not generally intended to imply that features, elements and/or steps are in any way required for one or more embodiments or that one or more embodiments necessarily include logic for deciding, with or without user input or prompting, whether these features, elements and/or steps are included or are to be performed in any particular embodiment.
Any process descriptions, elements, or blocks in the flow diagrams described herein and/or depicted in the attached figures should be understood as potentially representing modules, segments, or portions of code which include one or more executable instructions for implementing specific logical functions or steps in the process. Alternate implementations are included within the scope of the embodiments described herein in which elements or functions may be deleted, executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those skilled in the art.
It should be emphasized that many variations and modifications may be made to the above-described embodiments, the elements of which are to be understood as being among other acceptable examples. All such modifications and variations are intended to be included herein within the scope of this disclosure. The foregoing description details certain embodiments of the invention. It will be appreciated, however, that no matter how detailed the foregoing appears in text, the invention can be practiced in many ways. As is also stated above, it should be noted that the use of particular terminology when describing certain features or aspects of the invention should not be taken to imply that the terminology is being re-defined herein to be restricted to including any specific characteristics of the features or aspects of the invention with which that terminology is associated. The scope of the invention should therefore be construed in accordance with the appended claims and any equivalents thereof.

Claims (13)

What is claimed is:
1. A database configured to receive and process requests associated with a plurality of stored time-series data and provide results to a user device for display in an interactive user interface, the database comprising:
a discovery node comprising a first computer processor;
a processing node comprising a second computer processor and memory; and
a time-series database storing the plurality of time-series data, wherein the plurality of time-series data comprises first time-series data and second time-series data, and wherein the memory stores a copy of a portion of the plurality of time-series data; and
a computer readable storage medium storing first program instructions and second program instructions,
wherein the first program instructions are configured for execution by the first computer processor in order to cause the database to transmit a series expression, a start time, an end time, and a period received from the user device to the processing node, wherein the first time-series data and the second time-series data correspond to the series expression, and
wherein the second program instructions are configured for execution by the second computer processor in order to cause the database to:
retrieve, from the memory, a portion of the first time-series data and a portion of the second time-series data;
for each period between the start time and the end time,
identify a first data value from the portion of the first time-series data and a second data value from the portion of the second time-series data that are both associated with a same timestamp value, and
generate a value based on a computation identified by the series expression that is applied to the first data value and the second data value.
2. The database of claim 1, wherein the second program instructions are further configured for execution by the second computer processor in order to cause the database to transmit the generated values to the user device for display in the interactive user interface.
3. The database of claim 1, wherein the first program instructions are further configured for execution by the first computer processor in order to cause the database to determine that the processing node is associated with the first time-series data and the second time-series data.
4. The database of claim 1, wherein the second program instructions are further configured for execution by the second computer processor in order to cause the database to retrieve, from the memory, data values from the first time-series data that are associated with timestamp values that fall within the start time and the end time and data values from the second time-series data that are associated with timestamp values that fall within the start time and the end time.
5. The database of claim 1, wherein the first data value comprises a plurality of data values that are each associated with a different timestamp value, and wherein the second data value comprises a plurality of data values that are each associated with a different timestamp value.
6. The database of claim 1, wherein the first program instructions are further configured for execution by the first computer processor in order to cause the database to:
receive, from a data source, third time-series data and fourth time-series data, wherein the third time-series data and the fourth time-series data correspond with a first sensor and comprise overlapping time values; and
compact the third time-series data and the fourth time-series data to generate the second time-series data.
7. The database of claim 6, wherein the first program instructions are further configured for execution by the first computer processor in order to cause the database to:
determine, for each overlapping time value, whether a third data value corresponding to the third time-series data or a fourth data value corresponding to the fourth time-series data is stored in a later-modified file; and
insert the data value stored in the later-modified file into the second time-series data in association with the respective overlapping time value.
8. The database of claim 1, wherein the start time and the end time correspond to a window of data viewed by a user via the interactive user interface.
9. The database of claim 1, wherein the period identifies a period of time that corresponds with a width of a pixel in the interactive user interface.
10. The database of claim 9, wherein the second program instructions are further configured for execution by the second computer processor in order to cause the database to:
determine whether the period is greater than a period of time between each timestamp value in the first time-series data;
aggregate data values such that a period of time between an earliest timestamp value corresponding to a data value in the aggregate and a latest timestamp value corresponding to a data value in the aggregate equals the period in response to a determination that the period is greater than the period of time between each timestamp value in the first time-series data; and
for each period between the start time and the end time,
identify a first aggregated data value from the portion of the first time-series data and the second data value from the portion of the second time-series data that are both associated with a same timestamp value, and
generate a value based on a computation identified by the series expression that is applied to the first aggregated data value and the second data value.
11. The database of claim 1, wherein the series expression identifies a first operation associated with the first time-series data and the second time-series data and a second operation associated with a result of the first operation and the first time-series data.
12. The database of claim 1, wherein the series expression comprises one of a sum, a difference, a product, a ratio, a moving average, a zScore, or a square root.
13. The database of claim 1, wherein the first time-series data stored in the time-series database is immutable.
US15/171,494 2015-06-05 2016-06-02 Time-series data storage and processing database system Active US9672257B2 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
US15/171,494 US9672257B2 (en) 2015-06-05 2016-06-02 Time-series data storage and processing database system
EP16173056.9A EP3101560B1 (en) 2015-06-05 2016-06-06 Time-series data storage and processing database system
US15/614,388 US10585907B2 (en) 2015-06-05 2017-06-05 Time-series data storage and processing database system
US16/805,257 US11687543B2 (en) 2015-06-05 2020-02-28 Time-series data storage and processing database system
US18/316,894 US20230359638A1 (en) 2015-06-05 2023-05-12 Time-series data storage and processing database system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201562171875P 2015-06-05 2015-06-05
US15/171,494 US9672257B2 (en) 2015-06-05 2016-06-02 Time-series data storage and processing database system

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US15/614,388 Continuation US10585907B2 (en) 2015-06-05 2017-06-05 Time-series data storage and processing database system

Publications (2)

Publication Number Publication Date
US20160357828A1 US20160357828A1 (en) 2016-12-08
US9672257B2 true US9672257B2 (en) 2017-06-06

Family

ID=56116283

Family Applications (4)

Application Number Title Priority Date Filing Date
US15/171,494 Active US9672257B2 (en) 2015-06-05 2016-06-02 Time-series data storage and processing database system
US15/614,388 Active 2036-09-25 US10585907B2 (en) 2015-06-05 2017-06-05 Time-series data storage and processing database system
US16/805,257 Active 2037-05-26 US11687543B2 (en) 2015-06-05 2020-02-28 Time-series data storage and processing database system
US18/316,894 Pending US20230359638A1 (en) 2015-06-05 2023-05-12 Time-series data storage and processing database system

Family Applications After (3)

Application Number Title Priority Date Filing Date
US15/614,388 Active 2036-09-25 US10585907B2 (en) 2015-06-05 2017-06-05 Time-series data storage and processing database system
US16/805,257 Active 2037-05-26 US11687543B2 (en) 2015-06-05 2020-02-28 Time-series data storage and processing database system
US18/316,894 Pending US20230359638A1 (en) 2015-06-05 2023-05-12 Time-series data storage and processing database system

Country Status (2)

Country Link
US (4) US9672257B2 (en)
EP (1) EP3101560B1 (en)

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10216695B1 (en) 2017-09-21 2019-02-26 Palantir Technologies Inc. Database system for time series data storage, processing, and analysis
US10417224B2 (en) 2017-08-14 2019-09-17 Palantir Technologies Inc. Time series database processing system
US10585907B2 (en) 2015-06-05 2020-03-10 Palantir Technologies Inc. Time-series data storage and processing database system
US10664444B2 (en) 2016-08-02 2020-05-26 Palantir Technologies Inc. Time-series data storage and processing database system
US10997137B1 (en) 2018-12-13 2021-05-04 Amazon Technologies, Inc. Two-dimensional partition splitting in a time-series database
US11016986B2 (en) 2017-12-04 2021-05-25 Palantir Technologies Inc. Query-based time-series data display and processing system
US11068537B1 (en) * 2018-12-11 2021-07-20 Amazon Technologies, Inc. Partition segmenting in a distributed time-series database
US11216487B1 (en) 2019-09-23 2022-01-04 Amazon Technologies, Inc. Schema-based spatial partitioning in a time-series database
US11250019B1 (en) 2019-02-27 2022-02-15 Amazon Technologies, Inc. Eventually consistent replication in a time-series database
US11256719B1 (en) 2019-06-27 2022-02-22 Amazon Technologies, Inc. Ingestion partition auto-scaling in a time-series database
US11263270B1 (en) 2020-03-26 2022-03-01 Amazon Technologies, Inc. Heat balancing in a distributed time-series database
US20220067021A1 (en) * 2020-09-01 2022-03-03 Palantir Technologies Inc. Data insights
US11281726B2 (en) 2017-12-01 2022-03-22 Palantir Technologies Inc. System and methods for faster processor comparisons of visual graph features
US11314738B2 (en) 2014-12-23 2022-04-26 Palantir Technologies Inc. Searching charts
US11366598B1 (en) 2020-03-26 2022-06-21 Amazon Technologies, Inc. Dynamic lease assignments in a time-series database
US11379453B2 (en) 2017-06-02 2022-07-05 Palantir Technologies Inc. Systems and methods for retrieving and processing data
US11397752B1 (en) * 2019-06-27 2022-07-26 Amazon Technologies, Inc. In-memory ingestion for highly available distributed time-series databases
US11409771B1 (en) 2020-03-26 2022-08-09 Amazon Technologies, Inc. Splitting partitions across clusters in a time-series database
US11409725B1 (en) 2019-02-04 2022-08-09 Amazon Technologies, Inc. Multi-tenant partitioning in a time-series database
US11461347B1 (en) 2021-06-16 2022-10-04 Amazon Technologies, Inc. Adaptive querying of time-series data over tiered storage
US11513854B1 (en) * 2019-06-26 2022-11-29 Amazon Technologies, Inc. Resource usage restrictions in a time-series database
US11573981B1 (en) 2019-09-23 2023-02-07 Amazon Technologies, Inc. Auto-scaling using temporal splits in a time-series database
US11599516B1 (en) 2020-06-24 2023-03-07 Amazon Technologies, Inc. Scalable metadata index for a time-series database
US11853317B1 (en) 2019-03-18 2023-12-26 Amazon Technologies, Inc. Creating replicas using queries to a time series database
US11934409B2 (en) 2018-11-23 2024-03-19 Amazon Technologies, Inc. Continuous functions in a time-series database
US11941014B1 (en) 2021-06-16 2024-03-26 Amazon Technologies, Inc. Versioned metadata management for a time-series database

Families Citing this family (127)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9411327B2 (en) 2012-08-27 2016-08-09 Johnson Controls Technology Company Systems and methods for classifying data in building automation systems
US10534326B2 (en) 2015-10-21 2020-01-14 Johnson Controls Technology Company Building automation system with integrated building information model
US11947785B2 (en) 2016-01-22 2024-04-02 Johnson Controls Technology Company Building system with a building graph
US11268732B2 (en) 2016-01-22 2022-03-08 Johnson Controls Technology Company Building energy management system with energy analytics
CN109154802A (en) 2016-03-31 2019-01-04 江森自控科技公司 HVAC device registration in distributed building management system
US11774920B2 (en) 2016-05-04 2023-10-03 Johnson Controls Technology Company Building system with user presentation composition based on building context
US10417451B2 (en) 2017-09-27 2019-09-17 Johnson Controls Technology Company Building system with smart entity personal identifying information (PII) masking
US10505756B2 (en) 2017-02-10 2019-12-10 Johnson Controls Technology Company Building management system with space graphs
US11860940B1 (en) 2016-09-26 2024-01-02 Splunk Inc. Identifying buckets for query execution using a catalog of buckets
US11620336B1 (en) 2016-09-26 2023-04-04 Splunk Inc. Managing and storing buckets to a remote shared storage system based on a collective bucket size
US10795884B2 (en) 2016-09-26 2020-10-06 Splunk Inc. Dynamic resource allocation for common storage query
US10984044B1 (en) * 2016-09-26 2021-04-20 Splunk Inc. Identifying buckets for query execution using a catalog of buckets stored in a remote shared storage system
US11003714B1 (en) 2016-09-26 2021-05-11 Splunk Inc. Search node and bucket identification using a search node catalog and a data store catalog
US11599541B2 (en) 2016-09-26 2023-03-07 Splunk Inc. Determining records generated by a processing task of a query
US11281706B2 (en) 2016-09-26 2022-03-22 Splunk Inc. Multi-layer partition allocation for query execution
US11222066B1 (en) 2016-09-26 2022-01-11 Splunk Inc. Processing data using containerized state-free indexing nodes in a containerized scalable environment
US11593377B2 (en) 2016-09-26 2023-02-28 Splunk Inc. Assigning processing tasks in a data intake and query system
US11562023B1 (en) 2016-09-26 2023-01-24 Splunk Inc. Merging buckets in a data intake and query system
US11580107B2 (en) 2016-09-26 2023-02-14 Splunk Inc. Bucket data distribution for exporting data to worker nodes
US11416528B2 (en) 2016-09-26 2022-08-16 Splunk Inc. Query acceleration data store
US11232100B2 (en) 2016-09-26 2022-01-25 Splunk Inc. Resource allocation for multiple datasets
US10726009B2 (en) 2016-09-26 2020-07-28 Splunk Inc. Query processing using query-resource usage and node utilization data
US11586627B2 (en) 2016-09-26 2023-02-21 Splunk Inc. Partitioning and reducing records at ingest of a worker node
US11615104B2 (en) 2016-09-26 2023-03-28 Splunk Inc. Subquery generation based on a data ingest estimate of an external data system
US11294941B1 (en) 2016-09-26 2022-04-05 Splunk Inc. Message-based data ingestion to a data intake and query system
US11321321B2 (en) 2016-09-26 2022-05-03 Splunk Inc. Record expansion and reduction based on a processing task in a data intake and query system
US10353965B2 (en) 2016-09-26 2019-07-16 Splunk Inc. Data fabric service system architecture
US11550847B1 (en) 2016-09-26 2023-01-10 Splunk Inc. Hashing bucket identifiers to identify search nodes for efficient query execution
US11250056B1 (en) 2016-09-26 2022-02-15 Splunk Inc. Updating a location marker of an ingestion buffer based on storing buckets in a shared storage system
US11243963B2 (en) 2016-09-26 2022-02-08 Splunk Inc. Distributing partial results to worker nodes from an external data system
US11023463B2 (en) 2016-09-26 2021-06-01 Splunk Inc. Converting and modifying a subquery for an external data system
US11874691B1 (en) 2016-09-26 2024-01-16 Splunk Inc. Managing efficient query execution including mapping of buckets to search nodes
US10977260B2 (en) 2016-09-26 2021-04-13 Splunk Inc. Task distribution in an execution node of a distributed execution environment
US10956415B2 (en) 2016-09-26 2021-03-23 Splunk Inc. Generating a subquery for an external data system using a configuration file
US11269939B1 (en) 2016-09-26 2022-03-08 Splunk Inc. Iterative message-based data processing including streaming analytics
US11126632B2 (en) 2016-09-26 2021-09-21 Splunk Inc. Subquery generation based on search configuration data from an external data system
US20180089324A1 (en) 2016-09-26 2018-03-29 Splunk Inc. Dynamic resource allocation for real-time search
US11461334B2 (en) 2016-09-26 2022-10-04 Splunk Inc. Data conditioning for dataset destination
US10776355B1 (en) * 2016-09-26 2020-09-15 Splunk Inc. Managing, storing, and caching query results and partial query results for combination with additional query results
US11663227B2 (en) 2016-09-26 2023-05-30 Splunk Inc. Generating a subquery for a distinct data intake and query system
US11106734B1 (en) 2016-09-26 2021-08-31 Splunk Inc. Query execution using containerized state-free search nodes in a containerized scalable environment
US11604795B2 (en) 2016-09-26 2023-03-14 Splunk Inc. Distributing partial results from an external data system between worker nodes
US11567993B1 (en) 2016-09-26 2023-01-31 Splunk Inc. Copying buckets from a remote shared storage system to memory associated with a search node for query execution
US11163758B2 (en) 2016-09-26 2021-11-02 Splunk Inc. External dataset capability compensation
US11442935B2 (en) 2016-09-26 2022-09-13 Splunk Inc. Determining a record generation estimate of a processing task
US11314753B2 (en) 2016-09-26 2022-04-26 Splunk Inc. Execution of a query received from a data intake and query system
US10684033B2 (en) 2017-01-06 2020-06-16 Johnson Controls Technology Company HVAC system with automated device pairing
US11900287B2 (en) 2017-05-25 2024-02-13 Johnson Controls Tyco IP Holdings LLP Model predictive maintenance system with budgetary constraints
US11360447B2 (en) 2017-02-10 2022-06-14 Johnson Controls Technology Company Building smart entity system with agent based communication and control
US10095756B2 (en) 2017-02-10 2018-10-09 Johnson Controls Technology Company Building management system with declarative views of timeseries data
US11307538B2 (en) 2017-02-10 2022-04-19 Johnson Controls Technology Company Web services platform with cloud-eased feedback control
US10515098B2 (en) 2017-02-10 2019-12-24 Johnson Controls Technology Company Building management smart entity creation and maintenance using time series data
US10452043B2 (en) * 2017-02-10 2019-10-22 Johnson Controls Technology Company Building management system with nested stream generation
US11280509B2 (en) 2017-07-17 2022-03-22 Johnson Controls Technology Company Systems and methods for agent based building simulation for optimal control
US10854194B2 (en) 2017-02-10 2020-12-01 Johnson Controls Technology Company Building system with digital twin based data ingestion and processing
US11764991B2 (en) 2017-02-10 2023-09-19 Johnson Controls Technology Company Building management system with identity management
US20190361412A1 (en) 2017-02-10 2019-11-28 Johnson Controls Technology Company Building smart entity system with agent based data ingestion and entity creation using time series data
JP2018160057A (en) * 2017-03-22 2018-10-11 株式会社東芝 Information processing system, information processing method, and program
WO2018175912A1 (en) 2017-03-24 2018-09-27 Johnson Controls Technology Company Building management system with dynamic channel communication
US10558346B2 (en) * 2017-04-10 2020-02-11 Palantir Technologies Inc. Alerting system and method
US10817524B2 (en) * 2017-04-10 2020-10-27 Servicenow, Inc. Systems and methods for querying time series data
US10347113B1 (en) * 2017-04-10 2019-07-09 Palantir Technologies Inc. Alerting system and method
US11327737B2 (en) 2017-04-21 2022-05-10 Johnson Controls Tyco IP Holdings LLP Building management system with cloud management of gateway configurations
CN110832514A (en) * 2017-04-22 2020-02-21 潘吉瓦公司 Recording surveys of nowcasting abstractions from individual customs transactions
US10788229B2 (en) 2017-05-10 2020-09-29 Johnson Controls Technology Company Building management system with a distributed blockchain database
US11022947B2 (en) 2017-06-07 2021-06-01 Johnson Controls Technology Company Building energy optimization system with economic load demand response (ELDR) optimization and ELDR user interfaces
WO2018232147A1 (en) 2017-06-15 2018-12-20 Johnson Controls Technology Company Building management system with artificial intelligence for unified agent based control of building subsystems
US11733663B2 (en) 2017-07-21 2023-08-22 Johnson Controls Tyco IP Holdings LLP Building management system with dynamic work order generation with adaptive diagnostic task details
US20190034066A1 (en) 2017-07-27 2019-01-31 Johnson Controls Technology Company Building management system with central plantroom dashboards
US11921672B2 (en) 2017-07-31 2024-03-05 Splunk Inc. Query execution at a remote heterogeneous data store of a data fabric service
US11093548B1 (en) * 2017-08-29 2021-08-17 Vmware, Inc. Dynamic graph for time series data
US11151137B2 (en) 2017-09-25 2021-10-19 Splunk Inc. Multi-partition operation in combination operations
US10896182B2 (en) 2017-09-25 2021-01-19 Splunk Inc. Multi-partitioning determination for combination operations
US11768826B2 (en) 2017-09-27 2023-09-26 Johnson Controls Tyco IP Holdings LLP Web services for creation and maintenance of smart entities for connected devices
US10962945B2 (en) 2017-09-27 2021-03-30 Johnson Controls Technology Company Building management system with integration of data into smart entities
US11258683B2 (en) * 2017-09-27 2022-02-22 Johnson Controls Tyco IP Holdings LLP Web services platform with nested stream generation
US10565844B2 (en) 2017-09-27 2020-02-18 Johnson Controls Technology Company Building risk analysis system with global risk dashboard
US11314788B2 (en) 2017-09-27 2022-04-26 Johnson Controls Tyco IP Holdings LLP Smart entity management for building management systems
US11281169B2 (en) 2017-11-15 2022-03-22 Johnson Controls Tyco IP Holdings LLP Building management system with point virtualization for online meters
US10809682B2 (en) 2017-11-15 2020-10-20 Johnson Controls Technology Company Building management system with optimized processing of building system data
US11127235B2 (en) 2017-11-22 2021-09-21 Johnson Controls Tyco IP Holdings LLP Building campus with integrated smart environment
US11954713B2 (en) 2018-03-13 2024-04-09 Johnson Controls Tyco IP Holdings LLP Variable refrigerant flow system with electricity consumption apportionment
US10740310B2 (en) * 2018-03-19 2020-08-11 Oracle International Corporation Intelligent preprocessing of multi-dimensional time-series data
US10902654B2 (en) * 2018-04-20 2021-01-26 Palantir Technologies Inc. Object time series system
US10895972B1 (en) 2018-04-20 2021-01-19 Palantir Technologies Inc. Object time series system and investigation graphical user interface
US11334543B1 (en) 2018-04-30 2022-05-17 Splunk Inc. Scalable bucket merging for a data intake and query system
US10671624B2 (en) * 2018-06-13 2020-06-02 The Mathworks, Inc. Parallel filtering of large time series of data for filters having recursive dependencies
US10936434B2 (en) * 2018-07-13 2021-03-02 EMC IP Holding Company LLC Backup and tiered policy coordination in time series databases
US11074244B1 (en) * 2018-09-14 2021-07-27 Amazon Technologies, Inc. Transactional range delete in distributed databases
US11016648B2 (en) 2018-10-30 2021-05-25 Johnson Controls Technology Company Systems and methods for entity visualization and management with an entity node editor
KR102443028B1 (en) * 2018-11-06 2022-09-14 삼성전자주식회사 Semiconductor package
US11580164B1 (en) * 2018-11-09 2023-02-14 Palantir Technologies Inc. Ontology-based time series visualization and analysis
US20200162280A1 (en) * 2018-11-19 2020-05-21 Johnson Controls Technology Company Building system with performance identification through equipment exercising and entity relationships
US20200159376A1 (en) 2018-11-19 2020-05-21 Johnson Controls Technology Company Building system with semantic modeling based user interface graphics and visualization generation
US20200167355A1 (en) * 2018-11-23 2020-05-28 Amazon Technologies, Inc. Edge processing in a distributed time-series database
US11769117B2 (en) 2019-01-18 2023-09-26 Johnson Controls Tyco IP Holdings LLP Building automation system with fault analysis and component procurement
US10788798B2 (en) 2019-01-28 2020-09-29 Johnson Controls Technology Company Building management system with hybrid edge-cloud processing
GB2581140A (en) * 2019-01-31 2020-08-12 Ernst & Young Gmbh System and method of obtaining audit evidence
WO2020220216A1 (en) 2019-04-29 2020-11-05 Splunk Inc. Search time estimate in data intake and query system
US11715051B1 (en) 2019-04-30 2023-08-01 Splunk Inc. Service provider instance recommendations using machine-learned classifications and reconciliation
CN112084226A (en) * 2019-06-13 2020-12-15 北京京东尚科信息技术有限公司 Data processing method, system, device and computer readable storage medium
CN112084227A (en) * 2019-06-14 2020-12-15 核桃运算股份有限公司 Data query device, method and computer storage medium thereof
CN110609813B (en) * 2019-08-14 2023-01-31 北京华电天仁电力控制技术有限公司 Data storage system and method
US11494380B2 (en) 2019-10-18 2022-11-08 Splunk Inc. Management of distributed computing framework components in a data fabric service system
US20210200807A1 (en) 2019-12-31 2021-07-01 Johnson Controls Technology Company Building data platform with a graph change feed
US11894944B2 (en) 2019-12-31 2024-02-06 Johnson Controls Tyco IP Holdings LLP Building data platform with an enrichment loop
US11922222B1 (en) 2020-01-30 2024-03-05 Splunk Inc. Generating a modified component for a data intake and query system using an isolated execution environment image
US11537386B2 (en) 2020-04-06 2022-12-27 Johnson Controls Tyco IP Holdings LLP Building system with dynamic configuration of network resources for 5G networks
CN113535770A (en) * 2020-04-22 2021-10-22 杭州海康威视数字技术股份有限公司 Data query method and device
US20210349867A1 (en) * 2020-05-08 2021-11-11 Worthy Technology LLC System and methods for receiving, processing and storing rich time series data
US11874809B2 (en) 2020-06-08 2024-01-16 Johnson Controls Tyco IP Holdings LLP Building system with naming schema encoding entity type and entity relationships
CN113868267A (en) * 2020-06-30 2021-12-31 华为技术有限公司 Method for injecting time sequence data, method for inquiring time sequence data and database system
US11954154B2 (en) 2020-09-30 2024-04-09 Johnson Controls Tyco IP Holdings LLP Building management system with semantic model integration
US11397773B2 (en) 2020-09-30 2022-07-26 Johnson Controls Tyco IP Holdings LLP Building management system with semantic model integration
US11080264B1 (en) * 2020-10-02 2021-08-03 ActionIQ, Inc. Mutable data ingestion and storage
US11704313B1 (en) 2020-10-19 2023-07-18 Splunk Inc. Parallel branch operation using intermediary nodes
US20220138362A1 (en) 2020-10-30 2022-05-05 Johnson Controls Technology Company Building management system with configuration by building model augmentation
CN113779102B (en) * 2020-11-04 2022-11-08 北京沃东天骏信息技术有限公司 Data feature generation method and device, electronic equipment and computer readable medium
CN113078908B (en) * 2021-03-10 2022-03-25 杭州又拍云科技有限公司 Simple encoding and decoding method suitable for time sequence database
EP4309013A1 (en) 2021-03-17 2024-01-24 Johnson Controls Tyco IP Holdings LLP Systems and methods for determining equipment energy waste
US11769066B2 (en) 2021-11-17 2023-09-26 Johnson Controls Tyco IP Holdings LLP Building data platform with digital twin triggers and actions
US11899723B2 (en) 2021-06-22 2024-02-13 Johnson Controls Tyco IP Holdings LLP Building data platform with context based twin function processing
US11796974B2 (en) 2021-11-16 2023-10-24 Johnson Controls Tyco IP Holdings LLP Building data platform with schema extensibility for properties and tags of a digital twin
US11934966B2 (en) 2021-11-17 2024-03-19 Johnson Controls Tyco IP Holdings LLP Building data platform with digital twin inferences
US11704311B2 (en) 2021-11-24 2023-07-18 Johnson Controls Tyco IP Holdings LLP Building data platform with a distributed digital twin
US11714930B2 (en) 2021-11-29 2023-08-01 Johnson Controls Tyco IP Holdings LLP Building data platform with digital twin based inferences and predictions for a graphical building model
CN114748875B (en) * 2022-05-20 2023-03-24 一点灵犀信息技术(广州)有限公司 Data saving method, device, equipment, storage medium and program product

Citations (140)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0652513A1 (en) 1993-11-04 1995-05-10 International Business Machines Corporation Task scheduler for a multiprocessor system
US5532717A (en) 1994-05-19 1996-07-02 The United States Of America As Represented By The Secretary Of The Navy Method of displaying time series data on finite resolution display device
US5724575A (en) 1994-02-25 1998-03-03 Actamed Corp. Method and system for object-based relational distributed databases
US5872973A (en) 1995-10-26 1999-02-16 Viewsoft, Inc. Method for managing dynamic relations between objects in dynamic object-oriented languages
US5897636A (en) 1996-07-11 1999-04-27 Tandem Corporation Incorporated Distributed object computer system with hierarchical name space versioning
US6073129A (en) 1997-12-29 2000-06-06 Bull Hn Information Systems Inc. Method and apparatus for improving the performance of a database management system through a central cache mechanism
US6094653A (en) 1996-12-25 2000-07-25 Nec Corporation Document classification method and apparatus therefor
US6161098A (en) 1998-09-14 2000-12-12 Folio (Fn), Inc. Method and apparatus for enabling small investors with a portfolio of securities to manage taxable events within the portfolio
US6243717B1 (en) 1998-09-01 2001-06-05 Camstar Systems, Inc. System and method for implementing revision management of linked data entities and user dependent terminology
US6304873B1 (en) 1999-07-06 2001-10-16 Compaq Computer Corporation System and method for performing database operations and for skipping over tuples locked in an incompatible mode
US20010051949A1 (en) 1997-05-09 2001-12-13 Carey Michael J. System, method, and program for object building in queries over object views
US20010056522A1 (en) 1998-06-29 2001-12-27 Raju Satyanarayana Methods and apparatus for memory allocation for object instances in an object-oriented software environment
US6366933B1 (en) 1995-10-27 2002-04-02 At&T Corp. Method and apparatus for tracking and viewing changes on the web
US6418438B1 (en) 1998-12-16 2002-07-09 Microsoft Corporation Dynamic scalable lock mechanism
US20020091694A1 (en) 2000-11-08 2002-07-11 Namik Hrle Method and system for reduced lock contention in SQL transactions
US6549752B2 (en) 2001-01-29 2003-04-15 Fujitsu Limited Apparatus and method accumulating cases to be learned
US6560620B1 (en) 1999-08-03 2003-05-06 Aplix Research, Inc. Hierarchical document comparison system and method
US6574635B2 (en) 1999-03-03 2003-06-03 Siebel Systems, Inc. Application instantiation based upon attributes and values stored in a meta data repository, including tiering of application layers objects and components
US20030105759A1 (en) 2001-10-26 2003-06-05 United Services Automobile Association (Usaa) System and method of providing electronic access to one or more documents
US20030115481A1 (en) 2001-12-18 2003-06-19 Baird Roger T. Controlling the distribution of information
US20030130993A1 (en) 2001-08-08 2003-07-10 Quiver, Inc. Document categorization engine
US20030212718A1 (en) 2002-05-10 2003-11-13 Lsi Logic Corporation Revision control for database of evolved design
US6745382B1 (en) 2000-04-13 2004-06-01 Worldcom, Inc. CORBA wrappers for rules automation technology
US20040111410A1 (en) 2002-10-14 2004-06-10 Burgoon David Alford Information reservoir
US20040117345A1 (en) 2003-08-01 2004-06-17 Oracle International Corporation Ownership reassignment in a shared-nothing database system
US20040117387A1 (en) 2000-02-25 2004-06-17 Vincent Civetta Database sizing and diagnostic utility
US20040148301A1 (en) 2003-01-24 2004-07-29 Mckay Christopher W.T. Compressed data structure for a database
US20050097441A1 (en) 2003-10-31 2005-05-05 Herbach Jonathan D. Distributed document version control
US20050108231A1 (en) 2003-11-17 2005-05-19 Terrascale Technologies Inc. Method for retrieving and modifying data elements on a shared medium
US20050114763A1 (en) 2001-03-30 2005-05-26 Kabushiki Kaisha Toshiba Apparatus, method, and program for retrieving structured documents
US6976210B1 (en) 1999-08-31 2005-12-13 Lucent Technologies Inc. Method and apparatus for web-site-independent personalization from multiple sites having user-determined extraction functionality
US6980984B1 (en) 2001-05-16 2005-12-27 Kanisa, Inc. Content provider systems and methods using structured data
US20050289524A1 (en) 2004-06-22 2005-12-29 Mcginnes Simon Systems and methods for software based on business concepts
US20060074881A1 (en) 2004-10-02 2006-04-06 Adventnet, Inc. Structure independent searching in disparate databases
US20060080316A1 (en) 2004-10-08 2006-04-13 Meridio Ltd Multiple indexing of an electronic document to selectively permit access to the content and metadata thereof
US20060095521A1 (en) 2004-11-04 2006-05-04 Seth Patinkin Method, apparatus, and system for clustering and classification
US20060106847A1 (en) 2004-05-04 2006-05-18 Boston Consulting Group, Inc. Method and apparatus for selecting, analyzing, and visualizing related database records as a network
US20060116991A1 (en) 2004-10-13 2006-06-01 Ciphergrid Limited Remote database technique
US7058648B1 (en) 2000-12-01 2006-06-06 Oracle International Corporation Hierarchy-based secured document repository
US20060161558A1 (en) 2005-01-14 2006-07-20 Microsoft Corporation Schema conformance for database servers
US7111231B1 (en) 1999-02-24 2006-09-19 Intellisync Corporation System and methodology for dynamic application environment employing runtime execution templates
US20060218206A1 (en) 2002-08-12 2006-09-28 International Business Machines Corporation Method, System, and Program for Merging Log Entries From Multiple Recovery Log Files
US20060218491A1 (en) 2005-03-25 2006-09-28 International Business Machines Corporation System, method and program product for community review of documents
US20060218405A1 (en) 2005-03-23 2006-09-28 Business Objects, S.A. Apparatus and method for dynamically auditing data migration to produce metadata
US20060242630A1 (en) 2005-03-09 2006-10-26 Maxis Co., Ltd. Process for preparing design procedure document and apparatus for the same
US20060253502A1 (en) 2005-05-06 2006-11-09 Microsoft Corporation Maintenance of link level consistency between database and file system
US20060265397A1 (en) 2001-03-06 2006-11-23 Knowledge Vector, Inc. Methods, systems, and computer program products for extensible, profile-and context-based information correlation, routing and distribution
US20070050429A1 (en) 2005-08-26 2007-03-01 Centric Software, Inc. Time-range locking for temporal database and branched-and-temporal databases
US20070061487A1 (en) 2005-02-01 2007-03-15 Moore James F Systems and methods for use of structured and unstructured distributed data
US7194680B1 (en) 1999-12-07 2007-03-20 Adobe Systems Incorporated Formatting content by example
US20070143253A1 (en) 2005-12-19 2007-06-21 Pekka Kostamaa Database system
US20070185850A1 (en) 1999-11-10 2007-08-09 Walters Edward J Apparatus and Method for Displaying Records Responsive to a Database Query
US20070233756A1 (en) 2005-02-07 2007-10-04 D Souza Roy P Retro-fitting synthetic full copies of data
US20070271317A1 (en) 2004-08-16 2007-11-22 Beinsync Ltd. System and Method for the Synchronization of Data Across Multiple Computing Devices
US20080015970A1 (en) 2006-04-28 2008-01-17 Townsend Analytics, Ltd. Order Management System and Method for Electronic Securities Trading
WO2008043082A2 (en) 2006-10-05 2008-04-10 Splunk Inc. Time series search engine
US20080104149A1 (en) 2006-11-01 2008-05-01 Ephraim Meriwether Vishniac Managing Storage of Individually Accessible Data Units
US20080104060A1 (en) 2006-10-31 2008-05-01 Business Objects, S.A. Apparatus and method for assessing relevant categories and measures for use in data analyses
US20080195672A1 (en) 2002-05-09 2008-08-14 International Business Machines Corporation System and program product for sequential coordination of external database application events with asynchronous internal database events
US20080201339A1 (en) 2007-02-21 2008-08-21 Mcgrew Robert J Providing unique views of data based on changes or rules
US20080270316A1 (en) 2007-02-28 2008-10-30 Aaron Guidotti Information, document, and compliance management for financial professionals, clients, and supervisors
US7461158B2 (en) 2002-08-07 2008-12-02 Intelliden, Inc. System and method for controlling access rights to network resources
US20080301378A1 (en) 2007-06-01 2008-12-04 Microsoft Corporation Timestamp based transactional memory
US20090031247A1 (en) 2007-07-26 2009-01-29 Walter Wolfgang E Active Tiled User Interface
US20090106308A1 (en) 2007-10-18 2009-04-23 Christopher Killian Complexity estimation of data objects
US20090164387A1 (en) 2007-04-17 2009-06-25 Semandex Networks Inc. Systems and methods for providing semantically enhanced financial information
US20090177962A1 (en) 2008-01-04 2009-07-09 Microsoft Corporation Intelligently representing files in a view
US20090254971A1 (en) 1999-10-27 2009-10-08 Pinpoint, Incorporated Secure data interchange
US20090271435A1 (en) 2008-04-24 2009-10-29 Katsushi Yako Data management method, data management program, and data management device
US20090313223A1 (en) 2008-06-17 2009-12-17 Tekla Corporation Data retrieval
US20090313311A1 (en) 2008-06-12 2009-12-17 Gravic, Inc. Mixed mode synchronous and asynchronous replication system
US20100036831A1 (en) 2008-08-08 2010-02-11 Oracle International Corporation Generating continuous query notifications
US20100070489A1 (en) 2008-09-15 2010-03-18 Palantir Technologies, Inc. Filter chains with associated views for exploring large data sets
US20100076939A1 (en) 2008-09-05 2010-03-25 Hitachi, Ltd. Information processing system, data update method and data update program
US20100082541A1 (en) 2005-12-19 2010-04-01 Commvault Systems, Inc. Systems and methods for performing replication copy storage operations
US20100114887A1 (en) 2008-10-17 2010-05-06 Google Inc. Textual Disambiguation Using Social Connections
US20100114831A1 (en) 2008-10-30 2010-05-06 Gilbert Gary M Building a Synchronized Target Database
US20100114817A1 (en) 2008-10-30 2010-05-06 Broeder Sean L Replication of operations on objects distributed in a storage system
US7725530B2 (en) 2005-12-12 2010-05-25 Google Inc. Proxy server collection of data for module incorporation into a container document
US7730082B2 (en) 2005-12-12 2010-06-01 Google Inc. Remote module incorporation into a container document
US7730109B2 (en) 2005-12-12 2010-06-01 Google, Inc. Message catalogs for remote modules
US20100138842A1 (en) 2008-12-03 2010-06-03 Soren Balko Multithreading And Concurrency Control For A Rule-Based Transaction Engine
US20100145909A1 (en) 2008-12-10 2010-06-10 Commvault Systems, Inc. Systems and methods for managing replicated database data
US20100161688A1 (en) 2008-12-22 2010-06-24 Google Inc. Asynchronous distributed garbage collection for replicated storage clusters
US20100161565A1 (en) 2008-12-18 2010-06-24 Electronics And Telecommunications Research Institute Cluster data management system and method for data restoration using shared redo log in cluster data management system
US7761407B1 (en) 2006-10-10 2010-07-20 Medallia, Inc. Use of primary and secondary indexes to facilitate aggregation of records of an OLAP data cube
US20100191884A1 (en) 2008-06-12 2010-07-29 Gravic, Inc. Method for replicating locks in a data replication engine
US20100211618A1 (en) 2009-02-17 2010-08-19 Agilewaves, Inc. Efficient storage of data allowing for multiple level granularity retrieval
US20100211550A1 (en) 2009-02-17 2010-08-19 Amadeus S.A.S. Method allowing validation in a production database of new entered data prior to their release
US20100235606A1 (en) 2009-03-11 2010-09-16 Oracle America, Inc. Composite hash and list partitioning of database tables
US7814084B2 (en) 2007-03-21 2010-10-12 Schmap Inc. Contact information capture and link redirection
US20100283787A1 (en) 2006-03-03 2010-11-11 Donya Labs Ab Creation and rendering of hierarchical digital multimedia data
US20100325581A1 (en) 2006-11-10 2010-12-23 Microsoft Corporation Data object linking and browsing tool
US20110029498A1 (en) 2009-07-10 2011-02-03 Xkoto, Inc. System and Method for Subunit Operations in a Database
US20110047540A1 (en) 2009-08-24 2011-02-24 Embarcadero Technologies Inc. System and Methodology for Automating Delivery, Licensing, and Availability of Software Products
US7962495B2 (en) 2006-11-20 2011-06-14 Palantir Technologies, Inc. Creating data in a data store using a dynamic ontology
US20110153592A1 (en) 2002-10-16 2011-06-23 Ita Software, Inc. Dividing A Travel Query Into Sub-Queries
US20110173619A1 (en) 2005-10-11 2011-07-14 Eric Ian Fish Apparatus and method for optimized application of batched data to a database
US7984374B2 (en) 1999-07-23 2011-07-19 Adobe Systems Incorporated Computer generation of documents using layout elements and content elements
US20110184813A1 (en) 2009-09-14 2011-07-28 Cbs Interactive, Inc. Targeting offers to users of a web site
US20110218978A1 (en) * 2010-02-22 2011-09-08 Vertica Systems, Inc. Operating on time sequences of data
US20110258158A1 (en) 2010-04-14 2011-10-20 Bank Of America Corporation Data Services Framework Workflow Processing
US20110258242A1 (en) 2010-04-16 2011-10-20 Salesforce.Com, Inc. Methods and systems for appending data to large data volumes in a multi-tenant store
US20110270812A1 (en) 2006-10-11 2011-11-03 Ruby Jonathan P Extended Transactions
US20120013684A1 (en) 2009-09-30 2012-01-19 Videojet Technologies Inc. Thermal ink jet ink compostion
US8126848B2 (en) 2006-12-07 2012-02-28 Robert Edward Wagner Automated method for identifying and repairing logical data discrepancies between database replicas in a database cluster
WO2012025915A1 (en) 2010-07-21 2012-03-01 Sqream Technologies Ltd A system and method for the parallel execution of database queries over cpus and multi core processors
US20120072825A1 (en) 2010-09-20 2012-03-22 Research In Motion Limited Methods and systems for identifying content elements
US20120123989A1 (en) 2010-11-15 2012-05-17 Business Objects Software Limited Dashboard evaluator
US20120124179A1 (en) 2010-11-12 2012-05-17 Realnetworks, Inc. Traffic management in adaptive streaming protocols
US8185819B2 (en) 2005-12-12 2012-05-22 Google Inc. Module specification for a module to be incorporated into a container document
US20120150791A1 (en) 2008-06-02 2012-06-14 Ian Alexander Willson Methods and systems for loading data into a temporal data warehouse
US20120159307A1 (en) 2010-12-17 2012-06-21 Microsoft Corporation Rendering source regions into target regions of web pages
US20120330908A1 (en) 2011-06-23 2012-12-27 Geoffrey Stowe System and method for investigating large amounts of data
EP2555126A2 (en) 2011-08-02 2013-02-06 Palantir Technologies, Inc. System and method for accessing rich objects via spreadsheets
US20130060742A1 (en) 2011-09-02 2013-03-07 Allen Chang Multi-row transactions
US20130097130A1 (en) 2011-10-17 2013-04-18 Yahoo! Inc. Method and system for resolving data inconsistency
US20130151388A1 (en) 2011-12-12 2013-06-13 Visa International Service Association Systems and methods to identify affluence levels of accounts
US20130304770A1 (en) 2012-05-10 2013-11-14 Siemens Aktiengesellschaft Method and system for storing data in a database
US20140040276A1 (en) * 2012-07-31 2014-02-06 International Business Machines Corporation Method and apparatus for processing time series data
US8676857B1 (en) 2012-08-23 2014-03-18 International Business Machines Corporation Context-based search for a data store related to a graph node
US20140095543A1 (en) * 2012-09-28 2014-04-03 Oracle International Corporation Parameterized continuous query templates
US20140149272A1 (en) 2012-08-17 2014-05-29 Trueex Group Llc Interoffice bank offered rate financial product and implementation
US20140181833A1 (en) 2012-12-21 2014-06-26 International Business Machines Corporation Processor provisioning by a middleware system for a plurality of logical processor partitions
US20140324876A1 (en) 2013-04-25 2014-10-30 International Business Machines Corporation Management of a database system
US20150039886A1 (en) 2013-08-01 2015-02-05 Bitglass, Inc. Secure application access system
US20150089353A1 (en) 2013-09-24 2015-03-26 Chad Folkening Platform for building virtual entities using equity systems
US9009827B1 (en) 2014-02-20 2015-04-14 Palantir Technologies Inc. Security sharing system
EP2863326A1 (en) 2013-10-18 2015-04-22 Palantir Technologies, Inc. Systems and user interfaces for dynamic and interactive simultaneous querying of multiple data stores
US9043696B1 (en) 2014-01-03 2015-05-26 Palantir Technologies Inc. Systems and methods for visual definition of data associations
US9092482B2 (en) 2013-03-14 2015-07-28 Palantir Technologies, Inc. Fair scheduling for mixed-query loads
US20150213134A1 (en) 2012-10-11 2015-07-30 Tencent Technology (Shenzhen) Company Limited Data query method and system and storage medium
US20150212663A1 (en) 2014-01-30 2015-07-30 Splunk Inc. Panel templates for visualization of data within an interactive dashboard
US20150213043A1 (en) 2012-07-13 2015-07-30 Hitachi Solutions, Ltd. Retrieval device, method for controlling retrieval device, and recording medium
US20150242397A1 (en) 2013-06-19 2015-08-27 Tencent Technology (Shenzhen) Company Limited Method, server and system for managing content in content delivery network
US20150278325A1 (en) * 2014-03-28 2015-10-01 Hitachi High-Technologies Corporation Information processing apparatus, information processing method, information system and medium
US20150341467A1 (en) 2014-05-26 2015-11-26 Samsung Electronics Co., Ltd Method and of improving http performance on communication network and apparatus adapted thereto
US9230280B1 (en) 2013-03-15 2016-01-05 Palantir Technologies Inc. Clustering data based on indications of financial malfeasance
US20160062555A1 (en) 2014-09-03 2016-03-03 Palantir Technologies Inc. System for providing dynamic linked panels in user interface
EP3101560A1 (en) 2015-06-05 2016-12-07 Palantir Technologies, Inc. Time-series data storage and processing database system

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000048047A (en) 1998-01-19 2000-02-18 Asahi Glass Co Ltd Time series data storing method, time series database system, time series data processing method, time series data processing system, time series data display system, and recording medium
US7233843B2 (en) * 2003-08-08 2007-06-19 Electric Power Group, Llc Real-time performance monitoring and management system
CA2452251C (en) 2003-12-04 2010-02-09 Timothy R. Jewell Data backup system and method
US20060288035A1 (en) * 2005-06-16 2006-12-21 Oracle International Corporation Relational database support for immutable media
US9195700B1 (en) 2007-10-10 2015-11-24 United Services Automobile Association (Usaa) Systems and methods for storing time-series data
US20100070426A1 (en) * 2008-09-15 2010-03-18 Palantir Technologies, Inc. Object modeling for exploring large data sets
US20120221589A1 (en) * 2009-08-25 2012-08-30 Yuval Shahar Method and system for selecting, retrieving, visualizing and exploring time-oriented data in multiple subject records
JP5423553B2 (en) 2010-04-09 2014-02-19 株式会社日立製作所 Database management method, computer, sensor network system, and database search program
US20120150925A1 (en) 2010-12-10 2012-06-14 International Business Machines Corporation Proactive Method for Improved Reliability for Sustained Persistence of Immutable Files in Storage Clouds
US20130066882A1 (en) 2011-09-09 2013-03-14 Onzo Limited Data storage method and system
CN109960688A (en) 2012-08-01 2019-07-02 华为技术有限公司 A kind of file mergences method and apparatus
WO2014181475A1 (en) 2013-05-10 2014-11-13 株式会社日立製作所 Database server storing plurality of versions of data, and database management method
US9189387B1 (en) * 2013-06-24 2015-11-17 Emc Corporation Combined memory and storage tiering
US9450602B2 (en) 2014-01-02 2016-09-20 Sap Se Efficiently query compressed time-series data in a database
US10171491B2 (en) 2014-12-09 2019-01-01 Fortinet, Inc. Near real-time detection of denial-of-service attacks
WO2016122591A1 (en) * 2015-01-30 2016-08-04 Hewlett Packard Enterprise Development Lp Performance testing based on variable length segmentation and clustering of time series data
US20160328432A1 (en) 2015-05-06 2016-11-10 Squigglee LLC System and method for management of time series data sets
US9753935B1 (en) 2016-08-02 2017-09-05 Palantir Technologies Inc. Time-series data storage and processing database system

Patent Citations (163)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0652513A1 (en) 1993-11-04 1995-05-10 International Business Machines Corporation Task scheduler for a multiprocessor system
US5724575A (en) 1994-02-25 1998-03-03 Actamed Corp. Method and system for object-based relational distributed databases
US5532717A (en) 1994-05-19 1996-07-02 The United States Of America As Represented By The Secretary Of The Navy Method of displaying time series data on finite resolution display device
US5872973A (en) 1995-10-26 1999-02-16 Viewsoft, Inc. Method for managing dynamic relations between objects in dynamic object-oriented languages
US6366933B1 (en) 1995-10-27 2002-04-02 At&T Corp. Method and apparatus for tracking and viewing changes on the web
US5897636A (en) 1996-07-11 1999-04-27 Tandem Corporation Incorporated Distributed object computer system with hierarchical name space versioning
US6094653A (en) 1996-12-25 2000-07-25 Nec Corporation Document classification method and apparatus therefor
US20010051949A1 (en) 1997-05-09 2001-12-13 Carey Michael J. System, method, and program for object building in queries over object views
US6073129A (en) 1997-12-29 2000-06-06 Bull Hn Information Systems Inc. Method and apparatus for improving the performance of a database management system through a central cache mechanism
US20010056522A1 (en) 1998-06-29 2001-12-27 Raju Satyanarayana Methods and apparatus for memory allocation for object instances in an object-oriented software environment
US6510504B2 (en) 1998-06-29 2003-01-21 Oracle Corporation Methods and apparatus for memory allocation for object instances in an object-oriented software environment
US6243717B1 (en) 1998-09-01 2001-06-05 Camstar Systems, Inc. System and method for implementing revision management of linked data entities and user dependent terminology
US6161098A (en) 1998-09-14 2000-12-12 Folio (Fn), Inc. Method and apparatus for enabling small investors with a portfolio of securities to manage taxable events within the portfolio
US6418438B1 (en) 1998-12-16 2002-07-09 Microsoft Corporation Dynamic scalable lock mechanism
US7111231B1 (en) 1999-02-24 2006-09-19 Intellisync Corporation System and methodology for dynamic application environment employing runtime execution templates
US6574635B2 (en) 1999-03-03 2003-06-03 Siebel Systems, Inc. Application instantiation based upon attributes and values stored in a meta data repository, including tiering of application layers objects and components
US20030120675A1 (en) 1999-03-03 2003-06-26 Siebel Systems, Inc. Application instantiation based upon attributes and values stored in a meta data repository, including tiering of application layers, objects, and components
US6304873B1 (en) 1999-07-06 2001-10-16 Compaq Computer Corporation System and method for performing database operations and for skipping over tuples locked in an incompatible mode
US7984374B2 (en) 1999-07-23 2011-07-19 Adobe Systems Incorporated Computer generation of documents using layout elements and content elements
US6560620B1 (en) 1999-08-03 2003-05-06 Aplix Research, Inc. Hierarchical document comparison system and method
US6976210B1 (en) 1999-08-31 2005-12-13 Lucent Technologies Inc. Method and apparatus for web-site-independent personalization from multiple sites having user-determined extraction functionality
US20090254971A1 (en) 1999-10-27 2009-10-08 Pinpoint, Incorporated Secure data interchange
US20070185850A1 (en) 1999-11-10 2007-08-09 Walters Edward J Apparatus and Method for Displaying Records Responsive to a Database Query
US7194680B1 (en) 1999-12-07 2007-03-20 Adobe Systems Incorporated Formatting content by example
US20040117387A1 (en) 2000-02-25 2004-06-17 Vincent Civetta Database sizing and diagnostic utility
US6745382B1 (en) 2000-04-13 2004-06-01 Worldcom, Inc. CORBA wrappers for rules automation technology
US20020091694A1 (en) 2000-11-08 2002-07-11 Namik Hrle Method and system for reduced lock contention in SQL transactions
US7058648B1 (en) 2000-12-01 2006-06-06 Oracle International Corporation Hierarchy-based secured document repository
US6549752B2 (en) 2001-01-29 2003-04-15 Fujitsu Limited Apparatus and method accumulating cases to be learned
US20060265397A1 (en) 2001-03-06 2006-11-23 Knowledge Vector, Inc. Methods, systems, and computer program products for extensible, profile-and context-based information correlation, routing and distribution
US20050114763A1 (en) 2001-03-30 2005-05-26 Kabushiki Kaisha Toshiba Apparatus, method, and program for retrieving structured documents
US6980984B1 (en) 2001-05-16 2005-12-27 Kanisa, Inc. Content provider systems and methods using structured data
US20030130993A1 (en) 2001-08-08 2003-07-10 Quiver, Inc. Document categorization engine
US20030105759A1 (en) 2001-10-26 2003-06-05 United Services Automobile Association (Usaa) System and method of providing electronic access to one or more documents
US20030115481A1 (en) 2001-12-18 2003-06-19 Baird Roger T. Controlling the distribution of information
US20080195672A1 (en) 2002-05-09 2008-08-14 International Business Machines Corporation System and program product for sequential coordination of external database application events with asynchronous internal database events
US20030212718A1 (en) 2002-05-10 2003-11-13 Lsi Logic Corporation Revision control for database of evolved design
US7461158B2 (en) 2002-08-07 2008-12-02 Intelliden, Inc. System and method for controlling access rights to network resources
US20060218206A1 (en) 2002-08-12 2006-09-28 International Business Machines Corporation Method, System, and Program for Merging Log Entries From Multiple Recovery Log Files
US20040111410A1 (en) 2002-10-14 2004-06-10 Burgoon David Alford Information reservoir
US20110153592A1 (en) 2002-10-16 2011-06-23 Ita Software, Inc. Dividing A Travel Query Into Sub-Queries
US20040148301A1 (en) 2003-01-24 2004-07-29 Mckay Christopher W.T. Compressed data structure for a database
US20040117345A1 (en) 2003-08-01 2004-06-17 Oracle International Corporation Ownership reassignment in a shared-nothing database system
US20050097441A1 (en) 2003-10-31 2005-05-05 Herbach Jonathan D. Distributed document version control
US20050108231A1 (en) 2003-11-17 2005-05-19 Terrascale Technologies Inc. Method for retrieving and modifying data elements on a shared medium
US20060106847A1 (en) 2004-05-04 2006-05-18 Boston Consulting Group, Inc. Method and apparatus for selecting, analyzing, and visualizing related database records as a network
US20050289524A1 (en) 2004-06-22 2005-12-29 Mcginnes Simon Systems and methods for software based on business concepts
US20070271317A1 (en) 2004-08-16 2007-11-22 Beinsync Ltd. System and Method for the Synchronization of Data Across Multiple Computing Devices
US20060074881A1 (en) 2004-10-02 2006-04-06 Adventnet, Inc. Structure independent searching in disparate databases
US20060080316A1 (en) 2004-10-08 2006-04-13 Meridio Ltd Multiple indexing of an electronic document to selectively permit access to the content and metadata thereof
US20060116991A1 (en) 2004-10-13 2006-06-01 Ciphergrid Limited Remote database technique
US20060095521A1 (en) 2004-11-04 2006-05-04 Seth Patinkin Method, apparatus, and system for clustering and classification
US20060161558A1 (en) 2005-01-14 2006-07-20 Microsoft Corporation Schema conformance for database servers
US20070061487A1 (en) 2005-02-01 2007-03-15 Moore James F Systems and methods for use of structured and unstructured distributed data
US20070233756A1 (en) 2005-02-07 2007-10-04 D Souza Roy P Retro-fitting synthetic full copies of data
US20060242630A1 (en) 2005-03-09 2006-10-26 Maxis Co., Ltd. Process for preparing design procedure document and apparatus for the same
US7725728B2 (en) 2005-03-23 2010-05-25 Business Objects Data Integration, Inc. Apparatus and method for dynamically auditing data migration to produce metadata
US20060218405A1 (en) 2005-03-23 2006-09-28 Business Objects, S.A. Apparatus and method for dynamically auditing data migration to produce metadata
US20060218491A1 (en) 2005-03-25 2006-09-28 International Business Machines Corporation System, method and program product for community review of documents
US20060253502A1 (en) 2005-05-06 2006-11-09 Microsoft Corporation Maintenance of link level consistency between database and file system
US20070050429A1 (en) 2005-08-26 2007-03-01 Centric Software, Inc. Time-range locking for temporal database and branched-and-temporal databases
US20110173619A1 (en) 2005-10-11 2011-07-14 Eric Ian Fish Apparatus and method for optimized application of batched data to a database
US8185819B2 (en) 2005-12-12 2012-05-22 Google Inc. Module specification for a module to be incorporated into a container document
US7725530B2 (en) 2005-12-12 2010-05-25 Google Inc. Proxy server collection of data for module incorporation into a container document
US7730082B2 (en) 2005-12-12 2010-06-01 Google Inc. Remote module incorporation into a container document
US7730109B2 (en) 2005-12-12 2010-06-01 Google, Inc. Message catalogs for remote modules
US20100082541A1 (en) 2005-12-19 2010-04-01 Commvault Systems, Inc. Systems and methods for performing replication copy storage operations
US20070143253A1 (en) 2005-12-19 2007-06-21 Pekka Kostamaa Database system
US20100283787A1 (en) 2006-03-03 2010-11-11 Donya Labs Ab Creation and rendering of hierarchical digital multimedia data
US20080015970A1 (en) 2006-04-28 2008-01-17 Townsend Analytics, Ltd. Order Management System and Method for Electronic Securities Trading
US20080215546A1 (en) 2006-10-05 2008-09-04 Baum Michael J Time Series Search Engine
WO2008043082A2 (en) 2006-10-05 2008-04-10 Splunk Inc. Time series search engine
US8112425B2 (en) 2006-10-05 2012-02-07 Splunk Inc. Time series search engine
US7761407B1 (en) 2006-10-10 2010-07-20 Medallia, Inc. Use of primary and secondary indexes to facilitate aggregation of records of an OLAP data cube
US20110270812A1 (en) 2006-10-11 2011-11-03 Ruby Jonathan P Extended Transactions
US20080104060A1 (en) 2006-10-31 2008-05-01 Business Objects, S.A. Apparatus and method for assessing relevant categories and measures for use in data analyses
US20080104149A1 (en) 2006-11-01 2008-05-01 Ephraim Meriwether Vishniac Managing Storage of Individually Accessible Data Units
US20100325581A1 (en) 2006-11-10 2010-12-23 Microsoft Corporation Data object linking and browsing tool
US7962495B2 (en) 2006-11-20 2011-06-14 Palantir Technologies, Inc. Creating data in a data store using a dynamic ontology
US8126848B2 (en) 2006-12-07 2012-02-28 Robert Edward Wagner Automated method for identifying and repairing logical data discrepancies between database replicas in a database cluster
US20150106347A1 (en) 2007-02-21 2015-04-16 Palantir Technologies, Inc. Providing unique views of data based on changes or rules
US20080201339A1 (en) 2007-02-21 2008-08-21 Mcgrew Robert J Providing unique views of data based on changes or rules
US8930331B2 (en) 2007-02-21 2015-01-06 Palantir Technologies Providing unique views of data based on changes or rules
US20080270316A1 (en) 2007-02-28 2008-10-30 Aaron Guidotti Information, document, and compliance management for financial professionals, clients, and supervisors
US7814084B2 (en) 2007-03-21 2010-10-12 Schmap Inc. Contact information capture and link redirection
US20090164387A1 (en) 2007-04-17 2009-06-25 Semandex Networks Inc. Systems and methods for providing semantically enhanced financial information
US20080301378A1 (en) 2007-06-01 2008-12-04 Microsoft Corporation Timestamp based transactional memory
US20090031247A1 (en) 2007-07-26 2009-01-29 Walter Wolfgang E Active Tiled User Interface
US20090106308A1 (en) 2007-10-18 2009-04-23 Christopher Killian Complexity estimation of data objects
US20090177962A1 (en) 2008-01-04 2009-07-09 Microsoft Corporation Intelligently representing files in a view
US20090271435A1 (en) 2008-04-24 2009-10-29 Katsushi Yako Data management method, data management program, and data management device
US20120150791A1 (en) 2008-06-02 2012-06-14 Ian Alexander Willson Methods and systems for loading data into a temporal data warehouse
US20100191884A1 (en) 2008-06-12 2010-07-29 Gravic, Inc. Method for replicating locks in a data replication engine
US20090313311A1 (en) 2008-06-12 2009-12-17 Gravic, Inc. Mixed mode synchronous and asynchronous replication system
US20090313223A1 (en) 2008-06-17 2009-12-17 Tekla Corporation Data retrieval
US20100036831A1 (en) 2008-08-08 2010-02-11 Oracle International Corporation Generating continuous query notifications
US20100076939A1 (en) 2008-09-05 2010-03-25 Hitachi, Ltd. Information processing system, data update method and data update program
US8041714B2 (en) 2008-09-15 2011-10-18 Palantir Technologies, Inc. Filter chains with associated views for exploring large data sets
US20100070489A1 (en) 2008-09-15 2010-03-18 Palantir Technologies, Inc. Filter chains with associated views for exploring large data sets
US20100114887A1 (en) 2008-10-17 2010-05-06 Google Inc. Textual Disambiguation Using Social Connections
US20100114831A1 (en) 2008-10-30 2010-05-06 Gilbert Gary M Building a Synchronized Target Database
US20100114817A1 (en) 2008-10-30 2010-05-06 Broeder Sean L Replication of operations on objects distributed in a storage system
US20100138842A1 (en) 2008-12-03 2010-06-03 Soren Balko Multithreading And Concurrency Control For A Rule-Based Transaction Engine
US20100145909A1 (en) 2008-12-10 2010-06-10 Commvault Systems, Inc. Systems and methods for managing replicated database data
US20100161565A1 (en) 2008-12-18 2010-06-24 Electronics And Telecommunications Research Institute Cluster data management system and method for data restoration using shared redo log in cluster data management system
US20100161688A1 (en) 2008-12-22 2010-06-24 Google Inc. Asynchronous distributed garbage collection for replicated storage clusters
US20100211550A1 (en) 2009-02-17 2010-08-19 Amadeus S.A.S. Method allowing validation in a production database of new entered data prior to their release
US20100211618A1 (en) 2009-02-17 2010-08-19 Agilewaves, Inc. Efficient storage of data allowing for multiple level granularity retrieval
US20100235606A1 (en) 2009-03-11 2010-09-16 Oracle America, Inc. Composite hash and list partitioning of database tables
US20110029498A1 (en) 2009-07-10 2011-02-03 Xkoto, Inc. System and Method for Subunit Operations in a Database
US20110047540A1 (en) 2009-08-24 2011-02-24 Embarcadero Technologies Inc. System and Methodology for Automating Delivery, Licensing, and Availability of Software Products
US20110184813A1 (en) 2009-09-14 2011-07-28 Cbs Interactive, Inc. Targeting offers to users of a web site
US20120013684A1 (en) 2009-09-30 2012-01-19 Videojet Technologies Inc. Thermal ink jet ink compostion
US20110218978A1 (en) * 2010-02-22 2011-09-08 Vertica Systems, Inc. Operating on time sequences of data
US20110258158A1 (en) 2010-04-14 2011-10-20 Bank Of America Corporation Data Services Framework Workflow Processing
US20110258242A1 (en) 2010-04-16 2011-10-20 Salesforce.Com, Inc. Methods and systems for appending data to large data volumes in a multi-tenant store
WO2012025915A1 (en) 2010-07-21 2012-03-01 Sqream Technologies Ltd A system and method for the parallel execution of database queries over cpus and multi core processors
US20120072825A1 (en) 2010-09-20 2012-03-22 Research In Motion Limited Methods and systems for identifying content elements
US20120124179A1 (en) 2010-11-12 2012-05-17 Realnetworks, Inc. Traffic management in adaptive streaming protocols
US20120123989A1 (en) 2010-11-15 2012-05-17 Business Objects Software Limited Dashboard evaluator
US20120159307A1 (en) 2010-12-17 2012-06-21 Microsoft Corporation Rendering source regions into target regions of web pages
US20120330908A1 (en) 2011-06-23 2012-12-27 Geoffrey Stowe System and method for investigating large amounts of data
US9208159B2 (en) 2011-06-23 2015-12-08 Palantir Technologies, Inc. System and method for investigating large amounts of data
US20140344231A1 (en) 2011-06-23 2014-11-20 Palantir Technologies, Inc. System and method for investigating large amounts of data
US20130036346A1 (en) 2011-08-02 2013-02-07 Cicerone Derek Michael System and Method for Accessing Rich Objects Via Spreadsheets
US9280532B2 (en) 2011-08-02 2016-03-08 Palantir Technologies, Inc. System and method for accessing rich objects via spreadsheets
EP2555126A2 (en) 2011-08-02 2013-02-06 Palantir Technologies, Inc. System and method for accessing rich objects via spreadsheets
US20130318060A1 (en) 2011-09-02 2013-11-28 Palantir Technologies, Inc. Multi-row transactions
US8504542B2 (en) 2011-09-02 2013-08-06 Palantir Technologies, Inc. Multi-row transactions
US20150112956A1 (en) 2011-09-02 2015-04-23 Palantir Technologies, Inc. Transaction protocol for reading database values
US20130060742A1 (en) 2011-09-02 2013-03-07 Allen Chang Multi-row transactions
US8954410B2 (en) 2011-09-02 2015-02-10 Palantir Technologies, Inc. Multi-row transactions
AU2014206155A1 (en) 2011-09-02 2014-08-07 Palantir Technologies, Inc. Multi-row transactions
US20130097130A1 (en) 2011-10-17 2013-04-18 Yahoo! Inc. Method and system for resolving data inconsistency
US20130151388A1 (en) 2011-12-12 2013-06-13 Visa International Service Association Systems and methods to identify affluence levels of accounts
US20130304770A1 (en) 2012-05-10 2013-11-14 Siemens Aktiengesellschaft Method and system for storing data in a database
US20150213043A1 (en) 2012-07-13 2015-07-30 Hitachi Solutions, Ltd. Retrieval device, method for controlling retrieval device, and recording medium
US20140040276A1 (en) * 2012-07-31 2014-02-06 International Business Machines Corporation Method and apparatus for processing time series data
US20140149272A1 (en) 2012-08-17 2014-05-29 Trueex Group Llc Interoffice bank offered rate financial product and implementation
US8676857B1 (en) 2012-08-23 2014-03-18 International Business Machines Corporation Context-based search for a data store related to a graph node
US20140095543A1 (en) * 2012-09-28 2014-04-03 Oracle International Corporation Parameterized continuous query templates
US20150213134A1 (en) 2012-10-11 2015-07-30 Tencent Technology (Shenzhen) Company Limited Data query method and system and storage medium
US20140181833A1 (en) 2012-12-21 2014-06-26 International Business Machines Corporation Processor provisioning by a middleware system for a plurality of logical processor partitions
US9092482B2 (en) 2013-03-14 2015-07-28 Palantir Technologies, Inc. Fair scheduling for mixed-query loads
US20150261817A1 (en) 2013-03-14 2015-09-17 Palantir Technologies, Inc. Fair scheduling for mixed-query loads
US9230280B1 (en) 2013-03-15 2016-01-05 Palantir Technologies Inc. Clustering data based on indications of financial malfeasance
US20140324876A1 (en) 2013-04-25 2014-10-30 International Business Machines Corporation Management of a database system
US20150242397A1 (en) 2013-06-19 2015-08-27 Tencent Technology (Shenzhen) Company Limited Method, server and system for managing content in content delivery network
US20150039886A1 (en) 2013-08-01 2015-02-05 Bitglass, Inc. Secure application access system
US20150089353A1 (en) 2013-09-24 2015-03-26 Chad Folkening Platform for building virtual entities using equity systems
EP2863326A1 (en) 2013-10-18 2015-04-22 Palantir Technologies, Inc. Systems and user interfaces for dynamic and interactive simultaneous querying of multiple data stores
US9116975B2 (en) 2013-10-18 2015-08-25 Palantir Technologies Inc. Systems and user interfaces for dynamic and interactive simultaneous querying of multiple data stores
US20160034545A1 (en) 2013-10-18 2016-02-04 Palantir Technologies Inc. Systems and user interfaces for dynamic and interactive simultaneous querying of multiple data stores
US20150227295A1 (en) 2014-01-03 2015-08-13 Palantir Technologies, Inc. Systems and methods for visual definition of data associations
US9043696B1 (en) 2014-01-03 2015-05-26 Palantir Technologies Inc. Systems and methods for visual definition of data associations
EP2891992A1 (en) 2014-01-03 2015-07-08 Palantir Technologies, Inc. Systems and methods for visual definition of data associations
US20150212663A1 (en) 2014-01-30 2015-07-30 Splunk Inc. Panel templates for visualization of data within an interactive dashboard
US9009827B1 (en) 2014-02-20 2015-04-14 Palantir Technologies Inc. Security sharing system
US20150278325A1 (en) * 2014-03-28 2015-10-01 Hitachi High-Technologies Corporation Information processing apparatus, information processing method, information system and medium
US20150341467A1 (en) 2014-05-26 2015-11-26 Samsung Electronics Co., Ltd Method and of improving http performance on communication network and apparatus adapted thereto
US20160062555A1 (en) 2014-09-03 2016-03-03 Palantir Technologies Inc. System for providing dynamic linked panels in user interface
EP2993595A1 (en) 2014-09-03 2016-03-09 Palantir Technologies, Inc. Dynamic user interface
EP3101560A1 (en) 2015-06-05 2016-12-07 Palantir Technologies, Inc. Time-series data storage and processing database system

Non-Patent Citations (57)

* Cited by examiner, † Cited by third party
Title
"Apache HBase," http://hbase.apache.org/ printed Sep. 14, 2011 in 1 page.
"The Apache Cassandra Project," http://cassandra.apache.org/ Printed Sep. 14, 2011 in 3 pages.
Anonymous, "BackTult-JD Edwards One World Version Control System", in 1 page, Jul. 23, 2007.
Anonymous, "BackTult—JD Edwards One World Version Control System", in 1 page, Jul. 23, 2007.
Antoshenkov, Gennady, "Dictionary-Based Order-Preserving String Compression", The VLDB Journal, pp. 26-39, 1997.
Baker et al., "Megastore: Providing Scalable, Highly Available Storage for Interactive Services", 5th Biennial Conference on Innovative Data Systems Research (CIDR '11), Asilomar, California, Jan. 9-12, 2011.
Bernstein et al., "Hyder-A Transactional Record Manager for Shared Flash", 5th Biennial Conference on Innovative Data Systems Research (CIDR '11), vol. 12, Asilomar, California, Jan. 9-12, 2011.
Bernstein et al., "Hyder—A Transactional Record Manager for Shared Flash", 5th Biennial Conference on Innovative Data Systems Research (CIDR '11), vol. 12, Asilomar, California, Jan. 9-12, 2011.
Chang et al., "Bigtable: A Distributed Storage System for Structured Data", Google, Inc., OSDI'06: Seventh Symposium on Operating System Design and Implementation, Seattle, WA, Nov. 2006.
Chung, Chin-Wan, "Dataplex: An Access to Heterogeneous Distributed Databases", Communications of the ACM, Association for Computing Machinery, Inc., vol. 33, Issue No. 1, pp. 70-80, Jan. 1, 1990.
Devanbu et al., "Authentic Third-party Data Publication", http://www.cs.ucdavis.edu/˜devanbu/authdbpub.pdf, p. 19, 2000.
Dreyer et al., "An Object-Oriented Data Model for a Time Series Management System", Proceedings of the 7th International Working Conference on Scientific and Statistical Database Management, p. 12, Charlottesville, Virginia, USA, Sep. 28-30, 1994.
Elmasri et al., "Fundamentals of Database Systems", Fourth Edition, pp. 455-491, 2004.
Hogue et al., "Thresher: Automating the Unwrapping of Semantic Content from the World Wide Web", 14th International Conference on World Wide Web, WWW 2005: Chiba, Japan, May 10-14, 2005.
Klemmer et al., "Where Do Web Sites Come From? Capturing and Interacting with Design History," Association for Computing Machinery, CHI 2002, Apr. 20-25, 2002, Minneapolis, MN, pp. 8.
Mentzas et al., "An Architecture for Intelligent Assistance in the Forecasting Process", Proceedings of the Twenty-Eighth Hawaii International Conference on System Sciences, vol. 3, pp. 167-176, Jan. 3-6, 1995.
Miklau et al., "Securing History: Privacy and Accountability in Database Systems", 3rd Biennial Conference on Innovative Data Systems Research (CIDR), pp. 387-396, Asilomar, California, Jan. 7-10, 2007.
Niepert et al., "A Dynamic Ontology for a Dynamic Reference Work", Joint Conference on Digital Libraries, pp. 1-10, Vancouver, British Columbia, Jun. 17-22, 2007.
Nierman, "Evaluating Structural Similarity in XML Documents", 6 pages, 2002.
Notice of Allowance for U.S. Appl. No. 13/196,788 dated Dec. 18, 2015.
Notice of Allowance for U.S. Appl. No. 13/826,228 dated Mar. 27, 2015.
Notice of Allowance for U.S. Appl. No. 14/192,767 dated Dec. 16, 2014.
Notice of Allowance for U.S. Appl. No. 14/278,963 dated Sep. 2, 2015.
Notice of Allowance for U.S. Appl. No. 14/451,221 dated Aug. 4, 2015.
Notice of Allowance for U.S. Appl. No. 14/504,103 dated May 18, 2015.
Notice of Allowance for U.S. Appl. No. 14/734,772 dated Apr. 27, 2016.
Notice of Allowance for U.S. Appl. No. 14/746,671 dated Jan. 21, 2016.
Official Communication for European Patent Application No. 14189344.6 dated Feb. 20, 2015.
Official Communication for European Patent Application No. 14199182.8 dated Mar. 13, 2015.
Official Communication for European Patent Application No. 15183721.8 dated Nov. 23, 2015.
Official Communication for European Patent Application No. 16173056.9 dated Nov. 3, 2016.
Official Communication for Netherlands Patent Application No. 2012436 dated Nov. 6, 2015.
Official Communication for U.S. Appl. No. 13/196,788 dated Nov. 25, 2015.
Official Communication for U.S. Appl. No. 13/196,788 dated Oct. 23, 2015.
Official Communication for U.S. Appl. No. 14/278,963 dated Jan. 30, 2015.
Official Communication for U.S. Appl. No. 14/451,221 dated Apr. 6, 2015.
Official Communication for U.S. Appl. No. 14/504,103 dated Feb. 5, 2015.
Official Communication for U.S. Appl. No. 14/504,103 dated Mar. 31, 2015.
Official Communication for U.S. Appl. No. 14/578,389 dated Apr. 22, 2016.
Official Communication for U.S. Appl. No. 14/578,389 dated Oct. 21, 2015.
Official Communication for U.S. Appl. No. 14/580,218 dated Jun. 26, 2015.
Official Communication for U.S. Appl. No. 14/726,211 dated Apr. 5, 2016.
Official Communication for U.S. Appl. No. 14/734,772 dated Jul. 24, 2015.
Official Communication for U.S. Appl. No. 14/734,772 dated Oct. 30, 2015.
Official Communication for U.S. Appl. No. 14/746,671 dated Nov. 12, 2015.
Official Communication for U.S. Appl. No. 14/746,671 dated Sep. 28, 2015.
Official Communication for U.S. Appl. No. 14/841,338 dated Feb. 18, 2016.
Official Communication for U.S. Appl. No. 14/996,179 dated May 20, 2016.
Peng et al., "Large-scale Incremental Processing Using Distributed Transactions and Notifications", Proceedings of the 9th USENIX Symposium on Operating Systems Design and Implementation, USENIX, p. 14, 2010.
QUEST, "Toad for ORACLE 11.6-Guide to Using Toad", pp. 1-162, Sep. 24, 2012.
QUEST, "Toad for ORACLE 11.6—Guide to Using Toad", pp. 1-162, Sep. 24, 2012.
Thomson et al., "The Case for Determinism in Database Systems", The 36th International Conference on Very Large Data Bases, Proceedings of the VLDB Endowment, vol. 3, Issue No. 1, p. 11, Singapore, Sep. 13-17, 2010.
Wikipedia, "Federated Database System," Sep. 7, 2013, retrieved from the internet on Jan. 27, 2015 http://en.wikipedia.org/w/index.php?title=Federated-database-system&oldid=571954221.
Wikipedia, "Federated Database System," Sep. 7, 2013, retrieved from the internet on Jan. 27, 2015 http://en.wikipedia.org/w/index.php?title=Federated—database—system&oldid=571954221.
Wollrath et al., "A Distributed Object Model for the Java System", Conference on Object-Oriented Technologies and Systems, pp. 219-231, Jun. 17-21, 1996.
Yang et al., "HTML Page Analysis Based on Visual Cues", A129, pp. 859-864, 2001.
Zhao et al., "Exploratory Analysis of Time-Series with ChronoLenses", IEEE Transactions on Visualization and Computer Graphics, vol. 17, No. 12, Dec. 2011, pp. 2422-2431.

Cited By (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11314738B2 (en) 2014-12-23 2022-04-26 Palantir Technologies Inc. Searching charts
US10585907B2 (en) 2015-06-05 2020-03-10 Palantir Technologies Inc. Time-series data storage and processing database system
US11687543B2 (en) 2015-06-05 2023-06-27 Palantir Technologies Inc. Time-series data storage and processing database system
US10664444B2 (en) 2016-08-02 2020-05-26 Palantir Technologies Inc. Time-series data storage and processing database system
US11379453B2 (en) 2017-06-02 2022-07-05 Palantir Technologies Inc. Systems and methods for retrieving and processing data
US10417224B2 (en) 2017-08-14 2019-09-17 Palantir Technologies Inc. Time series database processing system
US11397730B2 (en) 2017-08-14 2022-07-26 Palantir Technologies Inc. Time series database processing system
US20230169082A1 (en) * 2017-09-21 2023-06-01 Palantir Technologies Inc. Database system for time series data storage, processing, and analysis
US11573970B2 (en) 2017-09-21 2023-02-07 Palantir Technologies Inc. Database system for time series data storage, processing, and analysis
US10216695B1 (en) 2017-09-21 2019-02-26 Palantir Technologies Inc. Database system for time series data storage, processing, and analysis
US11914605B2 (en) * 2017-09-21 2024-02-27 Palantir Technologies Inc. Database system for time series data storage, processing, and analysis
US11281726B2 (en) 2017-12-01 2022-03-22 Palantir Technologies Inc. System and methods for faster processor comparisons of visual graph features
US11016986B2 (en) 2017-12-04 2021-05-25 Palantir Technologies Inc. Query-based time-series data display and processing system
US11934409B2 (en) 2018-11-23 2024-03-19 Amazon Technologies, Inc. Continuous functions in a time-series database
US11068537B1 (en) * 2018-12-11 2021-07-20 Amazon Technologies, Inc. Partition segmenting in a distributed time-series database
US10997137B1 (en) 2018-12-13 2021-05-04 Amazon Technologies, Inc. Two-dimensional partition splitting in a time-series database
US11409725B1 (en) 2019-02-04 2022-08-09 Amazon Technologies, Inc. Multi-tenant partitioning in a time-series database
US11250019B1 (en) 2019-02-27 2022-02-15 Amazon Technologies, Inc. Eventually consistent replication in a time-series database
US11853317B1 (en) 2019-03-18 2023-12-26 Amazon Technologies, Inc. Creating replicas using queries to a time series database
US11513854B1 (en) * 2019-06-26 2022-11-29 Amazon Technologies, Inc. Resource usage restrictions in a time-series database
US11397752B1 (en) * 2019-06-27 2022-07-26 Amazon Technologies, Inc. In-memory ingestion for highly available distributed time-series databases
US11256719B1 (en) 2019-06-27 2022-02-22 Amazon Technologies, Inc. Ingestion partition auto-scaling in a time-series database
US11803572B2 (en) 2019-09-23 2023-10-31 Amazon Technologies, Inc. Schema-based spatial partitioning in a time-series database
US11573981B1 (en) 2019-09-23 2023-02-07 Amazon Technologies, Inc. Auto-scaling using temporal splits in a time-series database
US11216487B1 (en) 2019-09-23 2022-01-04 Amazon Technologies, Inc. Schema-based spatial partitioning in a time-series database
US11263270B1 (en) 2020-03-26 2022-03-01 Amazon Technologies, Inc. Heat balancing in a distributed time-series database
US11409771B1 (en) 2020-03-26 2022-08-09 Amazon Technologies, Inc. Splitting partitions across clusters in a time-series database
US11366598B1 (en) 2020-03-26 2022-06-21 Amazon Technologies, Inc. Dynamic lease assignments in a time-series database
US11599516B1 (en) 2020-06-24 2023-03-07 Amazon Technologies, Inc. Scalable metadata index for a time-series database
US20220067021A1 (en) * 2020-09-01 2022-03-03 Palantir Technologies Inc. Data insights
US11461347B1 (en) 2021-06-16 2022-10-04 Amazon Technologies, Inc. Adaptive querying of time-series data over tiered storage
US11941014B1 (en) 2021-06-16 2024-03-26 Amazon Technologies, Inc. Versioned metadata management for a time-series database

Also Published As

Publication number Publication date
EP3101560B1 (en) 2021-05-19
EP3101560A1 (en) 2016-12-07
US20200201859A1 (en) 2020-06-25
US10585907B2 (en) 2020-03-10
US20230359638A1 (en) 2023-11-09
US20170270172A1 (en) 2017-09-21
US11687543B2 (en) 2023-06-27
US20160357828A1 (en) 2016-12-08

Similar Documents

Publication Publication Date Title
US11687543B2 (en) Time-series data storage and processing database system
US20200285617A1 (en) Time-series data storage and processing database system
US11531680B2 (en) Data aggregation and analysis system
US11709852B2 (en) Query-based time-series data display and processing system
AU2014253499B2 (en) Space-optimized display of multi-column tables with selective text truncation based on a combined text width
US10423527B2 (en) Memory management and image display for mobile devices
US9811577B2 (en) Asynchronous data replication using an external buffer table
US20200285651A1 (en) Systems and methods for data analysis and visualization and managing data conflicts
EP3107014A1 (en) Data aggregation and analysis system
US11150917B2 (en) System for data aggregation and analysis of data from a plurality of data sources
US11494444B2 (en) Systems and methods for visualizing and analyzing multi-dimensional data
US10769125B2 (en) Ordering records for timed meta-data generation in a blocked record environment
EP3812922A1 (en) Methods and systems for data synchronization
US11392348B2 (en) Ordering records for timed meta-data generation in a blocked record environment
EP4016322A1 (en) Data structure based on event compaction and read-offsets

Legal Events

Date Code Title Description
AS Assignment

Owner name: PALANTIR TECHNOLOGIES INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:TOBIN, DAVID;SCOTT, DYLAN;SIMSEK, ORCUN;AND OTHERS;SIGNING DATES FROM 20160629 TO 20160726;REEL/FRAME:039348/0220

STCF Information on status: patent grant

Free format text: PATENTED CASE

AS Assignment

Owner name: MORGAN STANLEY SENIOR FUNDING, INC., AS ADMINISTRATIVE AGENT, NEW YORK

Free format text: SECURITY INTEREST;ASSIGNOR:PALANTIR TECHNOLOGIES INC.;REEL/FRAME:051713/0149

Effective date: 20200127

Owner name: ROYAL BANK OF CANADA, AS ADMINISTRATIVE AGENT, CANADA

Free format text: SECURITY INTEREST;ASSIGNOR:PALANTIR TECHNOLOGIES INC.;REEL/FRAME:051709/0471

Effective date: 20200127

AS Assignment

Owner name: PALANTIR TECHNOLOGIES INC., CALIFORNIA

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:ROYAL BANK OF CANADA;REEL/FRAME:052856/0382

Effective date: 20200604

Owner name: MORGAN STANLEY SENIOR FUNDING, INC., NEW YORK

Free format text: SECURITY INTEREST;ASSIGNOR:PALANTIR TECHNOLOGIES INC.;REEL/FRAME:052856/0817

Effective date: 20200604

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 4

AS Assignment

Owner name: PALANTIR TECHNOLOGIES INC., CALIFORNIA

Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE ERRONEOUSLY LISTED PATENT BY REMOVING APPLICATION NO. 16/832267 FROM THE RELEASE OF SECURITY INTEREST PREVIOUSLY RECORDED ON REEL 052856 FRAME 0382. ASSIGNOR(S) HEREBY CONFIRMS THE RELEASE OF SECURITY INTEREST;ASSIGNOR:ROYAL BANK OF CANADA;REEL/FRAME:057335/0753

Effective date: 20200604

AS Assignment

Owner name: WELLS FARGO BANK, N.A., NORTH CAROLINA

Free format text: ASSIGNMENT OF INTELLECTUAL PROPERTY SECURITY AGREEMENTS;ASSIGNOR:MORGAN STANLEY SENIOR FUNDING, INC.;REEL/FRAME:060572/0640

Effective date: 20220701

Owner name: WELLS FARGO BANK, N.A., NORTH CAROLINA

Free format text: SECURITY INTEREST;ASSIGNOR:PALANTIR TECHNOLOGIES INC.;REEL/FRAME:060572/0506

Effective date: 20220701