CA2550832A1 - Query framework system and method - Google Patents

Query framework system and method Download PDF

Info

Publication number
CA2550832A1
CA2550832A1 CA002550832A CA2550832A CA2550832A1 CA 2550832 A1 CA2550832 A1 CA 2550832A1 CA 002550832 A CA002550832 A CA 002550832A CA 2550832 A CA2550832 A CA 2550832A CA 2550832 A1 CA2550832 A1 CA 2550832A1
Authority
CA
Canada
Prior art keywords
query
provider
providers
framework system
operations
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
CA002550832A
Other languages
French (fr)
Inventor
Soufiane Azizi
Vladimir Mordvinov
Mike C. Potter
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Cognos Inc
Original Assignee
Cognos Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CA002519010A external-priority patent/CA2519010A1/en
Application filed by Cognos Inc filed Critical Cognos Inc
Priority to CA002550832A priority Critical patent/CA2550832A1/en
Publication of CA2550832A1 publication Critical patent/CA2550832A1/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2452Query translation

Abstract

A query framework system is provided. The query framework system compris es a query framework service for consuming a query and returning a result set, a provider application programming interface for defining a consistent application programming interface for providers, one or more providers incorporating query processin g logic, and a configuration repository for storing the list of the providers in the query framework system and their properties.

Description

Query Framework System and Method FIELD OF THE INVENTION
[001 ] The invention relates generally to data access middleware, and in particular to a query framework system and method.
BACKGROUND OF THE INVENTION
[002] Prior business intelligence (BI) query engines are not extensible, are not data agnostic, and can not support the various requirements of a corporate performance management or a BI solution in a consistent manner. They can only support some of the BI requirements using fragmented and inconsistent sub-engines, each providing a subset of the solution. This inconsistency filters up to the front end of the solution and present the BI user with an inconsistent user experience.
SUMMARY OF THE INVENTION
[003] In accordance with an embodiment of the present invention, there is provided a query framework system. The query framework system comprises a query framework service for consuming a query and returning a result set, a provider application programming interface for defining a consistent application programming interface for providers, one or more providers incorporating query processing logic, and a configuration repository for storing the list of the providers in the query framework system and their properties.
[004] In accordance with another aspect of the present invention, a system with query processing operations distributed across multiple reusable components, query framework providers, is provided. A set of providers is defined that comprises a system performing processing of data agnostic BI queries.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 shows in a block diagram an example of a query framework system 10, in accordance with an embodiment of the present invention.
Figure 2 shows in a block diagram another example of a query framework system 10, in accordance with an embodiment of the present invention.

Figure 3 shows in a block diagram a more detailed example of a query framework system.
Figure 4 shows a high level view of an example of a design of a planner, data, or service provider within the query framework system.
Figure 5 illustrates a sample scenario of a provider use, in accordance with an embodiment of the query framework system.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
[005] The present inventions allows for the building of business intelligence (BI) and corporate performance management (CPM) systems that query data using a BI
query language that is data agnostic. The invention can also be applied to query processing systems supporting any other structured query language, for example SQL or MDX.
[006] Figure 1 shows in a block diagram an example of a query framework system 10, in accordance with an embodiment of the present invention. The query framework system 10 comprises a query framework service 11 for consuming a query and returning a result set, a provider application programming interface (API) 12 for defining a consistent provider API, one or more providers 14 and a configuration repository 16.
[007] Figure 2 shows in a block diagram another example of a query framework system 10, in accordance with an embodiment of the present invention. The query framework system 10 comprises a query framework service 11 for consuming a query and returning a result set, a provider application programming interface (API) 12 for defining a consistent provider API, one or more providers 14, an operation support table 15 for allowing for an initial distribution of operations involved in a query between providers, and a configuration repository 16.
[008] The ability to support an end to end CPM solution (Planning, Business Intelligence, and Score Carding) using the same query engine and hence the same underlying software platform is an advantage of the query framework system 10.
Other advantages include the flexibility of the overall system architecture which is easily maintainable and extendable. Customers, other solution providers, and OEMs can build on top of this framework their own business logic and hence customize an out of the box CPM. Also, the benefit to the end user in terms of consistency in his/her experience, whether reporting, ad-hoc querying, or analyzing data from various types of data source technology is another advantage.
[009] Query processing usually is done in two phases: query planning and query execution. During query planning a query received from a user is gradually transformed into an execution plan, the tree of nodes representing operations to be performed during execution phase. Distributed query planning in the query framework system is achieved by breaking the full query into portions supported by corresponding providers.
During the planning phase providers receive a designated query portion to apply planning actions. Providers analyze the designated query part and contribute to planning of this portion if they detect that their planning functionality is applicable to a given query portion. Providers may prepare the query portion, i.e., plan the actions to be performed during the execution phase and replace this query portion with its execution plan.
Providers may transform this query portion into another form in order to facilitate its preparation by other providers. The sequence of the provider invocation during query planning is predefined and stored in the configuration repository. The sequence of the provider invocation during the execution phase is driven by the execution plan built as a result of the planning phase. Provider execution method manipulates the data received from underlying datasources or other providers in order to produce the result master dataset returned back to a system user.
[0010] Existing data processing frameworks like ODBC or OLEDB effectively provide an ability to distribute execution of a query across different components. Yet the sequence of these operations had to be defined by a client application that would have to specify exactly what components are to be used to execute given query.
[0011 ] The query framework system 10 allows to distribute not only the query execution phase functionality across multiple components, but also to distribute the planning logic.
It does not require a client application to be responsible for formulation of an operation execution sequence. Instead the configuration of the query framework would determine the execution operation sequence, the execution plan, during distributed query planning phase.
[0012] The main principle the distributed planning is based on is the ability of providers to recognize and extract the operations they can support out of a query portion they receive during the planning phase. Decomposition providers can break a full query into a tree of smaller sub-queries to make it possible for providers to take a responsibility for those sub-queries they can actually support.

[0013] Another cornerstone of the query framework is the unified Provider API
interface 12 that includes not only the methods involved in data retrieval, but also the methods invoked during query planning. All providers are required to implement this interface and hence all providers are capable of participation in the query planning process.
[0014] Provider API 12 as well as the client facing query framework API 11 also serve the data agnostic function, i.e. these interfaces provide an access and ability to manipulate both relational and multidimensional or OLAP data.
Provider Cate og ries [0015] The query framework can be filled with any set of providers. It is recognized, though, that the following categories of providers will provide the completeness and flexibility of the query processing system 10 as well as reusable nature of the system components, query framework providers. The categories are: query transformation providers, service provider and query planning providers.
[0016] Query planner providers provide access to data either through internal operations or by calling external components providing data. The operation of query planner providers typically involves translation of the BIQuery language into the query language of underlying data sources, such as relational databases supporting SQL query language.
Query planner providers plan and execute a query portion or whole query that the coordinator designates to them. A result of execution is one or more master datasets.
[0017] Service providers handle operations that are not supported by planner providers.
During query planning the service provider take the responsibility for the portion of a query not supported by planner providers. During execution service providers perform post-processing of data returned by query planner providers. This category of providers can be replaced, newly added or extracted out the system with minimum disruption to the query framework system.
[0018] Query transformation providers are involved in the query planning process. They are to perform query manipulations required to facilitate query planning process.
Important instances of query transformation providers are query refinement providers, query decomposition providers and query optimization providers. The refinement provider enhances a query specification to disambiguate, clarify and make it easier to work with by other providers in the query framework system. The decomposition providers break a complex query into an equivalent combination of simpler queries.
Decomposition allows the coordinator 104 to distribute only portions of a full query to appropriate providers that do not support all operations involved in the full query. Query optimization providers are to transform a query in order to speed up a query execution.
[0019] The provider of the decomposition into primitive operations can be present in the query framework system to play a special role of insuring the system completeness given the system also has service providers associated with every primitive operation. The primitive operation denotes a simplest operation that constitutes an indivisible query part.
The examples of primitive macro operations that can be recognized in the SQL
query language are: filter operation, grouping and aggregation operation, sorting operation, joining operation, etc.
Configuration repository 16 [0020] Query framework configuration repository contains the parameters of the system as a whole, the list of providers and their properties. A provider becomes integrated into the query framework system once it is described in the configuration repository, so the query framework is easily extendable.
[0021 ] A provider is described by its identifier, the name of a shared library/DLL
containing the provider functionality. It is defined whether a provider is optional or mandatory for the system work, the property that separates essential system functionality from an additional plug-ins.
[0022] Properties of a provider connection and metadata cache can be configured by the system defaults or can be overridden by the configuration parameters defined for specifically for a given provider.
[0023] Provider configuration has the section "provider details" that contains parameters applicable only to a given provider and interpreted by the provider itself.
[0024] The configuration repository also contains operation support tables of those providers that do not support the Discover method. It simplifies the access and maintenance of the operation support tables of those providers that do not require being self described in the system through the Discover method.

Coordination Planner 101 [0025] The query framework system coordination planner 101 (or coordinator) is the component that is responsible for delegating the planning and execution of a BIQuery or its portions to a list of available providers. The coordinator makes use of available query transformation providers, planner providers, and service providers to refine, plan, decompose and execute a query.
[0026] The coordinator 101 is designed to have no specific knowledge about the available providers. Their capabilities, in terms of the supported operations, and the sequence of their invocation, are discovered at run time based on the information stored in the configuration repository and with the information returned by the provider method describing the provider properties. This design is intended to support feature providers to seamlessly plug-in to the coordination layer in order to extend the query framework system 10 functionality (examples are "disclosure avoidance" provider or Custom Add-on Filters).
[0027] During the query planning phase of query processing, the coordinator 101 invokes providers according to the query processing sequence stored in the configuration repository 150. The coordinator 101 extracts the query portion supported by a given provider according to the operation support table of this provider. If the extracted query portion is empty the provider is not invoked. The coordinator 101 then sends the extracted portion to the provider planning method and incorporates the result of the planning back into the query. One or more passes over the query processing sequence are expected to result into a completely planned query comprising a query execution plan, the tree of executable operations.
[0028] The execution plan is used by the coordinator 101 in order to guide the provider invocation during the execution phase of query processing. The coordinator 101 first invokes providers responsible for the leaf nodes of the execution plan tree.
The coordinator 101 sends the executable operation definition to the execution method of corresponding provider and collects result datasets the provider returns. The collected result master datasets are passed by the coordinator to the providers higher in the execution plan tree as non-leaf nodes in the execution plan denote operations applied on top of one or more input master data streams.
[0029] Using this mechanism the coordinator 1 O 1 distributes parts of the decomposed query among one or more planner providers for execution. The planner providers execute the query with the help of data providers and return master datasets pointers to the coordinator 1 O 1. The coordinator 101 then distributes the other parts of the decomposed query among service providers, passing them pointers to the master datasets as input stream. The service providers apply local query operations on these input streams and return result master datasets to the query framework service 100, which in turn, returns them to the consumer application that submitted the original BIQuery for execution. This final result set returned to the client could span multiple planners and data source types.
[0030] Preferably, the coordinator 101 shares the same interface, the provider API 12, with all providers in the query framework 10. The coordinator 101 acts as any other component in the system. Hence as any other component it can be replaced if needed.
Adherence to the same interface also allows for recursive calls from other providers needed in the some cases in order to force execution of the BIQuery portions that have to be processed out of the ordinary sequence.
[0031 ] The coordination layer of the query framework system architecture is the layer composed of the coordinator 1 O l and the available query transformation providers (which encapsulate query refinement, decomposition, and optimization functionality) and the available service providers (which encapsulate local execution, tabular operations, and multi-cube joining functionality). The coordination layer of the query framework 10 breaks any client dependency on the data source technologies available underneath the query framework service 100. In other words, the coordination layer provides data agnostic query capabilities that can span relational, multidimensional, and proprietary vendor data sources to clients of the query framework system 10 that use high level and simple BIQuery semantics. Data agnostic query capability is the main role of the query framework coordination layer. Another important role is the centralization of common query operations across all types of planner providers. Common functionality that would have been implemented repeatedly by each planner provider should be factored out to this layer.
System supporting data agnostic BIQueries.
[0032] The following is an example of a set of providers that comprises a query framework system capable of handling data agnostic BI queries.
[0033] This system can include a relational query planner 110, an OLAP query planner 111, a multidimensional operation decomposition provider 121, a local tabular operation provider 122, a tabular function decomposition provider 123, a tabular operation decomposition provider 124, a master detail provider 127, a report mode provider 128, and a cube builder provider 129.
[0034] The relational query planner 110 is to handle query operation that can be converted into SQL statements. The relational query provider 110 sends its queries to relational data sources.
[0035] The OLAP query provider 111 handles query operations that can be converted into MDX statements. The OLAP query provider 111 sends its queries to OLAP
data sources.
[0036] The multidimensional operation decomposition provider 121 performs a decomposition of queries containing OLAP operations on top of relational datasources. It may decompose a query into 4 sub-queries: a sub-query that contains only relational operations, a sub-query responsible for the pivoting operation, a sub-query containing a OLAP functions, and a sub-query providing a final post-processing of the result of first three sub-queries.
[0037] The cube builder provider 129 performs a pivoting operation of transforming a tabular stream into a cube dynamically created to represent pivoted dataset.
[0038] The local tabular operation provider 122 can apply SQL operations to an input data stream. This provider can be implemented as an extension of the functionality of the relational query provider 111 that treats an input data stream as a special relational datasource.
[0039] The tabular operation decomposition 124 provider is responsible for handling operations over tabular streams, like union, except or intersect on top of OLAP
datasources that do not support these operations natively.
[0040] The tabular function decomposition provider 123 is responsible for decomposing the queries containing SQL-like functions on top of OLAP datasources that do not support these operations natively.
[0041 ] The master detail provider 127 is to handle master-detail links between two independents queries. The master detail provider as well as the tabular operation decomposition provider allow combining data from relational and OLAP
datasources.
[0042] The report mode provider 128 extends the OLAP functionalities by lifting limitation of MDX query language: unioning of members from different dimensions, projecting the same dimension on two or more edges, and unioning member tuples of different size.
[0043] The functionality of the described providers can be rearranged in a different set of component by combining the feature of several providers into a single one or conversely by distributing a functionality of a single provider to a set of several components.
[0044] Other components may be added to the system 100, including a no data mode provider, a translator, a post-processor, a metadata query provider 130, a drill through provider 131, and an OLAP data provider 132. Figure 3 shows in a block diagram a more detailed example of a query framework system 10.
Provider API 12 and Operation Support Table 15 [0045] The provider API 12 and the operation support table (OST) 15 are central to the query framework 10 architecture. In particular, the Prepare() method of the provider API
12 and the OST 15 are be the basis for the query planning process.
[0046] The operation support table 15 is associated with every relevant provider. It defines which operations from a predefined set are supported by a given provider.
Operations are recognizable patterns in a tree of query blocks or in a tree of a query block specification. Operations are associated with a given node of a corresponding tree and searched for in the context of this node. The coordinator defines these patterns and provides the logic of spotting them in a query block tree and reporting "supported/unsupported" association of an operation with each provider.
[0047] The operation support table 15 allows for an initial distribution of the operations involved in the query between providers participating in the query framework.
In most use cases that are handled by a query engine, this initial distribution based on the data source type will be sufficient. That is, all current queries involving only dbQuery subjects and current relational expressions are assigned to the relational planner based on the corresponding data source type in the OST 15. Also, all current queries involving mdQuery subjects and dimensional expressions will be assigned to the OLAP
planner based on the corresponding data source type in the OST 15.
Resource Pool [0048] The coordinator 101 owns two resources pools: a request resource pool and a connection resource pools. These pools contains resources that can be reused between different stages of query processing and shared across different provider participating in an execution of the a given request or associated with a given connection.
Request resource pool contains resources pertaining to the same client query set to the query framework system. Connection resource pool contains resources that depends only on the properties of the connection and hence can be reused for multiple requests performed though the connection. The resource pools are accessible for all providers to add and read instances of the resources. The request resource pool is destroyed by the coordinator 101 once the input request is completely processed. The connection resource pool get destroyed when the connection is closed. Resources are classified into types.
A resource type denotes the semantics of a resource items and also is associated with the interface class used to access the content of a resource item. Some resource item types can be predefined in a query processing system for resources needed by every provider. Other types are specific for a given provider and are dynamically generated.
Example Provider API
[0049] Figure 4 shows a high level view of an example of a design of a planner, data, or service provider within the query framework system 10. Interface classes are implemented in the query framework layer; the boxes are classes implemented by the provider and inherit the QF base implementation) [0050] Each provider implements the following methods:
ProviderInitialize~
NewConnectionObject() ProviderTerminate() ProviderCompatibleWith() ProviderGetVersion() SupportedAPIVersion() They are used to initialize the provider, create a new connection object, and shut provider down, respectively.
[0051 ] Additionally, each provider implements the following classes:
ProviderQFSConnectiQn ProviderQFSMetadataQuery ProviderQFSQuery ProviderQFSMasterDataset ProviderQFSPartialDataset ProviderQFSEdgeIterator ProviderQFSCeIIIterator ProviderQFSRowCopy [0052] In each name, prefix Provider is replaced with abbreviation that uniquely identifies the provider type. Each such class inherits from the standard Query Framework Service (QFS) class with a similar name less the prefix Provider. For example, VendorQFSConnection inherits from QFSConnection.
[0053] Each method that a provider has to implement is defined as "abstract"
in its parent class and has a provider prefix in its name. For example, QFSMetadataQuery requires that the related provider class implements ProviderExcute() method. Provider is not allowed to overload other methods from the parent class.
[0054] Methods with the provider prefix are called by equivalent non-abstract methods of the parent, giving the provider a chance to perform processing that is provider-specific.
For example, ProviderQFSConnection:: ProviderDiscover() is called by method QFSConnection::Discover().
[0055] The following is a list of methods that are implemented in every provider:
Class ~~ Method' Pur ose <global members> NewConnectionObject() Instantiates new connection object Providerlnitialize() Initializes provider (run once upon loading of the shared librar ) ProviderTerminate~ Un-initializes provider (run once upon unloading of the shared library) ProviderCompatibleWith(7Checks if provider is compatible with specified provider version ProviderGetVersion() Retrieves provider version SupportedAPIYersion() Retrieves supported QFS

version ProviderQFSConnection ProviderConnect() Requests a new connection to the data provider ProviderClose~ Closes current connection
11 ProviderDiscover() Reports provider properties ProviderTestQ Tests data source Connection ProviderNewMetadataQuery~Instantiates new metadata query ob' ect ProviderNewQuery() Instantiates new data query object ProviderQFSMetadataQueryProviderExecute~ Executes metadata query ProviderCancel() Cancels metadata uery ProviderCloseQ Closes metadata query ob-ect ProviderQFSQuery ProviderPre areQ Plan query PrividerExecute() Execute data quer ProviderCancel() Cancels data query ProviderClose() Closes data query object ProviderQFSMasterDatasetProviderGetDatasetlnfo()Retrieves general master dataset info ProviderGetPartialDataset~Requests artial dataset ProviderQFSPartialDatasetProviderGetEdgelterator~Retrieves edge iterator ProviderGetCelllterator()Retrieves cell iterator ProviderQFSCeIIIteratorProviderGetNumRows() Retrieves number of rows in iterator ProviderGetRowsetld() Retrieves offset of rowset descriptor ProviderGetPosition~ Returns current iterator osition ProviderNext() Positions iterator at the next row ProviderPrevious~ Positions iterator at the previous row ProviderGoto() Positions iterator at the specified row ProviderGetMemberOrdinal()Retrieves ordinal position of a member along an edge of a crosstab re ort ProviderGetEdgeOrdinal()Retrieves row's ordinal position from a cell rowset along the specified ed a ProviderGetSummaryLevel()Retrieves summary level of the current row ProviderGetColumn() Retrieves column value and state ProvideYGetColumnAsT~ariant()Retrieves column value and state ProviderGetRow() Retrieves entire current row
12 ProviderGetRowCopy~ Saves a the row of data for future use ProviderQFSSetIterator ProviderGetNumRows~ Retrieves number of rows in iterator ProviderGetRowsetld() Retrieves offset of rowset descri for ProviderGetPosition() Returns current iterator position ProviderNextQ Positions iterator at the next row ProviderPrevious~ Positions iterator at the revious row ProviderGoto() Positions iterator at the s ecified row ProviderGetMemberOrdinal()Retrieves ordinal position of a member along an ed a of a crosstab re ort ProviderGetEdgeOrdinal()Retrieves row's ordinal position from a cell rowset along the specified edge ProviderGetSummaryLevel~Retrieves summary level of the current row ProviderGetColumn() Retrieves column value and state ProviderGetColumnAsvariant()Retrieves column value and state ProviderGetRow() Retrieves entire current row ProviderGetRowCopy() Saves a the row of data for future use ProviderGetNumChildren()Returns number of children for the current member ProviderToChildren() Positions iterator on the first child of the current member ProviderToParent() Positions iterator on the parent of the current member ProviderQFSRowCopy ProviderGetRowsetld() Retrieves offset of rowset descriptor ProviderGetPosition~ Returns current iterator position ProviderGetMemberOrdinal()Retrieves ordinal position of a member along an ed a o a crosstab re ort
13 ProviderGetEdgeOrdinal()Retrieves row's ordinal position from a cell rowset along the specified edge ProviderGetSummaryLevel()Retrieves summary level of the current row ProviderGetColumn() Retrieves column value and state ProviderGetColumnAsl~ariant()Retrieves column value and state ProviderGetRow() Retrieves entire current row [0056] Provider authors can use the provided GenericProvider as the basis of their implementation and for more detailed information on methods and classes that are implemented.
[0057] Preferably, Provider is packaged in a form of shared library/DLL that is loaded by QFS on demand. Provider remains loaded in memory as long as there is at least one connection open to it. After all connections to the provider are closed, provider library is unloaded by QFS.
[0058] Figure 5 illustrates a sample scenario of a Provider use, in accordance with an embodiment of the query framework system 10. When provider is loaded, its ::ProviderInitialize() method is called. This gives provider a chance to execute certain operations once. This method is called only once.
[0059] Every time QFS needs to open a new connection it calls provider method NewConnectionObject(), which returns a new instance of a connection object, ProviderQFSConnection. At this point connection to data source has not yet been established. It is opened when ProviderQFSConnection::ProviderConnect() method is called. Concurrent calls can be made to this method from multiple threads and so it must be implemented to be thread-safe.
[0060] After connection is initialized it is used to query provider for properties (ProviderQFSConnection:: ProviderDiscover()), and create new metadata and data query objects (ProviderQFSConnection:: ProviderNewMetadataQuery() and ProviderQFSConnection::ProviderNewQuery()).
14 ProviderQFSMetadataQuery object is used to execute metadata quexies on the data source. ProviderQFSMetadataQuery::ProviderExecute() is called to perform actual execution.
[0061] ProviderQFSQuery object is used to execute data queries on the data source.
ProviderQFSQuery:: ProviderPrepare() is called during planning stage of query processing in order to invoke provider query planning logic.
ProviderQFSQuery::
ProviderExecute() is called to perform actual execution, and its result is an instance of ProviderQFSMasterDataset that is then used to retrieve results of query execution in binary form. ProviderQFSMasterDataset is used to request information about different portions of the result set in the form of ProviderQFSPartialDataset objects.
Each of those data sets is then queried using instances of ProviderQFSIterator.
[0062] ProviderClose() method is called to destroy connections and queries of both types.
[0063] When provider is no longer needed it is unloaded from memory. At this moment its ::ProviderTerminate() method is executed, giving it a chance to free system resources.
This method is called only once.
[0064] ProviderCancel() method is used to cancel query that is currently being executed.
This method is called at any point in time during query processing from a separate thread and therefore must be thread-safe.
[0065] The systems and methods according to the present invention may be implemented by any hardware, software or a combination of hardware and software having the functions described above. The software code, either in its entirety or a part thereof, may be stored in a computer readable memory. Further, a computer data signal representing the software code that may be embedded in a carrier wave may be transmitted via a communication network. Such a computer readable memory and a computer data signal are also within the scope of the present invention, as well as the hardware, software and the combination thereof.
[0066] While particular embodiments of the present invention have been shown and described, changes and modifications may be made to such embodiments without departing from the true scope of the invention.

Claims (13)

WHAT IS CLAIMED IS:
1. A query framework system comprising:

a query framework service for consuming a query and returning a result set;

a provider application programming interface for defining a consistent application programming interface for providers;

one or more providers incorporating query processing logic; and a configuration repository for storing the list of the providers in the query framework system and their properties.
2. The query framework system as claimed in claim 1, wherein the provider application programming interface includes query planning means for analyzing a designated query portion and for preparing an execution of the designated query portion;
3. The query framework system as recited in claim 1, wherein the provider application programming interface is data agnostic.
4. The query framework system as recited in claim 1, wherein the query service include a resource pool module for managing resources used by the one or more providers.
5. The query framework system as recited in claim 4, wherein the resource pool module registers, stores, providers access to and deletes the resources.
6. The query framework system as recited in claim 1, further comprising a coordination planner for organizing a processing of the query, the coordination planner implements the programming application interface.
7. The query framework system as recited in claim 6, wherein the coordination planner decomposes a query and distributes portions of the query to the one or more query providers.
8. The query framework system as recited in claim 6, wherein the coordination planner is built as a provider and uses the common provider application programming interface.
9. The query framework system as recited in claim 1, wherein the configuration repository includes an operation support table for describing operations supported by the one or more provider.
10. The query framework system as claimed in the claim 1, wherein the one or more providers are categorized into one or more groups for facilitating the provider design.
11. The query framework system as claimed in the claim 1, wherein the one or more providers includes one or more of:

query transformation providers for performing query transformation during query planning phase of query execution;

query planning providers for providing access to external data sources by translating the query portion to a query language of the underlying data source.
12. The query framework system as claimed in claim 11, wherein service providers are responsible for the portion of a query not supported by planner providers and perform designated operation as a post-processing of data returned by planner providers.
13. The query framework system as claimed in claim 1, wherein the one or more providers comprises one or more of:

a relational query planner for handling query operations that can be converted into SQL statements and sends its queries to relational data sources;

an OLAP query planner for handling query operations that can be converted into MDX statements and sends its queries to OLAP data sources;

a local tabular operation and function provider for applying SQL operations to an input data stream;

a multidimensional operation decomposition provider for performing a decomposition of queries containing OLAP operations on top of relational datasources;

a tabular operation decomposition provider for handling operations over tabular streams, like union, except or intersect on top of OLAP datasources that do not support these operations natively;

a tabular function decomposition provider for decomposing the queries containing SQL-like functions on top of OLAP datasources that do not support these operations natively;

a master detail provider for handling master-detail links between two independents queries;

a report mode provider extending the OLAP functionalities by lifting limitation of MDX query language: unioning of members from different dimensions, projecting the same dimension on two or more edges, and unioning member tuples of different size; and a cube builder provider for performing a pivoting operation of transforming a tabular stream into a cube dynamically created to represent pivoted dataset.
CA002550832A 2005-09-13 2006-06-22 Query framework system and method Abandoned CA2550832A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CA002550832A CA2550832A1 (en) 2005-09-13 2006-06-22 Query framework system and method

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CA002519010A CA2519010A1 (en) 2005-09-13 2005-09-13 Query framework system and method
CA2,519,010 2005-09-13
CA002550832A CA2550832A1 (en) 2005-09-13 2006-06-22 Query framework system and method

Publications (1)

Publication Number Publication Date
CA2550832A1 true CA2550832A1 (en) 2007-03-13

Family

ID=37872093

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002550832A Abandoned CA2550832A1 (en) 2005-09-13 2006-06-22 Query framework system and method

Country Status (1)

Country Link
CA (1) CA2550832A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2003573A1 (en) * 2007-06-11 2008-12-17 Sap Ag Enhanced widget composition platform

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2003573A1 (en) * 2007-06-11 2008-12-17 Sap Ag Enhanced widget composition platform
US8578330B2 (en) 2007-06-11 2013-11-05 Sap Ag Enhanced widget composition platform
US9021423B2 (en) 2007-06-11 2015-04-28 Sap Se Enhanced widget composition platform

Similar Documents

Publication Publication Date Title
US7716212B2 (en) Query framework system and method
US8935273B2 (en) Method of processing and decomposing a multidimensional query against a relational data source
DeWitt et al. Split query processing in polybase
CA2318299C (en) Metadata exchange
Thusoo et al. Hive-a petabyte scale data warehouse using hadoop
US7676493B2 (en) Incremental approach to an object-relational solution
US8918388B1 (en) Custom data warehouse on top of mapreduce
US20040064456A1 (en) Methods for data warehousing based on heterogenous databases
US20080183725A1 (en) Metadata service employing common data model
US20070027849A1 (en) Integrating query-related operators in a programming language
US20080319957A1 (en) Extensible command trees for entity data model platform
US8122044B2 (en) Generation of business intelligence entities from a dimensional model
US9767151B2 (en) Optimizing database queries having hierarchy filters
US20070061305A1 (en) System and method of providing date, arithmetic and other relational functions for OLAP sources
US9146979B2 (en) Optimization of business warehouse queries by calculation engines
CA2534257A1 (en) Storage api for a common data platform
Wang et al. An IDEA: an ingestion framework for data enrichment in AsterixDB
Messina et al. Keep it simple, fast and scalable: a Multi-Model NoSQL DBMS as an (eb) XML-over-SOAP service
Gadepally et al. Version 0.1 of the bigdawg polystore system
CA2550832A1 (en) Query framework system and method
Gadepally et al. BigDAWG version 0.1
US8271463B2 (en) System and method for providing access to data with user defined table functions
Savinov DataCommandr: Column-Oriented Data Integration, Transformation and Analysis
US20220012238A1 (en) Datacube access connectors
Muller Software engineering for the industrial Internet: Situation-aware smart applications

Legal Events

Date Code Title Description
EEER Examination request
FZDE Dead