WO2004012061A3 - Consistent message ordering for semi-active and passive replication - Google Patents

Consistent message ordering for semi-active and passive replication Download PDF

Info

Publication number
WO2004012061A3
WO2004012061A3 PCT/US2003/023778 US0323778W WO2004012061A3 WO 2004012061 A3 WO2004012061 A3 WO 2004012061A3 US 0323778 W US0323778 W US 0323778W WO 2004012061 A3 WO2004012061 A3 WO 2004012061A3
Authority
WO
WIPO (PCT)
Prior art keywords
message ordering
semi
active
passive replication
consistent message
Prior art date
Application number
PCT/US2003/023778
Other languages
French (fr)
Other versions
WO2004012061A2 (en
Inventor
Louise E Moser
Peter M Melliar-Smith
Original Assignee
Eternal Systems Inc
Louise E Moser
Peter M Melliar-Smith
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Eternal Systems Inc, Louise E Moser, Peter M Melliar-Smith filed Critical Eternal Systems Inc
Priority to AT03772073T priority Critical patent/ATE552555T1/en
Priority to EP03772073A priority patent/EP1543420B1/en
Priority to AU2003259297A priority patent/AU2003259297A1/en
Publication of WO2004012061A2 publication Critical patent/WO2004012061A2/en
Publication of WO2004012061A3 publication Critical patent/WO2004012061A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2097Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements maintaining the standby controller/processing unit updated
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2038Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant with a single idle spare processing component
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2048Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant where the redundant components share neither address space nor persistent storage
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/02Details
    • H04L12/16Arrangements for providing special services to substations
    • H04L12/18Arrangements for providing special services to substations for broadcast or conference, e.g. multicast
    • H04L12/1881Arrangements for providing special services to substations for broadcast or conference, e.g. multicast with schedule organisation, e.g. priority, sequence management
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1095Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/40Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass for recovering from a failure of a protocol instance or entity, e.g. service redundancy protocols, protocol state redundancy or protocol service redirection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/82Solving problems relating to consistency
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/02Details
    • H04L12/16Arrangements for providing special services to substations
    • H04L12/18Arrangements for providing special services to substations for broadcast or conference, e.g. multicast
    • H04L12/1863Arrangements for providing special services to substations for broadcast or conference, e.g. multicast comprising mechanisms for improved reliability, e.g. status reports

Abstract

Mechanisms for achieving consistent message ordering within a fault-tolerant distributed computer system based on semi-active or passive replication are described. The mechanisms communicate message ordering information from the primary replica to its backup replicas in such a way as to minimize the end-to-end request/response time, to minimize the number of additional messages that are multicast, and to ensure that, in the event of a fault, a backup replica has, or can obtain, the messages and the message ordering information that it needs to reproduce the actions of the primary replica.
PCT/US2003/023778 2002-07-29 2003-07-29 Consistent message ordering for semi-active and passive replication WO2004012061A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
AT03772073T ATE552555T1 (en) 2002-07-29 2003-07-29 UNIFORM MESSAGE ARRANGEMENT FOR SEMIACTIVE AND PASSIVE DUPLICATION
EP03772073A EP1543420B1 (en) 2002-07-29 2003-07-29 Consistent message ordering for semi-active and passive replication
AU2003259297A AU2003259297A1 (en) 2002-07-29 2003-07-29 Consistent message ordering for semi-active and passive replication

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US39958002P 2002-07-29 2002-07-29
US60/399,580 2002-07-29

Publications (2)

Publication Number Publication Date
WO2004012061A2 WO2004012061A2 (en) 2004-02-05
WO2004012061A3 true WO2004012061A3 (en) 2004-04-22

Family

ID=31188598

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2003/023778 WO2004012061A2 (en) 2002-07-29 2003-07-29 Consistent message ordering for semi-active and passive replication

Country Status (5)

Country Link
US (1) US6928577B2 (en)
EP (1) EP1543420B1 (en)
AT (1) ATE552555T1 (en)
AU (1) AU2003259297A1 (en)
WO (1) WO2004012061A2 (en)

Families Citing this family (48)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7107355B2 (en) * 2002-02-11 2006-09-12 Sun Microsystems, Inc. High availability lightweight directory access protocol service
US7231554B2 (en) * 2002-03-25 2007-06-12 Availigent, Inc. Transparent consistent active replication of multithreaded application programs
US7305582B1 (en) * 2002-08-30 2007-12-04 Availigent, Inc. Consistent asynchronous checkpointing of multithreaded application programs based on active replication
US7206964B2 (en) * 2002-08-30 2007-04-17 Availigent, Inc. Consistent asynchronous checkpointing of multithreaded application programs based on semi-active or passive replication
US7188273B2 (en) * 2003-11-24 2007-03-06 Tsx Inc. System and method for failover
US7526672B2 (en) 2004-02-25 2009-04-28 Microsoft Corporation Mutual exclusion techniques in a dynamic peer-to-peer environment
US20110082928A1 (en) 2004-10-22 2011-04-07 Microsoft Corporation Maintaining consistency within a federation infrastructure
US20080288659A1 (en) * 2006-11-09 2008-11-20 Microsoft Corporation Maintaining consistency within a federation infrastructure
US7337350B2 (en) * 2005-02-09 2008-02-26 Hitachi, Ltd. Clustered storage system with external storage systems
US8214353B2 (en) * 2005-02-18 2012-07-03 International Business Machines Corporation Support for schema evolution in a multi-node peer-to-peer replication environment
US8037056B2 (en) 2005-02-18 2011-10-11 International Business Machines Corporation Online repair of a replicated table
US7376675B2 (en) * 2005-02-18 2008-05-20 International Business Machines Corporation Simulating multi-user activity while maintaining original linear request order for asynchronous transactional events
US9286346B2 (en) * 2005-02-18 2016-03-15 International Business Machines Corporation Replication-only triggers
US8316129B2 (en) 2005-05-25 2012-11-20 Microsoft Corporation Data communication coordination with sequence numbers
US7802257B1 (en) * 2005-06-20 2010-09-21 Oracle America, Inc. Mechanism for bridging a thread-oriented computing paradigm and a job-oriented computing paradigm
US9043640B1 (en) * 2005-08-26 2015-05-26 Open Invention Network, LLP System and method for event-driven live migration of multi-process applications
US8301700B1 (en) 2010-08-06 2012-10-30 Open Invention Network Llc System and method for event-driven live migration of multi-process applications
US8281184B1 (en) 2010-08-06 2012-10-02 Open Invention Network Llc System and method for reliable non-blocking messaging for multi-process application replication
US8584145B1 (en) 2010-08-06 2013-11-12 Open Invention Network, Llc System and method for dynamic transparent consistent application-replication of multi-process multi-threaded applications
US9141481B1 (en) 2010-08-06 2015-09-22 Open Invention Network, Llc System and method for reliable non-blocking messaging for multi-process application replication
US8589953B1 (en) * 2010-08-06 2013-11-19 Open Invention Network, Llc System and method for transparent consistent application-replication of multi-process multi-threaded applications
US8621275B1 (en) 2010-08-06 2013-12-31 Open Invention Network, Llc System and method for event-driven live migration of multi-process applications
US7725764B2 (en) 2006-08-04 2010-05-25 Tsx Inc. Failover system and method
US20080059469A1 (en) * 2006-08-31 2008-03-06 International Business Machines Corporation Replication Token Based Synchronization
WO2008105030A1 (en) * 2007-02-28 2008-09-04 Fujitsu Limited Backup device
US7631214B2 (en) * 2007-05-31 2009-12-08 International Business Machines Corporation Failover processing in multi-tier distributed data-handling systems
US8218549B2 (en) * 2007-06-18 2012-07-10 International Business Machines Corporation Synchronization of message stream in a multi-tier messaging system
US8073922B2 (en) * 2007-07-27 2011-12-06 Twinstrata, Inc System and method for remote asynchronous data replication
US8756204B2 (en) * 2008-01-08 2014-06-17 Microsoft Corporation Asynchronous multi-level undo support in javascript grid
AT507204B1 (en) * 2008-10-09 2010-03-15 Frequentis Ag METHOD AND APPENDIX FOR DISTRIBUTING INSERTED DATA
US8650571B2 (en) * 2009-03-30 2014-02-11 Hewlett-Packard Development Company, L.P. Scheduling data analysis operations in a computer system
US8352482B2 (en) * 2009-07-21 2013-01-08 Vmware, Inc. System and method for replicating disk images in a cloud computing based virtual machine file system
US8762340B2 (en) 2010-05-14 2014-06-24 Salesforce.Com, Inc. Methods and systems for backing up a search index in a multi-tenant database environment
US9135127B1 (en) 2010-08-06 2015-09-15 Open Invention Network, Llc System and method for dynamic transparent consistent application-replication of multi-process multi-threaded applications
US8589732B2 (en) 2010-10-25 2013-11-19 Microsoft Corporation Consistent messaging with replication
US8631277B2 (en) 2010-12-10 2014-01-14 Microsoft Corporation Providing transparent failover in a file system
US9331955B2 (en) 2011-06-29 2016-05-03 Microsoft Technology Licensing, Llc Transporting operations of arbitrary size over remote direct memory access
US8856582B2 (en) 2011-06-30 2014-10-07 Microsoft Corporation Transparent failover
US8788579B2 (en) 2011-09-09 2014-07-22 Microsoft Corporation Clustered client failover
US20130067095A1 (en) 2011-09-09 2013-03-14 Microsoft Corporation Smb2 scaleout
US9319267B1 (en) * 2012-10-04 2016-04-19 Solace Systems, Inc. Replication in assured messaging system
WO2014197963A1 (en) * 2013-06-13 2014-12-18 Tsx Inc. Failover system and method
US9569517B1 (en) * 2013-11-27 2017-02-14 Google Inc. Fault tolerant distributed key-value storage
US10348840B2 (en) * 2017-01-16 2019-07-09 International Business Machines Corporation Dynamic workflow control between network entities
US10938750B2 (en) 2019-03-18 2021-03-02 Advanced New Technologies Co., Ltd. Consensus system downtime recovery
JP6880227B2 (en) * 2019-03-18 2021-06-02 アドバンスド ニュー テクノロジーズ カンパニー リミテッド Recovery of consensus system downtime
SG11201908544UA (en) 2019-03-18 2019-10-30 Alibaba Group Holding Ltd Consensus system downtime recovery
US11803317B2 (en) 2020-12-15 2023-10-31 International Business Machines Corporation Interrupted replicated write recognition

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4725834A (en) * 1984-02-27 1988-02-16 American Telephone And Telegraph Company, At&T Bell Laboratories Reliable broadcast protocol for a token passing bus network
US5799146A (en) * 1996-04-30 1998-08-25 International Business Machines Corporation Communications system involving groups of processors of a distributed computing environment
US6035415A (en) * 1996-01-26 2000-03-07 Hewlett-Packard Company Fault-tolerant processing method
US6178441B1 (en) * 1998-09-21 2001-01-23 International Business Machines Corporation Method and system in a computer network for the reliable and consistent ordering of client requests
US6247141B1 (en) * 1998-09-24 2001-06-12 Telefonaktiebolaget Lm Ericsson (Publ) Protocol for providing replicated servers in a client-server system
US6671821B1 (en) * 1999-11-22 2003-12-30 Massachusetts Institute Of Technology Byzantine fault tolerance

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2894676B2 (en) * 1994-03-21 1999-05-24 インターナショナル・ビジネス・マシーンズ・コーポレイション Asynchronous remote copy system and asynchronous remote copy method
US5941999A (en) * 1997-03-31 1999-08-24 Sun Microsystems Method and system for achieving high availability in networked computer systems
US6338092B1 (en) * 1998-09-24 2002-01-08 International Business Machines Corporation Method, system and computer program for replicating data in a distributed computed environment
US6823474B2 (en) * 2000-05-02 2004-11-23 Sun Microsystems, Inc. Method and system for providing cluster replicated checkpoint services
DE60125400D1 (en) * 2000-10-27 2007-02-01 Availigent Inc ERROR TOLERANCE FOR COMPUTER PROGRAMS OPERATED VIA A COMMUNICATION NETWORK
US6335415B1 (en) * 2001-01-30 2002-01-01 Council Of Scientific & Industrial Research Process for the preparation of a polyester
US7231554B2 (en) * 2002-03-25 2007-06-12 Availigent, Inc. Transparent consistent active replication of multithreaded application programs

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4725834A (en) * 1984-02-27 1988-02-16 American Telephone And Telegraph Company, At&T Bell Laboratories Reliable broadcast protocol for a token passing bus network
US6035415A (en) * 1996-01-26 2000-03-07 Hewlett-Packard Company Fault-tolerant processing method
US5799146A (en) * 1996-04-30 1998-08-25 International Business Machines Corporation Communications system involving groups of processors of a distributed computing environment
US6178441B1 (en) * 1998-09-21 2001-01-23 International Business Machines Corporation Method and system in a computer network for the reliable and consistent ordering of client requests
US6247141B1 (en) * 1998-09-24 2001-06-12 Telefonaktiebolaget Lm Ericsson (Publ) Protocol for providing replicated servers in a client-server system
US6671821B1 (en) * 1999-11-22 2003-12-30 Massachusetts Institute Of Technology Byzantine fault tolerance

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
KAASHOEK M.F., TANENBAUM A.S.: "Group communication in the amoeba distributed operating system", PROCEEDINGS OF THE IEEE 11TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ARLINGTON, TX, May 1991 (1991-05-01), pages 222 - 230, XP000221860 *
MOSER L.E. ET AL.: "Totem: a fault-tolerant multicast group communication system", COMMUNICATIONS OF ACM, vol. 39, no. 4, April 1996 (1996-04-01), pages 54 - 63, XP000585187 *

Also Published As

Publication number Publication date
EP1543420B1 (en) 2012-04-04
AU2003259297A8 (en) 2004-02-16
AU2003259297A1 (en) 2004-02-16
US20040103342A1 (en) 2004-05-27
WO2004012061A2 (en) 2004-02-05
US6928577B2 (en) 2005-08-09
EP1543420A4 (en) 2007-03-28
ATE552555T1 (en) 2012-04-15
EP1543420A2 (en) 2005-06-22

Similar Documents

Publication Publication Date Title
WO2004012061A3 (en) Consistent message ordering for semi-active and passive replication
WO2001084314A3 (en) Method and system for providing cluster replicated checkpoint services
DE69800808T2 (en) Redundant, distributed network system
AU3869600A (en) Data distribution in a server cluster
WO2002089341A3 (en) System and method for providing access to resources using a fabric switch
BR0309363A (en) Disaster Recovery Method and System
WO2004004236A3 (en) Portal for distributing business and product information
FI945627A (en) Restoring the home registry of a mobile communication system
WO2002061612A3 (en) Data structure for information systems
WO2006026420A3 (en) Automated failover in a cluster of geographically dispersed server nodes using data replication over a long distance communication link
CA2323106A1 (en) File server storage arrangement
WO2004025404A3 (en) Method and apparatus for server share migration and server recovery using hierarchical storage management
AU2003271982A1 (en) Method and means for tolerating multiple dependent or arbitrary double disk failures in a disk array
WO2003063430A3 (en) System and method for providing a fault tolerant routing data base
AU3729500A (en) Method and system for consistent cluster operational data in a server cluster using a quorum of replicas
TW200515140A (en) System and method of relational configuration mirroring
WO2006073847A3 (en) Systems and methods for dynamic data backup
WO2005025111A3 (en) Redundancy scheme for network processing systems
GB2430286B (en) System and method for information handling system image network communication
WO2004070521A3 (en) Alternate server system
ATE491990T1 (en) REDUNDANCY IN ARRAY STORAGE SYSTEMS
WO2006138308A3 (en) System and corresponding method for providing redundant storage of a data file over a computer network
WO2001042962A3 (en) Method, system, and apparatus for providing message data regarding events associated with websites
AU4911401A (en) Non-fault tolerant network nodes in a multiple fault tolerant network
WO2006017199A3 (en) Autonomous service backup and migration

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2003772073

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2003772073

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP