GLORIA — GEOMAR Library Ocean Research Information Access

Hits per page

hits 1 - 7 | 7 hits

Sorting

Online Resource

Middleware infrastructure for parallel and distributed programming models in heterogeneous systems

Al-Jaroodi, J. ; Mohamed, N. ; Hong Jiang ; [et al.]

Institute of Electrical and Electronics Engineers (IEEE) ; 2003

In: IEEE Transactions on Parallel and Distributed Systems Vol. 14, No. 11 ( 2003-11), p. 1100-1111

add to mindlist on the mindlist

Details

In: IEEE Transactions on Parallel and Distributed Systems, Institute of Electrical and Electronics Engineers (IEEE), Vol. 14, No. 11 ( 2003-11), p. 1100-1111

Type of Medium: Online Resource

ISSN: 1045-9219

URL: Article

DOI: 10.1109/TPDS.2003.1247671

RVK:

SQ 1100

RVK:

SA 1000

Language: English

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

Publication Date: 2003

detail.hit.zdb_id: 2027774-X

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

Online Resource

Opportunistic Transient-Fault Detection

Gomaa, Mohamed A. ; Vijaykumar, T. N.

Association for Computing Machinery (ACM) ; 2005

In: ACM SIGARCH Computer Architecture News Vol. 33, No. 2 ( 2005-05), p. 172-183

add to mindlist on the mindlist

Details

In: ACM SIGARCH Computer Architecture News, Association for Computing Machinery (ACM), Vol. 33, No. 2 ( 2005-05), p. 172-183

Abstract: CMOS scaling increases susceptibility of microprocessors to transient faults. Most current proposals for transient-fault detection use full redundancy to achieve perfect coverage while incurring significant performance degradation. However, most commodity systems do not need or provide perfect coverage. A recent paper explores this leniency to reduce the soft-error rate of the issue queue during L2 misses while incurring minimal performance degradation. Whereas the previous paper reduces soft-error rate without using any redundancy, we target better coverage while incurring similarly-minimal performance degradation by opportunistically using redundancy. We propose two semi-complementary techniques, called partial explicit redundancy (PER) and implicit redundancy through reuse (IRTR), to explore the trade-off between soft-error rate and performance. PER opportunistically exploits low-ILP phases and L2 misses to introduce explicit redundancy with minimal performance degradation. Because PER covers the entire pipeline and exploits not only L2 misses but all low-ILP phases, PER achieves better coverage than the previous work. To achieve coverage in high-ILP phases as well, we propose implicit redundancy through reuse (IRTR). Previous work exploits the phenomenon of instruction reuse to avoid redundant execution while falling back on redundant execution when there is no reuse. IRTR takes reuse to the extreme of performance-coverage trade-off and completely avoids explicit redundancy by exploiting reuseýs implicit redundancy within the main thread for fault detection with virtually no performance degradation. Using simulations with SPEC2000, we show that PER and IRTR achieve better trade-off between soft-error rate and performance degradation than the previous schemes.

Type of Medium: Online Resource

ISSN: 0163-5964

URL: Article

DOI: 10.1145/1080695.1069985

RVK:

SS 1985

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2005

detail.hit.zdb_id: 2088489-8

detail.hit.zdb_id: 186012-4

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

Online Resource

Learning from obstructions: An effective deep learning approach for minimum vertex cover

Abu-Khzam, Faisal N. ; Abd El-Wahab, Mohamed M. ; Haidous, Moussa ; [et al.]

Springer Science and Business Media LLC ; 2022

In: Annals of Mathematics and Artificial Intelligence

add to mindlist on the mindlist

Details

In: Annals of Mathematics and Artificial Intelligence, Springer Science and Business Media LLC

Type of Medium: Online Resource

ISSN: 1012-2443 , 1573-7470

URL: Article

DOI: 10.1007/s10472-022-09813-2

RVK:

ST 300

Language: English

Publisher: Springer Science and Business Media LLC

Publication Date: 2022

detail.hit.zdb_id: 1045926-1

detail.hit.zdb_id: 2002961-5

SSG: 17,1

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

Online Resource

Transient-fault recovery for chip multiprocessors

Gomaa, Mohamed ; Scarbrough, Chad ; Vijaykumar, T. N. ; [et al.]

Association for Computing Machinery (ACM) ; 2003

In: ACM SIGARCH Computer Architecture News Vol. 31, No. 2 ( 2003-05), p. 98-109

add to mindlist on the mindlist

Details

In: ACM SIGARCH Computer Architecture News, Association for Computing Machinery (ACM), Vol. 31, No. 2 ( 2003-05), p. 98-109

Abstract: To address the increasing susceptibility of commodity chip multiprocessors (CMPs) to transient faults, we propose Chiplevel Redundantly Threaded multiprocessor with Recovery (CRTR). CRTR extends the previously-proposed CRT for transient-fault detection in CMPs, and the previously-proposed SRTR for transient-fault recovery in SMT. All these schemes achieve fault tolerance by executing and comparing two copies, called leading and trailing threads, of a given application. Previous recovery schemes for SMT do not perform well on CMPs. In a CMP, the leading and trailing threads execute on different processors to achieve load balancing and reduce the probability of a fault corrupting both threads; whereas in an SMT, both threads execute on the same processor. The inter-processor communication required to compare the threads introduces latency and bandwidth problems not present in an SMT.To hide inter-processor latency, CRTR executes the leading thread ahead of the trailing thread by maintaining a long slack, enabled by asymmetric commit. CRTR commits the leading thread before checking and the trailing thread after checking, so that the trailing thread state may be used for recovery. Previous recovery schemes commit both threads after checking, making a long slack suboptimal. To tackle inter-processor bandwidth, CRTR not only increases the bandwidth supply by pipelining the communication paths, but also reduces the bandwidth demand. By reasoning that faults propagate through dependences, previously-proposed Dependence-Based Checking Elision (DBCE) exploits (true) register dependence chains so that only the value of the last instruction in a chain is checked. However, instructions that mask operand bits may mask faults and limit the use of dependence chains. We propose Death- and Dependence-Based Checking Elision (DDBCE), which chains a masking instruction only if the source operand of the instruction dies after the instruction. Register deaths ensure that masked faults do not corrupt later computation. Using SPEC2000, we show that CRTR incurs negligible performance loss compared to CRT for inter-processor (one-way) latency as high as 30 cycles, and that the bandwidth requirements of CRT and CRTR with DDBCE are 5.2 and 7.1 bytes/cycle, respectively.

Type of Medium: Online Resource

ISSN: 0163-5964

URL: Article

DOI: 10.1145/871656.859631

RVK:

SS 1985

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2003

detail.hit.zdb_id: 2088489-8

detail.hit.zdb_id: 186012-4

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

Online Resource

Heat-and-run : leveraging SMT and CMP to manage power density through the operating system

Gomaa, Mohamed ; Powell, Michael D. ; Vijaykumar, T. N.

Association for Computing Machinery (ACM) ; 2004

In: ACM SIGARCH Computer Architecture News Vol. 32, No. 5 ( 2004-12), p. 260-270

add to mindlist on the mindlist

Details

In: ACM SIGARCH Computer Architecture News, Association for Computing Machinery (ACM), Vol. 32, No. 5 ( 2004-12), p. 260-270

Abstract: Power density in high-performance processors continues to increase with technology generations as scaling of current, clock speed, and device density outpaces the downscaling of supply voltage and thermal ability of packages to dissipate heat. Power density is characterized by localized chip hot spots that can reach critical temperatures and cause failure. Previous architectural approaches to power density have used global clock gating, fetch toggling, dynamic frequency scaling, or resource duplication to either prevent heating or relieve overheated resources in a superscalar processor. Previous approaches also evaluate design technologies where power density is not a major problem and most applications do not overheat the processor. Future processors, however, are likely to be chip multiprocessors (CMPs) with simultaneously-multithreaded (SMT) cores. SMT CMPs pose unique challenges and opportunities for power density. SMT and CMP increase throughput and thus on-chip heat, but also provide natural granularities for managing power-density. This paper is the first work to leverage SMT and CMP to address power density. We propose heat-and-run SMT thread assignment to increase processor-resource utilization before cooling becomes necessary by co-scheduling threads that use complimentary resources. We propose heat-and-run CMP thread migration to migrate threads away from overheated cores and assign them to free SMT contexts on alternate cores, leveraging availability of SMT contexts on alternate CMP cores to maintain throughput while allowing overheated cores to cool. We show that our proposal has an average of 9% and up to 34% higher throughput than a previous superscalar technique running the same number of threads.

Type of Medium: Online Resource

ISSN: 0163-5964

URL: Article

DOI: 10.1145/1037947.1024424

RVK:

SS 1985

Language: English

Publisher: Association for Computing Machinery (ACM)

Publication Date: 2004

detail.hit.zdb_id: 2088489-8

detail.hit.zdb_id: 186012-4

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

Online Resource

Cancelable multi-biometric recognition system based on deep learning

Abdellatef, Essam ; Ismail, Nabil A. ; Abd Elrahman, Salah Eldin S. E. ; [et al.]

Springer Science and Business Media LLC ; 2020

In: The Visual Computer Vol. 36, No. 6 ( 2020-6), p. 1097-1109

add to mindlist on the mindlist

Details

In: The Visual Computer, Springer Science and Business Media LLC, Vol. 36, No. 6 ( 2020-6), p. 1097-1109

Type of Medium: Online Resource

ISSN: 0178-2789 , 1432-2315

URL: Article

DOI: 10.1007/s00371-019-01715-5

RVK:

SQ 1100

Language: English

Publisher: Springer Science and Business Media LLC

Publication Date: 2020

detail.hit.zdb_id: 1463287-1

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

Online Resource

Deep Convolutional Neural Networks for Large-scale Speech Tasks

Sainath, Tara N. ; Kingsbury, Brian ; Saon, George ; [et al.]

Elsevier BV ; 2015

In: Neural Networks Vol. 64 ( 2015-04), p. 39-48

add to mindlist on the mindlist

Details

In: Neural Networks, Elsevier BV, Vol. 64 ( 2015-04), p. 39-48

Type of Medium: Online Resource

ISSN: 0893-6080

URL: Article

DOI: 10.1016/j.neunet.2014.08.005

RVK:

SQ 1100

Language: English

Publisher: Elsevier BV

Publication Date: 2015

detail.hit.zdb_id: 1491372-0

detail.hit.zdb_id: 740542-X

Permalink

	Location	Call Number	Limitation	Availability

Others were also interested in ...

Online Resource

Link to publisher

hits 1 - 7 | 7 hits