WEB OF SCIENCE
SCOPUS
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Choi, Dong-Gu | - |
| dc.contributor.author | Lee, Jaehyun | - |
| dc.contributor.author | Koo, Jahyun | - |
| dc.contributor.author | Han, Woo Kyoung | - |
| dc.contributor.author | Park, Dahoon | - |
| dc.contributor.author | Kung, Jaeha | - |
| dc.contributor.author | Lee, Junghyup | - |
| dc.contributor.author | Yoon, Jong-Hyeok | - |
| dc.date.accessioned | 2025-02-28T13:40:14Z | - |
| dc.date.available | 2025-02-28T13:40:14Z | - |
| dc.date.created | 2025-02-27 | - |
| dc.date.issued | 2024-11-19 | - |
| dc.identifier.isbn | 9798350376326 | - |
| dc.identifier.uri | http://hdl.handle.net/20.500.11750/57946 | - |
| dc.description.abstract | Analog compute-in-memory (ACIM) has been intensively investigated, pursuing better energy efficiency, network accuracy, and compatibility with various AI models [1-5]. In particular, SRAM-based ACIM macros achieve the flexibility of input/weight (IN/W) allocation incorporating bit-serial inputs, bitwise weight loading across multiple bitlines (BL), and digital shift-and-add multiplication of partial sum (PSUM) in the output line (OL). However, shift-and-add multiplication inevitably exacerbates the PSUM errors arising from the computing/readout process under device mismatches and a limited sensing margin (SM) in ACIM (Fig. 1). This leads to severely erroneous MAC outputs and substantial accuracy loss, impeding the practical utilization of ACIM. To mitigate the Psum errors, the ACIM macro with high-precision IN/W and truncation at the MAC output was proposed [4]. The truncation filters out the quantization noise to an extent, thereby attaining the mitigated accuracy loss. Nevertheless, prior work still suffers from PSUM errors due to limited VLSB of high-resolution ADCs. Furthermore, the truncated MAC outputs undermine the advantages of high-precision IN/W undergoing frequent weight updates in ACIM macros. An alternative approach is using a low-resolution ADC with quantization for PSUM to secure higher VLSB and suppress the resultant PSUM error [5]. However, under high macro utilization, it eventually suffers from accuracy loss due to quantization error, which is amplified by the shift and adder. To address the challenges, the drive strength-based SRAM compute-in-memory (DS-CIM) macro is proposed featuring: 1) 6b drive strength-mode sensing with adaptive dynamic range that secures up to 39.2x-boosted sensing margin and 97% of error-free Psum readout on 2's-complement 4b-IN/W ResNet-20 benchmarks, 2) row-wise adaptive dynamic range SAR (ADR-SAR) logic enabling concurrent ADC readout at every OL with the area efficiency of 15.83 TOPS/mm2, 3) input-aware binary search (IABS) reducing average ADC conversion cycles by 64% on the ResNet-20 benchmark, and 4) a heterogeneous logic unit (HLU) for column-wise logic reconfigurability. © 2024 IEEE. | - |
| dc.language | English | - |
| dc.publisher | IEEE Solid-State Circuits Society | - |
| dc.relation.ispartof | 2024 IEEE Asian Solid-State Circuits Conference, A-SSCC 2024 | - |
| dc.title | A 65nm 687.5-TOPS/W Drive Strength-based SRAM Compute-In-Memory Macro with Adaptive Dynamic Range for Edge AI applications | - |
| dc.type | Conference Paper | - |
| dc.identifier.doi | 10.1109/A-SSCC60305.2024.10848920 | - |
| dc.identifier.scopusid | 2-s2.0-85218214522 | - |
| dc.identifier.bibliographicCitation | Choi, Dong-Gu. (2024-11-19). A 65nm 687.5-TOPS/W Drive Strength-based SRAM Compute-In-Memory Macro with Adaptive Dynamic Range for Edge AI applications. IEEE Asian Solid-State Circuits Conference. doi: 10.1109/A-SSCC60305.2024.10848920 | - |
| dc.identifier.url | https://a-sscc2024.org/program/ | - |
| dc.citation.conferenceDate | 2024-11-18 | - |
| dc.citation.conferencePlace | JA | - |
| dc.citation.conferencePlace | Hiroshima | - |
| dc.citation.title | IEEE Asian Solid-State Circuits Conference | - |
Department of Electrical Engineering and Computer Science