Misplaced Pages

SuperH: Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editContent deleted Content addedVisualWikitext
Revision as of 12:48, 6 July 2010 editWmat (talk | contribs)21 edits Added link to Renesas Open Source portal.← Previous edit Latest revision as of 18:00, 29 December 2024 edit undoElmepi (talk | contribs)Extended confirmed users1,170 edits Models: comparision and SH-6 added 
(208 intermediate revisions by more than 100 users not shown)
Line 1: Line 1:
{{Short description|Instruction set architecture by Hitachi}}
{{Infobox CPU architecture {{Infobox CPU architecture
|name=SuperH (SH) | name = SuperH (SH)
|designer=] | designer = ]
|bits=32 | bits = 32-bit (32 → 64)
| introduced = 1992<ref>{{cite web |title=The Story of the Hitachi SH-2 and the Sega Saturn |url=https://www.sega-16.com/forum/showthread.php?33527-The-Story-of-the-Hitachi-SH-2-and-the-Sega-Saturn |website=www.sega-16.com |publisher=Renesas |access-date=27 February 2023 | archive-url = https://web.archive.org/web/20230227165625/https://www.sega-16.com/forum/showthread.php?33527-The-Story-of-the-Hitachi-SH-2-and-the-Sega-Saturn | archive-date=2023-02-27}}</ref>
|introduced=1990s
|version= | version =
|design=RISC | design = RISC
|type= | type =
| encoding = SH-2: 16-bit instructions<br />SH-2A and newer: mixed 16- and 32-bit instructions
|encoding=Fixed
| page size =
|branching=
| branching =
|endianness=Bi
| endianness = Bi
|extensions=
| extensions =
|open=
| open = Yes, and royalty-free<ref></ref>
|registers=
| registers =
}} }}
'''SuperH''' (or '''SH''') is a ] ] (RISC) ] (ISA) developed by ] and currently produced by ]. It is implemented by ]s and ]s for ]s.


At the time of introduction, SuperH was notable for having fixed-length 16-bit instructions in spite of its 32-bit architecture. Using smaller instructions had consequences: the ] was smaller and instructions were generally two-operand format. However for the market the SuperH was aimed at, this was a small price to pay for the improved memory and ] efficiency.
'''SuperH''' (or '''SH''') is a ] ] (RISC) ] (ISA) developed by ]. It is implemented by ]s and ]s for ]s.

Later versions of the design, starting with SH-5, included both 16- and 32-bit instructions, with the 16-bit versions mapping onto the 32-bit version inside the CPU. This allowed the ] to continue using the shorter instructions to save memory, while not demanding the amount of instruction decoding logic needed if they were completely separate instructions. This concept is now known as a ] and is also used by other companies, the most notable example being ] for its ] instruction set.

In 2015, many of the original ]s for the SuperH architecture expired and the SH-2 CPU was reimplemented as ] under the name ].


==History== ==History==
===SH-1 and SH-2===
The SuperH processor core family was first developed by ] in the early 1990s.
]
Hitachi has developed a complete group of upward compatible instruction set CPU cores. The SH-1 and the SH-2 were used in the ] and ]. These cores have 16-bit instructions for better code density than 32-bit instructions, a benefit at the time as memory was very expensive.
The SuperH processor core family was first developed by ] in the early 1990s. The design concept was for a single ] (ISA) that would be ] across a series of ]s.


In the past, this sort of design problem would have been solved using ], with the low-end models in the series performing non-implemented instructions as a series of more basic instructions. For instance, a "long multiply" (multiplying two 32-bit registers to produce a 64-bit product) might be implemented in hardware on high-end models but instead be performed as a series of additions on low-end models.
A few years later the SH-3 core was added to the SH CPU family; new features included another interrupt concept, a ] (MMU) and a modified cache concept. The SH-3 core also got a ] extension, then called SH-3-DSP. With extended data paths for efficient DSP processing, special accumulators and a dedicated ]-type DSP engine, this core was unifying the DSP and the RISC processor world. A derivative was also used with the original SH-2 core.


One of the key realizations during the development of the ] concept was that the microcode had a finite decoding time, and as processors became faster, this represented an unacceptable performance overhead. To address this, Hitachi instead developed a single ISA for the entire line, with unsupported instructions causing traps on those implementations that didn't include hardware support. For instance, the initial models in the line, the SH-1 and SH-2, differed only in their support for 64-bit multiplication; the SH-2 supported {{code|MUL}}, {{code|DMULS}} and {{code|DMULU}}, whereas the SH-1 would cause a trap if these were encountered.{{sfn|Program|1996|p=1}}
For the ], ] developed the SH-4 architecture. ] (2-way) instruction execution and a ] ] were the highlights of this architecture. SH-4 based standard chips were introduced around 1998.


The SH-1 was the basic model, supporting a total of 56 instructions. The SH-2 added 64-bit multiplication and a few additional commands for branching and other duties, bringing the total to 62 supported instructions.{{sfn|Program|1996|p=1}} The SH-1 and the SH-2 were used in the ], ] and ].<ref>{{cite web |title=CP System III (CPS3) Hardware (Capcom) |url=https://www.system16.com/hardware.php?id=799 |website=www.system16.com |publisher=System 16 |access-date=3 August 2019}}</ref>
In early 2001, ] and ] formed the ] SuperH, Inc., which was going to license the SH-4 core to other companies and was developing the SH-5 architecture, the first move of SuperH into the 64-bit area. SuperH, Inc. sold the IP of these CPU cores to ] in 2004.


The ISA uses ] instructions for better code density than 32-bit instructions, which was important at the time due to the high cost of ] and the implementation cost of cache. As of 2023, code density is still important for small embedded systems and massively multicore processors. The downsides to this approach were that there were fewer bits available to encode a register number or a constant value. In the original SuperH ISA, there were only 16 general registers, requiring four bits for the source and another four for the destination; however some instructions have an implied R0, R15, or a system register as an extra operand. The instruction opcode is four, eight, twelve, or sixteen bits long, and the remaining four-bit fields are used for register or immediate operands in various ways: there are twelve classes of instructions, for a total of 142 instructions in SH-2.{{sfn|Program|1996|pp=30-33}}
The SH-5 design supported two modes of operation. SHcompact mode is equivalent to the user-mode instructions of the SH-4 instruction set. SHmedia mode is very different, using 32-bit instructions with sixty-four 64-bit integer registers and ] instructions. In SHmedia mode the destination of a ] (jump) is loaded into a branch register separately from the actual branch instruction. This allows the processor to prefetch instructions for a branch without having to snoop the instruction stream. The combination of a compact 16-bit instruction encoding with a more powerful 32-bit instruction encoding is not unique to SH-5; recent ] processors have a 16-bit Thumb mode, and ] processors have a MIPS-16 mode. However, SH-5 differs because its backward compatibility mode is the 16-bit encoding rather than the 32-bit encoding.


Delayed branches are introduced for both SH-1 and SH-2. Unconditional branch instructions have one ].<ref>{{cite web |url=https://www.renesas.com/us/en/document/mah/superh-risc-engine-sh7020-and-sh7021-hd6437020-hd6477021-hd6437021-hd6417021?r=469371 |page=19,48 |title=SH7020 and SH7021 Hardware ManualSuperH™ RISC engine |accessdate=2023-12-02}}</ref>
The evolution of the SuperH architecture still continues. The latest evolutionary step happened around 2003 where the cores from SH-2 up to SH-4 were getting unified into a superscalar SH-X core which forms a kind of instruction set superset of the previous architectures.


===SH-3===
Today, the SuperH CPU cores, architecture and products are with ], a merger of the Hitachi and Mitsubishi semiconductor groups and the architecture is consolidated around the SH-2, SH-2A, SH-3, SH4 and SH-4A platforms giving a scalable family.
A few years later, the SH-3 core was added to the family; new features included another ] concept, a ] (MMU), and a modified cache concept. These features required an extended instruction set, adding six new instructions for a total of 68.{{sfn|Program|1996|p=1}} The SH-3 was ], running in either big-endian or little-endian byte ordering.

The SH-3 core also added a ] extension, then called SH-3-DSP. With extended data paths for efficient DSP processing, special accumulators and a dedicated ]-type DSP engine, this core unified the DSP and the RISC processor world. A derivative of the DSP was also used with the original SH-2 core.

Between 1994 and 1996, 35.1 million SuperH devices were shipped worldwide.<ref>{{cite web |url=http://segatech.com/technical/cpu/tech_sh4.html |archive-url=https://web.archive.org/web/20160305103423/http://segatech.com/technical/cpu/tech_sh4.html |archive-date=5 March 2016 |title=360-MIPS SuperH RISC Processor Enables Personal Access Systems SH7750 Launches the SH-4 Series |date = November 1997 }}</ref>

===SH-4===
In 1997, Hitachi and ] (STM) started collaborating on the design of the SH-4 for the ]. SH-4 featured ] (2-way) instruction execution and a ] ] (particularly suited to ]). Standard chips based on the SH-4 were introduced around 1998.<ref name=sh5/>

===Licensing===
In early 2001, Hitachi and STM formed the ] SuperH, Inc., which was going to license the SH-4 core to other companies and was developing the SH-5 architecture, the first move of SuperH into the 64-bit area. The earlier SH-1 through 3 remained the property of Hitachi.<ref name=sh5>{{cite web |title=STMicro, Hitachi plan new company to develop RISC cores|url=http://www.eetimes.com/document.asp?doc_id=1180485 |date=3 April 2001 |website=EE Times|quote=Hitachi created the SH family of processors and developed its first four major iterations, but has worked with ST since 1997, when the companies agreed to share a common high-end microprocessor road map. They jointly developed the 32-bit SH4 RISC processor core, and began development of the SH5 architecture, which will now be completed by SuperH. SuperH's initial product will be the SH4 core. Earlier SH versions will not be part of the spin-off agreement.}}</ref><ref>{{cite web|title=SuperH, Inc. formed by Hitachi and STMicroelectronics to Boost the Proliferation of SuperH Cores in Embedded Microprocessor Applications|url=http://investors.st.com/phoenix.zhtml?c=111941&p=irol-newsArticle_print&ID=1454682}}{{dead link |date=September 2021}}</ref>

In 2003, Hitachi and ] formed a joint-venture called ], with Hitachi controlling 55% of it. In 2004, Renesas Technology bought STMicroelectronics's share of ownership in the SuperH Inc. and with it the licence to the SH cores.<ref>{{cite web|url=https://www.eetimes.com/renesas-to-take-over-superh-core-business/|date=28 September 2004|website=EE Times|title=Renesas to take over SuperH core business |first=Peter |last=Clarke}}</ref> Renesas Technology later became Renesas Electronics, following their merger with ].

The SH-5 design supported two modes of operation: SHcompact mode, which is equivalent to the user-mode instructions of the SH-4 instruction set; and SHmedia mode, which is very different in that it uses 32-bit instructions with sixty-four 64-bit integer registers and ] instructions. In SHmedia mode the destination of a ] (jump) is loaded into a branch register separately from the actual branch instruction. This allows the processor to prefetch instructions for a branch without having to snoop the instruction stream. The combination of a compact 16-bit instruction encoding with a more powerful 32-bit instruction encoding is not unique to SH-5; ] processors have a 16-bit ] mode (ARM licensed several patents from SuperH for Thumb<ref name="lwn"/>) and ] processors have a MIPS-16 mode. However, SH-5 differs because its backward compatibility mode is the 16-bit encoding rather than the 32-bit encoding.

The last evolutionary step happened around 2003 where the cores from SH-2 up to SH-4 were getting unified into a superscalar SH-X core which formed a kind of instruction set superset of the previous architectures, and added support for ].

===Continued availability===
Since 2010, the SuperH CPU cores, architecture and products are with ] and the architecture is consolidated around the SH-2, SH-2A, SH-3, SH-4 and SH-4A platforms. The system-on-chip products based on SH-3, SH-4 and SH-4A microprocessors were subsequently replaced by newer generations based on licensed CPU cores from ], with many of the existing models still marketed and sold until March 2025 through the Renesas Product Longevity Program.<ref>{{cite web|url=https://www.renesas.com/us/en/products/microcontrollers-microprocessors/other-mcus-mpus/superh-risc-engine-family-mcus|website=Renesas Electronics|title="SuperH RISC Engine Family MCUs"}}</ref>

As of 2021, the SH72xx microcontrollers based on the SH-2A continue to be marketed by Renesas with guaranteed availability until February 2029, along with newer products based on several other architectures including ], ], and ].

===J Core===
{{Distinguish|Samsung Galaxy J2 Core}}
The last of the SH-2 patents expired in 2014. At ] Japan 2015, j-core developers presented a ] of the SH-2 ISA with extensions (known as the "J2 core" due to the unexpired ]).<ref name="lwn">{{cite web|url=http://lwn.net/Articles/647636|title=Resurrecting the SuperH architecture|author=Nathan Willis|date=June 10, 2015|publisher=]}}</ref><ref name="0pf_jcore">{{cite web|url=http://j-core.org|title=J Cores|publisher=j-core|access-date=April 27, 2016|url-status=dead|archive-url=https://web.archive.org/web/20160511092659/http://j-core.org/|archive-date=May 11, 2016}}</ref> Subsequently, a design walkthrough was presented at ELC 2016.<ref>{{cite web |url=http://j-core.org/talks/ELC-2016.pdf |archive-url=https://web.archive.org/web/20160617085346/http://j-core.org/talks/ELC-2016.pdf |archive-date=2016-06-17 |url-status=live |title=j-core Design Walkthrough}}</ref>

The ] ]-licensed ] code for the J2 core has been proven on ] ]s and on ]s manufactured on ]'s ] process, and is capable of booting ].<ref name="lwn"/> J2 is backwards ISA compatible with SH-2, implemented as a 5-stage pipeline with separate Instruction and Data memory interfaces, and a machine-generated Instruction Decoder supporting the densely packed and complex (relative to other RISC machines) ISA. Additional instructions are easy to add. J2 implements instructions for dynamic shift (using the SH-3 and later instruction patterns), extended atomic operations (used for threading primitives) and locking/interfaces for symmetric multiprocessor support. Plans to implement the SH-2A (as "J2+") and SH-4 (as "J4") instruction sets as the relevant patents expire<!--expired?--> in 2016–2017.<ref name="lwn"/>{{needs update|date=June 2022}}

Several features of SuperH have been cited as motivations for designing new cores based on this architecture:<ref name="lwn"/>

* High ] compared to other 32-bit ] ]s such as ] or ]<ref name="weaver2015">{{cite news|author=V.M. Weaver|title=Exploring the Limits of Code Density (Tech Report with Newest Results)|date=17 March 2015|url=http://web.eece.maine.edu/~vweaver/papers/iccd09/ll_document.pdf |archive-url=https://web.archive.org/web/20150713143728/http://web.eece.maine.edu/~vweaver/papers/iccd09/ll_document.pdf |archive-date=2015-07-13 |url-status=live}}</ref> important for cache and memory bandwidth performance
* Existing ] and ] support (], ], ]<ref name="0pf_jcore"/>)
* Extremely low ASIC ] costs now that the patents are expiring (around {{USD|0.03}} for a dual-core J2 core on TSMC's 180&nbsp;nm process).
* Patent- and royalty-free (BSD-licensed) implementation
* Full and vibrant community support
* Availability of low cost hardware development platform for zero cost FPGA tools
* CPU and SoC RTL generation and integration tools, producing FPGA and ASIC portable RTL and documentation
* Clean, modern design with open source design, generation, simulation and verification environment


==Models== ==Models==
{|class="wikitable" style="float:right; text-align:right"
]
|-
|+ Generation comparision <ref name="FumioArakawa">{{Cite web |url=https://old.hotchips.org/wp-content/uploads/hc_archives/hc13/2_Mon/05hitachi.pdf |title=SH-5: A First 64-bit SuperH Core with Multimedia Extension |author=Fumio Arakawa |publisher=Hitachi, Ltd. |format=PDF; 617 kB |date=2013-07-27 |access-date=2022-02-08}}</ref>
! rowspan="2" | !! rowspan="2" | Clock<br /> !! colspan="3" | ] !! rowspan="2" | Image
|-
! ] !! MOPS !! ]
|-
! SH-1
| 20 || 20 || || || ]
|-
! SH-2
| 28,7 || 78 || 120 || || ]
|-
! SH-3
| 200 || 260 || 400 || || ]
|-
! SH-4
| 200 || 480 || || 1,9 || ]
|-
! SH-5
| 400 || 700 || 9600 || 2,8 ||
|-
! ''SH-6''
| || >2000 || >24000 || >7,0 ||
|}

The family of SuperH CPU cores includes: The family of SuperH CPU cores includes:
* SH-1 – used in microcontrollers for deeply embedded applications (] drives, ]s, etc.)
* SH-2 – used in microcontrollers with higher performance requirements, networking applications, and also in video game consoles, like the ] and ] add-on. The SH-2 has also found home in many automotive ] applications, including ], ], and ].
* SH-2A – The SH-2A core is an extension of the SH-2 core including a few extra instructions but most importantly moving to a superscalar architecture (it is capable of executing more than one instruction in a single cycle) and two five-stage pipelines. It also incorporates 15 register banks to facilitate an interrupt latency of 6 clock cycles. It is also strong in motor control application but also in multimedia, car audio, powertrain, automotive body control and office + building automation
* SH-DSP – initially developed for the ] market, used later in many consumer applications requiring DSP performance for ] compression etc.
* SH-3 – used for mobile and handheld applications such as the ], strong in ] applications and market for many years in the car navigation market. The ], similar to the ] hardware's CPU, also made use of this CPU. The Korg Electribe EMX and ESX music production units also use the SH-3.<ref>{{Cite web |last=Kuwabara |first=M. |date=25 July 2019 |title=Korg EMX / ESX Service Manual |url=http://dealers.korgusa.com/svcfiles/ESX1_Svc_%20man.pdf |archive-url=https://web.archive.org/web/20190713235231/http://dealers.korgusa.com/svcfiles/ESX1_Svc_%20man.pdf |archive-date=13 July 2019}}</ref>
* SH-3-DSP – used mainly in multimedia terminals and networking applications, also in printers and fax machines
* SH-4 – used whenever high performance is required such as car multimedia terminals, ]s, most notably the ], or ]es
* SH-5 – used in high-end 64-bit multimedia applications
* SH-X – mainstream core used in various flavours (with/without DSP or FPU unit) in engine control unit, car multimedia equipment, set-top boxes or mobile phones
* SH-Mobile – SuperH Mobile Application Processor; designed to offload application processing from the baseband LSI


===SH-2===
*SH-1 - used in microcontrollers for deeply embedded applications (] drives, ]s, etc)
]
*SH-2 - used in microcontrollers with higher performance requirements, also used in automotive such as ]s or in networking applications, and also in video game consoles, like the Sega Saturn. The SH-2 has also found home in many motor control applications.
*SH-2A - The SH-2A core is an extension of the SH-2 core including a few extra instructions but most importantly moving to a superscalar architecture (it is capable of executing more than one instruction in a single cycle) and two five-stage pipelines. It also incorporates 15 register banks to facilitate an interrupt latency of 6 clock cycles. It is also strong in motor control application but also in multimedia, car audio, powertrain, automotive body control and office + building automation
*SH-DSP - initially developed for the ] market, used later in many consumer applications requiring DSP performance for ] compression etc
*SH-3 - used for mobile and handheld applications such as the ], strong in Windows CE applications and market for many years in the car navigation market
*SH-3-DSP - used mainly in multimedia terminals and networking applications, also in printers and fax machines
*SH-4 - used whenever high performance is required such as car multimedia terminals, ]s, or ]es
*SH-5 - used in high-end multimedia applications
*SH-X - mainstream core used in various flavours (with/without DSP or FPU unit) in engine control unit, car multimedia equipment, set-top boxes or mobile phones
*SH-Mobile - SuperH Mobile Application Processor; designed to offload application processing from the baseband LSI


The SH-2 is a 32-bit RISC architecture with a 16-bit fixed instruction length for high code density and features a hardware ] (MAC) block for DSP algorithms and has a five-stage pipeline.
==SH-2==
]


The SH-2 has a cache on all ]-less devices.
The SH-2 is a 32-bit RISC architecture, it has 16 general purpose registers, which makes it well suited for programs written in ].


It provides 16 general-purpose registers, a vector-base register, global-base register, and a procedure register.
It has a 16-bit fixed length instructions for high code density, features a hardware Multiply-Accumulate (MAC) block for DSP algorithms and has a five-stage pipeline.


Today the SH-2 family stretches from 32&nbsp;KB of on-board flash up to ROM-less devices. It is used in a variety of different devices with differing peripherals such as CAN, Ethernet, motor-control timer unit, fast ADC and others.
The SH-2 has a cache on all ROM-less devices.


===SH-2A===
It also provides a vector-base-register, global-base-register and a procedure register.
The SH-2A is an upgrade to the SH-2 core that added some 32-bit instructions. It was announced in early 2006.


New features on the SH-2A core include:
Today the SH-2 family stretches from 32k of on board flash up to ROM-less devices. It is used in a variety of different devices with differing peripherals such as CAN, Ethernet, motor-control timer unit, fast ADC and others.
* Superscalar architecture: execution of 2 instructions simultaneously
* ]
* Two 5-stage pipelines
* Mixed 16-bit and 32-bit instructions
* 15 register banks for interrupt response in 6&nbsp;cycles.
* Optional FPU


The SH-2A family today spans a wide memory field from 16&nbsp;KB up to and includes many ROM-less variations. The devices feature standard peripherals such as ], ], ] and more as well as more application-specific peripherals such as ] timers, ] controllers and peripherals dedicated to automotive powertrain applications.
==SH-2A==
===SH-3===
The SH-2A is an upgrade to the SH-2 core. It was announced in early 2006.
] version)]]


===SH-4===
At launch in 2007 the SH-2A based SH7211 was the world's fastest embedded flash microcontroller running at 160&nbsp;MHz. It has later been superseded by several newer SuperH devices running at up to 200&nbsp;MHz.
The SH-4 is a RISC CPU and was developed for primary use in multimedia applications, such as Sega's ] and ] game systems. It includes a much more powerful floating-point unit{{ref|casio|}} and additional built-in functions, along with the standard 32-bit integer processing and 16-bit instruction size.


New features on the SH-2A core include: SH-4 features include:
* FPU with four floating-point multipliers, supporting 32-bit single-precision and 64-bit double-precision floats
* 4D floating-point ] and ]
* 128-bit floating-point bus allowing 3.2&nbsp;GB/sec transfer rate from the data cache
* 64-bit external data bus with 32-bit memory addressing, allowing a maximum of 4&nbsp;GB addressable memory (see ]) with a transfer rate of 800&nbsp;MB/sec
* Built-in interrupt, DMA, and power management controllers


{{note|casio}}There is no FPU in the custom SH-4 made for Casio, the SH7305.
*Superscalar architecture: execution of 2 instructions simultaneously.
*]
*Two 5-stage pipelines.
*15 register banks for interrupt response in 6 cycles.
*Optional FPU


===SH-5===
The SH-2A family today spans a wide memory field from 16kB up to and includes many ]-less variations. The devices feature standard peripherals such as ], ], ] and more as well as more application specific peripherals such as ] timers, ] controllers and peripherals dedicated to automotive powertrain applications.
The SH-5 is a 64-bit RISC CPU.<ref>{{cite web|title=SH-5 CPU Core, Volume1: Architecture|url=http://lars.nocrew.org/computers/processors/SuperH/cpush5v1.pdf |archive-url=https://web.archive.org/web/20090320050543/http://lars.nocrew.org/computers/processors/SuperH/cpush5v1.pdf |archive-date=2009-03-20 |url-status=live}}</ref>


Almost no non-simulated SH-5 hardware was ever released,<ref>{{cite web |title=Wasabi SH-5 Press Release|url=http://www.bsdnewsletter.com/2002/10/News47.html|date=8 March 2016}}</ref> and, unlike the still-live SH-4, support for SH-5 was dropped from ]<ref>{{cite web |title=GCC 7 Release Series Changes, New Features, and Fixes|url=https://gcc.gnu.org/gcc-7/changes.html|date=2 February 2018}}</ref> and Linux.
==External links==
*
*http://www.RenesasInteractive.com - Online training for Renesas products.
*http://RenesasRulz.com - A community support forum.


===SH-6===
Linux for SuperH
SH-6 was an announced but never implemented further development. It was supposed to achieve over 2 GIPS, over 7 GFLOPS and over 24 GOPS.<ref name="FumioArakawa" />
*http://oss.renesas.com Renesas suppported Open Source Software site.
*http://www.kpitgnutools.com, Official free download of GNUSH-ELF and SH-Linux Toolchains with free Technical Support
*http://linux-sh.org/ Wiki of linux-sh port
*http://sh-linux.org/ gcc toolchain
*http://shlinux.com/ MPC Data SHLinux support
*http://spinics.net/lists/linux-sh/ Linux SuperH Kernel Development mailing list archive
*http://www.littlebluelinux.com/ Little Blue Linux - for SH2A
*http://www.codesourcery.com/sgpp/lite/superh GNU Toolchain for SuperH


==References==
]s for SuperH
===Citations===
*] (Jornada Linux Mobility Edition) has working distribution for ] Jornada 620LX/660LX/680/690 handhelds; JLime is probably the largest SuperH distribution.
{{Reflist}}
*]
*] port is in progress .
*]
*]


===Bibliography===
] on SuperH
* {{cite book |url=https://antime.kapsi.fi/sega/files/h12p0.pdf |title=SuperH RISC Engine SH-1/SH-2 Programming Manual |publisher=Hitachi Americal Ltd. |date=3 September 1996 |access-date=2020-12-06 |ref=CITEREFProgram1996}}
*http://netbsd.org/Ports/sh3/
* {{cite book |url=http://www.st.com/st-web-ui/static/active/en/resource/technical/document/user_manual/CD00147165.pdf |title=SH-4 CPU Core Architecture |date=12 September 2002 |publisher=STMicroelectronics and Hitachi Ltd. |id=ADCS 7182230F |access-date=2020-12-06}}


==External links==
] on SuperH
* , Products, Tools, Manuals, App.Notes, Information
*http://openbsd.org/landisk.html
*
* {{Github|j-core}}
*
* {{webarchive |url=https://web.archive.org/web/20160810170747/http://segatech.com/technical/cpu/index.html |date=August 10, 2016 |title=DCTP - Hitachi 200&nbsp;MHz SH-4}}
*
* {{cite web |title=The SuperH-3, part 1: Introduction |url=https://devblogs.microsoft.com/oldnewthing/20190805-00/?p=102749 |first=Raymond |last=Chen |work=The Old New Thing |type=blog |date=5 August 2019 |access-date=2024-01-22}} A 15-part series on programming for the microprocessor.


{{Renesas Electronics}}
{{Hitachi}}
{{RISC-based processor architectures}} {{RISC-based processor architectures}}
{{CPU technologies}}
{{Multimedia extensions}}


{{DEFAULTSORT:Superh}} {{DEFAULTSORT:Superh}}
] ]
]
] ]
]

]
]
]
]
]
]
]
]
]
]
]
]
]
]

Latest revision as of 18:00, 29 December 2024

Instruction set architecture by Hitachi
SuperH (SH)
DesignerHitachi
Bits32-bit (32 → 64)
Introduced1992
DesignRISC
EncodingSH-2: 16-bit instructions
SH-2A and newer: mixed 16- and 32-bit instructions
EndiannessBi
OpenYes, and royalty-free

SuperH (or SH) is a 32-bit reduced instruction set computing (RISC) instruction set architecture (ISA) developed by Hitachi and currently produced by Renesas. It is implemented by microcontrollers and microprocessors for embedded systems.

At the time of introduction, SuperH was notable for having fixed-length 16-bit instructions in spite of its 32-bit architecture. Using smaller instructions had consequences: the register file was smaller and instructions were generally two-operand format. However for the market the SuperH was aimed at, this was a small price to pay for the improved memory and processor cache efficiency.

Later versions of the design, starting with SH-5, included both 16- and 32-bit instructions, with the 16-bit versions mapping onto the 32-bit version inside the CPU. This allowed the machine code to continue using the shorter instructions to save memory, while not demanding the amount of instruction decoding logic needed if they were completely separate instructions. This concept is now known as a compressed instruction set and is also used by other companies, the most notable example being ARM for its Thumb instruction set.

In 2015, many of the original patents for the SuperH architecture expired and the SH-2 CPU was reimplemented as open source hardware under the name J2.

History

SH-1 and SH-2

SH-2 on Sega 32X and Sega Saturn

The SuperH processor core family was first developed by Hitachi in the early 1990s. The design concept was for a single instruction set (ISA) that would be upward compatible across a series of CPU cores.

In the past, this sort of design problem would have been solved using microcode, with the low-end models in the series performing non-implemented instructions as a series of more basic instructions. For instance, a "long multiply" (multiplying two 32-bit registers to produce a 64-bit product) might be implemented in hardware on high-end models but instead be performed as a series of additions on low-end models.

One of the key realizations during the development of the RISC concept was that the microcode had a finite decoding time, and as processors became faster, this represented an unacceptable performance overhead. To address this, Hitachi instead developed a single ISA for the entire line, with unsupported instructions causing traps on those implementations that didn't include hardware support. For instance, the initial models in the line, the SH-1 and SH-2, differed only in their support for 64-bit multiplication; the SH-2 supported MUL, DMULS and DMULU, whereas the SH-1 would cause a trap if these were encountered.

The SH-1 was the basic model, supporting a total of 56 instructions. The SH-2 added 64-bit multiplication and a few additional commands for branching and other duties, bringing the total to 62 supported instructions. The SH-1 and the SH-2 were used in the Sega Saturn, Sega 32X and Capcom CPS-3.

The ISA uses 16-bit instructions for better code density than 32-bit instructions, which was important at the time due to the high cost of main memory and the implementation cost of cache. As of 2023, code density is still important for small embedded systems and massively multicore processors. The downsides to this approach were that there were fewer bits available to encode a register number or a constant value. In the original SuperH ISA, there were only 16 general registers, requiring four bits for the source and another four for the destination; however some instructions have an implied R0, R15, or a system register as an extra operand. The instruction opcode is four, eight, twelve, or sixteen bits long, and the remaining four-bit fields are used for register or immediate operands in various ways: there are twelve classes of instructions, for a total of 142 instructions in SH-2.

Delayed branches are introduced for both SH-1 and SH-2. Unconditional branch instructions have one delay slot.

SH-3

A few years later, the SH-3 core was added to the family; new features included another interrupt concept, a memory management unit (MMU), and a modified cache concept. These features required an extended instruction set, adding six new instructions for a total of 68. The SH-3 was bi-endian, running in either big-endian or little-endian byte ordering.

The SH-3 core also added a DSP extension, then called SH-3-DSP. With extended data paths for efficient DSP processing, special accumulators and a dedicated MAC-type DSP engine, this core unified the DSP and the RISC processor world. A derivative of the DSP was also used with the original SH-2 core.

Between 1994 and 1996, 35.1 million SuperH devices were shipped worldwide.

SH-4

In 1997, Hitachi and STMicroelectronics (STM) started collaborating on the design of the SH-4 for the Dreamcast. SH-4 featured superscalar (2-way) instruction execution and a vector floating-point unit (particularly suited to 3D graphics). Standard chips based on the SH-4 were introduced around 1998.

Licensing

In early 2001, Hitachi and STM formed the IP company SuperH, Inc., which was going to license the SH-4 core to other companies and was developing the SH-5 architecture, the first move of SuperH into the 64-bit area. The earlier SH-1 through 3 remained the property of Hitachi.

In 2003, Hitachi and Mitsubishi Electric formed a joint-venture called Renesas Technology, with Hitachi controlling 55% of it. In 2004, Renesas Technology bought STMicroelectronics's share of ownership in the SuperH Inc. and with it the licence to the SH cores. Renesas Technology later became Renesas Electronics, following their merger with NEC Electronics.

The SH-5 design supported two modes of operation: SHcompact mode, which is equivalent to the user-mode instructions of the SH-4 instruction set; and SHmedia mode, which is very different in that it uses 32-bit instructions with sixty-four 64-bit integer registers and SIMD instructions. In SHmedia mode the destination of a branch (jump) is loaded into a branch register separately from the actual branch instruction. This allows the processor to prefetch instructions for a branch without having to snoop the instruction stream. The combination of a compact 16-bit instruction encoding with a more powerful 32-bit instruction encoding is not unique to SH-5; ARM processors have a 16-bit Thumb mode (ARM licensed several patents from SuperH for Thumb) and MIPS processors have a MIPS-16 mode. However, SH-5 differs because its backward compatibility mode is the 16-bit encoding rather than the 32-bit encoding.

The last evolutionary step happened around 2003 where the cores from SH-2 up to SH-4 were getting unified into a superscalar SH-X core which formed a kind of instruction set superset of the previous architectures, and added support for symmetric multiprocessing.

Continued availability

Since 2010, the SuperH CPU cores, architecture and products are with Renesas Electronics and the architecture is consolidated around the SH-2, SH-2A, SH-3, SH-4 and SH-4A platforms. The system-on-chip products based on SH-3, SH-4 and SH-4A microprocessors were subsequently replaced by newer generations based on licensed CPU cores from Arm Ltd., with many of the existing models still marketed and sold until March 2025 through the Renesas Product Longevity Program.

As of 2021, the SH72xx microcontrollers based on the SH-2A continue to be marketed by Renesas with guaranteed availability until February 2029, along with newer products based on several other architectures including Arm, RX, and RH850.

J Core

Not to be confused with Samsung Galaxy J2 Core.

The last of the SH-2 patents expired in 2014. At LinuxCon Japan 2015, j-core developers presented a cleanroom reimplemention of the SH-2 ISA with extensions (known as the "J2 core" due to the unexpired trademarks). Subsequently, a design walkthrough was presented at ELC 2016.

The open source BSD-licensed VHDL code for the J2 core has been proven on Xilinx FPGAs and on ASICs manufactured on TSMC's 180 nm process, and is capable of booting μClinux. J2 is backwards ISA compatible with SH-2, implemented as a 5-stage pipeline with separate Instruction and Data memory interfaces, and a machine-generated Instruction Decoder supporting the densely packed and complex (relative to other RISC machines) ISA. Additional instructions are easy to add. J2 implements instructions for dynamic shift (using the SH-3 and later instruction patterns), extended atomic operations (used for threading primitives) and locking/interfaces for symmetric multiprocessor support. Plans to implement the SH-2A (as "J2+") and SH-4 (as "J4") instruction sets as the relevant patents expire in 2016–2017.

Several features of SuperH have been cited as motivations for designing new cores based on this architecture:

  • High code density compared to other 32-bit RISC ISAs such as ARM or MIPS important for cache and memory bandwidth performance
  • Existing compiler and operating system support (Linux, Windows Embedded, QNX)
  • Extremely low ASIC fabrication costs now that the patents are expiring (around US$0.03 for a dual-core J2 core on TSMC's 180 nm process).
  • Patent- and royalty-free (BSD-licensed) implementation
  • Full and vibrant community support
  • Availability of low cost hardware development platform for zero cost FPGA tools
  • CPU and SoC RTL generation and integration tools, producing FPGA and ASIC portable RTL and documentation
  • Clean, modern design with open source design, generation, simulation and verification environment

Models

Generation comparision
Clock
Computer performance Image
MIPS MOPS GFLOPS
SH-1 20 20 SH-1-Prozessor
SH-2 28,7 78 120 SH-2-Prozessor
SH-3 200 260 400 SH-3-Prozessor
SH-4 200 480 1,9 SH-4-Prozessor
SH-5 400 700 9600 2,8
SH-6 >2000 >24000 >7,0

The family of SuperH CPU cores includes:

  • SH-1 – used in microcontrollers for deeply embedded applications (CD-ROM drives, major appliances, etc.)
  • SH-2 – used in microcontrollers with higher performance requirements, networking applications, and also in video game consoles, like the Sega Saturn and Sega 32X add-on. The SH-2 has also found home in many automotive engine control unit applications, including Subaru, Mitsubishi, and Mazda.
  • SH-2A – The SH-2A core is an extension of the SH-2 core including a few extra instructions but most importantly moving to a superscalar architecture (it is capable of executing more than one instruction in a single cycle) and two five-stage pipelines. It also incorporates 15 register banks to facilitate an interrupt latency of 6 clock cycles. It is also strong in motor control application but also in multimedia, car audio, powertrain, automotive body control and office + building automation
  • SH-DSP – initially developed for the mobile phone market, used later in many consumer applications requiring DSP performance for JPEG compression etc.
  • SH-3 – used for mobile and handheld applications such as the Jornada, strong in Windows CE applications and market for many years in the car navigation market. The Cave CV1000, similar to the Sega NAOMI hardware's CPU, also made use of this CPU. The Korg Electribe EMX and ESX music production units also use the SH-3.
  • SH-3-DSP – used mainly in multimedia terminals and networking applications, also in printers and fax machines
  • SH-4 – used whenever high performance is required such as car multimedia terminals, video game consoles, most notably the Dreamcast, or set-top boxes
  • SH-5 – used in high-end 64-bit multimedia applications
  • SH-X – mainstream core used in various flavours (with/without DSP or FPU unit) in engine control unit, car multimedia equipment, set-top boxes or mobile phones
  • SH-Mobile – SuperH Mobile Application Processor; designed to offload application processing from the baseband LSI

SH-2

Hitachi SH-2 CPU

The SH-2 is a 32-bit RISC architecture with a 16-bit fixed instruction length for high code density and features a hardware multiply–accumulate (MAC) block for DSP algorithms and has a five-stage pipeline.

The SH-2 has a cache on all ROM-less devices.

It provides 16 general-purpose registers, a vector-base register, global-base register, and a procedure register.

Today the SH-2 family stretches from 32 KB of on-board flash up to ROM-less devices. It is used in a variety of different devices with differing peripherals such as CAN, Ethernet, motor-control timer unit, fast ADC and others.

SH-2A

The SH-2A is an upgrade to the SH-2 core that added some 32-bit instructions. It was announced in early 2006.

New features on the SH-2A core include:

  • Superscalar architecture: execution of 2 instructions simultaneously
  • Harvard architecture
  • Two 5-stage pipelines
  • Mixed 16-bit and 32-bit instructions
  • 15 register banks for interrupt response in 6 cycles.
  • Optional FPU

The SH-2A family today spans a wide memory field from 16 KB up to and includes many ROM-less variations. The devices feature standard peripherals such as CAN, Ethernet, USB and more as well as more application-specific peripherals such as motor control timers, TFT controllers and peripherals dedicated to automotive powertrain applications.

SH-3

Hitachi SH-3 CPU (BGA version)

SH-4

The SH-4 is a RISC CPU and was developed for primary use in multimedia applications, such as Sega's Dreamcast and NAOMI game systems. It includes a much more powerful floating-point unit and additional built-in functions, along with the standard 32-bit integer processing and 16-bit instruction size.

SH-4 features include:

  • FPU with four floating-point multipliers, supporting 32-bit single-precision and 64-bit double-precision floats
  • 4D floating-point dot-product operation and matrix–vector multiplication
  • 128-bit floating-point bus allowing 3.2 GB/sec transfer rate from the data cache
  • 64-bit external data bus with 32-bit memory addressing, allowing a maximum of 4 GB addressable memory (see Byte addressing) with a transfer rate of 800 MB/sec
  • Built-in interrupt, DMA, and power management controllers

There is no FPU in the custom SH-4 made for Casio, the SH7305.

SH-5

The SH-5 is a 64-bit RISC CPU.

Almost no non-simulated SH-5 hardware was ever released, and, unlike the still-live SH-4, support for SH-5 was dropped from GCC and Linux.

SH-6

SH-6 was an announced but never implemented further development. It was supposed to achieve over 2 GIPS, over 7 GFLOPS and over 24 GOPS.

References

Citations

  1. "The Story of the Hitachi SH-2 and the Sega Saturn". www.sega-16.com. Renesas. Archived from the original on 2023-02-27. Retrieved 27 February 2023.
  2. J-core Open Processor
  3. ^ Program 1996, p. 1.
  4. "CP System III (CPS3) Hardware (Capcom)". www.system16.com. System 16. Retrieved 3 August 2019.
  5. Program 1996, pp. 30–33.
  6. "SH7020 and SH7021 Hardware ManualSuperH™ RISC engine". p. 19,48. Retrieved 2023-12-02.
  7. "360-MIPS SuperH RISC Processor Enables Personal Access Systems SH7750 Launches the SH-4 Series". November 1997. Archived from the original on 5 March 2016.
  8. ^ "STMicro, Hitachi plan new company to develop RISC cores". EE Times. 3 April 2001. Hitachi created the SH family of processors and developed its first four major iterations, but has worked with ST since 1997, when the companies agreed to share a common high-end microprocessor road map. They jointly developed the 32-bit SH4 RISC processor core, and began development of the SH5 architecture, which will now be completed by SuperH. SuperH's initial product will be the SH4 core. Earlier SH versions will not be part of the spin-off agreement.
  9. "SuperH, Inc. formed by Hitachi and STMicroelectronics to Boost the Proliferation of SuperH Cores in Embedded Microprocessor Applications".
  10. Clarke, Peter (28 September 2004). "Renesas to take over SuperH core business". EE Times.
  11. ^ Nathan Willis (June 10, 2015). "Resurrecting the SuperH architecture". LWN.net.
  12. ""SuperH RISC Engine Family MCUs"". Renesas Electronics.
  13. ^ "J Cores". j-core. Archived from the original on May 11, 2016. Retrieved April 27, 2016.
  14. "j-core Design Walkthrough" (PDF). Archived (PDF) from the original on 2016-06-17.
  15. V.M. Weaver (17 March 2015). "Exploring the Limits of Code Density (Tech Report with Newest Results)" (PDF). Archived (PDF) from the original on 2015-07-13.
  16. ^ Fumio Arakawa (2013-07-27). "SH-5: A First 64-bit SuperH Core with Multimedia Extension" (PDF; 617 kB). Hitachi, Ltd. Retrieved 2022-02-08.
  17. Kuwabara, M. (25 July 2019). "Korg EMX / ESX Service Manual" (PDF). Archived from the original (PDF) on 13 July 2019.
  18. "SH-5 CPU Core, Volume1: Architecture" (PDF). Archived (PDF) from the original on 2009-03-20.
  19. "Wasabi SH-5 Press Release". 8 March 2016.
  20. "GCC 7 Release Series Changes, New Features, and Fixes". 2 February 2018.

Bibliography

External links

Renesas Electronics
Products
Hitachi
Divisions and
subsidiaries
Current
Former
Joint ventures and
shareholdings
Current
Former
Products, services
and standards
Current
Defunct
People
Places
Other
  • Now integrated into other Hitachi divisions or business groupings Sold to Western Digital
Reduced instruction set computer (RISC) architectures
Origins
In active development
Development discontinued
Processor technologies
Models
Architecture
Instruction set
architectures
Types
Instruction
sets
Execution
Instruction pipelining
Hazards
Out-of-order
Speculative
Parallelism
Level
Multithreading
Flynn's taxonomy
Processor
performance
Types
By application
Systems
on chip
Hardware
accelerators
Word size
Core count
Components
Functional
units
Logic
Registers
Control unit
Datapath
Circuitry
Power
management
Related
Instruction set extensions
SIMD (RISC)
SIMD (x86)
Bit manipulation
  • BMI (ABM: 2007, BMI1: 2012, BMI2: 2013, TBM: 2012)
  • ADX (2014)
Compressed instructions
Security and cryptography
Transactional memory
Virtualization
Suspended extensions' dates are struck through.
Categories: