Am29200<sup>™</sup> and Am29205<sup>™</sup> RISC Microcontrollers User's Manual Advanced Micro Devices # Am29200™ and Am29205™ RISC Microcontrollers **User's Manual** Rev. 1, 1994 #### © 1994 Advanced Micro Devices, Inc. Advanced Micro Devices reserves the right to make changes in its products without notice in order to improve design or performance characteristics. This publication neither states nor implies any warranty of any kind, including but not limited to implied warrants of merchantability or fitness for a particular application. AMD assumes no responsibility for the use of any circuitry other than the circuitry in an AMD product. The information in this publication is believed to be accurate in all respects at the time of publication, but is subject to change without notice. AMD assumes no responsibility for any errors or omissions, and disclaims responsibility for any consequences resulting from the use of the information included herein. Additionally, AMD assumes no responsibility for the functioning of undescribed features or parameters. #### **Trademarks** AMD and Am29000 are registered trademarks; Am29005, Am29027, Am29030, Am29035, Am29050, Am29200, Am29205, Am29240, Am29243, Am29245, 29K, Laser29K, EB29K, XRAY29K, MiniMON29K, and Design-Made-Easy are trademarks of Advanced Micro Devices, Inc. Fusion29K is a registered trademark of Advanced Micro Devices, Inc. PostScript is a registered trademark of Adobe Systems, Inc. High C is a registered trademark of MetaWare, Inc. Product names used in this publication are for identification purposes only and may be trademarks of their respective companies. ## IF YOU HAVE QUESTIONS, WE'RE HERE TO HELP YOU. #### **Customer Service** AMD's customer service network includes U.S. offices, international offices, and a customer training center. Expert technical assistance is available from AMD's worldwide staff of field application engineers and factory support staff to answer 29K™ Family hardware and software development questions. ## Hotline, Email, and Bulletin Board Support For answers to technical questions, AMD provides a toll-free number for direct access to our engineering support staff. For overseas customers, the easiest way to reach the engineering support staff with your questions is via fax with a short description of your question. AMD 29K Family customers also receive technical support through electronic mail. This worldwide service is available to 29K product users via the international UNIX email service. Also available is the AMD bulletin board service, which provides the latest 29K product information, including technical information and data on upcoming product releases. ## **Engineering Support Staff** (800) 292-9263 ext 2 toll-free for U.S. 0031-11-1163 toll-free for Japan (512) 602-4118 direct dial worldwide 44-(0)256-811101 U.K. and Europe hotline (512) 602-5031 fax 29k-support@amd.com email #### **Bulletin Board** (800) 292-9263 ext 1 toll-free for U.S. (512) 602-4898 worldwide and local for U.S. #### **Documentation and Literature** The 29K Family Customer Support Group responds quickly to information and literature requests. A simple phone call gets you free 29K Family information, such as data books, user's manuals, data sheets, application notes, the Fusion29K<sup>SM</sup> Partner Solutions Catalog and Newsletter, and other literature. Internationally, contact your local AMD sales office for complete 29K Family literature. #### **Customer Support Group** (800) 292-9263 ext 3 toll-free for U.S. (512) 602-5651 local for U.S. (512) 602-5051 fax for U.S. . # **TABLE OF CONTENTS** | ŗ | Preface | INTRODUCTION AND OVERVIEW Am29200 AND Am29205 RISC MICROCONTROLLERS | |---|-----------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | | | INTENDED AUDIENCE xvii USER'S MANUAL OVERVIEW xvii AMD DOCUMENTATION xix RELATED PUBLICATIONS xviii | | ( | Chapter 1 | FEATURES AND PERFORMANCE | | | | 1.1 DISTINCTIVE CHARACTERISTICS 1-1 1.1.1 Am29200 Microcontroller 1-2 1.1.2 Am29205 Microcontroller 1-4 | | | | 1.2 KEY FEATURES AND BENEFITS | | | | 1.3 PERFORMANCE OVERVIEW 1-7 1.3.1 Instruction Timing 1-7 1.3.2 Pipelining 1-8 1.3.3 Burst-Mode and Page-Mode Memories 1-8 1.3.4 Instruction Set Overview 1-8 1.3.5 Data Formats 1-8 1.3.6 Protection 1-9 1.3.7 DRAM Mapping 1-9 1.3.8 Interrupts and Traps 1-9 | | | | 1.4 DEBUGGING AND TESTING | | C | Chapter 2 | PROGRAMMING | | | | 2.1 INSTRUCTION SET 2-1 2.1.1 Integer Arithmetic 2-1 2.1.2 Compare 2-1 2.1.3 Logical 2-4 2.1.4 Shift 2-4 2.1.5 Data Movement 2-4 2.1.6 Constant 2-5 2.1.7 Floating Point 2-6 2.1.8 Branch 2-7 2.1.9 Miscellaneous 2-7 2.1.10 Reserved Instructions 2-8 | | | | 2.2 REGISTER MODEL 2-8 2.2.1 General-Purpose Registers 2-8 | | | 2.2.2 Special-Purpose Registers | |-----------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | | 2.3 ADDRESSING REGISTERS INDIRECTLY 2-12 2.3.1 Indirect Pointer C Register (IPC, Register 128) 2-13 2.3.2 Indirect Pointer A Register (IPA, Register 129) 2-13 2.3.3 Indirect Pointer B Register (IPB, Register 130) 2-14 | | | 2.4 INSTRUCTION ENVIRONMENT | | | 2.5 STATUS RESULTS OF INSTRUCTIONS 2-16 2.5.1 ALU Status Register (ALU, Register 132) 2-16 2.5.2 Arithmetic Operation Status Results 2-17 2.5.3 Logical Operation Status Results 2-17 2.5.4 Floating-Point Status Results 2-18 2.5.5 Floating-Point Status Register (FPS, Register 162) 2-18 | | | 2.6 INTEGER MULTIPLICATION AND DIVISION 2-19 2.6.1 Q Register (Q, Register 131) 2-20 2.6.2 Multiplication 2-20 2.6.3 Division 2-22 | | | 2.7 I NEED AN INSTRUCTION FOR 2-24 2.7.1 Run-Time Checking 2-24 2.7.2 Operating-System Calls 2-24 2.7.3 Multiprecision Integer Operations 2-25 2.7.4 Complementing a Boolean 2-25 2.7.5 Large Jump and Call Ranges 2-25 2.7.6 NO-OPs 2-25 | | | 2.8 VIRTUAL ARITHMETIC PROCESSOR 2-26 2.8.1 Trapping Arithmetic Instructions 2-26 2.8.2 Virtual Registers 2-26 | | | 2.9 PROCESSOR INITIALIZATION 2-26 2.9.1 Configuration Register (CFG, Register 3) 2-26 2.9.2 Reset Mode 2-27 | | Chapter 3 | DATA FORMATS AND HANDLING | | | 3.1 INTEGER DATA TYPES 3-1 3.1.1 Character Data 3-1 3.1.2 Half-Word Operations 3-2 3.1.3 Byte Pointer Register (BP, Register 133) 3-2 3.1.4 Bit Strings 3-3 3.1.5 Character-String Operations 3-4 3.1.6 Boolean Data 3-5 3.1.7 Instruction Constants 3-5 | | | 3.2FLOATING-POINT DATA TYPES3-53.2.1Single-Precision Floating-Point Values3-53.2.2Double-Precision Floating-Point Values3-63.2.3Special Floating-Point Values3-6 | | | 3.3 EXTERNAL DATA ACCESSES 3-7 3.3.1 Load/Store Instruction Format 3-7 3.3.2 Load Operations 3-9 3.3.3 Store Operations 3-9 3.3.4 Multiple Accesses 3-9 3.3.5 Addressing and Alignment 3-11 | | Chapter 4 | PROCEDURE LINKAGE | | | 4.1 RUN-TIME STACK ORGANIZATION AND USE | | | | 4.1.1 Management of the Run-Time Stack 4.1.2 Register Stack 4.1.3 Local Registers as a Stack Cache 4.1.4 Memory Stack | 4-3<br>4-4<br>4-5 | |-----------|-----|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------| | | 4.2 | PROCEDURE LINKAGE CONVENTIONS 4.2.1 Argument Passing 4.2.2 Procedure Prologue 4.2.3 Spill Handler 4.2.4 Return Values 4.2.5 Procedure Epilogue 4.2.6 Fill Handlers 4.2.7 Register Stack Leaf Frame 4.2.8 Local Variables and Memory-Stack Frames 4.2.9 Static Link Pointer 4.2.10 Transparent Procedures | 4-7<br>4-8<br>4-10<br>4-10<br>4-11<br>4-11<br>4-11 | | | 4.3 | REGISTER USAGE CONVENTION | 4-13 | | | 4.4 | COMPLEX PROCEDURE CALL EXAMPLE | 1-14 | | | 4.5 | TRACE-BACK TAGS | <b>1-1</b> 5 | | Chapter 5 | | ELINING AND INSTRUCTION SCHEDULING | | | | | FOUR-STAGE PIPELINE | | | | | PIPELINE HOLD MODE | | | | 5.3 | SERIALIZATION | | | | 5.4 | DELAYED BRANCH | 5-2 | | | 5.5 | OVERLAPPED LOADS AND STORES | 5-4 | | | 5.6 | DELAYED EFFECTS OF REGISTERS | 5-5 | | Chapter 6 | | STEM PROTECTION | | | | 6.1 | USER AND SUPERVISOR MODES 6.1.1 Supervisor Mode 6.1.2 User Mode | 6-1 | | | 6.2 | REGISTER PROTECTION | | | Chapter 7 | SYS | STEM OVERVIEW | | | | 7.1 | SIGNAL DESCRIPTION | | | | | 7.1.1 Clocks | | | | | 7.1.2 Processor Signals | | | | | 7.1.4 DRAM Interface | | | | | 7.1.5 Peripheral Interface Adapter (PIA) | 7-4 | | | | 7.1.6 DMA Controller | | | | | 7.1.8 Parallel Port | | | | | 7.1.9 Serial Port | | | | | 7.1.10 Video Interface | | | | | 7.1.12 Pin Changes for the Am29205 Microcontroller | | | | 7.2 | ACCESS PRIORITY | | | | | SYSTEM ADDRESS PARTITION | | | | 7.4 | INTERNAL PERIPHERALS AND CONTROLLERS | 7-8 | | | | | | | Chapter 8 | ROM CONTROLLER | |------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | | 8.1 OVERVIEW 8- | | | 8.2 PROGRAMMABLE REGISTERS 8- 8.2.1 ROM Control Register (RMCT, Address 80000000) 8- 8.2.2 ROM Configuration Register (RMCF, Address 80000004) 8- 8.2.3 Initialization 8- | | | 8.3 ROM ACCESSES 8-4 8.3.1 ROM Address Mapping 8-4 8.3.2 Simple ROM Accesses 8-4 8.3.3 Narrow ROM Accesses 8-4 8.3.4 Writes to the ROM Space 8-7 8.3.5 Burst-Mode ROM Accesses 8-8 8.3.6 Use of WAIT to Extend ROM Cycles 8-8 | | Chapter 9 | DRAM CONTROLLER | | | 9.1 OVERVIEW 9-1 | | | 9.2 PROGRAMMABLE REGISTERS9- | | | 9.2.1 DRAM Control Register (DRCT, Address 80000008) 9-1<br>9.2.2 DRAM Configuration Register (DRCF, Address 8000000C) 9-2 | | | 9.2.2 DRAM Configuration Register (DRCF, Address 8000000C) 9-2<br>9.2.3 DRAM Mapping Register 0 (DRM0, Address 80000010) 9-3 | | | 9.2.4 DRAM Mapping Register 1 (DRM1, Address 80000014) 9-4 | | | 9.2.5 DRAM Mapping Register 2 (DRM2, Address 80000018) 9-4 9.2.6 DRAM Mapping Register 3 (DRM3, Address 8000001C) 9-4 | | | 9.2.7 Initialization9-4 | | | 9.3 DRAM ACCESSES9-2 | | | 9.3.1 DRAM Address Mapping9-2 | | | 9.3.2 Address Multiplexing 9-5 9.3.3 32-Bit DRAM Width 9-7 | | | 9.3.4 16-Bit DRAM Width9-7 | | | 9.3.5 Mapped DRAM Accesses9-8 | | | 9.3.6 Normal Access Timing | | | 9.3.8 DRAM Refresh9-10 | | | 9.3.9 Video DRAM Interface9-12 | | Chapter 10 | PERIPHERAL INTERFACE ADAPTER | | | 10.1 OVERVIEW | | | 10.2 PROGRAMMABLE REGISTERS | | | 10.2.1 PIA Control Register 0/1 (PICT0/1, Address 80000020/24) 10-1 | | | 10.2.2 Initialization | | | 10.3 PIA ACCESSES | | | 10.3.2 Use of WAIT to Extend I/O Cycles | | Chapter 11 | DMA CONTROLLER | | | 11.1 OVERVIEW11-1 | | | 11.2 PROGRAMMABLE REGISTERS | | | 11.2.0 DIVIAT CONTROL REGISTER (DIVICT), Address 60000040) 11-5 | | | 11.2.7 DMA1 Address Register (DMAD1, Address 80000044) 11.2.8 DMA1 Count Register (DMCN1, Address 80000048) 11.2.9 Initialization | 11-7 | |------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------| | | 11.3 DMA TRANSFERS 11.3.1 Specifying the Direction of a DMA Transfer 11.3.2 Programming Internal DMA Transfers 11.3.3 Programming External DMA Transfers 11.3.4 Generating External DMA Requests 11.3.5 External DMA Transfers 11.3.6 Latching External DMA Requests | 11-8<br>11-9<br>11-9<br>11-9 | | | 11.4 DMA QUEUING (DMA CHANNEL 0) | 1-12 | | | 11.5 RANDOM DIRECT MEMORY ACCESS BY EXTERNAL DEVICES 1 | 1-12 | | Chapter 12 | PROGRAMMABLE I/O PORT | | | | 12.1 OVERVIEW | 12-1 | | | 12.2 PROGRAMMABLE REGISTERS 12.2.1 PIO Control Register (POCT, Address 800000D0) 12.2.2 PIO Input Register (PIN, Address 800000D4) 12.2.3 PIO Output Register (POUT, Address 800000D8) 12.2.4 PIO Output Enable Register (POEN, Address 800000DC) 12.2.5 Initialization | 12-1<br>12-2<br>12-2<br>12-3<br>12-3 | | • | 12.3 OPERATING THE I/O PORT | 12-3 | | Chapter 13 | PARALLEL PORT | | | | 13.1 OVERVIEW | 13-1 | | | 13.2 PROGRAMMABLE REGISTERS 13.2.1 Parallel Port Control Register (PPCT, Address 800000C0) 13.2.2 Parallel Port Status Register (PPST, Address 800000C8) 13.2.3 Parallel Port Data Register (PPDT, Address 800000C4) 13.2.4 Initialization 13.3 PARALLEL PORT TRANSFERS 13.3.1 Transfers from the Host 13.3.2 Transfers to the Host | 13-1<br>13-3<br>13-4<br>13-4<br>13-5<br>13-5 | | Chapter 14 | SERIAL PORT | | | | 14.1 OVERVIEW | 14-1 | | | 14.2 PROGRAMMABLE REGISTERS | 14-1<br>14-3 | | | 14.2.3 Serial Port Transmit Holding Register (SPTH, Address 80000088) | | | | (SPRB, Address 8000008C) | 14-4 | | | 14.2.5 Baud Rate Divisor Register (BAUD, Address 80000090) 14.2.6 Initialization | | | Chapter 15 | VIDEO INTERFACE | | | | 15.1 OVERVIEW | | | | 15.2 PROGRAMMABLE REGISTERS | 15-1<br>15 <b>-</b> 3 | | | 15.2.4 Video Data Holding Register (VDT, Address 800000EC) 15-3 | |------------|-----------------------------------------------------------------------------------------------------------------| | | 15.3 VIDEO INTERFACE OPERATION | | | 15.3.1 Transmitting Data on the Video Interface | | | 15.3.2 Receiving Data on the Video Interface | | Chapter 16 | INTERRUPTS AND TRAPS | | | 16.1 OVERVIEW | | | 16.2 INTERRUPTS AND TRAPS | | | 16.2.1 Current Processor Status Register (CPS, Register 2) 16-1 16.2.2 Interrupts | | | 16.2.3 Traps | | | 16.2.4 External Interrupts and Traps | | | 16.2.5 Wait Mode | | | 16.3 VECTOR AREA | | | 16.3.2 Vector Numbers | | | 16.4 INTERRUPT AND TRAP HANDLING | | | 16.4.1 Old Processor Status Register (OPS, Register 1) | | | 16.4.2 Program Counter Stack 16-6 16.4.3 Taking an Interrupt or Trap 16-10 | | | 16.4.4 Returning from an Interrupt or Trap | | | 16.4.5 Lightweight Interrupt Processing | | | 16.4.6 Simulation of Interrupts and Traps | | | 16.5.1 WARN Input | | | 16.6 SEQUENCING OF INTERRUPTS AND TRAPS 16-14 | | | 16.7 EXCEPTION REPORTING AND RESTARTING 16-16 | | | 16.7.1 Instruction Exceptions 16-16 16.7.2 Restarting Mapped DRAM Accesses 16-17 | | | 16.7.2 Restarting Mapped DRAW Accesses | | | 16.7.4 Floating-Point Exceptions | | | 16.7.5 Correcting Out-of-Range Results | | | 16.7.6 Exceptions builing interrupt and map handling | | | 16.8.1 Timer Facility Operation | | | 16.8.2 Timer Facility Initialization | | | 16.8.3 Handling Timer Interrupts | | | 16.8.4 Timer Facility Uses 16-22 16.8.5 Timer Counter Register (TMC, Register 8) 16-22 | | | 16.8.6 Timer Reload Register (TMR, Register 9) | | | 16.9 INTERNAL INTERRUPT CONTROLLER | | | 16.9.1 Interrupt Control Register (ICT, Address 80000028) 16-23 | | | 16.9.2 Interrupt Controller Initialization 16-25 16.9.3 Servicing Internal Interrupts 16-25 | | Chapter 17 | DEBUGGING AND TESTING | | | 17.1 OVERVIEW | | | 17.2 TRACE FACILITY | | | 17.3 INSTRUCTION BREAKPOINTS | | | 17.4 PROCESSOR STATUS OUTPUTS | | | 17.5 CONTROL FIELD IN SCAN PATH | | | | | | 17.6 TEST ACCESS PORT 17.6.1 Boundary-Scan Cells 17.6.2 Instruction Register and Implemented Instructions 17.6.3 Order of Scan Cells in Boundary-Scan Path 17.7 IMPLEMENTING A HARDWARE-DEVELOPMENT SYSTEM 17.7.1 Halt Mode 17.7.2 Step Mode 17.7.3 Load Test Instruction Mode 17.7.4 Accessing Internal State Via Boundary Scan 17.7.5 HALT Instructions as Breakpoints 17.7.6 Forcing Outputs to High Impedance | 17-4<br>17-8<br>17-11<br>17-11<br>17-12<br>17-13<br>17-14<br>17-16 | |------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------| | | 17.8 EMULATING THE Am29205 MICROCONTROLLER | . 17-17 | | Chapter 18 | INSTRUCTION SET | | | | 18.1 INSTRUCTION-DESCRIPTION NOMENCLATURE 18.1.1 Operand Notation and Symbols 18.1.2 Operator Symbols 18.1.3 Control-Flow Terminology 18.1.4 Assembler Syntax 18.2 INSTRUCTION FORMATS 18.3 INSTRUCTION DESCRIPTION | 18-1<br>18-2<br>18-3<br>18-3 | | Appendix A | SPECIAL SETTINGS FOR THE<br>Am29200 AND Am29205 MICROCONTROLLERS | A-1 | | Appendix B | PROCESSOR REGISTER SUMMARY | B-1 | | Appendix C | PERIPHERAL REGISTER SUMMARY | C-1 | | INDEV | | | #### LIST OF FIGURES Figure 1-1 Am29200 Microcontroller Block Diagram ...... 1-3 Figure 1-2 Figure 2-1 Figure 2-2 Special-Purpose Registers ......2-12 Indirect Pointer C Register ......2-13 Figure 2-3 Figure 2-4 Figure 2-5 Figure 2-6 Figure 2-7 Integer Environment Register ......2-15 Figure 2-8 Floating-Point Status ......2-18 Figure 2-9 Figure 2-10 Figure 2-11 Configuration Register ......2-26 Current Processor Status Register In Reset Mode ......2-27 Figure 2-12 Figure 3-1 Figure 3-2 Figure 3-3 Funnel Shift Count Register ......3-3 Figure 3-4 Figure 3-5 Figure 3-6 Figure 3-7 Figure 3-8 Figure 3-9 Run-Time Stack Example ......4-2 Figure 4-1 Figure 4-2 Figure 4-3 Relationship of Stack Cache and Register Stack .......4-4 Figure 4-4 Stack Underflow .......4-7 Figure 4-5 Figure 4-6 Figure 4-7 Figure 6-1 Register Bank Organization ......6-2 Register Bank Protect Register ......6-3 Figure 6-2 ROM Control Register .......8-1 Figure 8-1 Figure 8-2 Simple ROM Read Cycle .......8-5 Figure 8-3 Figure 8-4 Simple ROM Read Cycle—Zero Wait States ...... 8-6 Figure 8-5 Simple Write to ROM Bank ......8-7 Figure 8-6 Byte Write to ROM Bank (using CAS3-CAS0 as byte strobes) ....... 8-9 Figure 8-7 Burst-Mode ROM Read ......8-10 Extending a ROM Read Cycle with WAIT ......8-11 Figure 8-8 Extending a ROM Write Cycle with WAIT ......8-11 Figure 8-9 Figure 9-1 Figure 9-2 Figure 9-3 DRAM Mapping Register 0 ......9-3 Location of Bytes and Half-Words on a 16-Bit Bus ......9-8 Figure 9-4 Figure 9-5 DRAM Read Cycle ......9-9 Figure 9-6 DRAM Write Cycle ......9-9 DRAM Page-Mode Read Cycle .....9-10 Figure 9-7 DRAM Page-Mode Write Cycle ......9-11 Figure 9-8 DRAM Refresh Cycle ...... 9-11 Figure 9-9 Figure 9-10 VDRAM Transfer Cycle ......9-12 Figure 10-1 Figure 10-2 PIA Control Register 1 (PICT1, Address 80000024) ......10-1 Figure 10-3 PIA Write Cycle ......10-4 Figure 10-4 Extending a PIA Read Cycle with WAIT .....10-5 Figure 10-5 | Figure 10-6 | Extending a PIA Write Cycle with WAIT | . 10-5 | |--------------|------------------------------------------------------------|--------| | Figure 11-1 | DMA0 Control Register | | | Figure 11-2 | DMA0 Address Register | | | Figure 11-3 | DMA0 Address Tail Register | | | Figure 11-4 | DMA0 Count Register | | | Figure 11-5 | DMA0 Count Tail Register | | | Figure 11-6 | DMA1 Control Register | | | Figure 11-7 | External DMA PIA Read Cycle | 11-10 | | Figure 11-8 | External DMA PIA Write Cycle | | | Figure 11-9 | External Random DRAM Read Cycle | | | Figure 11-10 | External Random DRAM Write Cycle | 11-14 | | Figure 11-11 | External Random ROM Read Cycle | | | Figure 12-1 | PIO Control Register | | | Figure 12-2 | PIO Input Register | . 12-2 | | Figure 12-3 | PIO Output Register | | | Figure 12-4 | PIO Output Enable Register | | | Figure 13-1 | Parallel Port Control Register | | | Figure 13-2 | Parallel Port Status Register | | | Figure 13-3 | Parallel Port Data Register | | | Figure 13-4 | State Transitions for Transfers from the Host | | | Figure 13-5 | Transfer from the Host on the Parallel Port (BRS=0, ARB=0) | | | Figure 13-6 | Transfer from the Host on the Parallel Port (BRS=0, ARB=1) | | | Figure 13-7 | Transfer from the Host on the Parallel Port (BRS=1, ARB=0) | | | Figure 13-8 | Transfer from the Host on the Parallel Port (BRS=1, ARB=1) | | | Figure 13-9 | Parallel Port Buffer Read Cycle | | | Figure 13-10 | State Transitions for Transfers to the Host | . 13-9 | | Figure 13-11 | Transfer to the Host on the Parallel Port | 13-10 | | Figure 13-12 | Parallel Port Buffer Write Cycle | 13-10 | | Figure 14-1 | Serial Port Control Register | | | Figure 14-2 | Serial Port Status Register | . 14-3 | | Figure 14-3 | Serial Port Transmit Holding Register | | | Figure 14-4 | Serial Port Receive Buffer Register | | | Figure 14-5 | Baud Rate Divisor Register | . 14-5 | | Figure 15-1 | Video Control Register | . 15-1 | | Figure 15-2 | Top Margin Register | | | Figure 15-3 | Side Margin Register | | | Figure 15-4 | Video Data Holding Register | | | Figure 15-5 | VCLK, LSYNC, and VDAT Relationships | | | Figure 16-1 | Current Processor Status Register | | | Figure 16-2 | Vector Table Entry | | | Figure 16-3 | Vector Area Base Address Register | | | Figure 16-4 | Program Counter Unit | | | Figure 16-5 | Program Counter 0 Register | | | Figure 16-6 | Program Counter 1 Register | | | Figure 16-7 | Program Counter 2 Register | | | Figure 16-8 | Current Processor Status After an Interrupt or Trap | | | Figure 16-9 | Current Processor Status Before Interrupt Return | | | Figure 16-10 | | 16-18 | | Figure 16-11 | - · · · - · · · · · · · · · · · · · · · | 16-18 | | Figure 16-12 | | 16-19 | | Figure 16-13 | • | 16-22 | | Figure 16-14 | • | 16-23 | | Figure 16-15 | | 16-24 | | Figure 17-1 | Valid Transitions for CNTL Field | | | Figure 17-2 | Input Boundary-Scan Cell | | | Figure 17-3 | Output Boundary-Scan Cell | | | Figure 17-4 | Processor Status While in Load Test Instruction Mode | 17-13 | | Figure 17-5 | Using an Am29200 Microcontroller to Emulate an Am29205 Microcontroller | 17 10 | |----------------|------------------------------------------------------------------------|-------| | F! 40.4 | | | | Figure 18-1 | Instruction Format | | | Figure 18-2 | Frequently Occurring Instruction Field Uses | | | Figure 18-3 | Instruction-Description Format | | | Figure B-1 | General-Purpose Register Organization | | | Figure B-2 | Register Bank Organization | | | Figure B-3 | Special-Purpose Registers | | | Figure C-1 | On-Chip Peripheral Registers | C-1 | | LIST OF TABLES | | | | Table 1-1 | Am29200 and Am29205 Microcontrollers: Feature Summary . | 1-2 | | Table 2-1 | Integer Arithmetic Instructions | 2-2 | | Table 2-2 | Compare Instructions | 2-3 | | Table 2-3 | Logical Instructions | 2-4 | | Table 2-4 | Shift Instructions | 2-4 | | Table 2-5 | Data Movement Instructions | 2-5 | | Table 2-6 | Constant Instructions | 2-5 | | Table 2-7 | Floating-Point Instructions | 2-6 | | Table 2-8 | Branch Instructions | | | Table 2-9 | Miscellaneous Instructions | 2-8 | | Table 2-10 | Reserved Instructions | 2-8 | | Table 7-1 | Internal Peripheral Address Ranges | 7-8 | | Table 7-2 | Internal Peripheral Address Assignments | 7-10 | | Table 9-1 | Address Multiplexing for 16-bit DRAM Memory | 9-5 | | Table 9-2 | Address Multiplexing for 32-bit DRAM Memory | 9-6 | | Table 9-3 | DRAM Address Multiplexing (by-4 DRAMs) | 9-6 | | Table 9-4 | DRAM Address Connections to Microcontroller (by-4 DRAMs) | 9-7 | | Table 16-1 | Vector Number Assignments | | | Table 16-2 | Interrupt and Trap Priority Table | 16-15 | | Table 17-1 | Instruction Scan Path | 17-8 | | Table 17-2 | Main Data Scan Path | 17-9 | | Table 17-3 | ICTEST1 Scan Path | 17-10 | | Table 17-4 | ICTEST2 Scan Path | 17-11 | | Table B-1 | Processor Register Field Summary | B-7 | | Table C-1 | Peripheral Register Field Summary | C-6 | ## INTRODUCTION AND OVERVIEW #### Am29200 AND Am29205 RISC MICROCONTROLLERS The Am29200 and Am29205 microcontrollers are part of a growing family of 32-bit reduced-instruction set (RISC) processors employing submicron circuits to increase the degree of system integration, yielding very low system cost. Dense circuitry and a large number of on-chip peripherals minimize the number of components required to implement embedded systems, while providing performance superior to that of complex-instruction-set (CISC) microprocessors. Systems implemented with the Am29200 or Am29205 microcontroller can achieve higher performance at lower cost than existing systems. The Am29200 and Am29205 microcontrollers are binary compatible with all other members of the 29K Family, further broadening the price/performance range of the 29K Family. The Am29200 and Am29205 microcontrollers were designed expressly to meet the requirements of embedded applications such as laser printers, graphics processing, application program interface (API) accelerators, X terminals and servers, and scanners. Such applications make the following demands on system design: - Performance at low cost: A processor must interface with memory and peripherals with a minimum number of external components. - Design flexibility: One basic design must be extensible to an entire product line. - Reduced time-to-market: A complete set of development, debug, and benchmarking tools is critical for reducing product development time. - A rational, easy upgrade path: The processor family must provide bus- and software-compatibility so processor upgrades are transparent to both hardware and software. The Am29200 and Am29205 microcontrollers are optimized for any embedded application requiring better-than-CISC performance at minimal system cost. The electronic components for many systems, such as personal laser printers, amount to little more than the Am29200 or Am29205 microcontroller, ROM, DRAM, and electrical buffering. ## **DESIGN PHILOSOPHY** The 29K Family of processors results from a design philosophy that considers processor performance in light of the processor's hardware and software environment. The key to maximizing performance is understanding that the processor is part of an integrated system, and is itself a collection of components that must be properly integrated. Processor features must be considered not only on their own merits, but also in relation to other components of the system. A particular feature that, while considered alone may increase one aspect of processor performance, may actually decrease the performance of the total system, because of the burden it places elsewhere in the system. As an illustration, consider the factors involved in the execution time of any processor task: ## Task Time = (Instructions / Task) \* (Cycles / Instruction) \* (Time / Cycle) To minimize the time taken, it is necessary to minimize the above product. This is not equivalent to minimizing all the terms that contribute to the product; in fact, this is generally not possible due to the interaction of the terms. As an example of the interaction of the previous terms, consider the number of instructions required for a task. An attempt to minimize this number, a more or less traditional approach to processor architecture design, increases the average number of cycles required for the execution of an instruction, because of the increased number of operations performed by each instruction. In addition, cycle time is increased because of instruction-decode time. A second example of the interaction in the previous equation appears in an attempt to reduce the cycle time through the pipelining of operations. In theory, the cycle time can be made arbitrarily small by the definition of an arbitrarily large number of pipeline stages. In practice, at least in the case of general-purpose processors, pipelining rarely yields much of its potential benefit. This is due to situations where the pipeline cannot be kept fully occupied, such as when memory references and branches occur. In these situations, additional pipeline stages increase the number of cycles required for an operation, and thus affect the *Cycles / Instruction* term. ## **Optimum Performance** Each of the terms in the previous equation has some minimum bound for a given implementation technology and task. In general, this minimum bound cannot be approached without an offsetting increase in the other terms, making the overall product less-than-optimum. The question then arises, what combination of terms will yield an optimum product? There are several things to note when answering this question. The first observation is that the number of operations underlying a given task is more or less fixed. Any single processor ultimately limits the time required for a task because it has a single execution unit and a single instruction stream. The operations that must be performed are reflected in the *Instructions / Task* and *Cycles / Instruction* terms. These operations may be performed by relatively few instructions, where each instruction takes multiple cycles to execute, or by a larger number of instructions, where each takes a single cycle to execute. In the first case, the instructions are complex; in the second, they are simple. The point is that the trade-off between simple and complex instructions is not one-to-one. For example, reducing the number of cycles per instruction by a factor of three does not increase the number of instructions per task by the same factor. There are two reasons for this. The first is that even when an instruction set supports complex operations, a large proportion of the instructions that are executed perform operations that could be performed as well by simple instructions. The second is that simple instructions expose more of the internal processor operation to an optimizing compiler. This allows the compiler to tailor the organization and sequence of operations to the task at hand, thereby reducing the total number of instructions executed. ## **Performance Leverage** Another important observation is that there is a tremendous amount of leverage in the *Time / Cycle* and *Cycles / Instruction* terms. As they are made smaller, they have a proportionally greater effect on performance. For example, a reduction of 10 ns in the cycle time of a processor operating with a 200-ns cycle time yields an increase of 5% in the processor's performance. The same improvement in a processor operating with a 50-ns cycle time yields a 20% increase in performance. Correspondingly, a reduction of 0.2 in the number of cycles per instruction in a processor averaging 5 cycles per instruction yields a 4% increase in performance. However, the same reduction yields a 12.5% performance increase in a processor that averages 1.6 cycles per instruction. #### Conclusion It is possible, and desirable, to increase the number of instructions executed for a given task, and more than make up for the performance impact of this increase by reductions in the cycle time and in the number of cycles per instruction. For example, if both the cycle time and the number of cycles per instruction are reduced by a factor of three, while the number of instructions for a given task is allowed to grow by 50%, the resulting task time is reduced by a factor of six. The Am29200 and Am29205 microcontrollers were designed with the above effects in mind. Maximum performance is obtained by the optimization of the product of the number of instructions per task, the number of cycles per instruction, and the cycle time, not by minimizing one factor at the expense of the others. This is accomplished by careful definition of all processor components. In particular: - The Instruction / Task term is optimized by the definition of simple instructions. The processor provides an efficient instruction set and a large number of general-purpose registers to an optimizing, high-level language compiler. Most reductions in this term are accomplished by the compiler. The number of instructions for a given task may be greater than the number of instructions for processors with complex instruction sets. However, this increase is more than offset by other improvements in processor performance. - The Cycles / Instruction term is optimized by the data-flow structure and performance-enhancing features of the processor. A large amount of processor hardware is dedicated to achieving an average instruction-execution rate that is close to single-cycle execution. - The *Time / Cycle* term is optimized by the implementation technology, the processor system interface, and judicious use of pipelining. The simplicity of the instruction set and processor features helps minimize the cycle time. #### **PURPOSE OF THIS MANUAL** This manual describes the technical features, programming interface, on-chip peripherals, and complete instruction set of the Am29200 and Am29205 microcontrollers. ## **INTENDED AUDIENCE** This manual is intended for system hardware and software architects and system engineers who are designing or are considering designing systems based on the Am29200 and Am29205 microcontrollers. #### **USER'S MANUAL OVERVIEW** This manual contains information on the Am29200 and Am29205 microcontrollers and is essential for system hardware and software architects and design engineers. Additional information is available in the form of data sheets, application notes, and other documentation provided with software products and hardware-development tools. The information in this manual is organized into eighteen chapters: - Chapter 1 introduces the features and performance aspects of the Am29200 and Am29205 microcontrollers. - Chapter 2 describes the programmer's model of the Am29200 and Am29205 microcontrollers, including the instruction set and register model. - Chapter 3 expands on the programmer's model, discussing different data formats and data handling. Instructions that manipulate external data are also discussed. - Chapter 4 details the management of the run-time stack and defines the conventions that apply to procedure linkage and register usage. - Chapter 5 describes the internal **pipelining** and the effects of the pipeline on program behavior. - Chapter 6 describes the system-protection features provided by the Am29200 and Am29205 microcontrollers. - Chapter 7 provides an overview of the processor's system interfaces and the system components that are integrated on-chip. - Chapter 8 describes the ROM interface. - Chapter 9 describes the DRAM interface. - Chapter 10 describes the **peripheral interface adapter**, which is used for glueless attachment of a number of peripheral components. - Chapter 11 describes the DMA controller. - Chapter 12 describes the programmable I/O port. - Chapter 13 describes the parallel port. - Chapter 14 describes the serial port. - Chapter 15 describes the video interface. - Chapter 16 provides a description of the interrupt and trap mechanism and the handling of interrupts and traps, including the operation of the on-chip interrupt controller. - Chapter 17 describes the software and hardware facilities for debugging and testing. - Chapter 18 provides a detailed description of the **instruction set**. For those readers desiring only a **brief overview** of the Am29200 and Am29205 microcontrollers, Chapter 1 identifies the outstanding features of each device. This chapter addresses the basic software and hardware concerns. Chapters 2, 3, and 5 are recommended reading for both hardware and software developers. For software architects and system programmers interested mainly in **software-related issues**, Chapters 4, 6, and 16 provide the necessary information. Chapters 17 and 18 also provide related information. For hardware architects and systems hardware designers interested mainly in **hardware-related issues**, Chapters 7 through 15 and Chapter 17 provide most of the required information. Chapters 5 and 18 also provide related information. For users already familiar with other 29K Family processors, Chapters 7–15 describe the **on-chip peripherals and system functions** unique to the Am29200 and Am29205 microcontrollers. ## **AMD DOCUMENTATION** # 29K Family | ORDER NO. | DOCUMENT | |-----------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | 10620 | Am29000® and Am29005™ Microprocessors User's Manual and Data Sheet Describes the Am29000 and Am29005 microprocessors' technical features, programming interface, and complete instruction set. | | 15723 | Am29030™ and Am29035™ Microprocessors User's Manual and Data Sheet Describes the Am29030 and Am29035 microprocessors' technical features, programming interface, and complete instruction set. | | 14779 | Am29050™ Microprocessor User's Manual Describes the Am29050 microprocessor's technical features, programming interface, and complete instruction set. | | 16361 | Am29200 and Am29205 RISC Microcontrollers Data Sheet Describes the Am29200 and Am29205 microcontrollers' technical features, including electrical and mechanical specifications. | | 17741 | Am29240 <sup>™</sup> , Am29245 <sup>™</sup> , and Am29243 <sup>™</sup> RISC Microcontrollers User's Manual and Data Sheet Describes the Am29240, Am29245, and Am29243 microcontrollers' technical features, programming interface, and complete instruction set. | | 17882 | Am29240, Am29245, and Am29243 RISC Microcontrollers Brochure Describes features, reference designs, and tool support for this series of high-performance RISC microcontrollers. | | 18002 | <b>29K Family Comparison Chart</b> Compares the features of all 29K microprocessors and microcontrollers in a single chart organized for easy reference. | | 11426 | Fusion29K <sup>SM</sup> Catalog Provides information on more than 200 tools that speed a 29K Family | embedded product to market. Includes products from over 100 expert suppliers of embedded development solutions. Design solution chapters include: laser printer and OCR solutions, graphics solutions, and networking solutions. 12990 Fusion Newsletter Contains quarterly updates on developments in the Am186 Family, 29K Family, and E series of microprocessors and features new Fusion Partner solutions. 15176 29K Laser Printer Solutions Brochure Reviews how the 29K Family of microprocessors fits into the laser printer marketplace. Includes a description of AMD's PCL and PostScript® Laser29K™ Low-Cost Raster Image Processor demonstration boards. 10344 29K Family Design-Made-Easy Solutions Brochure Presents an overview of the entire 29K Family of microprocessors and microcontrollers. Features development support products. 16693 RISC Design-Made-Easy™ Applications Guide Presents topics on the 29K Family, including interfaces to integer multipliers, context switching, TLB handlers, benchmarking applications, byte-writable memories for three-bus microprocessors, host interface (HIF) version 2.0 specification, using the Am29000 microprocessor as a high-performance DMA controller, and writing interrupt handlers. ## **Development Tools** 17704 Am29200 and Am29205 RISC Microcontroller Brochure Reviews how the SA-29200 and SA-29205 demonstration boards and the SA-29200 expansion board use the Am29200 or Am29205 microcontroller to meet requirements for low-cost embedded applications. Includes additional support product and ordering information. 10287 MiniMON29K™ Portable Debug Monitor Data Sheet 10626 XRAY29K™ Source-Level Debugger Data Sheet 10957 High C® 29K Development Toolkit Data Sheet To order literature, contact your local AMD sales office or call: 800-2929-AMD, ext. 3 (in the U.S.), or 800-531-5202, ext. 55651 (in Canada), or direct dial from any location: 512-602-5651. #### **RELATED PUBLICATIONS** The IEEE Standard 1149.1-1990 (JTAG) may be ordered from IEEE Computer Society Press Customer Service Center 10662 Los Vaqueros Circle P.O. Box 3014 Los Alamitos, CA 90720-1264 USA IEEE Catalog No. SH13144 1-800-CS-BOOKS 714-821-4010 (fax) ## FEATURES AND PERFORMANCE This chapter provides a general evaluation of the Am29200 and Am29205 microcontrollers. A detailed technical description of the Am29200 and Am29205 microcontrollers is contained in subsequent chapters. This chapter informally describes the features of the two microcontrollers, concentrating on features which distinguish them from other available processors and describing how these features enhance system performance and cost-effectiveness. This chapter consists of the following sections: - Distinctive Characteristics - Key Features and Benefits - Performance Overview - Debugging and Testing ## 1.1 DISTINCTIVE CHARACTERISTICS The Am29200 and Am29205 RISC microcontrollers are highly integrated, 32-bit embedded processors implemented in complementary metal-oxide semiconductor (CMOS) technology. Through submicron technology, the Am29200 and Am29205 microcontrollers incorporate a complete set of system facilities commonly found in printing, imaging, graphics, and other embedded applications. The distinctive features of each microcontroller are compared in Table 1-1. Based on the 29K architecture, the Am29200 and Am29205 microcontrollers are part of a growing family of RISC microcontrollers, which also includes the high-performance Am29240, Am29245, and Am29243 RISC microcontrollers. Table 1-1 Am29200 and Am29205 Microcontrollers: Feature Summary | FEATURE | Am29200 Microcontroller | Am29205 Microcontroller | |-------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------|----------------------------------------------------------| | Data Bus Width<br>Internal<br>External | 32 bits<br>32 bits | 32 bits<br>16 bits | | Address Bus Width | 24 bits | 22 bits | | ROM Interface Banks Width ROM Size (Max/Bank) Boot-up ROM Width Burst-mode access | 4<br>8, 16, 32 bits<br>16 Mbytes<br>8, 16, 32 bits<br>Supported | 3<br>8, 16 bits<br>4 Mbytes<br>16 bits<br>Not Supported | | DRAM Interface Banks Width Size: 32-bit mode Size: 16-bit mode Video DRAM | 4<br>16, 32 bits<br>16 Mbytes/bank<br>8 Mbytes/bank<br>Supported | 4<br>16 bits only<br>—<br>8 Mbytes/bank<br>Not Supported | | On-Chip DMA Width (ext. peripherals) Externally Controlled External Master Access External Terminate Signal | 8, 16, 32 bits<br>2 Channels<br>Yes<br>Yes | 8, 16 bits<br>1 Channel<br>No<br>No | | Peripheral Interface Adapter<br>(PIA)<br>PIA Ports<br>Data Width | 6<br>8, 16, 32 bits | 2<br>8, 16 bits | | Programmable I/O Port (PIO) Signals | 16 | 8 | | Serial Ports Ports DSR/DTR | 1 Port<br>Supported | 1 Port<br>Uses PIO signals | | Interrupt Controller External Interrupt Pins External Trap and Warn Pins | 4<br>3 | 2<br>0 | | Parallel Port Controller<br>32-bit Transfer | Yes | No | | JTAG Debug Support | Yes | No | #### 1.1.1 Am29200 Microcontroller The Am29200 microcontroller meets the common requirements of embedded applications such as industrial control, graphics processing, imaging applications, laser printers, and general purpose applications requiring high-performance in a compact design. Figure 1-1 shows a block diagram of the Am29200 microcontroller. The microcontroller includes the following features: - Completely integrated system for embedded applications - Full 32-bit architecture - 16- and 20-MHz operating frequencies - Price/performance flexibility. Support for several low-cost memory configurations allows performance points of 8, 6, 5, and 3 million instructions per second sustained (at 16 MHz). - 4-Gbyte virtual address space, 304-Mbyte physical address space implemented - 192 general-purpose registers; three-address instruction architecture - Glueless system interfaces with on-chip wait state control - Four banks of ROM, each separately programmable for 8-, 16-, or 32-bit interface - Four banks of DRAM, each separately programmable for 16- or 32-bit interface - Burst-mode and page-mode access support - DRAM mapping on-chip - Advanced debugging support - IEEE Std. 1149.1–1990 (JTAG) compliant Standard Test Access Port and Boundary Scan Architecture - Two-channel DMA controller with queuing on one channel - 6-port peripheral interface adapter - 16-line programmable I/O port - Bidirectional bit serializer/deserializer (video interface) - Serial port (UART) - Bidirectional parallel port controller - Interrupt controller - On-chip timer - Binary compatibility with all 29K Family microcontrollers and microprocessors Figure 1-1 Am29200 Microcontroller Block Diagram #### 1.1.2 Am29205 Microcontroller The Am29205 RISC microcontroller (see Figure 1-2) is a highly integrated, low-cost derivative of the Am29200 microcontroller, with a 16-bit instruction bus, fewer peripheral ports, and no JTAG interface. It includes the following features: - Completely integrated system for embedded applications - Full 16-bit external, 32-bit internal architecture - 12- and 16-MHz operating frequencies - 68-Mbyte address space - Two-channel DMA controller (one external) - Three separately programmable ROM banks with16-bit ROM interface - Fully functional 16-bit DRAM interface complete with address MUXing, Refresh, and RAS/CAS generation - Two-port peripheral interface adapter - Eight-line programmable I/O port - Bidirectional bit serializer/deserializer (video interface) - Serial port (UART) - Bidirectional parallel port controller - Interrupt controller - On-chip timer Figure 1-2 Am29205 Microcontroller Block Diagram ## 1.2 KEY FEATURES AND BENEFITS ## 1.2.1 Complete Set of Common System Peripherals The Am29200 and Am29205 microcontrollers minimize system cost by incorporating a complete set of system facilities commonly found in embedded applications, eliminating the cost of additional components. The on-chip functions include: a ROM controller, a DRAM controller, a peripheral interface adapter, a DMA controller, a programmable I/O port, a parallel port, a serial port, and an interrupt controller. A serializer/deserializer (video interface) is also included for printer, scanner, and other imaging applications. By providing a complete set of common system peripherals on-chip and glueless interfacing to external memories and peripherals, these RISC microcontrollers let product designers capitalize on the very low system cost made possible by the integration of processor and peripherals. Many simple systems can be built using only the Am29200 or Am29205 microcontroller and external ROM and/or DRAM memory. ## 1.2.1.1 ROM Controller (Chapter 8) The ROM controller supports four individual banks of ROM or other static memory in the Am29200 microcontroller and three banks in the Am29205 microcontroller. Each ROM bank has its own timing characteristics, and each bank can be a different size: either 8, 16, or 32 bits wide in the Am29200 microcontroller and 8 or 16 bits wide in the Am29205 microcontroller. The ROM banks can appear as a contiguous memory area of up to 64 Mbytes in size on the Am29200 microcontroller. The ROM controller also supports writes to the ROM memory space for devices such as flash EPROMs and SRAMs. ## 1.2.1.2 DRAM Controller (Chapter 9) The DRAM controller supports four separate banks of dynamic memory. On the Am29200 microcontroller, each bank can be a different size: either 16 or 32 bits wide. DRAM banks on the Am29205 microcontroller are 16 bits wide. The DRAM banks can appear as a contiguous memory area of up to 64 Mbytes in size on the Am29200 microcontroller and 32 Mbytes on the Am29205 microcontroller. To support system functions such as on-the-fly data compression and decompression, four 64-Kbyte regions of the DRAM can be mapped into a 16-Mbyte virtual address space. ## 1.2.1.3 Peripheral Interface Adapter (PIA) (Chapter 10) The peripheral interface adapter allows for additional system features implemented by external peripheral chips. The PIA permits glueless interfacing from the Am29200 microcontroller to as many as six external peripheral regions and from the Am29205 microcontroller to two external peripherals. #### 1.2.1.4 DMA Controller (Chapter 11) The DMA controller in the Am29200 microcontroller provides two channels for transferring data between the DRAM and internal or external peripherals. One of the DMA channels is double buffered to relax the constraints on the reload time. On the Am29205 microcontroller, internal 32-bit transfers are supported on two DMA channels; external transfers are limited to 8-or 16-bit data accesses on one DMA channel. #### 1.2.1.5 Interrupt Controller (Section 16.9) The interrupt controller generates and reports the status of interrupts caused by on-chip peripherals. ## 1.2.1.6 I/O Port (Chapter 12) The Am29200 microcontroller's I/O port permits direct access to 16 individually programmable external input/output signals. Eight signals are available on the Am29205 microcontroller. These eight signals can be configured to cause interrupts on either microcontroller. ## 1.2.1.7 Parallel Port (Chapter 13) The parallel port implements a bidirectional IBM PC-compatible parallel interface to a host processor. ## 1.2.1.8 Serial Port (Chapter 14) The serial port implements a full-duplex UART. #### 1.2.1.9 Serializer/Deserializer (Chapter 15) The bidirectional bit serializer/deserializer (video interface) permits direct connection to a number of laser marking engines, video displays, or raster input devices such as scanners. ## 1.2.2 Wide Range of Price/Performance Points To reduce design costs and time-to-market, one basic system design can be used as the foundation for an entire product line. From this design, numerous implementations of the product at various levels of price and performance may be derived with minimum time, effort, and cost. The Am29200 and Am29205 microcontrollers provide this capability through programmable memory widths, burst-mode and page-mode access support, programmable wait states, and hardware and 29K Family software compatibility. A system can be upgraded without hardware and software redesign using various memory architectures. The ROM controller on the Am29205 microcontroller accommodates memories that are either 8 or 16 bits wide, while that of the Am29200 microcontroller supports either 8-, 16-, or 32-bit memories. The DRAM controller on the Am29205 microcontroller accommodates dynamic memories that are 16 bits wide; the Am29200 microcontroller supports either 16- or 32-bit memories. These unique features provide a flexible interface to low-cost memory as well as a convenient, flexible upgrade path. For example, a system can start with a 16-bit memory design and can subsequently improve performance by migrating to a 32-bit memory design. One particular advantage is the ability to add memory in half-megabyte increments. This provides significant cost savings for applications that do not require larger memory upgrades. The Am29200 microcontroller family allows users to address a wide range of price/performance points, with higher performance and lower cost than existing designs based on CISC microprocessors. ## 1.2.3 Glueless System Interfaces The Am29200 and Am29205 microcontrollers also minimize system cost by providing a glueless attachment to external ROMs, DRAMs, and other peripheral components. Processor outputs have edge-rate control that allows them to drive a wide range of load capacitances with low noise and ringing. This eliminates the cost of external logic and buffering. ## 1.2.4 Bus- and Binary-Compatibility Compatibility within a processor family is critical for achieving a rational, easy upgrade path. The Am29200 and Am29205 microcontrollers are members of a bus-compatible family of RISC microcontrollers, which also includes the high-performance Am29240, Am29245, and Am29243 microcontrollers. Future members of this family will improve in price and performance and system capabilities without requiring that users redesign their system hardware or software. Bus compatibility ensures a convenient upgrade path for future systems. The Am29200 microcontroller is binary compatible with the Am29240, Am29245, and Am29243 microcontrollers, as well as the Am29000, Am29005, Am29030, Am29035, and Am29050 microprocessors. The Am29200 microcontroller family provides a migration path to low-cost, highly integrated systems for products based on other 29K Family microprocessors, without requiring expensive rewrites of application software. ## 1.2.5 Complete Development and Support Environment A complete development and support environment is vital for reducing a product's time-to-market. Advanced Micro Devices has created a standard development environment for the 29K Family of processors. In addition, the Fusion29K third-party support organization provides the most comprehensive customer/partner program in the embedded processor market. Advanced Micro Devices offers a complete set of hardware and software tools for design, integration, debugging, and benchmarking. These tools, which are available now for the 29K Family, include the following: - High C® 29K optimizing C compiler with assembler, linker, ANSI library functions, and 29K architectural simulator - XRAY29K™ source-level debugger - MiniMON29K<sup>™</sup> debug monitor - A complete family of demonstration and development boards In addition, Advanced Micro Devices has developed a standard host interface (HIF) specification for operating system services, the Universal Debug Interface (UDI) for seamless connection of debuggers to ICEs and target hardware, and extensions for the UNIX common object file format (COFF). This support is augmented by an engineering hotline, an on-line bulletin board, and field application engineers. #### 1.3 PERFORMANCE OVERVIEW The Am29200 and Am29205 microcontrollers offer a significant margin of performance over CISC microprocessors in existing embedded designs, since the majority of processor features were defined for the maximum achievable performance at a very low cost. This section describes the features of the Am29200 and Am29205 microcontrollers from the point of view of system performance. ## 1.3.1 Instruction Timing (Section 2.1) The Am29200 and Am29205 microcontrollers use an arithmetic/logic unit, a field shift unit, and a prioritizer to execute most instructions. Each of these is organized to operate on 32-bit operands and provide a 32-bit result. All operations are performed in a single cycle. The performance degradation of load and store operations is minimized in the Am29200 and Am29205 microcontrollers by overlapping them with instruction execution, by taking advantage of pipelining, and by organizing the flow of external data into the processor so that the impact of external accesses is minimized. ## 1.3.2 Pipelining (Chapter 5) Instruction operations are overlapped with instruction fetch, instruction decode and operand fetch, instruction execution, and result write-back to the register file. Pipeline forwarding logic detects pipeline dependencies and routes data as required, avoiding delays that might arise from these dependencies. Pipeline interlocks are implemented by processor hardware. Except for a few special cases, it is not necessary to rearrange programs to avoid pipeline dependencies, although this is sometimes desirable for performance. ## 1.3.3 Burst-Mode and Page-Mode Memories (Sections 8.3.5, 9.3.7) The Am29200 microcontroller directly supports burst-mode memories in ROM address space. The burst-mode memory supplies instructions at the maximum bandwidth, without the complexity of an external cache or the performance degradation due to cache misses. Both the Am29200 and Am29205 microcontrollers can also use the page-mode capability of common DRAMs to improve the access time in cases where page-mode accesses can be used. This is particularly useful in very low-cost systems with 16-bit-wide DRAMs, where the DRAM must be accessed twice for each 32-bit operand. ## 1.3.4 Instruction Set Overview (Section 2.1, Chapter 18) The Am29200 and Am29205 microcontrollers employ a three-address instruction set architecture. The compiler or assembly-language programmer is given complete freedom to allocate register usage. There are 192 general-purpose registers, allowing the retention of intermediate calculations and avoiding needless data destruction. Instruction operands may be contained in any of the general-purpose registers, and the results may be stored into any of the general-purpose registers. The instruction set contains 117 instructions that are divided into nine classes. These classes are integer arithmetic, compare, logical, shift, data movement, constant, floating point, branch, and miscellaneous. The floating-point instructions are not executed directly, but are emulated by trap handlers. All directly implemented instructions are capable of executing in one processor cycle, with the exception of interrupt returns, loads, and stores. ## 1.3.5 Data Formats (Chapter 3) The Am29200 and Am29205 microcontrollers define a word as 32 bits of data, a half-word as 16 bits, and a byte as 8 bits. The hardware provides direct support for word-integer (signed and unsigned), word-logical, word-boolean, half-word integer (signed and unsigned), and character data (signed and unsigned). Word-boolean data is based on the value contained in the most significant bit of the word. The values TRUE and FALSE are represented by the MSB values 1 and 0, respectively. Other data formats, such as character strings, are supported by instruction sequences. Floating-point formats (single and double precision) are defined for the processor; however, there is no direct hardware support for these formats in the Am29200 or Am29205 microcontroller. ## 1.3.6 Protection (Chapter 6) The Am29200 and Am29205 microcontrollers offer two mutually exclusive modes of execution, the user and supervisor modes, that restrict or permit accesses to certain processor registers and external storage locations. The register file may be configured to restrict accesses to supervisor-mode programs on a bank-by-bank basis. ## 1.3.7 DRAM Mapping (Section 9.3.5) The Am29200 and Am29205 microcontrollers provide a 16-Mbyte region of virtual memory that is mapped to one of four 64-Kbyte blocks in the physical DRAM memory. This supports system functions such as on-the-fly data compression and decompression, allowing a large data structure such as a frame buffer to be stored in a compressed format while the application software operates on a region of the structure that is decompressed. Using a mechanism that is analogous to demand paging, system software moves data between the compressed and decompressed formats in a way that is invisible to the applications software. This feature can greatly reduce the amount of memory required for printing, imaging, and graphics applications. ## 1.3.8 Interrupts and Traps (Chapter 16) When the microcontroller takes an interrupt or trap, it does not automatically save its current state information in memory. This lightweight interrupt and trap facility greatly improves the performance of temporary interruptions such as simple operating-system calls that require no saving of state information. In cases where the processor state must be saved, the saving and restoring of state information is under the control of software. The methods and data structures used to handle interrupts—and the amount of state saved—may be tailored to the needs of a particular system. Interrupts and traps are dispatched through a 256-entry vector table that directs the processor to a routine that handles a given interrupt or trap. The vector table may be relocated in memory by the modification of a processor register. There may be multiple vector tables in the system, though only one is active at any given time. The vector table is a table of pointers to the interrupt and trap handlers and requires only 1 Kbyte of memory. The processor performs a vector fetch every time an interrupt or trap is taken. The vector fetch requires at least three cycles, in addition to the number of cycles required for the basic memory access. ## 1.4 DEBUGGING AND TESTING (Chapter 17) Software debugging on the Am29200 and Am29205 microcontrollers is facilitated by the instruction trace facility and instruction breakpoints. Instruction tracing is accomplished by forcing the processor to trap after each instruction has been executed. Instruction breakpoints are implemented by the HALT instruction or by a software trap. The Am29200 microcontroller provides two additional features to assist system debugging and testing: The test/development interface is composed of a group of pins that indicate the state of the processor and control the operation of the processor. ■ An IEEE Standard 1149.1–1990 (JTAG) compliant Standard Test Access Port and Boundary-Scan Architecture. The test access port provides a scan interface for testing system hardware in a production environment, and contains extensions that allow a hardware-development system to control and observe the processor without interposing hardware between the processor and system. Hardware testing and debugging on the Am29205 microcontroller are supported by using an Am29200 microcontroller to emulate an Am29205 microcontroller. # **PROGRAMMING** This chapter focuses on programming the Am29200 and Am29205 microcontrollers. First, this chapter presents an instruction set overview. It then describes the register model, emphasizing the general- and special-purpose registers. This chapter also describes certain special-purpose registers that deal directly with instruction execution. Finally, this chapter describes general considerations related to application programming. #### 2.1 INSTRUCTION SET The Am29200 and Am29205 microcontrollers recognize 117 instructions. All instructions execute in a single cycle, except for IRET, IRETINV, LOADM, STOREM, and certain arithmetic instructions such as floating-point instructions. Floating-point and integer multiply and divide instructions are not implemented directly in hardware, but are implemented by a virtual arithmetic interface invoked using instruction traps (see Section 2.8). Most instructions deal with general-purpose registers for operands and results; however, in most instructions, an 8-bit constant can be used in place of a register-based operand. Some instructions deal with special-purpose registers and external devices and memories. This section describes the nine instruction classes and provides a brief summary of instruction operations. A detailed instruction specification is contained in Chapter 18. Section 18.1 describes the nomenclature used here. If the processor attempts to execute an unimplemented instruction, an Illegal Opcode trap occurs unless the instruction is reserved for emulation (see Section 2.1.10). Reserved instructions are assigned individual traps. ## 2.1.1 Integer Arithmetic The Integer Arithmetic instructions perform add, subtract, multiply, and divide operations on word-length integers. Certain instructions in this class cause traps if signed or unsigned overflow occurs during the execution of the instruction. There is support for multiprecision arithmetic on operands whose lengths are multiples of words. All instructions in this class set the ALU Status Register. The Integer Arithmetic instructions are shown in Table 2-1. In the Am29200 and Am29205 microcontrollers, the integer multiply and divide instructions cause traps to routines which perform the operations. ## 2.1.2 Compare The Compare instructions (Table 2-2) test for various relationships between two values. For all Compare instructions except the CPBYTE instruction, the comparisons are performed on word-length signed or unsigned integers. There are two types of Compare instructions. The first type places a Boolean value reflecting the outcome of the compare into a general-purpose register. For the second type, assert instructions, instruction execution continues only if the comparison is true; otherwise a trap occurs. The assert instructions specify a vector for the trap (see Section 16.3). The assert instructions support run-time operand checking and operating-system calls. If the trap occurs in the User mode, and a trap number between 0 and 63 is specified by the instruction, a Protection Violation trap occurs. Table 2-1 Integer Arithmetic Instructions | Mnemonic | Operation Description | | |----------|-----------------------------------------------------------------------------|--| | ADD | DEST ← SRCA + SRCB | | | ADDS | DEST ← SRCA + SRCB IF signed overflow THEN Trap (Out of Range) | | | ADDU | DEST ← SRCA + SRCB IF unsigned overflow THEN Trap (Out of Range) | | | ADDC | DEST ← SRCA + SRCB + C | | | ADDCS | DEST ← SRCA + SRCB + C IF signed overflow THEN Trap (Out of Range) | | | ADDCU | DEST ← SRCA + SRCB + C IF unsigned overflow THEN Trap (Out of Range) | | | SUB | DEST ← SRCA – SRCB | | | SUBS | DEST ← SRCA – SRCB IF signed overflow THEN Trap (Out of Range) | | | SUBU | DEST ← SRCA – SRCB IF unsigned underflow THEN Trap (Out of Range) | | | SUBC | DEST ← SRCA – SRCB – 1 + C | | | SUBCS | DEST ← SRCA – SRCB – 1 + C<br>IF signed overflow THEN Trap (Out of Range) | | | SUBCU | DEST ← SRCA – SRCB – 1 + C IF unsigned underflow THEN Trap (Out of Range) | | | SUBR | DEST ← SRCB –SRCA | | | SUBRS | DEST ← SRCB – SRCA IF signed overflow THEN Trap (Out of Range) | | | SUBRU | DEST ← SRCB - SRCA<br>IF unsigned underflow THEN Trap (Out of Range) | | | SUBRC | DEST ← SRCB − SRCA − 1 + C | | | SUBRCS | DEST ← SRCB – SRCA – 1 + C<br>IF signed overflow THEN Trap (Out of Range) | | | SUBRCU | DEST ← SRCB – SRCA – 1 +C IF unsigned underflow THEN Trap (Out of Range) | | | MULTIPLU | DEST ← SRCA · SRCB (unsigned) | | | MULTIPLY | $DEST \leftarrow SRCA \cdot SRCB \ (signed)$ | | | MUL | Perform one-bit step of a multiply operation (signed) | | | MULL | Complete a sequence of multiply steps | | | MULTM | $DEST \leftarrow SRCA \cdot SRCB \text{ (signed), most significant bits}$ | | | MULTMU | $DEST \leftarrow SRCA \cdot SRCB \text{ (unsigned), most significant bits}$ | | | MULU | Perform one-bit step of a multiply operation (unsigned) | | | DIVIDE | DEST ← (Q//SRCA)/SRCB (signed)<br>Q ← Remainder | | | DIVIDU | DEST ← (Q//SRCA)/SRCB (unsigned) Q ← Remainder | | | DIVO | Initialize for a sequence of divide steps (unsigned) | | | DIV | Perform one-bit step of a divide operation (unsigned) | | | DIVL | Complete a sequence of divide steps (unsigned) | | | DIVREM | Generate remainder for divide operation (unsigned) | | Table 2-2 Compare Instructions | CPEQ | IF SRCA = SRCB THEN DEST ← TRUE<br>ELSE DEST ← FALSE | |--------|--------------------------------------------------------------------------------------------------------------------------------------------------------| | CPNEQ | IF SRCA <> SRCB THEN DEST $\leftarrow$ TRUE ELSE DEST $\leftarrow$ FALSE | | CPLT | IF SRCA < SRCB THEN DEST $\leftarrow$ TRUE ELSE DEST $\leftarrow$ FALSE | | CPLTU | IF SRCA < SRCB (unsigned) THEN DEST $\leftarrow$ TRUE ELSE DEST $\leftarrow$ FALSE | | CPLE | IF SRCA ≤ SRCB THEN DEST ← TRUE<br>ELSE DEST ← FALSE | | CPLEU | IF SRCA $\leq$ SRCB (unsigned) THEN DEST $\leftarrow$ TRUE ELSE DEST $\leftarrow$ FALSE | | CPGT | IF SRCA > SRCB THEN DEST ← TRUE<br>ELSE DEST ← FALSE | | CPGTU | IF SRCA > SRCB (unsigned) THEN DEST $\leftarrow$ TRUE ELSE DEST $\leftarrow$ FALSE | | CPGE | If SRCA ≥ SRCB THEN DEST ← TRUE<br>ELSE DEST ← FALSE | | CPGEU | IF SRCA ≥ SRCB (unsigned) THEN DEST $\leftarrow$ TRUE ELSE DEST $\leftarrow$ FALSE | | СРВҮТЕ | IF (SRCA.BYTE0 = SRCB.BYTE0) OR (SRCA.BYTE1 = SRCB.BYTE1) OR (SRCA.BYTE2 = SRCB.BYTE2) OR (SRCA.BYTE3 = SRCB.BYTE3) THEN DEST ← TRUE ELSE DEST ← FALSE | | ASEQ | IF SRCA = SRCB THEN Continue<br>ELSE Trap (VN) | | ASNEQ | IF SRCA <> SRCB THEN Continue ELSE Trap (VN) | | ASLT | IF SRCA < SRCB THEN Continue<br>ELSE Trap (VN) | | ASLTU | IF SRCA < SRCB (unsigned) THEN Continue ELSE Trap (VN) | | ASLE | IF SRCA ≤ SRCB THEN Continue<br>ELSE Trap (VN) | | ASLEU | IF SRCA ≤ SRCB (unsigned) THEN Continue<br>ELSE Trap (VN) | | ASGT | IF SRCA > SRCB THEN Continue<br>ELSE Trap (VN) | | ASGTU | IF SRCA > SRCB (unsigned) THEN Continue ELSE Trap (VN) | | ASGE | IF SRCA ≥ SRCB THEN Continue<br>ELSE Trap (VN) | | ASGEU | IF SRCA ≥ SRCB (unsigned) THEN Continue ELSE Trap (VN) | ## 2.1.3 Logical The Logical instructions (Table 2-3) perform a set of bit-by-bit Boolean functions on word-length bit strings. All instructions in this class set the ALU Status Register. ## Table 2-3 Logical Instructions | Mnemonic | Operation Description | | |----------|------------------------|--| | AND | DEST ← SRCA & SRCB | | | ANDN | DEST ← SRCA & ~ SRCB | | | NAND | DEST ←~ (SRCA & SRCB) | | | OR | DEST ← SRCA I SRCB | | | NOR | DEST ← ~ (SRCA I SRCB) | | | XOR | DEST ← SRCA ^ SRCB | | | XNOR | DEST ← ~ (SRCA ^ SRCB) | | #### 2.1.4 Shift The Shift instructions (Table 2-4) perform arithmetic and logical shifts. All but the EXTRACT instruction operate on word-length data and produce a word-length result. The EXTRACT instruction operates on double-word data and produces a word-length result. If both parts of the double-word for the EXTRACT instruction are from the same source, the EXTRACT operation is equivalent to a rotate operation. For each operation, the shift count is a 5-bit integer, specifying a shift amount in the range of 0 to 31 bits. #### **Table 2-4** Shift Instructions | Mnemonic | Operation Description | | |----------|----------------------------------------------|--| | SLL | DEST ← SRCA << SRCB (zero fill) | | | SRL | DEST ← SRCA >> SRCB (zero fill) | | | SRA | DEST ← SRCA >> SRCB (sign fill) | | | EXTRACT | DEST ← high-order word of (SRCA//SRCB << FC) | | #### 2.1.5 Data Movement The Data Movement instructions (Table 2-5) move bytes, half-words, and words between processor registers. In addition, they move data between general-purpose registers and external devices, and memories. The instructions LOADL and STOREL are provided for compatibility with other 29K processors and are treated as LOAD and STORE instructions. Similarly, the instructions MFTLB and MTTLB perform no operation, except that both are privileged instructions. **Table 2-5** Data Movement Instructions | Mnemonic | Operation Description | |----------|------------------------------------------------------------------------------------------------------------------------------------------------------------------| | LOAD | DEST ← EXTERNAL WORD [SRCB] | | LOADL | Implemented as LOAD | | LOADSET | DEST ← EXTERNAL WORD [SRCB]<br>EXTERNAL WORD [SRCB] ← h'FFFFFFFF' | | LOADM | DEST DEST + COUNT ← EXTERNAL WORD [SRCB] EXTERNAL WORD [SRCB + COUNT · 4] | | STORE | EXTERNAL WORD [SRCB] ← SRCA | | STOREL | Implemented as STORE | | STOREM | EXTERNAL WORD [SRCB]<br>EXTERNAL WORD [SRCB + COUNT · 4] ←<br>SRCA SRCA + COUNT | | EXBYTE | DEST ← SRCB, with low-order byte replaced by byte in SRCA selected by BP | | EXHW | $\label{eq:DEST} \begin{tabular}{l} \textbf{DEST} \leftarrow \textbf{SRCB}, with low-order half-word replaced by half-word in SRCA selected by BP \end{tabular}$ | | EXHWS | DEST ← half-word in SRCA selected by BP, sign-extended to 32 bits | | INBYTE | $\label{eq:def:DEST} \textbf{DEST} \leftarrow \textbf{SRCA}, \text{ with byte selected by BP replaced by low-order byte of SRCB}$ | | INHW | $\label{eq:DEST} \begin{tabular}{l} \textbf{DEST} \leftarrow \textbf{SRCA, with half-word selected by BP replaced by low-order half-word of SRCB} \end{tabular}$ | | MFSR | DEST ← SPECIAL | | MFTLB | no operation (privileged) | | MTSR | SPDEST ← SRCB | | MTSRIM | SPDEST ← 0l16 | | MTTLB | no operation (privileged) | ## 2.1.6 Constant The Constant instructions (Table 2-6) provide the ability to place half-word and word constants into registers. Most instructions in the instruction set allow an 8-bit constant as an operand. The Constant instructions allow the construction of larger constants. **Table 2-6 Constant Instructions** | Mnemonic | Operation Description | | |----------|---------------------------------------------|--| | CONST | DEST ← 0l16 | | | CONSTH | Replace high-order half-word of SRCA by I16 | | | CONSTN | DEST ← 1I16 | | ## 2.1.7 Floating Point The Floating-Point instructions (Table 2-7) provide operations on single-precision (32-bit) or double-precision (64-bit) floating-point data. They also provide conversions between single-precision, double-precision, and integer number representations. In the Am29200 and Am29205 microcontrollers, these instructions cause traps (specified by the vector numbers listed in the table) to routines which perform the floating-point operations. **Table 2-7 Floating-Point Instructions** | Mnemonic | <b>Operation Description</b> | | Vector Number | |----------|--------------------------------------------------------------------|-------------------------------------------------------------------------------|---------------| | FADD | DEST (single-precision) | ← SRCA (single-precision)<br>+ SRCB (single-precision) | 48 | | DADD | DEST (double-precision) | ← SRCA (double-precision) + SRCB (double-precision) | 49 | | FSUB | DEST (single-precision) | ← SRCA (double-precision) – SRCB (single-precision) | 50 | | DSUB | DEST (double-precision) | ← SRCA (double-precision)<br>– SRCB (double-precision) | 51 | | FMUL | DEST (single-precision) | ← SRCA (single-precision) · SRCB (single-precision) | 52 | | FDMUL | DEST (double-precision) | <ul><li>← SRCA (single-precision)</li><li>∙ SRCB (single-precision)</li></ul> | 57 | | DMUL | DEST (double-precision) | ← SRCA (double-precision) · SRCB (double-precision) | 53 | | FDIV | DEST (single-precision) | ← SRCA (single-precision / SRCB (single-precision) | 54 | | DDIV | DEST (double-precision) | ← SRCA (double-precision) / SRCB (double-precision) | 55 | | FEQ | IF SRCA (single-precision) THEN DEST ← TRUE ELSE DEST ← FALSE | = SRCB (single-precision) | 42 | | DEQ | IF SRCA (double-precisior<br>THEN DEST ← TRUE<br>ELSE DEST ← FALSE | n) = SRCB (double-precision) | 43 | | FGE | IF SRCA (single-precision) THEN DEST ← TRUE ELSE DEST ← FALSE | >= SRCB (single-precision) | 46 | | DGE | IF SRCA (double-precisior<br>THEN DEST ← TRUE<br>ELSE DEST ← FALSE | ) >= SRCB (double-precision) | 47 | | FGT | IF SRCA (single-precision) THEN DEST ← TRUE ELSE DEST ← FALSE | > SRCB (single-precision) | 44 | | DGT | IF SRCA (double-precisior<br>THEN DEST ← TRUE<br>ELSE DEST ← FALSE | ) > SRCB (double-precision) | 45 | | SQRT | DEST (single-precision, do<br>← SQRT [SRCA (single-pr | | 37 | | CONVERT | DEST (integer, single-prec<br>← SRCA (integer, single-p | ision, double-precision) | 36 | | CLASS | | ingle-precision, double-precision | n)] 38 | | | | | | #### 2.1.8 Branch The Branch instructions (Table 2-8) control the execution flow of instructions. Branch target addresses may be absolute, relative to the program counter (with the offset given by a signed instruction constant), or contained in a general-purpose register. For conditional jumps, the outcome of the jump is based on a Boolean value in a general-purpose register. Procedure calls are unconditional and save the return address in a general-purpose register. All branches have a delayed effect; the instruction following the branch is executed regardless of the outcome of the branch. Table 2-8 Branch Instructions | Mnemonic | Operation Description | |----------|-------------------------------------------------------------------------------------------------| | CALL | DEST ← PC//00 + 8 PC ← TARGET Execute delay instruction | | CALLI | DEST ← PC//00 + 8 PC ← SRCB Execute delay instruction | | JMP | PC ← TARGET<br>Execute delay instruction | | JMPI | PC ← SRCB<br>Execute delay instruction | | JMPT | IF SRCA = TRUE THEN PC ← TARGET Execute delay instruction | | JMPTI | IF SRCA = TRUE THEN PC ← SRCB Execute delay instruction | | JMPF | IF SRCA = FALSE THEN PC ← TARGET Execute delay instruction | | JMPFI | IF SRCA = FALSE THEN PC ← SRCB Execute delay instruction | | JMPFDEC | IF SRCA = FALSE THEN SRCA ← SRCA − 1 PC ← TARGET ELSE SRCA ← SRCA − 1 Execute delay instruction | #### 2.1.9 Miscellaneous The Miscellaneous instructions (Table 2-9) perform various operations that cannot be grouped into other instruction classes. In certain cases, these are control functions available only to Supervisor-mode programs. The instructions INV and IRETINV are provided for compatibility with other 29K processors. INV performs no operation, and IRETINV performs the same operations as IRET. Both are privileged instructions. #### Table 2-9 Miscellaneous Instructions | Mnemonic | Operation Description | | |----------|--------------------------------------------------------------|--| | CLZ | Determine number of leading zeros in a word | | | SETIP | Set IPA, IPB, and IPC with operand register numbers | | | EMULATE | Load IPA and IPB with operand register numbers and Trap (VN) | | | INV | No operation | | | IRET | Perform an interrupt return sequence | | | IRETINV | Perform an interrupt return sequence | | | HALT | Enter Halt mode | | #### 2.1.10 Reserved Instructions Sixteen operation codes are reserved for instruction emulation. Each of these instructions causes a trap and sets the indirect pointers IPC, IPA, and IPB. The relevant operation codes, and the corresponding trap vectors, are as follows: #### Table 2-10 Reserved Instructions | Operation Codes (Hexadecimal) | Trap Vector Numbers (Decimal) | |-------------------------------|-------------------------------| | D8-DD | 24–29 | | E7-E9 | 39–41 | | F8 | 56 | | FA-FF | 58–63 | The reserved instructions are intended for future processor enhancements, and users desiring compatibility with future processor versions should not use them for any purpose. #### 2.2 REGISTER MODEL The microcontroller has two classes of registers that are accessible by instructions. These are the general-purpose registers and the special-purpose registers. Any operation available to the microcontroller can be performed on the general-purpose registers, while special-purpose registers are accessed only by the instructions MTSR, MTSRIM, and MFSR. This section describes the general-purpose and special-purpose registers. #### 2.2.1 General-Purpose Registers The microcontroller incorporates 192 general-purpose registers. The organization of the general-purpose registers is diagrammed in Figure 2-1. General-purpose registers hold the following types of operands for program use: - 32-bit addresses - 32-bit signed or unsigned integers - 32-bit branch-target addresses - 32-bit logical bit strings - 8-bit signed or unsigned characters Figure 2-1 General-Purpose Register Organization - 16-bit signed or unsigned integers - Word-length Booleans - Single-precision floating-point numbers - Double-precision floating-point numbers (in two register locations) Because a large number of general-purpose registers are provided, a large amount of frequently used data can be kept on-chip, where access time is fastest. Instructions can specify two general-purpose registers for source operands and one general-purpose register for storing the instruction result. These registers are specified by three 8-bit instruction fields containing register numbers. A register may be specified directly by the instruction, or indirectly by one of three special-purpose registers. ## 2.2.1.1 Register Addressing The general-purpose registers are partitioned into 64 global registers and 128 local registers, differentiated by the most significant bit of the register number. The distinction between global and local registers is the result of register-addressing considerations. The following terminology is used to describe the addressing of general-purpose registers: - Register number, a software-level number for a general-purpose register. For example, this is the number contained in an instruction field. Register numbers range from 0 to 255. - Global-register number, a software-level number for a global register. Global-register numbers range from 0 to 127. - Local-register number, a software-level number for a local register. Local-register numbers range from 0 to 127. - Absolute-register number, a hardware-level number used to select a general-purpose register in the register file. Absolute-register numbers range from 0 to 255. #### 2.2.1.2 Global Registers When the most significant bit of a register number is 0, a global register is selected. The seven least significant bits of the register number give the global-register number. For global registers, the absolute-register number is equivalent to the register number. Global registers 2 through 63 are not implemented. An attempt to access these registers yields unpredictable results; however, they may be protected from User-mode access by the Register Bank Protect Register (see Section 6.2.1). The register numbers associated with Global Registers 0 and 1 have special meaning. The number for Global Register 0 specifies that an indirect pointer is to be used as the source of the register number (see Section 2.3); there is an indirect pointer for each of the instruction operand/result registers. Global Register 1 contains the Stack Pointer, which is used in the addressing of local registers. #### 2.2.1.3 Local Registers When the most significant bit of a register number is 1, a local register is selected. The seven least significant bits of the register number give the local-register number. For local registers, the absolute-register number is obtained by adding the local-register number to bits 8–2 of the Stack Pointer and truncating the result to seven bits; the most significant bit of the original register number is unchanged (i.e., it remains a 1). The Stack Pointer addition applied to local-register numbers provides a limited form of base-plus-offset addressing within the local registers. The Stack Pointer contains the 32-bit base address. This assists run-time storage management of variables for dynamically nested procedures (see Chapter 4). #### 2.2.1.4 Local-Register Stack Pointer The Stack Pointer is a 32-bit register that may be an operand of an instruction as any other general-purpose register. However, a shadow copy of Global Register 1 is maintained by processor hardware for use in local-register addressing. This shadow copy is set only with the results of Arithmetic and Logical instructions. If the Stack Pointer is set with the result of any other instruction class, local registers cannot be accessed predictably until the Stack Pointer is set once again with an Arithmetic or Logical instruction. A modification of the Stack Pointer has a delayed effect on the addressing of local registers, as discussed in Section 5.6. ## 2.2.2 Special-Purpose Registers The microcontroller contains 24 special-purpose registers. The organization of the special-purpose registers is shown in Figure 2-2. Special-purpose registers provide controls and data for certain processor operations. Some special-purpose registers are updated dynamically by the processor, independent of software controls. Because of this, a read of a special-purpose register following a write does not necessarily get the data that was written. Some special-purpose registers have fields reserved for future processor implementations. When a special-purpose register is read, a bit in a reserved field is read as a 0. An attempt to write a reserved bit with a 1 has no effect; however, this should be avoided because of upward-compatibility considerations, except for bits 5 and 6 of the Current Processor Status Register. These bits are used to disable address translation in other 29K processors and may be written with 1 in the Am29200 and Am29205 microcontrollers. The special-purpose registers are accessed by explicit data movement only. Instructions that move data to or from a special-purpose register specify the special-purpose register by an 8-bit field containing a special-purpose register number. Register numbers are specified directly by instructions. The special-purpose registers are partitioned into protected and unprotected registers. Special-purpose registers numbered 0–127 and 160–255 are protected (note that not all of these are implemented). Special-purpose registers numbered 128–159 are unprotected (again, not all are implemented). Protected special-purpose registers numbered 0–127 are accessible only by programs executing in the Supervisor mode. An attempted read or write of a special-purpose register by a User-mode program causes a Protection Violation trap to occur. Special-purpose registers numbered 160–255, though architecturally unprotected, are not accessible by programs in the User mode or the Supervisor mode. These register numbers are reserved for virtual registers in the arithmetic architecture, and any attempted access causes a Protection Violation trap. The Floating-Point Environment Register, Integer Environment Register, and Floating-Point Status Register are not implemented in processor hardware. These registers are implemented via the virtual arithmetic interface provided on the Am29200 and Am29205 microcontrollers (see Section 2.8). Figure 2-2 Special-Purpose Registers | Register Numb | per Protected Registers | Mnemonic | |---------------|--------------------------------------|----------| | ٥ ٢ | Vector Area Base Address | VAB | | 1 | Old Processor Status | OPS | | 2 | Current Processor Status | CPS | | 3 | Configuration | CFG | | 4 | Channel Address | CHA | | 5 | Channel Data | CHD | | 6 | Channel Control | CHC | | 7 | Register Bank Protect | RBP | | 8 | Timer Counter | TMC | | 9 | Timer Reload | TMR | | 10 | Program Counter 0 | PC0 | | 11 | Program Counter 1 | PC1 | | 12 | Program Counter 2 | PC2 | | | Unprotected Registers | | | 128 | Indirect Pointer C | IPC | | 129 | Indirect Pointer A | IPA | | 130 | Indirect Pointer B | IPB | | 131 | Q | Q | | 132 | ALU Status | ALU | | 133 | Byte Pointer | BP | | 134 | Funnel Shift Count | FC | | 135 | Load/Store Count Remaining | CR | | • | | • | | 160 | Floating-Point Environment (virtual) | FPE | | 161 | Integer Environment (virtual) | INTE | | 162 | Floating-Point Status (virtual) | FPS | An attempted read of an unimplemented special-purpose register yields an unpredictable value. An attempted write of an unimplemented, protected special-purpose register has an unpredictable effect on processor operation, unless the write causes a Protection Violation. An attempted write of an unimplemented, unprotected special-purpose register has no effect; however, this should be avoided because of upward-compatibility considerations. ## 2.3 ADDRESSING REGISTERS INDIRECTLY Specifying Global Register 0 as an instruction operand register or result register causes an indirect access to the general-purpose registers. In this case, the absolute-register number is provided by an indirect pointer contained in a special-purpose register. Each of the three possible registers for instruction execution has an associated 8-bit indirect pointer. Indirect register numbers can be selected independently for each of the three operands. Since the indirect pointers contain absolute-register numbers, the number in an indirect pointer is not added to the Stack Pointer when local registers are selected. The indirect pointers are set by the MTSR, MTSRIM, SETIP, and EMULATE instructions, and by floating-point, MULTIPLY, MULTM, MULTIPLU, MULTMU, DIVIDE, and DIVIDU instructions. For a move-to-special-register instruction, an indirect pointer is set with bits 9–2 of the 32-bit source operand. This provides consistency between the addressing of words in general-purpose registers and the addressing of words in external devices or memories. A modification of an indirect pointer using a move-to-special-register instruction has a delayed effect on the addressing of general-purpose registers (see Section 5.6). For the remaining instructions, all three indirect pointers are set simultaneously with the absolute-register numbers derived from the register numbers specified by the instruction. For any local registers selected by the instruction, the Stack-Pointer addition is applied to the register numbers before the indirect pointers are set. Except when an indirect pointer is set by a move-to-special-register instruction, register numbers stored into the indirect pointers are checked for bank-protection violations at the time the indirect pointers are set. ## 2.3.1 Indirect Pointer C Register (IPC, Register 128) This unprotected special-purpose register (Figure 2-3) provides the RC-operand register number (see Section 18.3) when an instruction RC field has the value zero (i.e., when Global Register 0 is specified). Figure 2-3 Indirect Pointer C Register #### Bits 31-10: Reserved Bits 9–2: Indirect Pointer C (IPC)—The 8-bit IPC field contains an absolute-register number for a general-purpose register. This number directly selects a register. (Stack-Pointer addition is not performed in the case of local registers.) Bits 1-0: Zeros—The IPC field is aligned for compatibility with word addresses. ## 2.3.2 Indirect Pointer A Register (IPA, Register 129) This unprotected special-purpose register (Figure 2-4) provides the RA-operand register number (see Section 18.3) when an instruction RA field has the value zero (i.e., when Global Register 0 is specified). Figure 2-4 Indirect Pointer A Register #### Bits 31-10: Reserved Bits 9–2: Indirect Pointer A (IPA)—The 8-bit IPA field contains an absolute-register number for either a general-purpose register or a local register. This number directly selects a register. (Stack-Pointer addition is not performed in the case of local registers.) Bits 1-0: Zeros—The IPA field is aligned for compatibility with word addresses. ## 2.3.3 Indirect Pointer B Register (IPB, Register 130) This unprotected special-purpose register (Figure 2-5) provides the RB-operand register number (see Section 18.3) when an instruction RB field has the value zero (i.e., when Global Register 0 is specified). ## Figure 2-5 Indirect Pointer B Register #### Bits 31-10: Reserved Bits 9–2: Indirect Pointer B (IPB)—The 8-bit IPB field contains an absolute-register number for a general-purpose register. This number directly selects a register. (Stack-Pointer addition is not performed in the case of local registers.) Bits 1–0: Zeros—The IPB field is aligned for compatibility with word addresses. #### 2.4 INSTRUCTION ENVIRONMENT This section describes the special-purpose registers that affect the execution of Floating-Point and Integer Arithmetic instructions. ### 2.4.1 Floating-Point Environment Register (FPE, Register 160) This unprotected special-purpose register (Figure 2-6) contains control bits that affect the execution of floating-point operations. This register is not implemented directly by processor hardware, but is implemented by the virtual arithmetic software. Figure 2-6 Bits 31-9: Reserved **Bit 8: Fast Float Select (FF)**—The FF bit being 1 enables fast floating-point operations, in which certain requirements of the IEEE floating-point specification are not met. This improves the performance of certain operations by sacrificing conformance to the IEEE specification. **Bits 7–6: Floating-Point Round Mode (FRM)**—This field specifies the default mode used to round the results of floating-point operations, as follows: | FRM1-0 | Round Mode | , | |----------------------|-------------------------------------------------------------------|---| | 00<br>01<br>10<br>11 | Round to nearest<br>Round to – ∞<br>Round to + ∞<br>Round to zero | | - **Bit 5: Floating-Point Divide-By-Zero Mask (DM)**—If the DM bit is 0, a Floating-Point Exception trap occurs when the divisor of a floating-point division operation is zero and the dividend is a non-zero, finite number. If the DM bit is 1, a Floating-Point Exception trap does not occur for divide-by-zero. - **Bit 4: Floating-Point Inexact Result Mask (XM)**—If the XM bit is 0, a Floating-Point Exception trap occurs when the result of a floating-point operation is not equal to the infinitely precise result. If the XM bit is 1, a Floating-Point Exception trap does not occur for an inexact result. - **Bit 3: Floating-Point Underflow Mask (UM)**—If the UM bit is 0, a Floating-Point Exception trap occurs when the result of a floating-point operation is too small to be expressed in the destination format. If the UM bit is 1, a Floating-Point Exception trap does not occur for underflow. - **Bit 2: Floating-Point Overflow Mask (VM)**—If the VM bit is 0, a Floating-Point Exception trap occurs when the result of a floating-point operation is too large to be expressed in the destination format. If the VM bit is 1, a Floating-Point Exception trap does not occur for overflow. - **Bit 1: Floating-Point Reserved Operand Mask (RM)**—If the RM bit is 0, a Floating-Point Exception trap occurs when one or more input operands to a floating-point operation is a reserved value, or when the result of a floating-point operation is a reserved value. If the RM bit is 1, a Floating-Point Exception trap does not occur for reserved operands. - **Bit 0: Floating-Point Invalid Operation Mask (NM)**—If the NM bit is 0, a Floating-Point Exception trap occurs when the input operands to a floating-point operation produce an indeterminate result (e.g., $\infty$ times 0). If the NM bit is 1, a Floating-Point Exception trap does not occur for invalid operations. ## 2.4.2 Integer Environment Register (INTE, Register 161) This unprotected special-purpose register (Figure 2-7) contains control bits that affect the execution of integer multiplication and division operations. This register is not implemented directly by processor hardware, but is implemented by the virtual arithmetic interface. ### Bits 31-2: Reserved **Bit 1: Integer Division Overflow Mask (DO)**—If the DO bit is 0, an Out-of-Range trap occurs when overflow of a signed or unsigned 32-bit result occurs during a DIVIDE or DIVIDU instruction, respectively. If the DO bit is 1, an Out-of-Range trap does not occur for overflow during integer divide operations. The DIVIDE and DIVIDU instructions always cause an Out-of-Range Trap upon division by zero, regardless of the value of the DO bit. **Bit 0: Integer Multiplication Overflow Exception Mask (MO)**—If the MO bit is 0, an Out-of-Range trap occurs when overflow of a signed or unsigned 32-bit result occurs during a MULTIPLY or MULTIPLU instruction, respectively. If the MO bit is 1, an Out-of-Range trap does not occur for overflow during integer multiply operations. Because 64-bit results cannot overflow, this bit should be set to 1 when obtaining a 64-bit result for multiplication to avoid Out-of-Range traps. ## 2.5 STATUS RESULTS OF INSTRUCTIONS This section discusses the status information generated by arithmetic, logical and floating-point operations, and the special registers that contain this status information. ## 2.5.1 ALU Status Register (ALU, Register 132) This unprotected special-purpose register (Figure 2-8) holds information about the outcome of Arithmetic/Logic Unit (ALU) operations as well as control for certain operations performed by the execution unit. #### Bits 31-12: Reserved **Bit 11: Divide Flag (DF)**—The DF bit is used by the instructions that implement division. This bit is set at the end of the division instructions either to 1 or to the complement of the 33rd bit of the ALU. When a Divide Step instruction is executed, the DF bit determines whether an addition or subtraction operation is performed by the ALU. **Bit 10: Overflow (V)**—The V bit indicates that the result of a signed, two's-complement ALU operation required more than 32 bits to represent the result correctly. The value of this bit is determined by exclusive-ORing the ALU carry-out with the carry-in to the most significant bit for signed, two's-complement operations. This bit is not used for any special purpose in the processor and is provided for information only. **Bit 9: Negative (N)**—The N bit is set with the value of the most significant bit of the result of an arithmetic or logical operation. If two's-complement overflow occurs, the N bit does not reflect the true sign of the result. This bit is used in divide operations. **Bit 8: Zero (Z)**—The Z bit indicates that the result of an arithmetic or logical operation is zero. This bit is not used for any special purpose in the processor, and is provided for information only. **Bit 7: Carry (C)**—The C bit stores the carry-out of the ALU for arithmetic operations. It is used by the add-with-carry and subtract-with-carry instructions to generate the carry into the Arithmetic/Logic Unit. **Bits 6–5: Byte Pointer (BP)**—The BP field holds a 2-bit pointer to a byte within a word. It is used by Insert Byte and Extract Byte instructions. The most significant bit of the BP field is used to determine the position of a half-word within a word for the Insert Half-Word, Extract Half-Word, and Extract Half-Word, Sign-Extended instructions. The BP field is set by a Move To Special Register instruction with either the ALU Status Register or the Byte Pointer Register as the destination. It is also set by a load or store instruction if the Set Byte Pointer (SB) bit in the instruction is 1. A load or store sets the BP field with 11. The Byte Pointer Register (Section 3.1.3) provides direct access to this field. Bits 4–0: Funnel Shift Count (FC)—The FC field contains a 5-bit shift count for the funnel shifter. The funnel shifter concatenates two source operands into a single 64-bit operand and extracts a 32-bit result from this 64-bit operand; the FC field specifies the number of bit positions from the most significant bit of the 64-bit operand to the most significant bit of the 32-bit result. The FC field is used by the EXTRACT instruction. The FC field is set by a Move To Special Register instruction with either the ALU Status Register or the Funnel Shift Count Register as the destination. ## 2.5.2 Arithmetic Operation Status Results The Arithmetic instructions modify the V, N, Z, and C bits. These bits are set according to the result of the operation performed by the instruction. All instructions in the Arithmetic class—except for MULTIPLY, MULTM, DIVIDE, MULTI-PLU, MULTMU, and DIVIDU—perform an add. In the case of subtraction, the subtract is performed by adding the two's-complement or one's-complement of an operand to the other operand. The multiply-step and divide-step operations also perform adds, again possibly complementing one of the operands before the operation is performed. In general, the status bits are based on the results of the add. If two's-complement overflow occurs during the add, the V bit of the ALU Status Register is set; otherwise it is reset. Two's-complement overflow occurs when the carry-in to the most significant bit of the intermediate result differs from the carry-out. When this occurs, the result cannot be represented by a signed word integer. Note that the V bit always is set in this manner, even when the result is unsigned. The N bit of the ALU Status Register is set to the value of the most significant bit of the result of the add. Note that the divide-step and multiply-step operations may shift the result after the operation is performed. In the cases where shifting occurs, the N bit may not agree with the result that is written into a general-purpose register, since the N bit is based only on the result of the add, not on the shift. If the result of the add causes a zero word to be written to a general-purpose register, the Z bit of the ALU Status Register is set; otherwise, it is reset. The Z bit always reflects the result written into a general-purpose register; if shifting is performed by a multiply or divide step, the Z bit reflects the shifted value. If there is a carry out of the add operation, the C bit is set; otherwise it is reset. ## 2.5.3 Logical Operation Status Results The Logical instructions modify the N and Z bits. These bits are set according the result of the instruction. The V and C bits are meaningless in regard to the Logical instructions, so they are not modified. The N bit of the ALU Status Register is set to the value of the most significant bit of the result of the logical operation. If the result of the logical operation is a zero word, the Z bit of the ALU Status Register is set; otherwise, it is reset. ## 2.5.4 Floating-Point Status Results The Floating-Point instructions check for a number of exceptional conditions, and report these exceptions by setting bits of the Floating-Point Status Register. The exceptional conditions may also cause traps, depending on the state of mask bits in the Floating-Point Environment Register. There are two groups of status bits in the Floating-Point Status Register: trap status bits and sticky status bits. When an exception is detected, the virtual arithmetic processor on the microcontroller sets the trap status bit and/or the sticky status bit associated with the exception, depending on the corresponding exception mask bit and on whether or not a trap occurs. The sticky status bit is set whenever the corresponding exception is masked, regardless of whether or not a trap occurs. A trap status bit is set whenever a trap occurs, regardless of the state of the corresponding mask bit. A trap status bit is reset when a trap occurs and the indicated status does not apply to the trapping operation. A sticky status bit is reset only by software. ## 2.5.5 Floating-Point Status Register (FPS, Register 162) This unprotected special-purpose register (Figure 2-9) contains status bits indicating the outcome of floating-point operations. This register is not implemented directly by processor hardware, but is implemented by the virtual arithmetic software. The floating-point status bits are divided into two groups. The first group consists of the sticky status bits (DS, XS, US, VS, RS, and NS), which, once set, remain set until explicitly cleared by a Move-to-Special-Register (MTSR) or Move-to-Special-Register-Immediate (MTSRIM) instruction. Only those sticky status bits corresponding to masked exceptions are updated. The update occurs at the end of instruction execution. The second group consists of the trap status bits (DT, XT, UT, VT, RT, and NT) that report the status of an operation for which a Floating-Point Exception trap is taken. These bits are updated only by an operation that takes a trap as a result of an unmasked Floating-Point Exception; all other operations leave these bits unchanged. A trap status bit is updated regardless of the state of the corresponding exception mask in the Floating-Point Environment Register. #### Figure 2-9 Floating-Point Status Bits 31-14: Reserved Bit 13: Floating-Point Divide-By-Zero Trap (DT)—The DT bit is set when a Floating-Point Exception trap occurs and the associated floating-point operation is a divide with a zero divisor and a non-zero, finite dividend. Otherwise, this bit is reset when a Floating-Point Exception trap occurs. - **Bit 12: Floating-Point Inexact Result Trap (XT)**—The XT bit is set when a Floating-Point Exception trap occurs and the result of the associated floating-point operation is not equal to the infinitely-precise result. Otherwise, this bit is reset when a Floating-Point Exception trap occurs. - **Bit 11: Floating-Point Underflow Trap (UT)**—The UT bit is set when a Floating-Point Exception trap occurs and the result of the associated floating-point operation is too small to be expressed in the destination format. Otherwise, this bit is reset when a Floating-Point Exception trap occurs. - **Bit 10:** Floating-Point Overflow Trap (VT)—The VT bit is set when a Floating-Point Exception trap occurs and the result of the associated floating-point operation is too large to be expressed in the destination format. Otherwise, this bit is reset when a Floating-Point Exception trap occurs. - **Bit 9: Floating-Point Reserved Operand Trap (RT)**—The RT bit is set when a Floating-Point Exception trap occurs and the result of the associated floating-point operation is a reserved value. Otherwise, this bit is reset when a Floating-Point Exception trap occurs. - **Bit 8: Floating-Point Invalid Operation Trap (NT)**—The NT bit is set when a Floating-Point Exception trap occurs and the input operands to the associated floating-point operation produce an indeterminate result. Otherwise, this bit is reset when a Floating-Point Exception trap occurs. #### Bits 7-6: Reserved - **Bit 5: Floating-Point Divide-By-Zero Sticky (DS)**—The DS bit is set when the DM bit of the Floating-Point Environment Register is 1, the divisor of a floating-point division operation is a zero, and the dividend is a non-zero, finite number. - **Bit 4: Floating-Point Inexact Result Sticky (XS)**—The XS bit is set when the XM bit of the Floating-Point Environment Register is 1 and the result of a floating-point operation is not equal to the infinitely precise result. - **Bit 3: Floating-Point Underflow Sticky (US)**—The US bit is set when the UM bit of the Floating-Point Environment Register is 1 and the result of a floating-point operation is too small to be expressed in the destination format. - **Bit 2: Floating-Point Overflow Sticky (VS)**—The VS bit is set when the VM bit of the Floating-Point Environment Register is 1 and the result of a floating-point operation is too large to be expressed in the destination format. - **Bit 1: Floating-Point Reserved Operand Sticky (RS)**—The RS bit is set when the RM bit of the Floating-Point Environment Register is 1 and either one or more input operands to a floating-point operation is a reserved value or the result of a floating-point operation is a reserved value. - **Bit 0: Floating-Point Invalid Operation Sticky (NS)**—The NS bit is set when the NM bit of the Floating-Point Environment Register is 1 and the input operands to a floating-point operation produce an indeterminate result. #### 2.6 INTEGER MULTIPLICATION AND DIVISION The Am29200 and Am29205 microcontrollers do not directly support the instructions MULTIPLU, MULTMU, MULTIPLY, MULTM, DIVIDE, and DIVIDU. The processor is capable of performing these instructions as a sequence of multiply or divide steps, which are directly supported by hardware. A special register, Q, is used in conjunction with the SRCA and SRCB operands to execute the multiply or divide step. This section describes the Q register and discusses the general method for multiplication and division. ## 2.6.1 Q Register (Q, Register 131) The Q Register is an unprotected special-purpose register (Figure 2-10). #### Figure 2-10 Q Register **Bits 31–0: Quotient/Multiplier (Q)**—During a sequence of divide steps, this field holds the low-order bits of the dividend; it contains the quotient at the end of the divide. During a sequence of multiply steps, this field holds the multiplier; the field contains the low-order bits of the result at the end of the multiply. For an integer divide instruction, the Q field contains the high-order bits of the dividend at the beginning of the instruction, and contains the remainder upon completion of the instruction. ## 2.6.2 Multiplication The processor performs integer multiplication by a series of multiply-step instructions. Note that when the product of a constant and a variable is to be computed, a more efficient sequence of shift and add instructions can usually be found. Many compilers use this technique automatically. If a program requires the multiplication of two integers, the required sequence of multiply steps may be executed in-line or executed in a multiply routine called as a procedure. It may be beneficial to precede a full multiply procedure with a routine to discover whether or not the number of multiply steps may be reduced. This reduction is possible when the operands do not use all of the available 32 bits of precision. The following routine multiplies two 32-bit signed integers, giving a 64-bit result. Unsigned multiplication can be performed by substituting the MULU instruction for the MUL and MULL instructions. ``` ; 32 bit * 32 bit ->64 bit signed multiply multiplicand in Ir2, multiplier in Ir3 : Input: result most significant word in gr96, result least significant word in gr97 ; Output: SMul64: ; put multiplier in the Q register mtsr Q, Ir3 gr96, Ir2, 0 perform initial multiply step mul .rep 30 expand out 30 copies of the next instruction in-line mul gr96, Ir2, gr96 ; total of 30 more multiply steps .endr ; perform last sign correcting step mull gr96, Ir2,gr96 mfsr gr97, Q ; get the least significant result word ``` The following routine multiplies two 32-bit integers, returning a 32-bit result. It attempts to minimize the number of multiply-step instructions by checking the input operands. It is coded as a subroutine, with pointers to its operands passed in the indirect pointers IPC, IPA, and IPB. This allows the routine to operate on any combination of registers, rather than forcing the operands to be in fixed registers. ``` ; 32 bit * 32 bit -> 32 bit signed or unsigned multiply called by: call toc. MUL32 ; call the multiply routine setip dst_reg, src1_reg, src2_reg ; passing pointers to the operand registers ; in the delay slot ; input: operands in the registers pointed to by indirect-pointer registers IPA and IPB Output: result least significant word in the register pointed to by IPC Used: return address in tpc, special registers Q and FC Destroy: previous contents of registers tpc, Temp0 - Temp2 ; Symbolic register names: .reg Temp0, gr116 .reg Temp1, gr119 Temp2, gr120 .reg tpc, gr122 .reg 0x00200000 ; Debugger tag word .word Mul32: ; need an instruction to separate SETIP (probably last instruction) from access of indirect ; pointers mtsrim FC.8 ; useful when one operand is 8-bit Temp0, ar0, 0 ; copy value of IPA register or ; next check to see that the operand with the most leading zeros becomes the multiplier Temp1,gr0,gr0 cpgtu : the operands are already ordered correctly impf Temp1.do8 Temp1,Temp1,gr0 or ; if it jumps, Temp1 holds 0, so this copies ; the value of the IPB register const Temp0.0 ; swap the operands Temp0,Temp0,gr0 or Temp1,gr0,0 or do8: cpleu Temp2, Temp1, 0x7f ; less than 8 bits? Temp2,do16 impf ; no, check for 16 bits mtsr Q.Temp0 Temp0,Temp1,0 mulu .rep : expand out 7 copies of the next instruction in-line Temp0,Temp1,Temp0 mulu ; total of 7 more multiply steps .endr ; the top 24 bits of the result are in the lower 24 bits of Temp0, and the bottom 8 bits are in the : top of Q mfsr Temp1,Q impi ; return to the calling routine tpc extract gr0,Temp0,Temp1 extract the result in the delay-slot of the ; jump do16: Temp2,0x7fff : less than 16 bits? const cplequ Temp2, Temp0, Temp2 ; no, perform all 32 steps impf Temp2,do32 mulu Temp0,Temp1,0 ; perform initial multiply-step ``` ``` .rep 15 ; expand out 15 copies of next instruction ; in-line mulu Temp0,Temp1,Temp0 ; total of 15 more multiply-steps .endr ; the top 16 bits of the result will be in the lower 16 bits of Temp0, the bottom 16 bits in the top ; of Q mtsrim FC.16 ; extract on bit-16 boundary mfsr Temp1,Q impi ; return to the calling routine tpc extract gr0,Temp0,Temp1 : extracting the result in the delay-slot of the ; jump do32: mulu temp0,Temp1,0 ; perform initial step ; expand out 32 copies of the next instruction .rep : in-line mulu Temp0,Temp1,Temp0 ; total of 31 more multiply steps .endr impi ; return to calling routine copy the result to the return register in the mfsr gr0,Q ; delay slot ``` #### 2.6.3 Division The processor performs integer division by a series of divide-step instructions. When the divisor is a power of 2 and the dividend is unsigned, the divide should be accomplished by a right shift. If a program requires the division of two integers, the required sequence of divide steps may be executed in-line or executed in a divide routine called as a procedure. It may be beneficial to precede a full divide procedure with a routine to discover whether or not the number of divide steps may be reduced. This reduction is possible when the operands do not use all of the available 32 bits of precision. The following routine divides a 64-bit, unsigned dividend by a 32-bit unsigned divisor. ``` : 64 bit / 32 bit \rightarrow 32 bit unsigned divide Input: most significant dividend word in Ir2, least significant dividend word in Ir3, divisor in Ir4 Output: quotient in gr96, remainder in gr97 UDiv64: mtsr Q, lr3 ; put least significant word of the dividend in ; the Q register div0 ar97. Ir2 ; perform initial divide step expand out 31 copies of the next .rep instruction in-line div gr97, gr97, Ir4 ; total of 30 more divide steps .endr divl gr97, gr97, lr4 ; perform last step divrem ; compute remainder gr97, gr97, lr4 mfsr gr96, Q ; get the quotient ``` The following routine divides a 32-bit unsigned dividend by a 32-bit unsigned divisor. ``` ; 32 bit / 32 bit → 32 bit unsigned divide ; Input: dividend word in Ir2, divisor in Ir4 ; Output: quotient in gr96, remainder in gr97 ``` ``` UDiv32: mtsr Q. lr2 ; put the dividend in the Q register div0 ; perform initial divide step, zeroing out gr97, 0 ; the upper bits of the dividend ; expand out 31 copies of the next .rep 31 ; instruction in-line div gr97, gr97, lr4 : total of 30 more divide steps .endr divl ; perform last step gr97, gr97, lr4 divrem gr97, gr97, Ir4 ; compute remainder mfsr ; get the quotient gr96, Q The following routine divides a 32-bit signed dividend by a 32-bit signed divisor. It also traps division by zero. Because the divide-step instructions only operate on unsigned operands, extra code is required to perform sign checking and conversion. ; 32 bit / 32 bit signed divide, called by: call tpc, SDiv32 : call the divide routine setip dst_reg, src1_reg, src2_reg ; passing pointers to the operand registers in the delay slot dividend and divisor in the registers pointed to by the indirect-pointer : Input: registers IPA and IPB Output: result quotient in the register pointed to by IPC, remainder left in Temp0 : Used: return address in tpc, special register Q Destroyed: previous contents of registers tpc, Temp0 – Temp2 : Symbolic register names: .reg Temp0, ar116 Temp1, gr119 .reg .reg Temp2, gr120 tpc, gr122 .reg .word 0x00200000 ; Debugger tag word SDiv32: const Temp1, 0 V_DIVBYZERO, Temp1, gr0 asneq check for divide by zero with an assert get dividend from indirect pointer add Temp0, gr0, 0 impf Temp0, pdividend is it negative? (impf is also "imppos") add Temp2, Temp1, gr0 get divisor from indirect pointer const Temp1, 3 set negative result and remainder flags ; make dividend positive subr Temp0, Temp0, 0 pdividend: impf Temp2, pdivisor : is divisor negative? Q, Temp0 copy dividend to Q register in delay slot mtsr of the jump Temp1, Temp1, 1 xor ; turn off negative result flag subr Temp2, Temp2, 0 ; make divisor positive pdivisor: Temp0, 0 : initialize div0 : expand out 31 copies of the next .rep ; instruction in-line Temp0, Temp0, Temp2 div ; total of 30 more divide steps .endr ``` ; perform last divide step Temp0, Temp0, Temp2 divl divrem Temp0, Temp0, Temp2 get positive remainder mfsr Temp2, Q get positive quotient sII Temp1, Temp1, 30 copy negative remainder flag to test bit Temp1, premainder impf ; if it is not set, remainder is ok sll Temp1, Temp1, 1 ; copy negative result flag to test bit Temp0, Temp0, 0 ; negate remainder subr premainder: impfi Temp1, tpc ; return to caller if result is positive add gr0, Temp2, 0 copying quotient to the result register in the delay slot impi ; else return to caller, subr gr0, Temp2, 0 ; negating the quotient in the delay slot #### 2.7 I NEED AN INSTRUCTION FOR... This section discusses topics of general concern in the implementation of application programs. ## 2.7.1 Run-Time Checking The assert instructions provide programs with an efficient means of comparing two values and causing a trap when a specified relation between the two values is not satisfied. The instructions assert that some specified relation is true and trap if the relation is not true. This allows run-time checking, such as checking that a computed array index is within the boundaries of the storage for an array, to be performed with a minimum performance penalty. Assert instructions are available for comparing two signed or unsigned operands. The following relations are supported: equal-to, not-equal-to, less-than, less-than-or-equal-to, greater-than, and greater-than-or-equal-to. The assert instructions specify a vector number for the trap. However, only vector numbers 64 through 255 (inclusive) may be specified by User-mode programs. If a User-mode assert instruction causes a trap and the vector number is between 0 and 63 inclusive, a Protection Violation trap occurs, instead of the specified trap. Since the assert instructions allow the specification of the vector number, several traps may be defined in the system for different situations detected by the assert instructions. #### 2.7.2 Operating-System Calls An applications program can request a service from the operating system by using the following instruction: ``` asneq System_Routine, gr1, gr1 ``` This instruction always creates a trap since it attempts to assert that the content of a register is not equal to itself (the register number used here is irrelevant, as long as the register is otherwise accessible). The System\_Routine vector number specified by the instruction invokes the execution of the operating system routine that provides the requested service. This vector number may have any value between 64 and 255, inclusive (vector numbers 0 through 63 are pre-defined or reserved). Thus, as many as 192 different operating-system routines may be invoked from the applications program. In cases where the indirect pointers may be used, the EMULATE instruction allows two operand/result registers to be specified to the operating-system routine. The instruction is as follows: emulate System\_Routine, Ir3, Ir6 In this case, the System\_Routine vector number performs the same function as in the previous example. Here, however, LR3 and LR6 are specified as operand registers and/or result registers (these particular registers are used only for illustration). The operating-system routine has access to these registers via the indirect pointers, which allows flexible communication. ## 2.7.3 Multiprecision Integer Operations The processor allows the Carry (C) bit of the ALU Status Register to be used as an operand for add and subtract instructions. This provides for the addition and subtraction of operands that are greater than 32 bits in length. For example, the following code implements a 96-bit addition with signed overflow detection. add lr7, gr96, lr2 addc lr8, gr97, lr3 addcs lr9, gr98, lr4 Global registers GR96–GR98 contain the first operand, local registers LR2–LR4 contain the second operand, and local registers LR7–LR9 contain the result. The first two add instructions (ADD and ADDC) set the C bit, which is used by the second two instructions (ADDC and ADDCS). If the addition causes a signed overflow, then an Out-of-Range trap occurs; overflow is detected by the final instruction. ## 2.7.4 Complementing a Boolean To complement a Boolean in the processor's format, only the most significant bit of the Boolean word should be considered, since the least significant 31 bits may or may not be zeros. This is accomplished by the following instruction: cpge gr96, gr96, 0 The Boolean is in GR96 in this example. This instruction is based on the observation that a Boolean TRUE is a negative integer, since the Boolean bit coincides with the integer sign bit. If the operand of this instruction is a negative integer (i.e., TRUE), the result is the Boolean FALSE. If the operand is non-negative (i.e., the Boolean FALSE), the result is TRUE. Note that this instruction clears the least significant 31 bits. ## 2.7.5 Large Jump and Call Ranges The 16-bit relative branch displacement provided by processor instructions is sufficient in the majority of cases. However, addresses with a greater range are occasionally needed. In these cases, the CONST and CONSTH instructions generate the large branch-target address in a register. An indirect jump or call then uses this address to branch to the appropriate location. #### 2.7.6 NO-OPs When a NO-OP is required for proper operation (e.g., as described in Section 5.6), it is important that the selected instruction not perform any operation, regardless of program operating conditions. For example, the NO-OP cannot access general-purpose registers because a register may be protected from access in some situations. The suggested NO-OP is: aseq 0x40, gr1, gr1 This instruction asserts that the Stack Pointer (GR1) is equal to itself. Since the assertion is always true, there is no trap. Note also that the Stack Pointer cannot be protected, and that the assert instruction cannot affect any processor state. #### 2.8 VIRTUAL ARITHMETIC PROCESSOR In order to be object-code compatible with present and future implementations of the 29K Family of microprocessors, the Am29200 and Am29205 microcontrollers provide virtual arithmetic software. A virtual implementation is the means by which a processor appears to perform functions that it does not actually perform. In the case of the Am29200 and Am29205 microcontrollers, the processor defines arithmetic instructions, control, and status which are not directly supported by hardware, but which are implemented by system software. ## 2.8.1 Trapping Arithmetic Instructions The processor does not incorporate hardware to directly support floating-point operations, nor does it directly support full multiply and divide instructions. However, instructions to perform these operations are included in the instruction set. These instructions are included for compatibility with processor implementations, such as the Am29050 microprocessor, that have hardware to perform these operations. In application programs that must be fully object-code compatible across several processor versions—while taking advantage of the performance of the versions having arithmetic hardware—the defined instructions should be used to perform floating-point, multiplication, and division operations. In the Am29200 and Am29205 microcontrollers, the Floating-Point, CLASS, CONVERT, MULTIPLY, MULTIM, MULTIPLU, MULTMU, DIVIDE, DIVIDU, and SQRT instructions cause traps. The indirect pointers are set at the time the trap occurs, so a trap handler can gain access to the operands of the instruction and can determine where the result is to be stored. A trap handler can directly emulate the execution of the instruction. ## 2.8.2 Virtual Registers The processor does not incorporate hardware to directly support the Floating-Point Environment Register (FPE), Integer Environment Register (INTE), or Floating-Point Status Register (FPS). When one of these registers is referenced by a MTSR/MFSR instruction (or a variant), a Protection Violation trap occurs. The Protection Violation trap handler must establish that the faulting instruction is a MTSR/MFSR and that the register specified by the instruction is one of the registers supported by the virtual interface. This is accomplished by obtaining the faulting instruction from memory and examining the OPCODE and SRC/DEST fields. The trap handler then simulates the operation of the register. ### 2.9 PROCESSOR INITIALIZATION When power is first applied to the processor, it is in an indeterminate state and must be placed in a known state. Also, under certain circumstances, it may be necessary to place the processor in a defined state. This is accomplished by the Reset mode, which places the processor into a predefined state. ## 2.9.1 Configuration Register (CFG, Register 3) This protected special-purpose register (Figure 2-11) controls certain processor and system options. The Configuration Register is defined as follows: ### Figure 2-11 Configuration Register Bits 31–24: Processor Release Level (PRL)—The PRL field is an 8-bit, read-only identification number which specifies the processor version. Bits 23-0: Reserved #### 2.9.2 Reset Mode The Reset mode is invoked by asserting the RESET input. The Reset mode is entered within four processor cycles (MEMCLK cycles) after RESET is asserted. The RESET input must be asserted for at least four processor cycles to accomplish a processor reset. The Reset mode can be entered at any point during operation. If the RESET input is asserted at the time power is first applied to the processor, the processor enters the Reset mode only after four cycles have occurred on the MEMCLK pin. The Reset mode configures the processor state as follows: - 1. Instruction execution is suspended. - 2. Instruction fetching is suspended. - 3. Any interrupt or trap conditions are ignored. - 4. The Current Processor Status Register (see Section 16.2.1) is set as shown in Figure 2-12. - The Contents Valid (CV) bit of the Channel Control Register (see Section 16.7.2.3) is reset. Except as previously noted, the contents of all general-purpose registers and special-purpose registers are undefined. The Reset mode is exited when the RESET input is deasserted. Either four or five cycles after RESET is deasserted (depending on internal synchronization time), the processor performs an initial instruction access on the external interface. The initial instruction access is directed to address 0, which is in ROM Bank 0 after a reset. Setting the characteristics of the ROM in Bank 0 during reset is described in Section 8.2.3. A processor reset configures the internal peripherals as follows: In the ROM controller, ROM Bank 0 on the Am29200 microcontroller is configured by the BOOTW signal; the boot ROM in ROM Bank 0 on the Am29205 microcontroller is always 16 bits wide. The other banks are set so as not to interfere with accesses to ROM Bank 0. Figure 2-12 Current Processor Status Register In Reset Mode - The DRAM configuration is not set by a processor reset, DRAM mapping is disabled, and the refresh rate is set to the slowest possible value (refresh every 511 MEMCLK cycles). - 3. The configuration of the peripheral interface adapter is not set by a processor reset. - 4. The DMA controller is disabled, DRM fields are reset to 0, and all state machines are reset. - 5. The POEN field of the PIO Output Enable Register (see Section 12.2.4) is reset to 0, making all PIO pins inputs. - 6. The parallel port is disabled and all state machines are reset. - 7. The serial port is disabled and all state machines are reset. - 8. The video interface is disabled and all state machines are reset. All signals that may be either inputs or outputs are configured as inputs. # 4 # PROCEDURE LINKAGE This chapter describes the run-time storage organization recommended for the Am29200 and Am29205 microcontrollers and describes the use of the local registers to improve the performance of procedure calls. The presentation in this chapter is intended as a guide for implementing microcontroller software systems, not necessarily as a strict definition of how these systems must be implemented. Programming languages that use recursive procedures, such as C, generally use a stack to store data objects dynamically allocated at run-time. The organization of the run-time storage, including the run-time stack, determines how data objects are stored and how procedures are called at the machine level. The microcontroller is designed to minimize the overhead of calling a procedure, passing parameters to a procedure, and returning results from a procedure. This chapter describes the run-time storage organization and procedure-calling conventions. #### 4.1 RUN-TIME STACK ORGANIZATION AND USE A run-time stack consists of consecutive overlapping structures called activation records. An activation record contains dynamically allocated information specific to a particular activation (or call) of a procedure (such as local data objects). Because of recursion, multiple copies of a procedure may be active at any given time. Each active procedure has its own unique activation record allocated somewhere on the run-time stack. The local variables required by a particular procedure activation are contained in the activation record associated with that activation. Thus, the local variables for different activations do not interfere with one another. A compiler generates the instructions to create and manage the run-time stack, and compiler-generated instructions are based on its existence. As an example, Figure 4-1 shows three activation records on a run-time stack. This stack configuration was generated by procedure A calling procedure B, which in turn called procedure C. The fact that procedure C is the currently active procedure is reflected by its activation record being on the top of the run-time stack. The Stack Pointer points to the top of procedure C's activation record. In Figure 4-1, the storage areas labeled Out args and In args are the outgoing arguments area (for the caller) or the incoming arguments area (for the callee). These are shared between the caller procedure and the callee for the communication of parameters and results. The areas labeled Locals contain storage for local variables, temporary variables (for example, for expression evaluation), and any other items required for the proper execution of the procedure. ## 4.1.1 Management of the Run-Time Stack A run-time stack starts at a high address in memory and grows toward lower memory addresses as procedures are called. The bottom of the stack is the location with a high address at which the stack starts; the top of the stack is the location with a lower address at which the most recent activation record has been allocated. When a procedure is called, a new activation record might need to be allocated on the run-time stack. An activation record is allocated by subtracting from the stack pointer the Figure 4-1 Run-Time Stack Example number of locations needed by the new activation record. The stack pointer is decremented so that variables referenced during procedure execution are referenced in terms of positive offsets from the stack pointer. When storage for an activation record is allocated, the number of storage locations allocated is the sum of the number of locations needed for: - 1. Local variables - 2. Restarting the caller, such as locations for return addresses - Arguments of procedures that may be called in turn by the called procedure (the outgoing arguments area) In some cases storage is not required for one or more of the above items. Also, the incoming arguments area, though part of the activation record of the callee, is not allocated storage at this time, because this storage was allocated as the outgoing arguments area of the calling procedure. An activation record is deallocated, just prior to returning to the caller, by adding to the stack pointer the value subtracted during allocation. In the Am29200 and Am29205 microcontrollers, run-time storage is actually implemented as two stacks: the Register Stack and the Memory Stack. Storage is allocated and deallocated on these stacks at the same time. The Register Stack stores activation records associated with all active procedures (except leaf routines, as described later). The Memory Stack stores activation-record information that does not fit into the Register Stack or that must be kept in memory for other reasons (e.g., because of pointer dereferences). Both the Register Stack and the Memory Stack are stored in the external data memory. However, a portion of the Register Stack is kept in the processor's local registers for performance. The term *stack cache* in this section refers to the use of the local registers to contain a portion of the Register Stack. #### 4.1.2 Register Stack The Register Stack contains activation records for active procedures (Figure 4-2). An activation record in the Register Stack stores the following information: - Input arguments to the called procedure. This portion of the activation record is shared between a caller and the callee. It is allocated by the caller as part of the caller's activation record. - The caller's frame pointer. This is the address of the lowest-addressed byte above the highest-address word of the caller's activation record, and is used to manage the Register Stack. This portion of the activation record is shared between a caller and the callee. It is allocated by the caller as part of the caller's activation record. - The caller's return address. This is used to resume the execution of the caller after the called procedure terminates. This is also part of the caller's activation record. - The memory frame pointer. This is the address of the top of the caller's Memory Stack (see below). This address is stored by the callee (if required), and used to restore the memory stack upon return. - The local variables of the called procedure, if any. - Outgoing parameters of the called procedure, if any. - The frame pointer of the called procedure, if the procedure calls another procedure. - The return address for the called procedure, if the procedure calls another procedure. This location is allocated in the Register Stack, and is used when the called procedure calls another procedure. **Activation Record in the Register Stack** Incoming Arguments Frame Pointer LR1 (Caller) Before and After Call Return Address LR0 (Caller) Memory Frame Pointer Caller's Stack Pointer Callee's Activation Local Variables Record of Callee **Outgoing Arguments** LR1 (Callee) Frame Pointer During Call LR0 (Callee) Return Address Callee's Stack Pointer Figure 4-2 ## 4.1.3 Local Registers as a Stack Cache The Am29200 and Am29205 microcontrollers are designed for efficient implementation of the Register Stack. Specifically, each microcontroller can use the large number of relatively addressed local registers to cache portions of the Register Stack, yielding a significant gain in performance. Allocation and deallocation of activation records occurs largely within the confines of the high-speed local registers, and most procedure calls occur without external references. Furthermore, during procedure execution, most data accesses occur without external references, because activation-record data are referenced most frequently. The principle of locality of reference, which allows any cache to be effective, also applies to the stack cache. The entries in the stack cache are likely to remain there for re-use, because the size of the Register Stack does not change very much over long intervals of program execution. Activation records are typically small, so the 128 locations in the local register file can hold many activation records. Allocating Register-Stack activation records in the local registers is facilitated by the Stack Pointer in Global Register 1. During the execution of a procedure, the Stack Pointer points simultaneously to the top of the Register Stack in memory and to the local register at the top of the stack cache. In other words, Global Register 1, a word-length register, contains the 32-bit address of the top of the Register Stack, while bits 8–2 of Global Register 1 (with a 1 appended to the most significant bit) indicate the absolute register number of Local Register 0. Allocation and deallocation of the Register Stack is accomplished by subtracting from or adding to, respectively, the value of the Stack Pointer. Using this register-addressing scheme, locations from the Register Stack are automatically mapped into the local register file. Figure 4-3 shows the relationship between the Register Stack and the stack cache in the local registers. As shown, pointers are required to define the boundaries between the Register Stack and the stack cache. Figure 4-3 Relationship of Stack Cache and Register Stack - The register free bound pointer (*rfb*, gr127) defines the boundary between the portion of the Register Stack cached in the local registers and the portion stored in the external data memory. The *rfb* pointer contains the address of the first word in the Register Stack that is not contained in the local registers, but which is in memory. - The frame pointer (fp, Ir1) contains the memory address of the lowest-addressed word not in the current activation record. The fp is used to determine whether the caller's complete activation record is contained in the local registers when a procedure returns from a call, as described later. - The register stack pointer (*rsp*, gr1) points to the top of the Register Stack either in the local registers or the memory. The *rsp* is contained in the local-register Stack Pointer (Global Register 1). The top of the Register Stack may or may not be contained in the data memory. The *rsp* simply defines the location of the top of the Register Stack. - The register allocate bound pointer (*rab*, gr126) defines the lowest-addressed stack location that can be cached within the local registers. This defines the limit to which local registers can be allocated in the Register Stack. Several activation records may exist in the Register Stack at any given time, but only one stack location may be mapped to a local register at a given time. When the Register Stack grows beyond the 128-word capacity of the local registers, some movement of data between the stack cache and the Register Stack in data memory must occur. Stack overflow occurs when a procedure is called, but the activation record of the callee requires more registers than can be allocated in the stack cache (this is detected by comparing *rsp* with *rab*). Figure 4-4 illustrates stack overflow. In this case, the contents of a number of registers must be moved to data memory. The number of registers involved must be sufficient to allow the entire activation record of the callee to reside in the local registers. A block of the registers is copied, or *spilled*, into an area of external data memory, freeing space in the local register file for the most recent procedure call. Stack underflow occurs when a procedure returns to the caller, but the entire activation record of the caller is not resident in the stack cache (this is detected by comparing fp with rfb). Figure 4-5 illustrates stack underflow. In this case, the non-resident portion of the caller's stack must be moved from data memory to the local registers. Underflow occurs because overflow occurred at some previous point during program execution, causing part of the Register Stack to be moved to memory. The processor performs no hardware management of the stack cache and cannot detect a reference to a quantity that is not in the stack cache. Consequently, software must keep the size of an activation record less than or equal to the size of the local register file (128 words). Any additional storage requirements are satisfied by the Memory Stack. ## 4.1.4 Memory Stack In general, the Memory Stack is used to augment the Register Stack, holding additional information associated with activation records. For example, the Memory Stack holds large data structures that cannot fit into the Register Stack. Similar to the Register Stack, the Memory Stack contains a series of (possibly overlapping) activation records, each corresponding to a procedure activation. However, a Memory Stack activation record need not exist for a procedure that does not need a Memory Stack Area. The Memory Stack contains the following information: Overflow incoming arguments. These are incoming arguments that do not fit in the allowed incoming arguments area of the Register Stack activation record. Figure 4-4 Stack Overflow - Spilled incoming arguments. These are incoming arguments that cannot be kept in the Register Stack. For example, if the address of an argument is used in a called procedure, the associated value must be in the Memory Stack. - Any procedure-local variable not allocated to a register. - Local block space. This storage is allocated dynamically on the Memory Stack. It is used to implement functions such as the *alloca()* function in the C programming language. - Overflow outgoing arguments. These are outgoing arguments that do not fit in the allowed outgoing arguments area of the Register Stack activation record. In contrast to the Register Stack, the Memory Stack is not cached and has no fixed size limit. The top of the Memory Stack is defined by the memory stack pointer (*msp*), which is stored in Global Register 125 by convention. #### 4.2 PROCEDURE LINKAGE CONVENTIONS The procedure linkage conventions define the standard sequences of instructions used to call and return from procedures. These instruction sequences perform the following operations (other, more general operations may also be required, as described later): ■ Put procedure arguments into the outgoing arguments area of the activation record. This may or may not involve copying the arguments; copying is not necessary if the arguments are placed into the appropriate registers as the result of computation. Figure 4-5 Stack Underflow - Branch to the procedure using a call instruction, which also places the return address in a register. - Allocate a frame on the Register Stack. A frame is the storage that contains the procedure's activation record. - If overflow occurs during frame allocation, spill the least recently used locations of the Register Stack. The number of spilled locations must be sufficient to allow the new frame to reside entirely within the local registers. - Determine the frame-pointer value of the called procedure, if this procedure may call another procedure. - Execute the procedure. - Place return values into the appropriate registers. - Deallocate the activation-record frame. - Fill locations of the local registers from the Register Stack in external memory, if underflow occurs. - Branch to the procedure's return address. This section describes the routines that implement the procedure linkage conventions. The operations described here are not required on every procedure call. In some cases, operations can be omitted or simpler routines used; these cases and the accompanying simplifications are also described here. ## 4.2.1 Argument Passing The linkage convention allows up to 16 words of arguments to be passed from the caller to the callee in local registers. These arguments are passed in Local Register 2 through ## AMD Local Register 17 of the caller (note that the local-register numbers are different for the caller and the callee, because of Stack-Pointer addressing). When more than 16 words are required to pass arguments, the additional words are passed on the Memory Stack. In this case, the memory stack pointer (in Global Register 125) points to the seventeenth word of the arguments, and the remaining argument words have higher memory addresses. Multiword arguments may be split across the Register Stack and the Memory Stack. For example, if a multiword argument starts on the sixteenth word of the outgoing arguments, the first word of the argument is passed in the Register Stack, and the remainder of the argument is passed in the Memory Stack. All arguments occupy at least one word. Arguments that are a byte or half-word in length (for example, a character) are padded to 32 bits and passed as a full word. However, an array or structure composed of multiple byte or half-word components can be passed as a single, packed array or structure of bytes or half-words rather than an array or structure of padded bytes or half-words. No argument is aligned to anything other than a word address boundary, including multiword arguments. Some multiword arguments are referenced as a single object (for example, double-precision floating-point values). It may be necessary to copy such arguments to an aligned memory or register area before use. ## 4.2.2 Procedure Prologue When a procedure is called and the procedure may call another procedure, the callee must allocate a frame for itself on the Register Stack (this is not required for *leaf* procedures that do not call other procedures, as described later). A frame is allocated by decrementing the register stack pointer to accommodate the size of the required activation record. The procedure *prologue* is the instruction sequence that allocates the callee's Register Stack frame. To allocate the stack frame, the prologue routine decrements the register stack pointer by the amount *rsize* (see Figure 4-6). The value of *rsize* must be an even number given by the following formula: rsize ≥ (size of local variable area) + (size of outgoing arguments area) + 2 The value 2 in this formula accounts for the space required by the return address (in Local Register 0) and the frame pointer (in Local Register 1). The size of the local variable area includes the space for the memory frame pointer, if required. If the formula total is an odd value, the total must be adjusted (by adding 1) so the resulting *rsize* value is even. This aligns the top of the Register Stack on a double-word boundary. The reason for this alignment is that double-precision floating-point values must be aligned to registers with even absolute-register numbers. Alignment of double-precision values is accomplished by placing these values into even-numbered local registers and making *rsize* even (it is also assumed that the register stack pointer is initialized on an even-word boundary). Rsize is not the size of the entire activation record of the callee, because the callee's activation record includes storage that was allocated as part of the caller's activation record frame (e.g., the caller's outgoing arguments area, which is the callee's incoming arguments area). The size of the callee's entire activation record is denoted size and is given by the following formula: size = rsize + (size of the incoming arguments area) + 2 Incoming Arguments\* Frame Pointer LR1 (Caller) Return Address LR0 (Caller) Caller's Stack Pointer Memory Frame Pointer\* Callee's Activation Local Variables Record of Callee\* size rsize Outgoing Arguments\* Frame Pointer\* LR1 (Callee) LR0 (Callee) Return Address\* Callee's Stack Pointer \*May not be required Figure 4-6 Definition of size and rsize Values In the prologue routine, the following instruction is used to allocate the stack frame (rsp = gr1): prologue: sub rsp,rsp,rsize\*4 ; \*4 converts words to bytes However, this instruction does not account for the fact that there may not be enough room in the local registers to contain the activation record. There must be additional instructions to detect stack overflow and to cause spilling if overflow occurs. This is accomplished by comparing the new value of the register stack pointer with the value of the register allocate bound and invoking a trap handler (with vector number V\_SPILL) if overflow is detected. Furthermore, if the procedure calls another procedure, the prologue must compute a frame pointer. The frame pointer will be used by procedures called in turn by the callee to insure that the callee's activation record is in the local registers upon return (i.e., that it has not been spilled onto the Register Stack in data memory). The frame pointer is computed in the prologue because it need only be computed once, regardless of how many procedures are called by a given procedure. The complete procedure prologue is then (fp = Ir1): proloque: sub rsp, rsp, rsize\*4 asgeu V\_SPILL, rsp, rab ; allocate frame ; call spill handler if needed add fp, rsp, size\*4 ; compute frame pointer ## 4.2.3 Spill Handler If overflow occurs, the assert instruction in the prologue fails, causing a trap. The trap handler invokes a User-mode routine in the trapping process to spill Register Stack locations from the local registers to external memory. Having most of the spill handling in a User-mode routine minimizes the amount of time that interrupts are disabled and insures that spilling is performed using the correct virtual-memory configuration. The spill handler uses two registers. The first register, Global Register 121, normally contains a trap handler argument (*tav*), but is used by the spill handler as a temporary register. The second register, Global Register 122, stores a trap handler return address (*tpc*). This register is used by the User-mode spill handler to return to the trapping procedure. It is assumed that the address of the User-mode spill handler is contained in a global register, denoted user spill reg in the following instruction sequence. The complete spill handler is: ``` Spill: ; operating-system routine ; save return address mfsr tpc, PC1 ; branch to User spill via interrupt return PC1, user_spill_reg mtsr add tav, user_spill_reg, 4 PC0, tav mtsr iret : User-mode spill handler user_spill: : compute spill: allocate bound - rsp sub tav, rab, rsp srl tav, tav, 2 ; shift to get number of words sub tav, tav, 1 ; count is one less mtsr CR, tav ; set Count Remaining Register tav, rab, rsp sub tav. rfb. tav : compute new free bound sub add rab, rsp, 0 ; adjust allocate bound storem 0, 0, Ir0, tav : spill impi toc ; return to trapping procedure ; adjust free bound add rfb. tav. 0 ``` #### 4.2.4 Return Values If the called procedure returns one or more results, the first 16 words of the result(s) are returned in Global Register 96 through Global Register 111, starting with Global Register 96. If more than 16 words are required for the results, the additional words are returned in memory locations allocated by the caller. In this case a large return pointer (*Irp*) provided by the caller in Global Register 123 at the time of the call points to the seventeenth word of the results, and subsequent words are stored at higher memory addresses. ## 4.2.5 Procedure Epilogue The procedure epilogue deallocates the stack frame allocated by the procedure prologue and returns to the calling procedure. Stack deallocation is accomplished by adding the *rsize* value back to the register stack pointer, after which the deallocated registers are no longer used and are considered invalid. The epilogue also detects stack underflow and causes register filling if underflow occurs. This is accomplished by comparing the value of the caller's frame pointer with the register free bound and invoking a trap handler (with vector number V\_FILL) if underflow is detected. Finally, the epilogue returns to the caller using the caller's return address. The complete procedure epilogue is: ``` epilogue: ``` ``` add rsp, rsp, rsize*4; add back rsize count nop; cannot reference a local register here asleu V_FILL, fp, rfb; call fill handler if needed jmpi lr0; jump to return address nop; delay slot ``` #### 4.2.6 Fill Handlers If underflow occurs, the assert instruction in the epilogue fails, causing a trap. The trap handler invokes a User-mode routine in the trapping process to fill Register Stack locations from the external memory to local registers. The fill handler is similar in organization to the spill handler discussed above. The complete fill handler is: ``` Fill: ; operating-system routine tpc, PC1 mfsr ; save return address mtsr PC1, user_fill_reg ; branch to User fill via interrupt return add tav. user fill reg. 4 mtsr PC0. tav iret user_fill: ; User-mode fill handler const tav. (0x80<<2) ; local register has high bit set tav, tav, rfb ; put starting register number into Indirect or : Pointer A mtsr IPA, tav sub tav, fp, rfb ; compute number of bytes to fill add rab, rab, tav ; adjust the allocate bound srl tav, tav, 2 ; change byte count to word count sub tav. tav. 1 : make count zero-based mtsr CR, tav ; set Count Remaining register loadm 0, 0, gr0, rfb igmi tpc ; return to trapping procedure rfb, Ir1, 0 add ; adjust the free bound ``` ## 4.2.7 Register Stack Leaf Frame A leaf procedure is one that does not call any other procedure. The incoming arguments of a leaf procedure are already allocated in the calling procedure's activation-record frame, and the leaf routine is not required to allocate locations for any outgoing arguments, frame pointer, or return address (since it performs no call). Hence, a leaf procedure need not allocate a stack frame in the local registers, and can avoid the overhead of the procedure prologue and epilogue routines. Instead, a leaf routine can use a set of global registers for local variables; Global Register 96 through Global Register 124 are reserved for this purpose (among other purposes). If there is an insufficient number of global registers, the leaf procedure may allocate a frame on the Register Stack. ## 4.2.8 Local Variables and Memory-Stack Frames A called procedure can store its local variables and temporaries in space allocated in the Register Stack frame by the procedure prologue. The values are referenced as an offset from the *rsp* base address, using the Stack-Pointer addressing of the local registers. No object in a register is aligned on anything smaller than a register boundary, and all objects take at least one register. Because there are 128 local registers, the total Register Stack activation-record size cannot be greater than 128 words. If the callee needs more space for local variables and temporaries, it must allocate a frame on the Memory Stack to hold these objects. To allocate a Memory-Stack frame, the procedure prologue decrements the memory stack pointer (msp, in gr125). The procedure epilogue deallocates the Memory-Stack frame by incrementing the msp. A procedure that extends the Memory Stack dynamically (e.g., using alloca()) must make a copy of the msp at procedure entry before allocating the Memory-Stack frame. The msp is stored in the memory frame pointer (mfp) entry of the activation record in the Register Stack. The procedure can then change the msp during execution, according to the needs of dynamic allocation. On procedure return, the Memory-Stack frame is deallocated using the mfp to restore the msp. A procedure that does not extend the Memory Stack dynamically need not have an mfp entry in its activation record. The following prologue and epilogue routines are used if there is no dynamic allocation of the Memory Stack during procedure execution, but a Memory Stack frame is otherwise required (Figure 4-6 contains a diagram of register usage): ``` prologue: ``` ``` sub rsp, rsp, <rsize>*4 : allocate register frame V_SPILL, rsp, rab ; call spill handler if needed asgeu ; compute register frame pointer add fp, rsp, <size>*4 sub msp, msp, <msize> ; allocate memory frame ; msize = size of memory frame in words epilogue: add rsp, rsp, <rsize>*4 ; deallocate register frame ; deallocate memory frame add msp, msp, <msize> impi ; return IrO. ; call fill handler if needed asleu V_FILL, fp, rfb ``` The following prologue and epilogue routines are used if there is dynamic allocation of the Memory Stack during procedure execution: ``` prologue: ``` ``` sub rsp, rsp, <rsize>*4 ; allocate register frame V_SPILL, rsp, rab ; call spill handler if needed asgeu add fp, rsp, <size>*4 ; compute register frame pointer add Ir{\langle rsize \rangle - 1}, msp, 0 ; save memory frame pointer ; Ir{rsize-1} is last reg in new frame sub msp, msp, <msize> ; allocate memory frame, : msize = size of memory frame in words epilogue: add msp, Ir{\langle rsize \rangle - 1},0 ; restore memory stack pointer ; deallocate memory frame add rsp, rsp, <rsize>*4 ; deallocate register frame nop ; cannot reference a local register here impi ; return asleu V FILL, fp. rfb : call fill handler if needed ``` #### 4.2.9 Static Link Pointer Some programming languages permit nested procedure declarations, introducing the possibility that a procedure may reference variables and arguments that are defined and managed by another procedure. This other procedure is a static parent of the callee. A static parent is determined by the declarations of procedures in the program source and is not necessarily the calling procedure; the calling procedure is the dynamic parent. Since procedures can be nested at a number of levels, a given procedure may have a number of hierarchically organized static parents. A called procedure can locate its dynamic parent and the variables of the dynamic parent because of the return address and frame pointer in the Register Stack. However, these are not adequate to locate variables of the static parent that may be referenced in the procedure. If such references appear in a procedure, the procedure must be provided with a static link pointer (*slp*). In the run-time organization, the *slp* is stored in Global Register 124. Since there can be a hierarchy of static parents, the *slp* points to the *slp* of the immediate parent, which in turn points to the *slp* of its immediate parent, and so on. Note that the contents of Global Register 124 may be destroyed by a procedure call, so a procedure needing to reference the variables of a static parent may need to preserve the *slp* until these references are no longer necessary. # 4.2.10 Transparent Procedures A transparent procedure is one that requires very little overhead for managing run-time storage. Transparent procedures are used primarily to implement compiler-specific support functions, such as integer divide. A transparent routine does not allocate any activation-record frames. Parameters are passed to a transparent procedure using *tav* and the Indirect Pointer A, B, and C registers. The return address is stored in *tpc*. This convention allows a leaf procedure to call a transparent procedure without changing its status as a leaf procedure. There is a tight relationship between a compiler and the transparent procedures it calls. Some transparent procedures may need more temporary registers and the compiler must account for this. # 4.3 REGISTER USAGE CONVENTION The run-time organization standardizes the uses of the local and global registers. This section summarizes register use and the nomenclature for register values: - GR1: Register stack pointer (*rsp*) - GR2-GR63: Unimplemented - GR64-GR95: Reserved for operating-system use - GR96–GR111: Procedure return values. Lower-numbered registers are used before higher-numbered registers. If more than 16 words are needed, the additional words are stored in the Memory Stack (see GR123, large return pointer). These registers are also used for temporary values that are destroyed upon a procedure call. - GR112-GR115: Reserved for programmer. These registers are not used by the compiler, except as directed by the programmer. - GR116-GR120: Compiler temporaries - GR121: Trap handler argument/temporary (tav)—This register is used to communicate arguments to a software-invoked trap routine. It can be destroyed by the trap, but not by other traps and interrupts not explicitly generated by the program (for example, a Timer trap). - GR122: Trap handler return address/temporary (*tpc*). This register is also used by software-invoked traps. It can be destroyed by the trap, but not by other traps and interrupts not explicitly generated by the program (for example, a Timer trap). - GR123: Large return pointer/temporary (*Irp*) - GR124: Static link pointer/temporary (slp) - GR125: Memory stack pointer (*msp*) - GR126: Register allocate bound (rab) - GR127: Register free bound (rfb) - LR0: Return address - LR1: Frame pointer (fp) In this convention, registers must be handled by software according to system requirements. The following practices are recommended: - GR64–GR95 should be protected from User-mode access by the Register Bank Protect Register. - The contents of GR96–GR124 should be assumed destroyed by a procedure call, unless the procedure is a transparent procedure. - The contents of GR121 and GR122 should be assumed destroyed by any procedure call or any program-generated trap. - The contents of GR125 are always preserved by a procedure call. - The contents of GR126 and GR127 are managed by the spill and fill handlers and should not be modified except by these handlers. ## 4.4 COMPLEX PROCEDURE CALL EXAMPLE The following code sequence demonstrates a complex procedure call, illustrating how registers are used in the run-time organization: ### caller: ``` (other code) add Irp, msp, 32 ; pass Irp add sip, msp, 120 ; pass a static link call Ir0. callee const Ir2. 1 ; 1 as first argument (other code) callee: const tav, (126-2)*4 ; maximum register allocation allocate register frame sub rsp, rsp, tav assert will be taken V_SPILL, rsp, rab asgeu tav, (126-2)*4 + (3*4) incoming arguments and overhead const add create frame pointer fp, rsp, tav add Ir123, msp, 0 for dynamic Memory-Stack allocation const tav, memory_frame_size big msize (>65535 bytes) high half of msize consth tav, memory_frame_size allocate memory frame sub msp, msp, tav add Ir18, Irp, 0 save Irp for later ; save slp for later add Ir19, slp, 0 (other code) add msp, Ir123, 0 ; deallocate memory frame ; maximum allocation size const tav, (126-2)*4 add rsp, rsp, tav ; deallocate register frame ; return value const gr96, 1 ; return to caller impi lr0 ; ensure caller's registers in frame V_FILL, fp, rfb asleu ``` # 4.5 TRACE-BACK TAGS A trace-back tag is either one or two words of information included at the beginning of every procedure. This information permits a debug routine to determine the sequence of procedure calls and the values of program variables at a given point in execution. The trace-back tag describes the memory frame size and the number of local registers used by the associated procedure. A one-word tag is used if the memory frame size is less than 2K words; otherwise, the two-word tag is used. Regardless of tag length, the tag directly precedes the first instruction of the procedure. Figure 4-7 shows the format of the trace-back tags. The first word of a trace-back tag starts with the invalid operation code 00 (hexadecimal). This unique, invalid instruction operation code allows the debugger to locate the beginning of the procedure in the absence of other information related to the beginning of the procedure, such as from a symbol table. This is particularly useful after a program crash, in which case the debug routine may have only an arbitrary instruction address within a procedure. The call sequence up to the current point in execution can be determined from the *argcount* and *msize* values in the trace-back tag. However, for procedures that perform dynamic stack allocation (e.g., using *alloca()*), the memory frame pointer must be used. The tag word immediately preceding a procedure contains the following fields. Reserved fields must be zero. | 1-Word<br>Tag Bits | 2-Word<br>Tag Bits | ltem | Description | |--------------------|--------------------|-------------|-------------------------------------------------------------| | 31–24 | 31–24 (word 2) | opcode | 00h (an invalid opcode) | | 23 | 23 (word 2) | tag type | 0=one-word tag; 1=two-word tag | | 22 | 22 (word 2) | Mfp | 0=no mfp; 1=mfp used | | 21 | 21 (word 2) | Transparent | 0=normal;1=transparent procedure | | 20–16 | 20–16 (word 2) | argcount | Number of arguments in<br>registers (including Ir0 and Ir1) | | 15-11 | 15-0 (word 2) | reserved | Reserved, must be zero | | 10-3 | 31-2 (word 1) | msize | Memory frame size in doublewords | | 2-0 | 1–0 (word 1) | reserved | Reserved, must be zero | If the procedure uses a Memory-Stack frame size 2K words or more, the *msize* field is contained in the second tag word immediately preceding the first tag word. # 口 # 5 PIPELINING AND INSTRUCTION SCHEDULING This chapter offers a general overview of the internal operation of the pipeline, to help the programmer understand how the pipeline affects the program execution and the microcontroller's behavior under certain conditions. The operation of the functional units is coordinated by Pipeline Hold mode, which ensures that operations are performed in the proper order. In certain cases, the pipeline is exposed during instruction execution, because execution of certain instructions is dependent on the execution of previous instructions. This chapter discusses the cases where the pipeline is exposed to software and describes the resulting effect on instruction execution. ## 5.1 FOUR-STAGE PIPELINE The Am29200 and Am29205 microcontrollers implement a four-stage pipeline for instruction execution. The four stages are fetch, decode, execute, and write-back. For operations, the pipeline is organized so the effective instruction-execution rate may be as high as one instruction per cycle. During the fetch stage, the instruction fetch unit determines the location of the next processor instruction and issues the instruction to the decode stage. The instruction is fetched from an external instruction memory. During the decode stage, the instruction issued from the fetch stage is decoded, and the required operands are fetched and/or assembled. Addresses for branches, loads, and stores are also evaluated. During the execute stage, the execution unit performs the operation specified by the instruction. During the write-back stage, the results of the operation performed during the execute stage are stored. In the case of branches, loads, and stores, an address is transmitted to a memory or a peripheral. Most pipeline dependencies internal to the processor are handled by forwarding logic in the processor. For those dependencies that result from the external system, the Pipeline Hold mode ensures proper operation. In a few special cases, the processor pipeline is exposed to software executing on the microcontroller (see Sections 5.4, 5.5, and 5.6). # 5.2 PIPELINE HOLD MODE The Pipeline Hold mode is activated whenever sequential processor operation cannot be guaranteed. When this mode is active, the pipeline stages do not advance, and most internal processor state is not modified. The processor places itself in the Pipeline Hold mode in the following situations: The processor requires an instruction that has either not been fetched or not been returned by the external instruction memory. - The processor requires data from an in-progress load and the operation has not completed. - The processor attempts to execute a load or store instruction while another load or store is in progress. - The processor must perform a serialization operation as described in Section 5.3. - The processor is performing a sequence of load-multiple or store-multiple accesses. The Pipeline Hold mode in this case prevents further instruction execution until the completion of the load-multiple or store-multiple sequence. - The processor has taken an interrupt or trap, and the first instruction of the interrupt or trap handler has not entered the execute stage. The Pipeline Hold mode in this case prevents the processor pipeline from advancing until the interrupt or trap handler can begin execution. - The processor has executed an interrupt return, and the target instruction of the interrupt return has not entered the execute stage. The Pipeline Hold mode in this case prevents the processor pipeline from advancing until the interrupt return sequence is complete. The Pipeline Hold mode is exited whenever the causing conditions no longer exist, or when the WARN or RESET input is asserted. ## 5.3 SERIALIZATION The Am29200 and Am29205 microcontrollers overlap external data references with other operations. When an external data reference might have to be restarted, however, the processor context must be the same as when the operation was first attempted. To insure this, certain operations are serialized. The processor serializes by entering the Pipeline Hold mode in any of the following circumstances: An external access is not yet completed, and one of the following instructions is encountered: Move to Special Register (MTSR) Move to Special Register Immediate (MTSRIM) Move to TLB (MTTLB)—even though this performs no operation Interrupt Return (IRET) Interrupt Return and Invalidate (IRETINV) Halt (HALT) An external access is not yet completed, and an interrupt or trap, other than a WARN trap, is taken. If the processor is in the Pipeline Hold mode due to serialization, it enters the Executing mode once the external access is completed. #### 5.4 DELAYED BRANCH The effect of jump and call instructions is delayed by one cycle to allow the processor pipeline to achieve maximum throughput. When one of these branches is successful, the instruction immediately following the jump or call is executed before the target instruction of the jump or call is executed. Jump and call instructions collectively are referred to as delayed branches, and the instruction immediately following is called the delay instruction (sometimes referred to as a delay slot). For example, in the following code fragment: | | • | | | |--------|-------|----------------|-----| | | cpeq | gr96, lr6, lr7 | (1) | | | jmpf | gr96, label | (2) | | | sub | lr6, lr6, 1 | (3) | | | const | lr6, 0 | (4) | | label: | call | lr0, sort | (5) | | | add | lr2, lr5, 0 | (6) | | | cpneq | lr3, gr96, 0 | (7) | The SUB instruction (3) is executed regardless of the outcome of the JMPF instruction (2). Of course, if the JMPF is not successful, the CONST instruction (4) is also executed. If the JMPF is successful, then the instruction sequence is: (2), (3), (5), (6), and then the first instruction of the sort procedure. Note that the CALL instruction (5) is also a delayed branch, so the instruction immediately following it, (6), is always executed. After the sort procedure executes the return sequence, the CPNEQ instruction (7) is the next instruction executed. The benefit of delayed branches is improved performance and a simplified processor implementation. Performance is improved because the processor pipeline executes useful instructions in a larger number of cycles, compared to an implementation without delayed branches. For example, ignoring all other effects on performance and assuming 15% of all instructions are taken branches, then a processor without delayed branches would take at least two cycles for 15% of its instructions, leading to 0.85(1) + 0.15(2) = 1.15 cycles per instruction, on average. This represents a 15% performance degradation compared to a processor with delayed branches (assuming, for this simple example, the delay instruction is always useful). The cost of having delayed branches is either the extra effort required when the compiler takes advantage of delayed branches (by re-organizing code), or the extra NO-OP instruction that the compiler inserts after every branch to guarantee correct program operation. Since the compiler expends only a small amount of effort to avoid wasting time and space with NO-OPs, and since the performance improvement resulting from this effort is significant, delayed branches are beneficial overall. When two immediately adjacent branches are taken, the target of the first branch pre-empts execution of the delay cycle of the second branch, and the target of the second branch then follows the target of the first branch. For example, in the following code fragment: | | jmp L1<br>jmp L2<br>add | lr4, lr4, lr5 | (1)<br>(2)<br>(3) | |-----|-------------------------|---------------|-------------------| | L1: | sub | gr96, gr96, 1 | (4) | | | subc | gr97, gr97, 0 | (5) | | L2: | const | gr100, 0xff0f | (6) | |-----|-------|---------------------|-----| | | subr | gr101, gr101, 1 | (7) | | | or | gr100, gr100, gr101 | (8) | | | | 0 /0 /0 | ` ' | an unconditional JMP instruction (1) is followed immediately by another unconditional JMP instruction (2). (In this example, unconditional JMPs are used; however, any two immediately adjacent taken branches exhibit the same behavior.) The sequence of executed instructions in this case is: JMP instruction (1), JMP instruction (2), SUB instruction (4), CONST instruction (6), SUBR instruction (7), OR instruction (8), and so on. Note that the ADD instruction (3) is not executed. Also, the target of the first JMP instruction (1) was merely visited; control did not continue sequentially from L1, but rather continued from L2. # 5.5 OVERLAPPED LOADS AND STORES The Am29200 and Am29205 microcontrollers overlap external data references with other operations. Certain programming practices are necessary to exploit this parallelism to improve program performance. In order to make full use of overlapped storage accesses, some instruction reorganization may be necessary. For example, in the following sequence: | loop: | • | | | |-------|---------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------| | юр: | sil<br>add<br>load<br>add<br>sub<br>add<br>cplt<br>jmpt | gr121, gr119, 2<br>gr121, gr120, gr121<br>0, 0, gr121, gr121<br>gr96, gr96, gr121<br>gr98, gr98, 3<br>gr119, gr119, 1<br>gr122, gr119, lr2<br>gr122, loop | (1)<br>(2)<br>(3)<br>(4)<br>(5)<br>(6)<br>(7)<br>(8)<br>(9) | | | nop | | (8 | the ADD instruction (4) uses the result of the LOAD instruction (3). However, the following four instructions do not depend on the result of the LOAD. Therefore, the ADD instruction (4) can be moved past the JMPT (8), since it always will be executed even ifthe JMPT is taken, and can replace the NO-OP instruction (9). The resulting sequence is: | loop: | • | | | |-------|---------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------| | юф. | sll<br>add<br>load<br>sub<br>add<br>cplt<br>jmpt<br>add | gr121, gr119, 2<br>gr121, gr120, gr121<br>0, 0, gr121, gr121<br>gr98, gr98, 3<br>gr119, gr119, 1<br>gr122, gr119, lr2<br>gr122, loop<br>gr96, gr96, gr121 | (1)<br>(2)<br>(3)<br>(4)<br>(5)<br>(6)<br>(7)<br>(8) | | | | g. 00, g. 00, g. 12. | (-) | The instructions (4) through (7) are likely to be executed while external memory satisfies the load request, resulting in improved throughput. The processor thus allows parallelism to be exploited by instruction reordering. The overlapped load feature may be used to improve processor performance, but imposes no constraints on instruction sequences, as delayed branches do. The processor implements the proper pipeline interlocks to make this parallelism transparent to a running program. # 5.6 DELAYED EFFECTS OF REGISTERS The modification of some registers has a delayed effect on processor behavior, because of the processor pipeline. The affected registers are the Stack Pointer (Global Register 1), Indirect Pointers A, B, and C, and the Current Processor Status Register. An instruction that writes to the Stack Pointer can be followed immediately by an instruction that reads the Stack Pointer. However, any instruction that references a local register also uses the value of the Stack Pointer to calculate an absolute-register number. At least one cycle of delay must separate an instruction that updates the Stack Pointer and an instruction that references a local register. In most systems, this affects procedure call and return only (see Section 4.2). In general, though, an instruction that immediately follows a change to the Stack Pointer should not reference a local register (however, note that this restriction does not apply to a reference of a local register via an indirect pointer). The indirect pointers have an implementation similar to the Stack Pointer and exhibit similar behavior. At least one cycle of delay must separate an instruction that modifies an indirect pointer and an instruction that uses that indirect pointer to access a register. Note that it normally is not possible to guarantee that the delayed effect of the Stack Pointer and indirect pointers is visible to a program. If an interrupt or trap is taken immediately after one of these registers is set, then the interrupted routine sees the effect of the setting in the following instruction, because many interrupt or trap execution cycles elapse between the two instructions of the interrupted routine. For this reason, a program should not be written in a manner that relies on the delayed effect; the results of this practice may be unpredictable. If the Freeze (FZ) bit of the Current Processor Status Register is reset from 1 to 0, two cycles are required before all program state is reflected properly in the registers affected by the FZ bit. This implies that interrupts and traps cannot be enabled until two cycles after the FZ bit is reset, for proper sequencing of program state. There is no delay associated with setting the FZ bit from 0 to 1. • # SYSTEM PROTECTION The Am29200 and Am29205 microcontrollers provide protection for general-purpose registers and special-purpose registers. Certain processor operations are also protected. This chapter describes the processor's protection mechanisms. # 6.1 USER AND SUPERVISOR MODES At any given time, the microcontroller operates in one of two mutually exclusive program modes: the Supervisor mode or the User mode. All system-protection features of the microcontroller are based on the difference between these two modes. # 6.1.1 Supervisor Mode The processor operates in the Supervisor mode whenever the Supervisor Mode (SM) bit of the Current Processor Status Register is 1 (see Section 16.2.1). In the Supervisor mode, executing programs have access to all processor resources. Any attempt to access a special-purpose register in the range of 160 to 255 causes a Protection Violation to occur in either Supervisor or User mode. This permits virtualization of these registers. Supervisor-mode accesses are permitted for any general-purpose register, regardless of protection. # 6.1.2 User Mode The processor operates in the User mode whenever the SM bit in the Current Processor Status Register is 0. In the User mode, any of the following actions by an executing program causes a Protection Violation trap to occur: - An attempted access of any general-purpose register for which a bit in the Register Bank Protect Register is 1 (see Section 6.2). - An attempted execution of one of the following instructions: Interrupt Return, Interrupt Return and Invalidate, Invalidate, or Halt. However, a hardware-development system can disable protection checking for the Halt instruction, so this instruction may be used to implement instruction breakpoints in User-mode programs (see Sections 17.3 and 17.7.5). - An attempted access of special-purpose register in the range of 0 to 127 or 160 to 255. - An attempted execution of an assert or EMULATE instruction that specifies a vector number between 0 and 63, inclusive (see Section 16.3.2). # 6.2 REGISTER PROTECTION General-purpose registers are divided into register banks and are protected by the Register Bank Protect Register. The Register Bank Protect Register allows parameters for the operating system to be kept in general-purpose registers and protected from corruption by User-mode programs. Register banks consist of 16 registers (except for Bank 0, which contains Registers 2 through 15) and are partitioned according to absolute-register numbers, as shown in Figure 6-1. Figure 6-1 Register Bank Organization | Register Bank<br>Protect Register Bit | Absolute-Register<br>Numbers | General-Purpose<br>Registers | |---------------------------------------|------------------------------|------------------------------| | 0 | 2 through 15 | Bank 0 (not implemented) | | 1 | 16 through 31 | Bank 1 (not implemented) | | 2 | 32 through 47 | Bank 2 (not implemented) | | 3 | 48 through 63 | Bank 3 (not implemented) | | 4 . | 64 through 79 | Bank 4 | | 5 | 80 through 95 | Bank 5 | | 6 | 96 through 111 | Bank 6 | | 7 | 112 through 127 | Bank 7 | | 8 | 128 through 143 | Bank 8 | | 9 | 144 through 159 | Bank 9 | | 10 | 160 through 175 | Bank 10 | | . 11 | 176 through 191 | Bank 11 | | 12 | 192 through 207 | Bank 12 | | 13 | 208 through 223 | Bank 13 | | 14 | 224 through 239 | Bank 14 | | 15 | 240 through 255 | Bank 15 | The Register Bank Protect Register contains 16 protection bits, where each bit controls User-mode accesses (read or write) to a bank of registers. Bits 0–15 of the Register Bank Protect Register, protect Register Banks 0 through 15, respectively. When a bit in the Register Bank Protect Register is 1 and a register in the corresponding bank is specified as an operand register or result register by a User-mode instruction, a Protection Violation trap occurs. Note that protection is based on absolute-register numbers. In the case of local registers, Stack-Pointer addition is performed before protection checking. When the processor is in the Supervisor mode, the Register Bank Protect Register has no effect on general-purpose register accesses. # 6.2.1 Register Bank Protect Register (RBP, Register 7) This protected special-purpose register (Figure 6-2) protects banks of general-purpose registers from User-mode program accesses. The general-purpose registers are partitioned into 16 banks of 16 registers each (except that Bank 0 contains 14 registers). The banks are organized as shown in Figure 6-1. # Bits 31-16: Reserved **Bits 15–0:** Bank 15 through Bank 0 Protection Bits (B15–B0)—In the Register Bank Protect Register, each bit is associated with a particular bank of registers, and the bit number gives the associated bank number (e.g., B11 determines the protection for Bank 11). # **SYSTEM OVERVIEW** The Am29200 and Am29205 microcontrollers significantly reduce system cost because each microcontroller integrates many system functions onto a single chip. This chapter overviews the system interfaces and on-chip peripherals of the Am29200 and Am29205 microcontrollers. # 7.1 SIGNAL DESCRIPTION The Am29200 microcontroller uses 140 pins for signal inputs and outputs. It uses 28 pins for power and ground. The Am29205 microcontroller uses 84 pins for signal inputs and outputs. It uses 16 pins for power and ground. Section 7.1.12 summarizes the signal differences between the Am29200 and Am29205 microcontrollers. Note: The UCLK signal must be tied High if the serial port is not used. The TRST signal must be tied to RESET, whether or not the JTAG port is used. See Appendix A for other important hardware configuration notes. # 7.1.1 Clocks INCLK Input Clock (input) This is an oscillator input at twice the processor and system operating frequency. It can be driven at TTL levels. MEMCLK Memory Clock (output) This is a clock output at one-half of the frequency of INCLK. Most processor outputs, and many inputs, are synchronous to MEMCLK. MEMCLK drives out with CMOS levels. # 7.1.2 Processor Signals A23-A0 Address Bus (output, synchronous) The address bus supplies the byte address for all accesses, except for DRAM accesses. For DRAM accesses, multiplexed row and column addresses are provided on A14–A1. A2–A0 are also used to provide a clock to an optional burst-mode EPROM. The signals A23–A22 and burst-mode devices are not supported on the Am29205 microcontroller. ID31-ID0 Instruction/Data Bus (bidirectional, synchronous) The instruction/data bus (ID bus) transfers instructions to, and data to and from the processor. The signals ID15–ID0 are not supported on the Am29205 microcontroller. WAIT **Add Wait States** (input, synchronous, weak internal pull-up transistor) External accesses are normally timed by the Am29200 microcontroller. However, the WAIT signal may be asserted during a PIA, ROM, or DMA access to extend the access indefinitely. The WAIT pin is not available on the Am29205 microcontroller; see the WAIT/TRIST signal description. # WAIT/TRIST Add Wait States/Three-State Control (input, synchronous, weak internal pull-up transistor) The WAIT signal may be asserted during a PIA, ROM, or DMA access to extend the access indefinitely. The WAIT/TRIST pin also used for threestate control during test. When asserted during a processor reset, all output pins go into a high impedance state. For normal operation, this pin must be pulled High during processor reset. This pin is not available on the Am29200 microcontroller; see the WAIT signal description. #### R/W # Read/Write (output, synchronous) During an external ROM, DRAM, DMA, or PIA access, this signal indicates the direction of transfer: High for a read and Low for a write. #### RESET ## Reset (input, asynchronous) This input places the processor in the Reset mode. This signal has special hardening against metastable states, allowing it to be driven with a slow-rise-time signal. ## WARN # Warn (input, asynchronous, edge-sensitive, internal pull-up) A High-to-Low transition on this input causes a non-maskable WARN trap to occur. This trap bypasses the normal trap vector fetch sequence, and is useful in situations where the vector fetch may not work (e.g., when data memory is faulty). This signal has special hardening against metastable states, allowing it to be driven with a slow-transition-time signal. This signal is not supported on the Am29205 microcontroller. # INTR3-INTR0 Interrupt Requests 3-0 (input, asynchronous, internal pull-ups) These inputs generate prioritized interrupt requests. The interrupt caused by INTRO has the highest priority, and the interrupt caused by INTR3 has the lowest priority. The interrupt requests are masked in prioritized order by the Interrupt Mask field in the Current Processor Status Register and are disabled by the DA and DI bits of the Current Processor Status Register. These signals have special hardening against metastable states. allowing them to be driven with slow-transition-time signals. The INTR1-INTR0 signals are not supported on the Am29205 microcontroller. #### STAT2-STAT0 # **CPU Status (output, synchronous)** These outputs indicate information about the processor or the current access for the purposes of hardware debug. They are encoded as follows: | STAT2 | STAT1 | STAT0 | Condition | |-------|-------|-------|-------------------------------------------------| | 0 | 0 | 0 | Halt or Step mode | | 0 | 0 | 1 | Interrupt/trap vector fetch (vector valid) | | 0 | 1 | 0 | Load Test Instruction mode, Halt/Freeze | | 0 | 1 | 1 | Branch target fetch (instruction valid) | | 1 | 0 | 0 | External data access (data valid) | | 1 | 0 | 1 | External instruction access (instruction valid) | | 1 | 1 | . 0 | Internal peripheral access (data valid) | | 1 | 1 | . 1 | Idle or data/instruction not valid | Note that in all cases, a condition is reflected on the STAT pins on the second cycle following the condition. These signals are described in more detail in Section 17.4. The STAT2-STAT0 signals are not supported on the Am29205 microcontroller. #### TRAP1-TRAP0 # Trap Requests 1-0 (input, asynchronous, internal pull-ups) These inputs generate prioritized trap requests. The trap caused by TRAPO has the highest priority. These trap requests are disabled by the DA bit of the Current Processor Status Register. These signals have special hardening against metastable states, allowing them to be driven with slow-transition-time signals. These signals are not supported on the Am29205 microcontroller. # 7.1.3 ROM Interface ## ROMCS3-ROMCS0 # ROM Chip Selects, Banks 3-0 (output, synchronous) A Low level on one of these signals selects the memory devices in the corresponding ROM bank. ROMCS3 selects devices in ROM Bank 3, and so on. The timing and access parameters of each bank are individually programmable. ROMCS3 is not supported on the Am29205 microcontroller. # **ROMOE** ROM Output Enable (output, synchronous) This signal enables the selected ROM Bank to drive the ID bus. It is used to prevent bus contention when switching between different ROM banks or switching between a ROM bank and another device or DRAM bank. ## **BURST** Burst-Mode Access (output, synchronous) This signal is asserted to perform sequential accesses from a burst-mode device. This signal is not supported on the Am29205 microcontroller. # RSWE ROM Space Write Enable (output, synchronous) This signal is used to write an alterable memory in a ROM bank (such as an SRAM or Flash EPROM). RSWE supports only writes of width equal to or greater than the width of the memory, and the memory must be at least 16 bits wide. The CASx signals, described in Section 7.1.4, serve as individual byte strobes for writes to the ROM space, if ROM byte writes are enabled. # **BOOTW** Boot ROM Width (input, asynchronous) This input configures the width of ROM Bank 0, so the ROM can be accessed before the ROM configuration has been set by the system initialization software. The BOOTW signal is sampled during and after a processor reset. If BOOTW is High before and after reset (tied High), the boot ROM is 32 bits wide. If BOOTW is Low before and after reset (tied Low), the boot ROM is 16 bits wide. If BOOTW is Low before reset and High after reset (tied to $\overline{\text{RESET}}$ ), the boot ROM is 8 bits wide. This signal has special hardening against metastable states, allowing it to be driven with a slow-rise-time signal and permitting it to be tied to $\overline{\text{RESET}}$ . This signal is not supported on the Am29205 microcontroller. ROM Bank 0 is set to 16 bits during a processor reset; this setting cannot be changed. #### 7.1.4 DRAM Interface ## RAS3-RAS0 Row Address Strobe, Banks 3-0 (output, synchronous) A High-to-Low transition on one of these signals causes a DRAM in the corresponding bank to latch the row address and begin an access. RAS3 starts an access in DRAM Bank 3, and so on. These signals also are used in other special DRAM cycles. # CAS3-CAS0 Column Address Strobes, Byte 3-0 (output, synchronous) A High-to-Low transition on these signals causes the DRAM selected by RAS3-RAS0 to latch the column address and complete the access. To support byte and half-word writes, column address strobes are provided for individual DRAM bytes. CAS3 is the column address strobe for the DRAMs, in all banks, attached to ID31-ID24. CAS2 is for the DRAMs attached to ID23-ID16, and so on. These signals are also used in other special DRAM cycles. The CASx signals can be enabled to act as individual byte strobes for byte writes to the ROM space. In this configuration, ROM accesses do not conflict with DRAM accesses or refresh even though the CASx may be used by both the ROM and DRAM. Refresh is delayed during byte reads and writes to ROM space. The CAS1-CAS0 signals are not supported on the Am29205 microcontroller. #### WE # Write Enable (output, synchronous) This signal is used to write the selected DRAM bank. "Early write" cycles are used so the DRAM data inputs and outputs can be tied to the common ID bus. #### TR/OE # **Video DRAM Transfer/Output Enable (output, synchronous)** This signal is used with video DRAMs to transfer data to the video shift register. It is also used as an output enable in normal video DRAM read cycles. This signal is not supported on the Am29205 microcontroller. # 7.1.5 Peripheral Interface Adapter (PIA) #### PIACS5-PIACS0 ## Peripheral Chip Selects, Regions 5-0 (output, synchronous) These signals are used to select individual peripheral devices. DMA Channel 0 may be programmed to use PIACS0 during an external peripheral access, and DMA Channel 1 may be programmed to use PIACS1. PIACS5—PIACS2 are not supported on the Am29205 microcontroller. #### PIAOE # Peripheral Output Enable (output, synchronous) This signal enables the selected peripheral device to drive the ID bus. # PIAWE #### Peripheral Write Enable (output, synchronous) This signal causes data on the ID bus to be written into the selected peripheral. # 7.1.6 DMA Controller #### DREQ1-DREQ0 **DMA Request, Channels 1-0 (input, asynchronous, internal pull-ups)** These signals request an external transfer on DMA Channel 0 (DREQ0) or DMA Channel 1 (DREQ1). These requests are individually programmable to be either level- or edge-sensitive for either polarity of level or edge. DMA transfers can occur to and from internal peripherals independent of these requests. DREQ0 is not supported on the Am29205 microcontroller. #### DACK1-DACK0 # DMA Acknowledge, Channels 1-0 (output, synchronous) These signals acknowledge an external transfer on DMA Channel 0 (DREQ0) or DMA Channel 1 (DREQ1). DMA transfers can occur to and from internal peripherals independent of these acknowledgments. DACKO is not supported on the Am29205 microcontroller. # TDMA Terminate DMA (input, synchronous) This signal can be asserted during an external DMA transfer to terminate the transfer after the current access. This signal is not supported on the Am29205 microcontroller. # GREQ External Memory Grant Request (input, synchronous, internal pull-up) This signal is used by an external device to request an access to the Am29200 microprocessor's ROM or DRAM. To perform this access, the external device supplies an address to the Am29200 microcontroller's ROM controller or DRAM controller. To support a hardware-development system, GREQ should be either tied High or held at a high-impedance state during a processor reset. This signal is not supported on the Am29205 microcontroller. # GACK External Memory Grant Acknowledge (output, synchronous) This signal indicates to an external device that it has been granted an access to the Am29200 microcontroller's ROM or DRAM, and that the device should provide an address. This signal is not supported on the Am29205 microcontroller. ## 7.1.7 I/O Port # PIO15-PIO0 Programmable Input/Output (input/output, asynchronous) These signals are available for direct software control and inspection. PIO15–PIO8 may be individually programmed to cause processor interrupts. These signals have special hardening against metastable states, allowing them to be driven with slow-transition-time signals. The signals PIO7–PIO0 are not supported on the Am29205 microcontroller. ### 7.1.8 Parallel Port # PSTROBE Parallel Port Strobe (input, asynchronous) This signal is used by the host to indicate that data is on the parallel port or to acknowledge a transfer from the microcontroller. # PBUSY Parallel Port Busy (output, synchronous) This indicates to the host that the parallel port is busy and cannot accept a data transfer. # PACK Parallel Port Acknowledge (output, synchronous) This signal is used by the microcontroller to acknowledge a transfer from the host or to indicate to the host that data has been placed on the port. # PAUTOFD Parallel Port Autofeed (input, asynchronous) This signal is used by the host to indicate how line feeds should be performed or is used to indicate that the host is busy and cannot accept a data transfer. # POE Parallel Port Output Enable (output, synchronous) This signal enables an external data buffer containing data from the host to drive the ID bus. # PWE Parallel Port Write Enable (output, synchronous) This signal writes a buffer with data on the ID bus. Then, the buffer drives data to the host. # 7.1.9 Serial Port UCLK UART Clock (input) This is an oscillator input for generating the UART (serial port) clock. To generate the UART clock, the oscillator frequency may be divided by any amount up to 65,536. The UART clock operates at 16 times the serial port's baud rate. As an option, UCLK may be driven with MEMCLK or INCLK. It can be driven with TTL levels. UCLK must be tied High if unused. TXD Transmit Data (output, asynchronous) This output is used to transmit serial data. RXD Receive Data (input, asynchronous) This input is used to receive serial data. DSR Data Set Ready (output, synchronous) This indicates to the host that the serial port on the Am29200 microcontroller is ready to transmit or receive data. This signal is not supported on the Am29205 microcontroller. DTR Data Terminal Ready (input, asynchronous) This indicates to the Am29200 microcontroller that the host is ready to transmit or receive data. This signal is not supported on the Am29205 microcontroller. # 7.1.10 Video Interface VCLK Video Clock (input, asynchronous) This clock is used to synchronize the transfer of video data. As an option, VCLK may be driven with MEMCLK or INCLK. It can be driven with TTL levels. VDAT Video Data (input/output, synchronous to VCLK) This is serial data to or from the video device. LSYNC Line Synchronization (input, asynchronous) This signal indicates the start of a raster line. PSYNC Page Synchronization (input/output, asynchronous) This signal indicates the beginning of a raster page. # 7.1.11 JTAG 1149.1 Boundary Scan Interface (Am29200 Microcontroller) TCK Test Clock Input (asynchronous input, internal pull-up) This input is used to operate the test access port. The state of the test access port must be held if this clock is held either High or Low. This clock is internally synchronized to MEMCLK for certain operations of the test access port controller, so signals internally driven and sampled by the test access port are synchronous to processor internal clocks. This signal is not available on the Am29205 microcontroller. TMS Test Mode Select (input, synchronous to TCK, internal pull-up) This input is used to control the test access port. If it is not driven, it appears High internally. This signal is not available on the Am29205 microcontroller. TDI Test Data Input (input, synchronous to TCK, internal pull-up) This input supplies data to the test logic from an external source. It is sampled on the rising edge of TCK. If it is not driven, it appears High internally. This signal is not available on the Am29205 microcontroller. TDO Test Data Output (three-state output, synchronous to TCK) This output supplies data from the test logic to an external destination. It changes on the falling edge of TCK. It is in the high-impedance state except when scanning is in progress. This signal is not available on the Am29205 microcontroller. TRST Test Reset Input (asynchronous input, internal pull-up) This input asynchronously resets the test access port. This input places the test logic in a state such that no output driver is enabled. The TRST input must be asserted in conjunction with the RESET input for correct processor initialization, whether or not the JTAG port is used. (See Appendix A.) This signal is not available on the Am29205 microcontroller. # 7.1.12 Pin Changes for the Am29205 Microcontroller The reduced pin count of the Am29205 microcontroller comes from having a 16-bit instruction/data bus, fewer ports on some of the peripherals, and no JTAG interface. The following signals supported on the Am29200 microcontroller are not available on the Am29205 microcontroller. - Processor signals: A23–A22, ID15–ID0, WARN, INTR1–INTR0, TRAP1–TRAP0, STAT2–STAT0 - ROM interface signals: ROMCS3, BURST, BOOTW - DRAM interface signals: CAS1-CAS0, TR/OE - PIA signals: PIACS5—PIACS2 - DMA signals: DREQ0, DACK0, TDMA, GREQ, GACK - I/O port signals: PIO7-PIO0 - Serial port signals: DSR, DTR - JTAG signals: TCK, TDI, TMS, TDO, TRST In addition, the Am29200 microcontroller's WAIT pin is defined as a WAIT/TRIST pin on the Am29205 microcontroller. # 7.2 ACCESS PRIORITY Many of the processor interface signals are shared between various types of accesses. If more than one access request occurs at the same time, the requests are prioritized as follows, in decreasing order of priority: - 1. "Panic mode" DRAM Refresh (see Section 9.3.8) - 2. DMA Channel 0 transfer - 3. DMA Channel 1 transfer - 4. Memory access request by an external device (see Section 11.5) - 5. Processor DRAM, PIA, or ROM access for data - 6. Processor DRAM or ROM access for an instruction External DMA transfers require two accesses: one to read the data from a peripheral or the DRAM, and another to write the data to a peripheral or DRAM. The two accesses are performed back-to-back, without interruption by another access. Some processor accesses to narrow memories (a narrow memory is 8 or 16 bits wide) require two or four accesses; for example, reading 32 bits from an 8-bit-wide ROM requires four reads. These accesses are also performed back-to-back, without interruption. DRAM refresh cycles are normally overlapped with other, non-DRAM accesses. Because normal refresh cycles are performed when there is no conflict with other accesses, these cycles are not prioritized in the above list. ### 7.3 SYSTEM ADDRESS PARTITION All addresses are in the microcontroller's instruction/data memory address space. The address space is partitioned as shown in Table 7-1. Table 7-1 Internal Peripheral Address Ranges | Address Dongs | Selection | Maximum Physical Size | | | |--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------|--| | Address Range (hexadeximal) | Selection | Am29200 Microcontroller | Am29205 Microcontroller | | | 00000000-03FFFFFF 4000000-43FFFFFF 50000000-50FFFFFF 60000000-63FFFFFF 8000000-800000FC 9000000-90FFFFFF 91000000-91FFFFFF 92000000-92FFFFFF 93000000-93FFFFFF 9400000-94FFFFF 95000000-95FFFFFF | ROM Banks (all) DRAM Banks (all) Mapped DRAM Banks (all) VDRAM transfers Internal peripherals/controllers PIA Region 0 (PIACS0) PIA Region 1 (PIACS1) PIA Region 2 (PIACS2) PIA Region 3 (PIACS3) PIA Region 4 (PIACS4) PIA Region 5 (PIACS5) Reserved | 64 Mbytes 64 Mbytes 16 Mbytes 64 Mbytes ———————————————————————————————————— | 12 Mbytes 32 Mbytes 16 Mbytes Not Supported 4 Mbytes 4 Mbytes Not Supported Not Supported Not Supported Not Supported Not Supported | | An access to any unimplemented address or address range has an unpredictable effect on processor operation. # 7.4 INTERNAL PERIPHERALS AND CONTROLLERS Internal peripheral registers are selected by offsets from address 80000000h. The address assignment of the various internal peripherals and controllers is shown in Table 7-2. Nearly all registers are read/write and are 32 bits in length. However, a few register bits are read only, bits in the Interrupt Control Register are reset-only, and the DMA0 Address Tail Register and DMA0 Count Tail Register are both write-only. It is not possible to perform writes on individual bytes or halfwords of any register. Unimplemented register bits are read as zeros and should be written with zeros to ensure compatibility with future processor versions. Three registers have alternates, provided for backward compatibility. The following summary shows the preferred and alternate addresses for each of these registers. | Register | Preferred Address | Alternate Address | | |-------------------------------|-------------------|-------------------|--| | DMA0 Address Tail Register | 80000070h | 80000036h | | | DMA0 Count Tail Register | 8000003Ch | 8000003Ah | | | Parallel Port Status Register | 800000C8h | 800000C1h | | The alternate DMA0 Address Tail Register and the alternate DMA0 Count Tail Register allow write-only access for compatibility with earlier versions of the Am29200 and Am29205 microcontrollers. These two registers are supported for backward compatibility and should not be used for new designs. The DMA0 Address Tail Register (address 80000070h) and DMA0 Count Tail Register (address 8000003Ch) should be used instead. The alternate Parallel Port Status Register is also provided for compatibility. This register should not be used for new designs. The Parallel Port Status Register (address 800000C8h) should be used instead. Table 7-2 Internal Peripheral Address Assignments | | Address | | |------------------------------|----------------------|------------------------------------------------------------------------------| | Peripheral | (hexadecimal) | Register | | ROM Controller | 80000000 | ROM Control Register | | | 8000004 | ROM Configuration Register | | DRAM Controller | 80000008 | DRAM Control Register | | | 800000C | DRAM Configuration Register | | DRAM Mapping Unit | 8000010 | DRAM Mapping Register 0 | | | 80000014 | DRAM Mapping Register 1 | | | 80000018 | DRAM Mapping Register 2 | | | 8000001C | DRAM Mapping Register 3 | | Peripheral Interface Adapter | 80000020 | PIA Control Register 0 | | | 80000024 | PIA Control Register 1* | | Interrupt Controller | 80000028 | Interrupt Control Register | | DMA Channel 0 | 80000030 | DMA0 Control Register | | | 80000034 | DMA0 Address Register | | | 80000070 | DMA0 Address Tail Register | | | 80000038<br>8000003C | DMA0 Count Register DMA0 Count Tail Register | | | 0000000 | DIVIAO OOUTIL TAIL Flegister | | DMA Channel 1 | 80000040 | DMA1 Control Register | | | 80000044 | DMA1 Address Register | | | 80000048 | DMA1 Count Register | | Serial Port | 80000080 | Serial Port Control Register | | | 80000084 | Serial Port Status Register | | | 80000088<br>800008C | Serial Port Transmit Holding Register<br>Serial Port Receive Buffer Register | | | 80000090 | Baud Rate Divisor Register | | Davallal Davi | 9000000 | Parallal Part Control Pagistar | | Parallel Port | 800000C0<br>80000C4 | Parallel Port Control Register Parallel Port Data Register | | | 800000C8 | Parallel Port Status Register | | Programmable I/O Port | 800000D0 | PIO Control Register | | riogrammable I/O Fort | 800000D0<br>800000D4 | PIO Input Register | | | 800000D8 | PIO Output Register | | | 80000DC | PIO Output Enable Register | | Video Interface | 800000E0 | Video Control Register | | | 800000E4 | Top Margin Register | | | 800000E8 | Side Margin Register | | | 800000EC | Video Data Holding Register | | | —all others— | Reserved | | | | | Note: ◆ Reserved on the Am29205 microcontroller. # 口 # 8 # **ROM CONTROLLER** This chapter describes the operation of the ROM controller. Programmable registers and initialization are discussed, along with ROM address mapping, ROM reads and writes, burst-mode accesses, and extending ROM cycles. #### 8.1 OVERVIEW The on-chip ROM controller provides a glueless interface to static memory devices such as ROMs and EPROMs, as well as alterable devices such as SRAMs, flash EPROMs, and memory-mapped peripherals. ROM space on the Am29200 and Am29205 microcontrollers is divided into banks, each of which is individually configurable for width and access timing. Programmable registers control the location, size, width, wait-state, and burst capability of each bank. The banks can be arranged to form a contiguous memory area. The ROM interface on the Am29200 microcontroller accommodates up to four banks of ROM. These banks can be 8, 16, or 32 bits wide, with a maximum address space of 16 Mbytes per bank. The Am29205 microcontroller supports up to three ROM banks; 8- and 16-bit wide banks are supported, with a maximum address space of 4 Mbytes per bank. Burst-mode ROM access is not supported. Boot ROM width is 16 bits. The signals ROMCS3, BURST, and BOOTW are not available on the Am29205 microcontroller. # 8.2 PROGRAMMABLE REGISTERS # 8.2.1 ROM Control Register (RMCT, Address 80000000) The ROM Control Register (Figure 8-1) controls the access of ROM Banks 0 through 3 on the Am29200 microcontroller and ROM Banks 0 through 2 on the Am29205 microcontroller. # Figure 8-1 ROM Control Register = Reserved on Am29205 microcontroller **Bit 31: Burst-Mode ROM, Bank 0 (BST0), Am29200 microcontroller**—When this bit is 1, ROM Bank 0 is accessed using the burst-mode protocol, in which sequential accesses are completed at the rate of one access per cycle. When this bit is 0, the burst-mode protocol is not used. This bit is reserved on the Am29205 microcontroller. Bits 30–29: Data Width, Bank 0 (DW0)—This field indicates the width of the ROM in Bank 0, as follows: | DW0 | ROM Width | |-----|-----------------------------------------------| | 00 | 32 bits (Reserved on Am29205 microcontroller) | | 01 | 8 bits | | 10 | 16 bits | | 11 | Reserved | **Bit 28: Large Memory (LM)**—This bit controls the size of the ROM banks and the total size of the ROM address space. If the LM bit is 0 on either microcontroller, each ROM bank is up to 4 Mbytes in size, for placement within a 16 Mbyte address space. If the LM bit is 1 on the Am29200 microcontroller, each ROM bank is up to 16 Mbytes in size, for placement within a 64-Mbyte address space. If the LM bit is 1 on the Am29205 microcontroller, each ROM bank is up to 4 Mbytes in size, for placement within a 64-Mbyte address space. Bit 27: Byte Write Enable (BWE)—This bit controls whether or not the CASx signals are used as byte strobes during writes to the ROM address space. If BWE is 0, the CASx signals are not used during ROM writes (unless there is a hidden refresh at the same time). If BWE is 1, the CASx signals are used as byte strobes during a ROM write with hidden refresh prohibited during a ROM read or write. #### Bit 26: Reserved Bits 25–24: Wait States, Bank 0 (WS0)—This field specifies the number of wait states in a ROM access: that is, the number of cycles in addition to one cycle required to access the ROM. Zero-wait-state cycles are supported only for non-burst-mode ROM reads. Writes to the ROM address space and burst-mode ROMs have a minimum of one wait state, even when wait states are programmed at zero. Other bits of this register have a definition similar to BST0, DW0, and WS0 for ROM Banks 1 through 3 on the Am29200 microcontroller and ROM Banks 1 through 2 on the Am29205 microcontroller. The BSTx bits are not supported on the Am29205 microcontroller. Bits 7-0: Reserved. These bits are reserved on the Am29205 microcontroller only. # 8.2.2 ROM Configuration Register (RMCF, Address 80000004) The ROM Configuration Register (Figure 8-2) controls the selection of ROM Banks 0 through 3 on the Am29200 microcontroller and ROM Banks 0 through 2 on the Am29205 microcontroller. In most systems, this register should be set by software to cause all the banks of ROM to appear as a single, contiguous region of memory. Bits 31–27: Address Select, Bank 0 (ASEL0)—On a load, store, or instruction access, this field is compared against bits of the access address, with the comparisons possibly masked by the AMASK0 field. The unmasked bits of the ASEL0 field must match the corresponding bits of the address for ROM Bank 0 to be accessed. **Bits 26–24: Address Mask, Bank 0 (AMASK0)**—This field masks the comparison of the ASEL0 field with bits of the address on an access, to permit various sizes of memories and memory chips in ROM Bank 0 ("ad(x:y)" represents a field of address bits x through y, inclusive). Figure 8-2 ROM Configuration Register = Reserved on Am29205 microcontroller | AMASK0 | Address Comparison | Address Comparison | |--------|-------------------------|-----------------------------------------------| | Value | (LM=0) | (LM=1) | | 000 | ASEL0(4:0) to ad(23:19) | ASEL0(4:0) to ad(25:21) | | 001 | ASEL0(4:1) to ad(23:20) | ASEL0(4:1) to ad(25:22) | | 011 | ASEL0(4:2) to ad(23:21) | ASEL0(4:2) to ad(25:23) (Reserved on Am29205) | | 111 | ASEL0(4:3) to ad(23:22) | ASEL0(4:3) to ad(25:24) (Reserved on Am29205) | Only the AMASK0 values shown in the above table are valid. The AMASK0 field permits various sizes of memories and memory chips in ROM Bank 0 that are independent of the sizes in the other banks. Other bits of this register have a definition similar to ASEL0 and AMASK0 for ROM Banks 1 through 3 on the Am29200 microcontroller and ROM Banks 1 through 2 on the Am29205 microcontroller. **Bits 7–0: Reserved.** These bits are reserved on the Am29205 microcontroller only. Although ROM Bank 3 is not supported on the Am29205 microcontroller, AMASK3 and ASEL3 still exist in the ROM Configuration Register. ASEL3 must be programmed to a value that does not overlap with addresses specified for ROM Banks 2 through 0. #### 8.2.3 Initialization ROM Bank 0 is used as the boot ROM containing the initialization code for the processor and peripherals. On the Am29200 microcontroller, the width of Bank 0 is set by the BOOTW signal, which is sampled during and after a processor reset. If BOOTW is High before and after reset (tied High), the boot ROM is 32 bits wide. If BOOTW is Low before and after reset (tied Low), the boot ROM is 16 bits wide. If BOOTW is Low before reset and High after reset (tied to RESET), the boot ROM is 8 bits wide. The BOOTW signal is used to set the DW0 field before the boot ROM is accessed. On the Am29205 microcontroller, the boot ROM width in Bank 0 is 16 bits during processor reset. No 8-bit booting is possible since the BOOTW signal is not supported on the Am29205 microcontroller. The boot ROM defaults to a non-burst-mode ROM with three wait states until the ROM Control Register and ROM Configuration Register are set with the correct configuration. The LM bit is reset to 0. The ASEL0 and AMASK0 fields are both set to zero by a processor reset. To prevent bank conflicts during initialization, the ASEL and AMASK fields for ROM banks 1 through 3 are set to all 1s. The configuration of ROM banks 1 through 3, if present, must be set by software before the respective bank is accessed. ## 8.3 ROM ACCESSES # 8.3.1 ROM Address Mapping To map logical memory banks to physical addresses, each ROM bank uses two fields to determine the location of the bank in physical memory: AMASK and ASEL. AMASK selects the number of address bits decoded and thus the size of a given bank. ASEL contains the address bit values compared against the address and thus the location of a given bank. The LM bit controls the maximum size of the banks and the total size of the ROM address space as shown. | AMASK Value | Bank Size (LM=0) | Bank Size (LM=1) | |-------------|------------------|------------------| | 000 | 512 Kbytes | 2 Mbytes | | 001 | 1 Mbyte | 4 Mbytes | | 011 | 2 Mbytes | 8 Mbytes* | | 111 | 4 Mbytes | 16 Mbytes* | <sup>\*</sup> Am29200 microcontroller only The ASEL and AMASK fields allow the three or four ROM banks to appear as a contiguous region of ROM, with the restriction that a bank of a certain size must fit on the natural address boundary for that size. For example, a 2-Mbyte ROM must be placed on a 2-Mbyte address boundary. For this reason, ROM banks must appear in the address space in order of decreasing bank size if the banks are to be contiguous. Note that to achieve a contiguous memory, the various ROM banks need not appear in sequence in the address space. For example, on the Am29200 microcontroller, ROM Bank 3 may appear in an address range below the address range for ROM Bank 1 or 2. The only restriction in the placement of ROM banks is that ROM Bank 0 is used for the initial instruction fetches after a processor reset, starting at address 00000000, hexadecimal. Setting AMASK to 0 and ASEL to 1Fh reduces the probability of empty banks being inadvertently decoded. This configures the bank as small and as high in memory as possible. # 8.3.2 Simple ROM Accesses Figure 8-3 shows the timing of a simple ROM read cycle. The number of cycles is controlled by the WSx field in the ROM Control Register ("x" represents one of ROM Banks 0 through 3). The WSx field specifies the number of wait states: that is, the number of cycles beyond one cycle required to access the ROM. Figure 8-4 shows the timing of a zero-wait-state ROM read (WSx = 00). In this case, the ROMOE signal is asserted at the midpoint of the cycle rather than at the beginning of the second cycle (since there is no second cycle). #### 8.3.3 Narrow ROM Accesses A narrow ROM is one that is less than 32 bits wide. The Am29200 and Am29205 microcontrollers support 8- and 16-bit-wide ROMs in any bank, as determined by the DWx field in the ROM Control Register. Figure 8-3 Simple ROM Read Cycle An 8-bit-wide ROM must be attached to ID31–ID24. A 16-bit-wide ROM must be attached to ID31–ID16 and ignores A0. A 32-bit ROM is attached to ID31–ID0 and ignores A1–A0. A narrow ROM can respond to any read access, but the ROM must be at least 16 bits wide to respond to writes. Writes to 8-bit memories are not supported and may provide unreliable results. #### 8.3.3.1 8-Bit Narrow Accesses If the processor expects a half-word or a word on a read (that is, if the access is not a byte read), and a narrow ROM is 8 bits wide, the microcontroller generates one (for a half-word) or three (for a word) requests immediately following the first access. No other intervening accesses are performed. The address for each subsequent access is the same as the address for the first access, except that A1–A0 are incremented by one for each access. A burst-mode access may be performed for the subsequent bytes if the ROM permits such an access and if the ROM Control Register is programmed to enable burst. The microcontroller assembles the final word or half-word by placing the first received byte in the high-order byte position of the word or half-word. The second received byte is placed in the next-lower-order byte position and so on until the entire word or half-word is assembled. If the read access is a byte access, the processor performs only one access. If software generates an unaligned half-word or word read, the narrow ROM does not permit the implementation of the unaligned read. The address sequence generated to Figure 8-4 Simple ROM Read Cycle—Zero Wait States assemble the half-word or word wraps within the half-word or word. A trap on unaligned access is available and may be used to detect and correct such accesses. #### 8.3.3.2 16-Bit Narrow Accesses If the processor expects a word on a read, and a narrow ROM is 16 bits wide, the microcontroller generates one more request immediately following the first access. No other intervening accesses are performed. The address for the second access is the same as the address for the first access, except that A1–A0 are incremented by two for the second access. A burst-mode access may be performed for the second 16 bits if the ROM permits such an access. The microcontroller assembles the final word by placing the first received half-word in the high-order half-word position of the word, and the second received half-word in the low-order half-word position. If the read access is a byte or half-word access, the processor performs only one access. If software generates an unaligned word read, the narrow ROM does not permit the implementation of the unaligned read. The address sequence generated to assemble the word wraps within the word. A trap on unaligned access is available and may be used to detect and correct such accesses. # 8.3.4 Writes to the ROM Space # 8.3.4.1 Simple Writes Figure 8-5 shows the timing of a simple write to the ROM address space. This cycle is provided for alterable memories in the ROM space, such as SRAMs or Flash EPROMs. Zero-wait-state cycles are not supported for writes. Because of processor limitations, the ROM must be at least 16 bits wide to support writes (see Section 8.3.3). If 32-bit data is written into a 16-bit-wide ROM, the processor performs two back-to-back uninterrupted accesses. On the first cycle of the second write, the processor drives the data bus with the second 16 bits (that is, in the same cycle in which ROMCSx and A23–A0 are asserted). ## 8.3.4.2 Byte Writes If the BWE bit is set in the ROM Control Register, the processor uses the CASx signals as individual byte strobes, to allow byte and half-word writes to the ROM address space. Note that this reuse of the CASx signals causes CAS-only cycles to the memories in the DRAM banks (if present) during ROM writes and causes spurious write enables to non-selected memories in the ROM banks during DRAM accesses. These normally do not cause invalid operation. Furthermore, hidden refresh is disabled during ROM reads or writes if the BWE bit is set, to prevent invalid interference between simultaneous ROM and DRAM cycles. Thus, one slight disadvantage of using ROM byte writes is that there are fewer hidden refresh cycles and hence slightly degraded system performance. Figure 8-5 Simple Write to ROM Bank (for alterable memories in the ROM address space) The CAS3-CAS0 signals on the Am29200 microcontroller are used to write individual bytes for a 32-bit-wide ROM bank as follows: | Data width | A1-A0 | CAS3-CAS0 (on write) | | |------------|-------|----------------------|--| | 8 bits | 00 | 0111 | | | 8 bits | 01 | 1011 | | | 8 bits | 10 | 1101 | | | 8 bits | 11 | 1110 | | | 16 bits | . 0x | 0011 | | | 16 bits | 1x | 1100 | | | 32 bits | XX | 0000 | | The CAS3—CAS0 signals are used to write individual bytes for a 16-bit-wide bank (that is, a narrow bank) as follows: | Data width | A1-A0 | CAS3-CAS0 (on write) | |-------------------|-----------------|----------------------| | 8 bits | 00 | 0111 | | 8 bits | 01 | 1011 | | 8 bits | 10 | 0111 | | 8 bits | 11 | 1011 | | 16 bits | 0x | 0011 | | 16 bits | 1x | 0011 | | -all other writes | s (two cycles)— | 0011 | Byte writes are not supported for 8-bit-wide narrow banks. Figure 8-6 shows the timing of a write to the ROM address space. The CAS3-CAS0 signals have exactly the same timing as RSWE. # 8.3.5 Burst-Mode ROM Accesses (Am29200 Microcontroller) Figure 8-7 shows the timing of a burst-mode ROM access, for direct connection to burst-mode devices. Burst-mode accesses have a minimum of one wait state for the initial access, even when wait states are programmed as zero; sequential access after that are single cycle. Burst-mode writes are not supported. Burst-mode ROM accesses are not supported on the Am29205 microcontroller. # 8.3.6 Use of WAIT to Extend ROM Cycles If the WAIT signal is asserted two cycles before the end of a ROM access (that is, two cycles before the cycle in which ROMCSx would normally be deasserted), the processor extends the ROM access until WAIT is deasserted. This permits the system to extend the ROM access indefinitely. The access ends on the cycle after WAIT is deasserted, both for reads (Figure 8-8) and for writes (Figure 8-9). Note that the wait state counter continues to count while WAIT is active, so that the cycle is controlled by either the wait state counter or WAIT, depending on which has the longer duration. Note that WAIT will not be recognized by any bank programmed for zero wait states. The WAIT signal on the Am29200 microcontroller can also be used to extend individual accesses in a sequence of burst-mode accesses. For each access, the processor does not consider the data to be valid until the cycle after WAIT is High (Figure 8-7). Figure 8-7 Burst-Mode ROM Read (Am29200 Microcontroller) Figure 8-9 Extending a ROM Write Cycle with WAIT # 9 # DRAM CONTROLLER This chapter describes the DRAM interface on the Am29200 and Am29205 microcontrollers. The programmable registers are presented, followed by a discussion of DRAM accesses, address mapping, and address multiplexing. Mapped DRAM accesses, page-mode timing, DRAM refresh, and video DRAM transfers are also described. #### 9.1 OVERVIEW The Am29200 and Am29205 microcontrollers directly support DRAM devices without any additional components, providing RAS and CAS generation, address multiplexing, and refresh generation. The on-chip DRAM controller utilizes page-mode accesses and CAS-before-RAS refresh to extract maximum performance from DRAM devices. The DRAM interface accommodates up to four banks of DRAM that appear as a contiguous memory. Each bank on the Am29200 microcontroller is individually configurable in width; the Am29205 microcontroller supports only 16-bit wide DRAM banks . In addition, four 64-Kbyte regions of the DRAM can be mapped into a 16-Mbyte virtual address space. The DRAM controller provides a fixed access time of three cycles plus one cycle of RAS precharge after each access. Two cycle page-mode accesses are supported. To support a lower pin count, several signals used by the Am29200 microcontroller for DRAM interfacing are not available on the Am29205 microcontroller. Because the external data bus is only 16-bits wide, there need be only two CAS signals (one CAS per byte), labeled CAS3 and CAS2. The internal circuitry of the Am29205 microcontroller automatically concatenates the two 16-bit accesses, using big-endian structure for a full 32-bit word. The TR/OE signal for normal DRAM output enable and video DRAM transfer is not available on the Am29205 microcontroller. Any DRAM with an OE line should be tied to CAS for the bank, or tied to ground (asserted) as internal DRAM logic gates OE with CS. Video DRAM transfers are not supported on the Am29205 microcontroller. #### 9.2 PROGRAMMABLE REGISTERS # 9.2.1 DRAM Control Register (DRCT, Address 80000008) The DRAM Control Register (Figure 9-1) controls the access to and refresh of DRAM Banks 0 through 3. **Bit 31: Page-Mode DRAM, Bank 0 (PG0)**—When this bit is 1, burst-mode accesses to DRAM Bank 0 are performed using page-mode accesses for all but the first access. When this bit is 0, page-mode accesses are not performed. Bit 30: Data Width, Bank 0 (DW0)—This field indicates the width of the DRAM in Bank 0, as follows: | DW Value | DRAM Width | |----------|----------------------------------------------------------| | 0 | 32 bits (Reserved on Am29205 microcontroller)<br>16 bits | Since the Am29205 microcontroller supports only 16-bit DRAM, all DWx bits should be set to 1. #### Bit 29: Reserved **Bit 28: Large Memory (LM)**—This bit controls the size of the DRAM banks and the total size of the DRAM address space. If the LM bit is 0 on either microcontroller, each DRAM bank is up to 4 Mbytes in size, for placement within a 16 Mbyte address space. If the LM bit is 1 on the Am29200 microcontroller, each DRAM bank is up to 16 Mbytes in size, for placement within a 64-Mbyte address space. If the LM bit is 1 on the Am29205 microcontroller, each DRAM bank is up to 8 Mbytes in size, for placement within a 64-Mbyte address space. PG1, DW1, and so on perform functions similar to PG0 and DW0 for DRAM Banks 1 through 3. **Bit 15: Static-Column DRAM (SC)**—When this bit is 1, page-mode accesses to the DRAM are performed using static-column accesses. Static column accesses differ from page-mode cycles only in that CAS3—CAS0 are held Low throughout a read access. The timing of the access is not affected, and write accesses are not affected. When this bit is 0, normal page-mode accesses are performed, if enabled. #### Bits 14-9: Reserved Bits 8–0: Refresh Rate (REFRATE)—This field indicates the number of MEMCLK cycles between DRAM refresh intervals. A DRAM refresh interval is the time required to refresh all four DRAM banks. "CAS before RAS" cycles are performed, overlapped in the background with other non-DRAM accesses when possible. If one or more banks have not been refreshed in the background when the REFRATE interval expires, the processor forces a panic mode refresh of the unrefreshed banks. A zero in the REFRATE field disables refresh. Upon reset, this field is initialized to the value 1FFh. # 9.2.2 DRAM Configuration Register (DRCF, Address 8000000C) The DRAM Configuration Register (Figure 9-2) controls the selection of DRAM Banks 0 through 3. In most systems, this register should be set by software to cause the four banks of DRAM to appear as a single, contiguous region of memory. Bits 31–27: Address Select, Bank 0 (ASEL0)—On a load, store, or instruction access, this field is compared against bits of the access address, with the comparisons possibly masked by the AMASK0 field. The unmasked bits of the ASEL0 field must match the corresponding bits of the address for DRAM bank 0 to be accessed. Bits 26–24: Address Mask, Bank 0 (AMASK0)—This field masks the comparison of the ASEL0 field with bits of the address on an access, to permit various sizes of memories and memory chips in DRAM Bank 0 ("ad(x:y)" represents a field of address bits x through y, inclusive). | AMASK0<br>Value | Address Comparison (LM=0) | Address Comparison<br>(LM=1) | |-----------------|---------------------------|-----------------------------------------------| | 000 | ASEL0(4:0) to ad(23:19) | ASEL0(4:0) to ad(25:21) | | 001 | ASEL0(4:1) to ad(23:20) | ASEL0(4:1) to ad(25:22) | | 011 | ASEL0(4:2) to ad(23:21) | ASEL0(4:2) to ad(25:23) | | 111 | ASEL0(4:3) to ad(23:22) | ASEL0(4:3) to ad(25:24) (Reserved on Am29205) | Only the AMASKO values shown in the above table are valid. Other bits of this register have a definition similar to ASEL0 and AMASK0 for DRAM Banks 1 through 3. # 9.2.3 DRAM Mapping Register 0 (DRM0, Address 80000010) This register (Figure 9-3) specifies one of four possible mappings of a mapped DRAM access. Figure 9-3 DRAM Mapping Register 0 **Bit 31: Valid Mapping (VALID)**—This bit, when 1, indicates that the mapping specified by the VIRTBASE and PHYBASE fields is valid. #### Bits 30-24: Reserved **Bits 23–16: Virtual Base Address (VIRTBASE)**—This field specifies the virtual base address of the mapped region. On a mapped DRAM access, it is compared against bits 23-16 of the address generated by the load or store instruction. The comparison must match for the mapping to be performed. #### Bits 15-8: Reserved Bits 7–0: Physical Base Address (PHYBASE)—This field specifies the physical base address of the mapped region. On a mapped DRAM access, if the comparison of the virtual base address yields a match and the VALID bit is 1, the PHYBASE field replaces bits 23-16 of the address. # 9.2.4 DRAM Mapping Register 1 (DRM1, Address 80000014) This register is identical in layout and definition to the DRAM Mapping Register 0. It specifies the second of the four possible mappings. # 9.2.5 DRAM Mapping Register 2 (DRM2, Address 80000018) This register is identical in layout and definition to the DRAM Mapping Register 0. It specifies the third of the four possible mappings. #### 9.2.6 DRAM Mapping Register 3 (DRM3, Address 8000001C) This register is identical in layout and definition to the DRAM Mapping Register 0. It specifies the the fourth of the four possible mappings. #### 9.2.7 Initialization The configuration of DRAM banks, if present, must be set by software before normal DRAM accesses are performed (the DRAM may be accessed using default parameters that are set by software to determine the configuration of the DRAM). The DRAM Mapping registers are not initialized by a processor reset, and must be set by software before a mapped DRAM access occurs. The REFRATE field is initialized on reset to the value 1FFh. DRAM power-up requirements must be guaranteed by software. #### 9.3 DRAM ACCESSES #### 9.3.1 DRAM Address Mapping To map logical memory banks to physical addresses, each DRAM bank uses two fields to determine the location of the bank in physical memory. AMASK and ASEL. AMASK selects the number of address bits decoded and thus the size of a given bank. ASEL contains the address bit values compared against the address and thus the location of a given bank. The LM bit controls the maximum size of the banks and the total size of the DRAM address space as shown. | AMASK Value | Bank Size (LM=0) | Bank Size (LM=1) | |-------------|------------------|------------------| | 000 | 512 Kbytes | 2 Mbytes | | 001 | 1 Mbyte | 4 Mbytes | | 011 | 2 Mbytes | 8 Mbytes* | | 111 | 4 Mbytes | 16 Mbytes* | <sup>-</sup> Am29200 microcontroller only The ASEL and AMASK fields allow the four DRAM banks to appear as a contiguous region of DRAM, with the restriction that a bank of a certain size must fit on the natural address boundary for that size. For example, a 2-Mbyte DRAM must be placed on a 2-Mbyte address boundary. For this reason, DRAM banks must appear in the address space in order of decreasing bank size. Note that to achieve a contiguous memory, the various DRAM banks need not appear in sequence in the address space. For example, DRAM Bank 3 may appear in an address range below the address range for DRAM Bank 1 or 2. This provides flexibility in meeting the restriction that DRAM banks appear in the address space in order of decreasing size. Setting AMASK to 0 and ASEL to 1Fh reduces the probability of empty banks being inadvertently decoded. This configures the bank as small and as high in memory as possible. # 9.3.2 Address Multiplexing The address multiplexing for the DRAMs is performed directly by the processor on the A14–A1 pins, and no external multiplexing is required. As shown in Table 9-1 and Table 9-2, only the odd physical address pins from A9 and above (A9, A11, and A13) are used for 16-bit interfaces, while only even physical address pins above A9 (A10, A12, and A14) are used for 32-bit memories. Address bit A0 is not represented, since the Am29200 microcontroller supports only 16- and 32-bit DRAM widths. Address multiplexing for 16- and 32-bit DRAM memories is performed as shown in Table 9-1 and Table 9-2 ("ax" represents address bit x). Table 9-1 Address Multiplexing for 16-bit DRAM Memory | Address Pin | RAS Asserted | CAS Asserted | Bank Depth<br>(LM=0) (ea) | Bank Depth<br>(LM=1) (ea) | |-------------|--------------|--------------|-----------------------------------------|---------------------------| | . • | | | | | | A13 | a21 | a22 | 4 Mbyte | 8 Mbyte | | + | | | | | | A11 | a19 | a20 | 1 Mbyte | 2 Mbyte | | • | | | | | | A9 | a18 | a9 | | | | A8 | a17 | a8 | Up to<br>256 Kbyte | Up to<br>512 Kbyte | | A7 | a16 | a7 | 200 | | | A6 | a15 | a6 | | | | A5 | a14 | a5 | | | | - A4 | a13 | a4 | | | | A3 | a12 | a3 | | · | | A2 | a11 | a2 | 4 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - 1 - | | | A1 | a10 | a1 | | | Note: \* indicates signals not applicable to the bus width. Table 9-2 Address Multiplexing for 32-bit DRAM Memory (Am29200 Microcontroller) | Address Pin | RAS Asserted | CAS Asserted | Bank Depth<br>(LM=0) (ea) | Bank Depth<br>(LM=1) (ea) | |-------------|--------------|--------------|---------------------------|---------------------------| | A14 | a22 | a23 | 4 Mbyte | 16 Mbyte | | • | - | | | | | A12 | a20 | a21 | 2 Mbyte | 4 Mbyte | | • | | | <u></u> | | | A10 | a19 | a10 | | | | A9 | a18 | а9 | Up to<br>512 Kbyte | Up to<br>1 Mbyte | | A8 | a17 | a8 | · | ,.0 | | A7 | a16 | a7 | | | | A6 | a15 | а6 | | | | A5 | a14 | a5 | | | | A4 | a13 | a4 | | <b> </b> - | | A3 | a12 | a3 | | | | A2 | a11 | a2 | | | | • | | | | | Note: \* indicates signals not applicable to the bus width. Table 9-3 shows how this multiplexing of addresses supports various configurations of memory densities and memory widths, assuming the individual DRAMs are 4 bits wide. The addresses shown in Table 9-3 are the address bits for an access. Table 9-4 shows how the various memories should be connected to the processor's address pins to realize this address multiplexing, again assuming the individual DRAMs are 4 bits wide. Sequential accesses can use page-mode accesses, even though not all CAS address bits are contiguous address bits, because the processor does not generate a page-mode Table 9-3 DRAM Address Multiplexing (by-4 DRAMs) | DRAM | | | | | | | | s bits | | | | | | |---------|---------|----------|-----|-----|-----|-----|-----|--------|-------|-----|-----|-----|-----| | density | width | of cycle | 10 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | | | 16 bits | RAS | 1 | | a18 | a17 | a16 | a15 | a14 | a13 | a12 | a11 | a10 | | 1 Mbit | 10 0113 | CAS | | | а9 | a8 | a7 | a6 | а5 | a4 | a3 | a2 | a1 | | | 32 bits | RAS | | | a19 | a18 | a17 | a16 | a15 | a14 | a13 | a12 | a11 | | | 02 bits | CAS | | | a10 | a9 | а8 | a7 | a6 | а5 | a4 | а3 | a2 | | | 16 bits | RAS | | a19 | a18 | a17 | a16 | a15 | a14 · | a13 | a12 | a11 | a10 | | 4 Mbit | 10 0115 | CAS | | a20 | а9 | a8 | a7 | а6 | a5 | a4 | аЗ | a2 | a1 | | , | 32 bits | RAS | | a20 | a19 | a18 | a17 | a16 | a15 | a14 | a13 | a12 | a11 | | | QZ DILS | CAS | | a21 | a10 | a9 | a8 | a7 | a6 | a5 | a4 | а3 | a2 | | | 16 bits | RAS | a21 | a19 | a18 | a17 | a16 | a15 | a14 | a13 | a12 | a11 | a10 | | 16 Mbit | 10 0115 | CAS | a22 | a20 | а9 | a8 | a7 | а6 | a5 | a4 | аЗ | a2 | a1 | | TOTAL | 32 bits | RAS | a22 | a20 | a19 | a18 | a17 | a16 | a15 | a14 | a13 | a12 | a11 | | | 32 DIIS | CAS | a23 | a21 | a10 | a9 | a8 | а7 | a6 | а5 | a4 | a3 | a2 | | Table 9-4 | DRAM Address | Connections | to Microcontroller | (by-4 DRAMs) | |-----------|--------------|-------------|--------------------|--------------| |-----------|--------------|-------------|--------------------|--------------| | DRAM | | | DRAM multiplexed address bits | | | | | | | | | | |---------|---------------|-----|-------------------------------|-----|----|----|----|----|----|----|----|----| | density | density width | 10 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | | 1 Mbit | 16 bits | | | A9 | A8 | A7 | A6 | A5 | A4 | А3 | A2 | A1 | | | 32 bits | | | A10 | A9 | A8 | A7 | A6 | A5 | A4 | А3 | A2 | | 4 Mbit | 16 bits | | A11 | A9 | A8 | A7 | A6 | A5 | A4 | А3 | A2 | A1 | | | 32 bits | | A12 | A10 | A9 | A8 | A7 | A6 | A5 | A4 | А3 | A2 | | 16 | 16 bits | A13 | A11 | A9 | A8 | A7 | A6 | A5 | A4 | А3 | A2 | A1 | | Mbit | 32 bits | A14 | A12 | A10 | A9 | A8 | A7 | A6 | A5 | A4 | A3 | A2 | access across a 1-Kbyte address boundary. Thus, the processor will not change any address bits other than *a*(9:1) during a page-mode access. # 9.3.3 32-Bit DRAM Width (Am29200 Microcontroller) For a data access, the width of each DRAM bank on the Am29200 microcontroller can be programmed to be either 32 or 16 bits by the DRAM Control Register. If the DRAM is 32 bits wide, ID31–ID0 are used to transfer data to and from the processor, and the processor performs one access to read or write a byte, half-word, or word. The CAS3–CAS0 signals are asserted as follows (the value "0" is Low, "1" is High, and "x" is a don't care): | Data Width | A1-A0 | CAS3-CAS0 (on write) | |------------|----------------|----------------------| | 8 bits | 00 | 0111 | | 8 bits | 01 | 1011 | | 8 bits | 10 | 1101 | | 8 bits | 11 | 1110 | | 16 bits | 0x | 0011 | | 16 bits | 1x | 1100 | | 32 bits | 00 (one cycle) | 0000 | #### 9.3.4 16-Bit DRAM Width If the DRAM is 16 bits wide on either the Am29200 or Am29205 microcontroller, only ID31–ID16 are used to transfer data to and from the processor and the processor performs two accesses to read or write a full word. To read a 32-bit word from a 16-bit DRAM bank, the processor first reads the high-order 16 bits of the word, then generates a second access to read the low-order 16 bits of the word. The address is incremented by two for the second access. To read an 8-bit byte or 16-bit half-word from a 16-bit DRAM, the processor performs only a single access. Alignment and sign extension are performed as usual, except the required byte or half-word is received on ID31–ID16. Figure 9-4 shows the location of bytes and half-words from a 16-bit DRAM bank. In Figure 9-4, bytes and half-words are numbered as they are numbered in a word. To write a 32-bit word into a 16-bit DRAM bank, the processor first writes the high-order 16 bits of the word, then generates a second access to write the low-order 16 bits of the word. The address is incremented by two for the second access, and the low order bits of the word appear on ID31–ID16. To write an 8-bit byte or 16-bit half-word on a 16-bit bus, the processor performs only a single access. For a byte write, the appropriate byte is replicated on both ID31–ID24 and ID23–ID16. For a half-word write, the Figure 9-4 Location of Bytes and Half-Words on a 16-Bit Bus appropriate half-word appears on ID31–ID16. The CAS3–CAS0 signals are asserted as follows (the value "0" is Low, "1" is High, and "x" is a don't care): | Data width | A1-A0 | CAS3-CAS0 (on write) | |------------------|-----------------|----------------------| | 8 bits | 00 | 0111 | | 8 bits | 01 | 1011 | | 8 bits | 10 | 0111 | | 8 bits | 11 | 1011 | | 16 bits | 0x | 0011 | | 16 bits | 1x | 0011 | | -all other write | s (two cycles)— | 0011 | # 9.3.5 Mapped DRAM Accesses Processor DRAM accesses in the 16-Mbyte address range 50000000h–50FFFFFh are mapped to one of four 64-Kbyte regions of the DRAM. This provides a virtual memory region supporting functions such as image compression and decompression that yield lower overall memory requirements and thus lower system cost. Only processor DRAM accesses can be mapped. DRAM accesses by a DMA channel cannot be mapped. DRAM Mapping Registers 0 through 3 each specify a DRAM mapping. Before an access to a DRAM location having an address in the range 50000000h–50FFFFFh, bits 23-16 of the address are compared to the VIRTBASE fields in each of the DRAM Mapping registers. If the address bits match the VIRTBASE field in one of the registers, and the associated VALID bit is 1, then the PHYBASE field replaces bits 23-16 of the address before the access is performed. If more than one valid comparison occurs, the mapping specified by DRAM Mapping Register 0 has the highest priority, and the mapping specified by DRAM Mapping Register 3 has the lowest priority. If no valid comparison is detected, the processor's User- or Supervisor-mode Instruction or Data Mapping Miss occurs, depending on the program mode and type of access. # 9.3.6 Normal Access Timing Figure 9-5 shows the timing for a normal DRAM read cycle. Figure 9-6 shows the timing for a normal DRAM write cycle. DRAM cycles are fixed at four cycles including precharge and cannot be extended with WAIT. An additional cycle is taken after the data is read or written to permit time for RAS precharge. The rising edge of RAS occurs on the third rising edge of MEMCLK after the beginning of the cycle. \_\_\_\_\_ ♦ Am29200 microcontroller only # 9.3.7 Page-Mode Access Timing Page-mode accesses can be enabled for each bank to reduce the average access time for a sequence of accesses. If enabled, page-mode accesses are performed for instruction accesses and for the LOADM and STOREM instructions. Page-mode accesses permit an access time of two cycles for all but the first access. When the DRAM bank is 16 bits wide, two accesses are required to obtain a 32-bit word. Page-mode accesses are performed to access the second 16 bits in this case if page-mode accesses are enabled. Figure 9-7 shows the timing for a page-mode DRAM read cycle. Figure 9-8 shows the timing for a page-mode DRAM write cycle. Static-column accesses are performed if SC=1 in the DRAM Control Register. Static-column accesses differ from page-mode accesses only in that CASx remain Low throughout the access. #### 9.3.8 DRAM Refresh "CAS before RAS" refresh cycles are performed periodically, as determined by the REFRATE field of the DRAM Control Register. The REFRATE field specifies the number of MEMCLK cycles in a refresh interval; a zero in this field disables refresh. The microcontroller ensures that one row of each DRAM bank is refreshed in every interval. Each bank is refreshed separately to distribute the demand placed on the DRAM power supplies by the individual banks. Figure 9-9 shows the timing of a refresh cycle. Because refresh cycles use only the RASx and CASx signals, the processor attempts to perform refresh in the background, refreshing each bank in the cycles that the DRAM is not being used, possibly overlapped <sup>♦</sup> Am29200 microcontroller only with ROM and PIA accesses. Background refresh incurs very little overhead. The average penalty of background refresh is about 2 cycles per refresh interval. This penalty arises because the processor sometimes attempts to access the DRAM after a refresh cycle has been started. If one or more banks has not been refreshed by the end of a refresh interval, the DRAM controller performs "panic mode" refresh cycles to refresh the remaining banks. Panic mode refresh cycles take priority over all other processor accesses. # 9.3.9 Video DRAM Interface (Am29200 Microcontroller) A video DRAM (VDRAM) transfer cycle is performed during accesses in the range 6000000h– 63FFFFFFh. These cycles permit the transfer of data to a VDRAM shift register in graphics applications. For VDRAM transfer cycles with 16-bit memories, the DRAM bank's page-mode bit (PGx) in the DRAM Control Register must be turned off. Figure 9-10 shows the timing of a VDRAM transfer cycle. Note that the ID bus is not forced to high impedance. This cycle differs from a normal DRAM cycle because the signal TR/OE is asserted with different timing. # 口 # 10 # PERIPHERAL INTERFACE ADAPTER This chapter describes the peripheral interface (PIA) adapter on the Am29200 and Am29205 microcontrollers. Information is provided on the programmable registers, initialization, and PIA accesses, including timing. #### 10.1 OVERVIEW PIA space on the microcontroller is divided into regions, each of which can be directly attached to an off-chip peripheral device. The microcontroller's dedicated PIA chip select signals will assert a peripheral device's chip select input pin when the associated PIA region on the microcontroller is read or written. With six PIA chip select signals, PIACS5—PIACS0, the Am29200 microcontroller permits direct attachment of up to six external peripheral devices, each with its own 24-bit address space, for a maximum size of 16 Mbytes per PIA region. With two PIA chip select signals and a smaller address bus, the Am29205 microcontroller supports up to two peripheral devices, each with its own 22-bit memory space, for a maximum size of 4 Mbytes per PIA region. The 16-bit ID bus of the Am29205 microcontroller limits PIA support to 8- or 16-bit peripherals. The PIACS5-PIACS2 signals are not supported on the Am29205 microcontroller. #### 10.2 PROGRAMMABLE REGISTERS # 10.2.1 PIA Control Register 0/1 (PICT0/1, Address 80000020/24) The PIA Control Registers (Figure 10-1 and Figure 10-2) control the access to PIA Regions 0 through 5 on the Am29200 microcontroller. The PIA Control Register 1 is not available on the Am29205 microcontroller, since that product does not support PIA Regions 2, 3, 4, or 5. Figure 10-1 PIA Control Register 0 (PICTO, Address 80000020) Figure 10-2 PIA Control Register 1 (PICT1, Address 80000024) (Am29200 Microcontroller) = Reserved on Am29205 microcontroller Bit 31: Input/Output Extend, Region 0 (IOEXT0)—If this bit is one, the end of a PIA access is extended by one cycle after PIAOE is deasserted or by two cycles after PIAWE is deasserted. This provides one additional cycle of output disable time or data hold time for reads and writes, respectively. #### Bits 30-29: Reserved Bits 28–24: Input/Output Wait States, Region 0 (IOWAIT0)—This field specifies the number of wait states taken by an access to PIA Region 0. An I/O read cycle takes at least three cycles (two wait states), and an I/O write cycle takes at least four cycles (three wait states). If the IOWAIT0 field specifies an insufficient number of wait states for an access (for example, IOWAIT0 = 00010b for a write), the processor takes the required minimum number of wait states instead of the specified number. Other bits perform similar functions to IOEXT0 and IOWAIT0 for PIA Regions 1 through 5. Bits 15–0: Reserved. These bits are reserved on the Am29205 microcontroller and should be written with 0s to ensure compatibility. #### 10.2.2 Initialization The configuration of PIA regions, if present, must be set by software before PIA accesses are performed. Peripherals may be accessed using default parameters set by software to determine the presence and/or configuration of the peripherals. #### 10.3 PIA ACCESSES PIA accesses are performed as a result of load and store instructions having an address within the range of the specific PIA regions supported by the microcontroller. The Am29200 microcontroller supports PIA Regions 5 through 0; the Am29205 microcontroller supports PIA Regions 1 through 0. The PIA region number determines which PIACSx signal is asserted during the access. PIACS0 is asserted for an access to PIA Region 0, and so on. The Am29200 microcontroller supports PIA accesses within the address ranges of PIA Region 0 (addresses 90000000h–90FFFFFFh) through PIA Region 5 (addresses 95000000h–95FFFFFFh). The data width of the load or store (selected by the OPT bits of the instruction) determines the width of the access. An 8-bit device must be attached to ID7–ID0 on the Am29200 microcontroller and a 16-bit device must be attached to ID15–ID0. LOADM and STOREM instructions (possible only for 32-bit accesses) are performed as a series of simple loads and stores. The Am29205 microcontroller supports PIA accesses within the address ranges of PIA Region 0 (addresses 90000000h–90FFFFFFh) through PIA Region 1 (addresses 91000000h–91FFFFFFh). Since the instruction/data bus on the Am29205 microcontroller is 16 bits wide, only 8- or 16-bit peripherals are supported. They are attached to ID23–ID16 and ID31–ID16, respectively. Only byte or half-word accesses (specified by the OPT bits in the load and store instructions) are allowed on these peripherals. Performing a 32-bit data access on these peripherals may result in unpredictable data. When a byte access is made to the PIA region on either microcontroller, the two least significant bits of the address must be 11. When a half-word access is made, the two least significant bits must be 10. Instruction fetching from a PIA region is not supported. # 10.3.1 Normal Access Timing Figure 10-3 shows the timing of a PIA read cycle. The address is driven in the first cycle, the PIACSx signal is asserted in the second cycle to allow for address setup, and the PIAOE signal is asserted in the third cycle to allow for chip select setup. The data must be valid after the number of cycles specified by IOWAITx+1. After sampling the data, the microcontroller deasserts PIACSx and PIAOE. The interface operates such that the processor allows at least one cycle before it drives the instruction/data bus (ID31–ID0 on the Am29200 microcontroller and ID 31–16 on the Am29205 microcontroller) for a new access (though a new address may be driven on the address bus immediately), providing one cycle for the peripheral to disable its drivers. If this cycle is insufficient, setting the IOEXTx bit for the region causes the processor to insert an additional cycle after the read before starting a new access. Figure 10-4 shows timing of a PIA write cycle. The PIAOE signal is not asserted. Instead, the processor drives data in the second cycle and asserts the PIAWE signal in the third cycle to allow for address, data, and chip select setup. The PIAWE signal is deasserted one cycle before the final cycle to provide data hold time for the write. If one cycle of hold time is insufficient, setting the IOEXTx bit for the region causes the processor to insert an additional cycle of data hold time. ## 10.3.2 Use of WAIT to Extend I/O Cycles The WAIT signal is used to extend the number of wait states beyond the number determined by the IOWAITx field. WAIT can be asserted during a read at any time up until two cycles before PIAOE is deasserted, and can be asserted during a write at any time up until two cycles before PIAWE is deasserted. In response to WAIT, the processor extends the access until WAIT is deasserted. If WAIT is asserted within the appropriate amount of time, a normal read access ends on the cycle after WAIT is deasserted (Figure 10-5), and a normal write access ends on the second cycle after WAIT is deasserted, to provide data hold time (Figure 10-6). If IOEXTx=1, the processor waits one more cycle after a read access to begin a new access, and inserts one more cycle of data hold time after a write access. Figure 10-6 Extending a PIA Write Cycle with WAIT # 11 # **DMA CONTROLLER** This chapter describes the on-chip DMA controller. Programmable registers and initialization are detailed, followed by a discussion of DMA transfers, queuing, and random DMA by external devices. The DMA controller supports three types of DMA transfers: internal, external, and direct transfers. #### 11.1 OVERVIEW The on-chip DMA controller provides a means to transfer data between the DRAM and internal or external peripherals. Each supported DMA channel on the Am29200 and Am29205 microcontrollers is configurable for width, direction, address increment or decrement, external request type, and external peripheral wait states. Internal DMA transfers can be requested by the parallel port, serial port, and video interface. Each of these internal peripherals has a field in its control register for specifying which of the two DMA channels is to be used for the transfer. External DMA transfers are requested by off-chip peripherals. Direct DMA transfers transfer data between an external device and DRAM using an address supplied by the device. The GREQ and GACK signals are used to perform direct DMA. Direct DMA is not supported on the Am29205 microcontroller. The Am29200 microcontroller has two DMA channels, DMA Channel 0 and DMA Channel 1, each capable of performing either internal or external DMA transfers. One of these channels, DMA Channel 0, supports queued transfers. Using the GREQ and GACK signals, the Am29200 microcontroller also supports direct DRAM and ROM access by an external device such as an external DMA controller. The Am29205 microcontroller has one externally controlled DMA channel, DMA Channel 1, and two internally controlled channels, DMA Channels 0 and 1, available for use by the internal peripherals only. 32-bit DMA transfers between internal peripherals and DRAM are supported. However, DMA transfers between external peripherals and DRAM are limited to 8- or 16- bit data accesses. The DREQ0, DACK0, GREQ, GACK, and TDMA signals are not supported on the Am29205 microcontroller. #### 11.2 PROGRAMMABLE REGISTERS #### 11.2.1 DMA0 Control Register (DMCT0, Address 80000030) The DMA0 Control Register (Figure 11-1) controls DMA Channel 0 on the Am29200 and Am29205 microcontrollers. DMA Channel 0 on the Am29205 microcontroller is available for transfers between internal peripherals and DRAM only; external transfers are not supported. Figure 11-1 DMA0 Control Register Bit 31: DMA Extend (DMAEXT), Am29200 microcontroller—The DMAEXT bit serves a function very similar to the IOEXTx bits in the PIA Control registers. This bit is set to provide an additional cycle of output disable time for a read or an additional cycle of data hold time for a write. This bit is reserved on the Am29205 microcontroller. #### Bits 30-29: Reserved Bits 28–24: DMA Wait States (DMAWAIT), Am29200 microcontroller—This field specifies the number of wait states taken by an external access by DMA Channel 0. An external DMA read cycle takes at least three cycles (two wait states) and an external DMA write cycle takes at least four cycles (three wait states). If the DMAWAIT field specifies an insufficient number of wait states for an access (for example, DMAWAIT = 00010b for a write), the processor takes the required minimum number of wait states instead of the specified number. This field is reserved on the Am29205 microcontroller. Bits 23–22: Data Width (DW)—This field indicates the width of the data transferred by the DMA channel, as follows: | DW Value | DMA Transfer Width | |----------|------------------------------------------------------------------------------------------------------------------------------| | 00 | 32 bits (External and internal transfers on Am29200 microcontroller) 32 bits (Internal transfers on Am29205 microcontroller) | | 01 | 8 bits | | 10 | 16 bits | | 11 | 32 bits, address unchanged (Reserved on Am29205 microcontroller) | On the Am29200 microcontroller, the value DW=11 is used to repeatedly transfer a fixed pattern from a single DRAM location to a peripheral. For example, it can be used to transfer to a blank area of a printed page without requiring that a memory buffer be allocated for the blank area. Bits 21–20: DMA Request Mode (DRM), Am29200 microcontroller—This field indicates how external DMA requests are signaled by DREQ0, as follows: | DRM Value | DREQ0 Request | |-----------|--------------------------------------------------| | 00 | Active Low | | 01 | Active High | | 10 | High-to-Low transition | | 11 | High-to-Low transition<br>Low-to-High transition | The DRM field is set to 00 by a processor reset. See Section 11.3.6 for information on clearing latched DMA requests. This field is reserved on the Am29205 microcontroller. **Bit 19: Assert Chip Select (ACS), Am29200 microcontroller**—This bit controls whether DMA Channel 0 asserts PIACS0 during an external peripheral access. If the ACS bit is 1, the DMA channel asserts PIACS0; if the ACS bit is 0, the DMA channel does not assert PIACS0. This bit is reserved on the Am29205 microcontroller. #### Bits 18-10: Reserved **Bit 9: Transfer Up/Down (UD)**—This bit controls the addressing of memory for the series of DMA transfers. If the UD bit is 1, the DMA address (in the DMA0 Address Register) is incremented after each transfer. If the UD bit is 0, the DMA address is decremented after each transfer. The amount by which the address is incremented or decremented is determined by the width of the transfer, as follows: | DW Value | Address Incr/Decr | | |--------------|--------------------------------------------|--| | 00 (32 bits) | +/-4 | | | 01 (8 bits) | +/-1 | | | 10 (16 bits) | +/-2 | | | 11 (32 bits) | +/-0 (Reserved on Am29205 microcontroller) | | **Bit 8: Read/Write (RW)**—This bit controls whether the DMA transfer is to or from the DRAM. If the RW bit is 1, the DMA channel transfers data from the DRAM to the peripheral. If the RW bit is 0, the DMA channel transfers data from the peripheral to the DRAM. **Bit 7: Enable (EN)**—This bit enables the DMA channel to perform transfers. A 1 enables transfers, and a 0 disables transfers. **Bit 6: TDMA Terminate Enable (TTE), Am29200 microcontroller**—This bit, when 1, causes the DMA channel to sample the TDMA signal during an external DMA transfer and to terminate the transfer if TDMA is asserted. TDMA does not apply to an internal transfer. If this bit is 0, the TDMA signal is ignored. This bit is reserved on the Am29205 microcontroller. **Bit 5: Count Terminate Enable (CTE)**—This bit, when 1, causes the DMA channel to terminate the transfer when the DMACNT field of the DMA Count Register decrements past zero. If this bit is 0, the DMA transfer does not terminate, though the DMA channel still decrements the count after every transfer. **Bit 4: Queue Enable (QEN)**—This bit, when 1, enables the DMA queuing feature (which is implemented only on DMA Channel 0). DMA queuing allows the DMA0 Address Register and DMA0 Count Register to be reloaded automatically at the end of a DMA transfer from the DMA0 Address Tail Register and the DMA0 Count Tail Register, respectively. Queuing permits a second transfer to start immediately after a first transfer has terminated, greatly reducing the response-time requirement for software to set up and start the second transfer. When this bit is 0, DMA queuing is disabled, and the DMA0 Address Register and DMA0 Count Register are set directly to initiate a transfer. #### Bits 3-2: Reserved **Bit 1: TDMA Terminate Interrupt (TTI), Am29200 microcontroller**—The TTI bit is used to report that the DMA channel has generated an interrupt because of TDMA termination. If the TTE bit is one and the TDMA signal is asserted during an external DMA transfer, the TTI bit is set and a processor interrupt occurs. This bit is reserved on the Am29205 microcontroller. **Bit 0: Count Terminate Interrupt (CTI)**—The CTI bit is used to report that the DMA channel has generated an interrupt because of count termination. If the CTE bit is one and the DMACNT field decrements past zero, the CTI bit is set and a processor interrupt occurs. # 11.2.2 DMA0 Address Register (DMAD0, Address 80000034) The DMA0 Address Register (Figure 11-2) contains the addresses for a transfer by DMA Channel 0. Figure 11-2 DMA0 Address Register = Reserved on Am29205 microcontroller Bits 31–24: Peripheral Address (PERADDR), Am29200 microcontroller—This field specifies eight bits that are driven on A7–A0 during an external peripheral access by the DMA channel. A23–A8 are driven Low during the transfer. The peripheral address remains unchanged with each access. This field is reserved on the Am29205 microcontroller, since external DMA transfers are not supported on Channel 0. Bits 23—0: DRAM Address (DRAMADDR)—This field contains the DRAM address for the next DMA transfer to or from the DRAM. The DRAMADDR field is incremented or decremented (based on the UD bit) by an amount determined by the width of the DMA transfer. The increment or decrement amount is 1 for a byte transfer, 2 for a halfword transfer, and 4 for a word transfer. (Word transfer is not supported on the Am29205 microcontroller.) To support repeated transfers from the same word on the Am29200 microcontroller, the address can be left unchanged. The DRAMADDR field wraps from the value 000000h to FFFFFFh when decremented and from FFFFFFh to 000000h when incremented. # 11.2.3 DMA0 Address Tail Register (TAD0, Address 80000070) This write-only register (Figure 11-3) is the tail of the DMA Channel 0 address queue, and is used to write the address of a queued transfer when the QEN bit is 1. Figure 11-3 DMA0 Address Tail Register Bits 31-24: Reserved Bits 23–0: DRAM Address (DRAMADDR)—This field is written with the beginning DRAM address for a queued DMA transfer, if queuing is enabled. # 11.2.4 DMA0 Count Register (DMCN0, Address 80000038) The DMA0 Count Register (Figure 11-4) specifies the number of transfers remaining to be performed by DMA Channel 0. Figure 11-4 DMA0 Count Register #### Bits 31-24: Reserved Bit 23–0: DMA Count (DMACNT)—This field normally specifies the number of transfers remaining to be performed on the DMA channel. The count is zero-based: a count of zero indicates one transfer, a count of one indicates two transfers, and so on. The DMA channel decrements the DMACNT field after every transfer. If the CTE bit is 1, the DMA channel generates an interrupt when the DMACNT field is decremented past zero. However, if the CTE bit is not 1, the DMACNT field is still decremented after every transfer and can be used to determine how many transfers have been performed when the DMA channel terminates because of the TDMA signal. # 11.2.5 DMA0 Count Tail Register (TCN0, Address 8000003C) This write-only register (Figure 11-5) is the tail of the DMA Channel 0 count queue, and is used to write the transfer count of a queued transfer when the QEN bit is 1. Figure 11-5 DMA0 Count Tail Register #### Bits 31-24: Reserved Bits 23–0: DMA Count (DMACNT)—This field is written with the zero-based number of transfers to be performed by a queued DMA transfer, if queuing is enabled. # 11.2.6 DMA1 Control Register (DMCT1, Address 80000040) The DMA1 Control Register controls DMA Channel 1. Queuing is not implemented on DMA Channel 1. Figure 11-6 DMA1 Control Register **Bit 31: DMA Extend (DMAEXT)**—The DMAEXT bit serves a function very similar to the IOEXTx bits in the PIA Control registers. This bit is set to provide an additional cycle of output disable time for a read or an additional cycle of data hold time for a write. #### Bits 30-29: Reserved Bits 28–24: DMA Wait States (DMAWAIT)—This field specifies the number of wait states taken by an external access by DMA Channel 1. An external DMA read cycle takes at least three cycles (two wait states) and an external DMA write cycle takes at least four cycles (three wait states). If the DMAWAIT field specifies an insufficient number of wait states for an access (for example, DMAWAIT = 00010b for a write), the processor takes the required minimum number of wait states instead of the specified number. Bits 23–22: Data Width (DW)—This field indicates the width of the data transferred by the DMA channel, as follows: | DW Value | DMA Transfer Width | |----------|------------------------------------------------------------------------------------------------------------------------------| | 00 | 32 bits (External and internal transfers on Am29200 microcontroller) 32 bits (Internal transfers on Am29205 microcontroller) | | 01 | 8 bits | | 10 | 16 bits | | 11 | 32 bits, address unchanged (Reserved on Am29205 microcontroller) | On the Am29200 microcontroller, the value DW=11 is used to repeatedly transfer a fixed pattern from a single DRAM location to a peripheral. For example, it can be used to transfer to a blank area of a printed page without requiring that a memory buffer be allocated for the blank area. Bits 21–20: DMA Request Mode (DRM)—This field indicates how external DMA requests are signaled by DREQ1, as follows: | DRM Value | DREQ1 Request | | |-----------|------------------------|--| | 00 | Active Low | | | 01 | Active High | | | 10 | High-to-Low transition | | | 11 | Low-to-High transition | | The DRM field is set to 00 by a processor reset. See Section 11.3.6 for information on clearing latched DMA requests. **Bit 19: Assert Chip Select (ACS)**—This bit controls whether DMA Channel 1 asserts PIACS1 during an external peripheral access. If the ACS bit is 1, the DMA channel asserts PIACS1; if the ACS bit is 0, the DMA channel does not assert PIACS1. #### Bits 18-10: Reserved **Bit 9: Transfer Up/Down (UD)**—This bit controls the addressing of memory for the series of DMA transfers. If the UD bit is 1, the DMA address (in the DMA1 Address Register) is incremented after each transfer. If the UD bit is 0, the DMA address is decremented after each transfer. The amount by which the address is incremented or decremented is determined by the width of the transfer, as follows: | DW Value | Address Incr/Decr | | |--------------|--------------------------------------------|--| | 00 (32 bits) | +/-4 | | | 01 (8 bits) | +/-1 | | | 10 (16 bits) | +/-2 | | | 11 (32 bits) | +/-0 (Reserved on Am29205 microcontroller) | | **Bit 8: Read/Write (RW)**—This bit controls whether the DMA transfer is to or from the DRAM. If the RW bit is 1, the DMA channel transfers data from the DRAM to the peripheral. If the RW bit is 0, the DMA channel transfers data from the peripheral to the DRAM. **Bit 7: Enable (EN)**—This bit enables the DMA channel to perform transfers. A 1 enables transfers, and a 0 disables transfers. **Bit 6: TDMA Terminate Enable (TTE), Am29200 microcontroller**—This bit, when 1, causes the DMA channel to sample the TDMA signal during an external DMA transfer and to terminate the transfer if TDMA is asserted. TDMA does not apply to an internal transfer. If this bit is 0, the TDMA signal is ignored. This bit is reserved on the Am29205 microcontroller. **Bit 5: Count Terminate Enable (CTE)**—This bit, when 1, causes the DMA channel to terminate the transfer when the DMACNT field of the DMA Count Register decrements past zero. If this bit is 0, the CTE field does not terminate the DMA transfer, though the DMA channel still decrements the count after every transfer. #### Bits 4-2: Reserved **Bit 1: TDMA Terminate Interrupt (TTI), Am29200 microcontroller**—The TTI bit is used to report that the DMA channel has generated an interrupt because of TDMA termination. If the TTE bit is one and the TDMA signal is asserted during an external DMA transfer, the TTI bit is set and a processor interrupt occurs. This bit is reserved on the Am29205 microcontroller. **Bit 0: Count Terminate Interrupt (CTI)**—The CTI bit is used to report that the DMA channel has generated an interrupt because of count termination. If the CTE bit is one and the DMACNT field decrements past zero, the CTI bit is set and a processor interrupt occurs. # 11.2.7 DMA1 Address Register (DMAD1, Address 80000044) The DMA1 Address Register contains the addresses for a transfer by DMA Channel 1. It is identical in layout and definition to the DMA0 Address Register, except that the PERADDR field is not reserved on the Am29205 microcontroller for DMA Channel 1. # 11.2.8 DMA1 Count Register (DMCN1, Address 80000048) The DMA1 Count Register specifies the number of transfers remaining to be performed by DMA Channel 1. It is identical in layout and definition to the DMA0 Count Register. #### 11.2.9 Initialization The EN bits of the DMA0 and DMA1 Control registers are reset to 0 by a processor reset. The DRM fields of both registers are also reset to 0 (see Section 11.3.6). The DMA channels must be configured by software before they are used. #### 11.3 DMA TRANSFERS A DMA transfer is performed as a result of a DMA request. The DMA request can be generated either by an internal peripheral (parallel port, serial port, or video interface) or by an external device using DREQ1–DREQ0 on the Am29200 microcontroller and DREQ1 on the Am29205 microcontroller. # 11.3.1 Specifying the Direction of a DMA Transfer The direction of a DMA transfer is determined by the RW bit of the DMA Control Register. If the RW bit is 0, the DMA channel transfers data from the peripheral to the DRAM. The DMA channel first performs an access to read the data from the peripheral and then performs a DRAM write to store the data into the DRAM. Both accesses occur without interruption: there is no other intervening access. If the RW bit is 1, the DMA channel transfers data from the DRAM to the peripheral. The DMA channel first performs a DRAM read to access the data and then performs an internal or external access to write the data to the peripheral. Both accesses occur without interruption: there is no other intervening access. #### 11.3.2 Programming Internal DMA Transfers Programming an internal DMA transfer using the parallel port, serial port, or video interface involves coding the DMA controller registers along with the appropriate internal peripheral's control register, as listed in the following table. | Internal Peripheral | Control Register | DMA Enable Field | | |---------------------|--------------------------------|------------------|--| | Parallel Port | Parallel Port Control Register | MODE | | | Serial Port | Serial Port Control Register | TMODE, RMODE | | | Video Interface | Video Control Register | MODE | | Setting up an internal DMA transfer involves disabling the peripheral and the DMA controller, configuring both, and enabling both. Note that the internal peripheral and the DMA controller must be disabled before each is configured; after configuration, the peripheral must be enabled before the DMA controller is enabled. Otherwise, the steps listed below can be performed in any order. The following procedure describes how to program a simple internal transfer, using the video interface as an example. - Disable the parallel port, serial port, or video interface with the appropriate field in the peripheral's control register. For example, to disable the video interface, set the MODE field in the Video Control Register to 00. - 2. Disable the DMA channel by writing 0 to the EN bit of the DMAx Control Register. - Program the peripheral's control register with the appropriate values. For example, program the Video Control Register with values for CLKI, SDIR, VIDI, LSI, PSI, PSIO, etc., as required. - 4. Program other peripheral registers as needed. For example, program the Side Margin Register of the video interface to set the required page margins. - 5. Program the DMAx Control Register, specifying the address increment or decrement, transfer direction, and interrupt enables. - Program the DMAx Address Register by specifying the DRAM starting address to be read or written. - 7. Program the DMAx Count Register. - Enable peripheral DMA requests with the appropriate field in the peripheral's control register. For example, set the MODE field of the Video Control Register to 10 to enable DMA Channel 0. - 9. Enable the DMA channel by writing a 1 to the EN bit of the DMAx Control Register. # 11.3.3 Programming External DMA Transfers Programming an external DMA transfer is accomplished by coding the DMA controller registers. Note that the DMA controller must be disabled before being configured; otherwise, the steps listed below can be performed in any order. The following procedure describes how to program a simple external transfer. - 1. Disable the DMA channel by writing a 0 to the EN bit of the DMAx Control Register. - 2. Program the DMAx Control Register, specifying the address increment or decrement, transfer direction, interrupt enables, wait states, etc., for external peripherals. - 3. Program the DMAx Address Register by specifying the external peripheral address and the DRAM starting address to be read or written. - 4. Program the DMAx Count Register. - 5. Enable the DMA channel by writing a 1 to the EN bit of the DMAx Control Register. # 11.3.4 Generating External DMA Requests The generation of DMA requests by the DREQ1–DREQ0 signals is controlled by the DRM field of the DMA control register. The DMA requests can be programmed individually to be edge- or level-sensitive for either polarity of edge or level. If the DMA request is edge-sensitive, the DMA request signal must remain at the appropriate level for at least four cycles after the active edge to insure that the DMA channel detects the request. An active edge that occurs during an in-progress transfer (that is, while DACKx is asserted) is ignored. The DREQx signal must be Low (rising-edge-triggered) or High (falling-edge-triggered) for four cycles before a new active edge can be recognized. If the DMA request is level-sensitive, the request may be deasserted at any time while DACKx is asserted, and must be deasserted during the cycle in which DACKx is deasserted unless it is desired to generate a subsequent DMA request. #### 11.3.5 External DMA Transfers External DMA transfers appear very much like PIA accesses, except the DMA acknowledge signals (DACK1–DACK0 on the Am29200 microcontroller and DACK1 on the Am29205 microcontroller) are asserted during the transfer as well as, optionally, PIACS1–PIACS0 on the Am29200 microcontroller and PIACS1 on the Am29205 microcontroller. The address bus is driven with an address derived from the DMA Address Register. Bits 23–8 of the address are all 0s, and bits 7–0 are driven with the PERADDR field. It is possible to use the DACKx signals as chip selects to the DMA peripherals. The signals PIAOE, PIAWE, and WAIT are used as they are during a PIA access. The DMAWAIT field is used to determine the number of wait states, much as the IOWAITx field is used during a PIA access. On the Am29200 microcontroller, if the DRAM is 16 bits wide, a 32-bit DMA DRAM access appears as two 16-bit accesses on ID31–ID16. If the peripheral is 8 or 16 bits wide, a DMA peripheral access appears as a single access on ID7–ID0 or ID15–ID0, respectively. The peripheral must have the same width as the transfer. On the Am29205 microcontroller, DMA transfers between external peripherals and the DRAM are limited to 8- or 16- bit data accesses. For 8- or 16- bit wide peripherals, a DMA access appears on ID23–16 or ID31–16, respectively. The peripheral must have the same width as the transfer. Figure 11-7 shows the timing of a DMA read cycle (performed when the RW bit is 0). The DACKx signal (and, optionally, the PIACSx signal) is asserted in the second cycle, and the PIAOE signal is asserted in the third cycle. The data must be valid after the number of cycles determined by DMAWAIT. If DMAEXT=1, the processor waits one more cycle after the read access to begin a new access. The peripheral can use WAIT to extend the access. Figure 11-8 shows timing of a DMA write cycle (performed when the RW bit is 1). The PIAOE signal is not asserted. Instead, the processor drives data in the second cycle and asserts the PIAWE signal in the third cycle. The PIAWE signal is deasserted one cycle before the final cycle (the number of cycles is determined by DMAWAIT) to provide data hold time. If DMAEXT=1, the processor inserts one more cycle of data hold time after a write access. The peripheral can use WAIT to extend the access. On the Am29200 microcontroller, if the DMA channel's TTE bit is 1, an external peripheral can assert TDMA at any time while DACKx is asserted to terminate the transfer after the current access; in this case, the current access is completed as usual. As with PIA accesses, the peripheral can use $\overline{WAIT}$ to extend the access. The DMA channel continues to perform transfers until the count expires or the TDMA input is asserted (depending on the CTE and TTE bits). When the transfer terminates, the EN bit is reset unless there is an active queued transfer, as explained in Section 11.4. # 11.3.6 Latching External DMA Requests The DMA controller is designed to latch an active transition of the external DREQ line, even if such a transition occurs when the DMA is disabled. This latching occurs for both edge- and level-triggered modes. The latched transition will then be recognized when the DMA channel is enabled, assuming the DRM field has not changed. This latching avoids a problem when using edge-sensitive DMA requests. There is the potential to lose a request between the time a transfer terminates on the count going to zero (which automatically disables the channel, blocking further requests) and the time the DMA interrupt handler restarts the channel. Any programming of the DMA Control Register that changes the value of the DRM field from its previously programmed value will clear any latched request. Thus, to re-enable a DMA channel and also clear any latched request, the respective DMA Control Register must be written twice. With the first write, the DMA should remain disabled, and a value different from the desired DRM value should be set in the DRM field. On the second write, the DMA should be enabled, and the desired value should be set in the DRM field. Upon reset, the DRM field is set to 00 (active Low). Therefore, if the DMA is later enabled with DRM still at 00, any active Low transition of DREQ since reset will have been latched and will be considered an active request when the DMA is enabled. To clear any such latched request, as noted above, the DMA Control Register should be written twice, once with DMA disabled and DRM set to 11 (or 10 or 01), and finally with DMA enabled and DRM set to 00. # 11.4 DMA QUEUING (DMA CHANNEL 0) The address and count registers for DMA Channel 0 each consist of a two-entry queue, with each entry of the queue separately addressable for loading a new transfer. The DMA0 Address Register and DMA0 Count Register are at the head of the queue. The DMA0 Address Tail Register and DMA0 Count Tail Register are at the tail of the queue and are write-only registers. A DMA transfer queued behind an active transfer can start as soon as the first transfer is complete. This reduces the response-time requirement for software to load a new transfer: software has the entire transfer time of the second transfer to load the next transfer at the tail of the queue. DMA queuing is enabled by writing the appropriate address and count values at the head of the queue, then setting the DMA0 Control Register appropriately, with EN=1, QEN=0, and CTE/TTE=1. A transfer is loaded into the tail of the queue by first loading the DMA0 Count Tail Register, then loading the DMA0 Address Tail Register (note that the PERADDR field cannot be changed by a queued transfer). Writing the tail address causes the QEN bit to be set. Whenever a DMA transfer terminates at the head of the queue and the QEN bit is 1, the transfer at the tail of the queue advances to the head of the queue and begins immediately. When the queued transfer advances to the head of the queue, the QEN bit is reset, the EN bit remains set, and the CTI/TTI bit is set (note that the automatic queue advance makes it impossible to inspect the count of the former transfer after a TTI interrupt in order to discover how many transfers were performed by that transfer). The CTI/TTI interrupt handler need not clear the CTI/TTI bit: in fact, it is unsafe to write the DMA0 Control Register at this point because the termination of the current transfer (the transfer that was formerly queued) may be lost. The interrupt handler need only place the count and address of the next transfer at the tail of the queue (again, the tail address should be loaded after the count, because writing the tail address sets the QEN bit and enables the queue to advance). The CTI/TTI bit is automatically reset when the tail address is written. Queue underflow occurs if the transfer at the head of the queue terminates before the next transfer is loaded at the tail of the queue. Software can detect that underflow has occurred by examining the EN bit after setting up the next transfer. If the EN bit is 0, underflow has occurred, because a successful start of a queued transfer causes the EN bit to remain set when the termination interrupt is generated. # 11.5 RANDOM DIRECT MEMORY ACCESS BY EXTERNAL DEVICES (Am29200 MICROCONTROLLER) The Am29200 microcontroller is designed primarily for single-controller applications, and it has no provision for other bus masters to control the address and data buses in the traditional sense. However, the DMA controller does provide a mechanism for an external device to access the ROM or DRAM using addresses provided by the device rather than by a DMA channel. External devices use the GREQ and GACK signals to perform a random memory access via the Am29200 microcontroller's DRAM or ROM controller. Figure 11-9 shows the timing for a memory read using GREQ and GACK. The external device indicates that it wants to perform a memory access by asserting GREQ. As soon as the processor can perform the access, it asserts GACK. The external device can place the memory address on ID31–ID0 during any cycle following the assertion of GACK. The device indicates that the address is valid by deasserting GREQ. The processor uses this address to determine whether the access is to ROM or DRAM (according to the normal address allocation) and performs the required access. Figure 11-9 shows an access to DRAM, as an example. The processor deasserts GACK at the beginning of the cycle in which the data is valid on ID31–ID0. The deassertion of GACK completes the access. Figure 11-10 illustrates how the GREQ/GACK protocol can be used to perform a memory write. In this case, the external device supplies the address upon the deassertion of GREQ and then provides the write data on ID31–ID0. The processor does not distinguish between a read and a write, allowing the ID Bus to be available to the device for the transfer of both address and data. The distinction between reads and writes must be made by external logic (which, for example, forms the signal wenew in Figure 11-10) in a way that meets the memory timing requirements. For example, an AND gate can be GREO **GACK** Figure 11-10 External Random DRAM Write Cycle (Am29200 Microcontroller) used to form the negative OR of the processor's $\overline{\text{WE}}$ signal and the write enable from the external device. To summarize the use of GREQ and GACK: - 1. The external device asserts GREQ to request an access. - 2. Following the assertion of GACK, the device places the address on ID31–ID0 and deasserts GREQ to indicate that the address is valid. - 3. For a read, the device must be able to latch data from ID31–ID0 at the end of the cycle in which GACK is deasserted. For a write, the device must be prepared to drive data on ID31–ID0 on the second cycle following the address transfer and must hold the data valid until the cycle following the deassertion of GACK, at which time it must stop driving. The device must also supply a write enable signal that satisfies the timing requirements of the memory. In either case, the processor deasserts GACK based on the access timing of the ROM or DRAM. To further clarify the use of GREQ and GACK, Figure 11-11 shows example timing for a ROM read. Writes to the ROM space are more difficult to implement than DRAM writes because the processor always asserts the ROMOE signal. Memory accesses using GREQ and GACK are restricted to 32-bit accesses: 8- and 16-bit accesses are not supported. Zero-wait-state accesses are also not supported. Furthermore, the ROM and/or DRAM bank must be 32 bits wide. Although the GREQ/ Figure 11-11 External Random ROM Read Cycle (Am29200 Microcontroller) GACK protocol supports full 32-bit addressing, the addresses supplied must be within the range of ROM or DRAM addresses. DRAM mapping cannot be performed. During a processor reset, the GREQ input may be used by a hardware-development system to force processor outputs to the high-impedance state. To prevent driver conflicts, the system should keep GREQ in a high-impedance state during a processor reset. # 12 # PROGRAMMABLE I/O PORT This chapter discusses the programmable I/O port available on the Am29200 and Am29205 microcontrollers. Programmable registers, initialization, and operation are described. # 12.1 OVERVIEW The I/O port permits direct programmable access of up to sixteen external PIO signals, as either inputs or open-drain outputs. When used as inputs, eight of these signals, PIO15–PIO8, can be programmed to cause edge- or level-sensitive interrupts. The Am29200 microcontroller supports sixteen external PIO signals (PIO15–PIO0). The Am29205 microcontroller supports eight PIO signals, (PIO15–PIO8). #### 12.2 PROGRAMMABLE REGISTERS # 12.2.1 PIO Control Register (POCT, Address 800000D0) The PIO Control Register (Figure 12-1) controls interrupt generation and determines the polarity of PIO15–PIO0 on the Am29200 microcontroller and PIO15–PIO8 on the Am29205 microcontroller. #### Figure 12-1 PIO Control Register = Reserved on Am29205 microcontroller Bits 31–30: Interrupt Request Mode, PIO15 (IRM15)—This field enables PIO15 to generate an interrupt equivalent to a request on the processor's INTR3 input, and indicates whether PIO15 is level- or edge-sensitive in generating the interrupt. The IRM15 field controls PIO15 as follows: | IRM15 Value | PIO15 Interrupt | | |-------------|------------------------|--| | 00 | Interrupt disabled | | | 01 | Level-sensitive | | | 10 | Edge-sensitive | | | 11 | IRM15 only – see below | | The INVERT field (see below) further conditions interrupt generation. If the INVERT bit for PIO15 is 0, an interrupt, if enabled, is generated by a High level on PIO15 (level-sensitive) or on a Low-to-High transition (edge-sensitive) of PIO15. If the INVERT bit for PIO15 is 1, an interrupt, if enabled, is generated by a Low level on PIO15 (level-sensitive) or on a High-to-Low transition (edge-sensitive) of PIO15. For IRM15, the value 11 causes PIO15 to generate an edge-triggered interrupt and to also set the FBUSY bit in the Parallel Port Control Register (see Section 13.2.1), causing the PBUSY output to be asserted. This can be used to support certain system-specific features of the parallel port. Note that this value may cause a spurious setting of FBUSY during a reset, depending on the activity on PIO15 after a reset. **Bits 29–16: IRM14 through IRM8**—The IRM14–IRM8 fields enable interrupts and specify level- or edge-sensitivity for PIO14–PIO8, respectively. These fields are identical in definition to IRM15, except that the value 11 is reserved. Bits 15–0: PIO Inversion (INVERT)—This field determines how the level on each PIO signal is reflected in the PIO Input and PIO Output Registers, and how interrupts are generated. The most significant bit of the INVERT field determines the sense of PIO15, the next bit determines the sense of PIO14, and so on. A 0 in this field causes the internal and external sense of the respective PIO signal to be noninverted; a High external level is reflected as a 1 internally, and a Low is reflected as a 0 internally. A 1 in this field causes the internal and external sense of the respective PIO signal to be inverted; a High external level is reflected as a 0 internally, and a Low is reflected as a 1 internally. Bits 7–0: Reserved. These bits are reserved on the Am29205 microcontroller and should be written with 0s to ensure compatibility. # 12.2.2 PIO Input Register (PIN, Address 800000D4) The PIO Input Register (Figure 12-2) reflects the external levels of PIO15–PIO0 on the Am29200 microcontroller and PIO15–PIO8 on the Am29205 microcontroller. #### Figure 12-2 PIO Input Register = Reserved on Am29205 microcontroller #### Bits 31-16: Reserved **Bits 15–0: PIO Input (PIN)**—This field reflects the levels on each PIO signal. The most significant bit of the PIN field reflects the level on PIO15, the next bit reflects the level on PIO14, and so on. The correspondence between levels and bits in this register is controlled by the INVERT field. **Bits 7–0: Reserved.** These bits are reserved on the Am29205 microcontroller and will be read as 0s. # 12.2.3 PIO Output Register (POUT, Address 800000D8) The PIO Output Register (Figure 12-3) determines the levels driven on the PIO signals, for those signals enabled to be driven by the PIO Output Enable Register. # Figure 12-3 PIO Output Register = Reserved on Am29205 microcontroller #### Bits 31-16: Reserved Bits 15–0: PIO Output (POUT)—This field determines the levels on each PIO signal, if so enabled by the PIO Output Enable Register. The most significant bit of the POUT field determines the level on PIO15, the next bit determines the level on PIO14, and so on. The correspondence between levels and bits in this register is controlled by the INVERT field. Bits 7–0: Reserved. These bits are reserved on the Am29205 microcontroller and should be written with 0s to ensure compatibility. ## 12.2.4 PIO Output Enable Register (POEN, Address 800000DC) The PIO Output Enable Register (Figure 12-4) determines whether or not the PIO signals are driven as outputs. #### Figure 12-4 PIO Output Enable Register = Reserved on Am29205 microcontroller #### Bits 31-16: Reserved Bits 15–0: PIO Output Enable (POEN)—This field determines whether each PIO signal is driven as an output. The most significant bit of the POEN field determines whether PIO15 is driven, the next bit determines whether PIO14 is driven, and so on. A 1 in a bit position enables the respective signal to be driven according to the associated POUT and INVERT bits, and a 0 disables the signal as an output. Bits 7–0: Reserved. These bits are reserved on the Am29205 microcontroller and should be written with 0s to ensure compatibility. #### 12.2.5 Initialization During a processor reset, all bits of the PIO Output Enable Register are reset to 0, disabling all PIO signals as outputs. The I/O port must be initialized by software before the I/O port is used. #### 12.3 OPERATING THE I/O PORT The PIO signals are asynchronous to the processor. A change on any PIO signal is reflected in the PIO Input Register a maximum of four MEMCLK cycles after the change occurs. A level-sensitive interrupt occurs four cycles after the change, and an edge-sensitive interrupt occurs five cycles after the change. When driven as an output, a change to the PIO Output Register is reflected on the PIO signals a maximum of one cycle after the change occurs. All the PIO signals have additional metastable hardening, allowing them to be driven with slow-transition-time signals. The PIO Output Enable Register permits the PIO signals to be operated as open-drain outputs. This is accomplished by keeping the appropriate POUT bits constant and writing data into the POEN field, so the output is either driving Low or is disabled, depending on the data. # 13 PARALLEL PORT This chapter describes the parallel port supported on the Am29200 and Am29205 microcontrollers. Programmable registers and initialization are described, along with parallel port transfers from the host and to the host. #### 13.1 OVERVIEW The parallel port supports asynchronous bidirectional parallel data transfers. It connects a host processor to the Am29200 or Am29205 microcontroller. The parallel port supports data transfers from the host to the microcontroller or from the microcontroller to the host. Data is transferred via an 8- or 32-bit external data register. Data is transferred to and from the external data register via processor access (programmed I/O) or DMA transfers. The Am29205 microcontroller does not support full word transfers on the parallel port. ## 13.2 PROGRAMMABLE REGISTERS ## 13.2.1 Parallel Port Control Register (PPCT, Address 800000C0) The Parallel Port Control Register (Figure 13-1) controls the parallel port. = Reserved on Am29205 microcontroller #### Bit 31: Reserved **Bit 30: Full Word Transfer (FWT), Am29200 microcontroller**—This bit controls whether the parallel port generates an interrupt or DMA request every handshake or every fourth handshake. When FWT is 0, a transfer of either 8 or 32 bits occurs on every handshake. When FWT is 1, a transfer of either 8 or 32 bits occurs on every fourth handshake, reducing the demand the parallel port places on the processor. The FWT bit is reserved on the Am29205 microcontroller; it must be set to 0 to ensure proper operation. For proper transfer of data, external logic must assemble bytes from the parallel port interface into an 8- or 32-bit external latch that implements the Parallel Port Data Register. The actual size of the transfer is determined by the width of the external latch: an 8-bit latch enables 8-bit transfers, and a 32-bit latch enables full word transfers. The DMA transfer or load/store instruction that reads/writes the Parallel Port Data Register must indicate the correct data width. Full word transfers are implemented only for transfers from the host. Full word transfers are not supported on the Am29205 microcontroller. #### Bits 29-24: Reserved Bits 23–16: Transfer Delay (TDELAY)—During a transfer from the host, this field controls the duration of the assertion of PACK (and possibly PBUSY). During a transfer to the host, it controls the duration of data setup, PACK assertion, and data hold times. On transfers from the host, the TDELAY field specifies one less than the number of MEMCLK cycles in the duration interval. Setting TDELAY to 0 in this case will cause PACK to assert for one cycle. On transfers to the host, the TDELAY field specifies the number of MEMCLK cycles in the duration interval. In this case, if TDELAY is set to 0, PACK will not assert at all. Bit 15: Data Request (DRQ)—This bit is set to indicate that the parallel port is ready for data to be read from or written to the Parallel Port Data Register. If so enabled by the MODE field, this bit being 1 generates an interrupt or DMA request to read or write data. This bit is reset when the Parallel Port Data Register is read or written. The DRQ bit is read-only, allowing other bits of the Parallel Port Control Register to be set (for example, the FACK bit) without interfering with the data request. **Bit 14: Transfer Active (TRA)**—This bit is set at the beginning of a transfer on the parallel port and reset at the end of a transfer. It is read-only, so that setting other bits of the Parallel Port Control Register does not interfere with the indication of an active request. The TRA bit can be inspected by software to detect that a transfer is hung. #### Bits 13-11: Reserved **Bit 10: Data Direction (DDIR)**—This bit controls the direction of data transfer on the parallel port. If the DDIR bit is 0 (the default), data is received on the parallel port. If the DDIR bit is 1, data is transmitted on the parallel port. The MODE field must be 00 when the DDIR bit is changed. Bits 9–8: Parallel Port Mode (MODE)—This field enables the parallel port and controls the operational mode of the parallel port, as follows: | MODE Value | Effect on Parallel Port | | |------------|-----------------------------------------|--| | 00 | Disabled | | | 01 | Generate interrupt requests for service | | | 10 | Generate DMA Channel 0 requests | | | 11 | Generate DMA Channel 1 requests | | Requests for service are requests to read or write the Parallel Port Data Register. Placing the parallel port into the disabled state causes all internal state machines to be reset, forces PACK Low, and holds the parallel port in an idle state. Parallel port programmable registers are not affected when the port is disabled. Bit 7: Force Busy (FBUSY)—A 1 in this bit forces an active level on the PBUSY output. A 0 allows the PBUSY signal to operate normally. **Bit 6: Force ACK (FACK)**—A 1 in this bit forces an active level on the PACK output for one TDELAY interval. At the end of the interval, the FACK bit is reset and PACK is deasserted. Bit 5: Disable Hardware Handshake (DHH)—A 1 in this bit prevents the parallel port interface logic from controlling PACK or PBUSY. A 0 in this bit permits normal handshaking with PACK and PBUSY. FACK and FBUSY may be used by software to control PACK and PBUSY regardless of the DHH bit. #### Bits 4-3: Reserved Bit 2: BUSY Relationship to STROBE (BRS)—This bit controls the relative timing of the PBUSY and PSTROBE hardware handshaking when the parallel port is receiving data. If BRS=0, PBUSY is asserted on the Low-to-High transition (leading edge) of PSTROBE. If BRS=1, PBUSY is asserted on the High-to-Low transition (trailing edge) of PSTROBE. The parallel port does not respond to PSTROBE until PBUSY is asserted, except that the TRA bit is always set on the leading edge of PSTROBE. **Bit 1: ACK Relationship to BUSY (ARB)**—This bit controls the relative timing of the PACK and PBUSY handshaking when the parallel port is receiving data. If ARB=0, PBUSY and PACK are asserted and deasserted at the same time (except for output driver skew). Both PACK and PBUSY are asserted at either the leading or trailing edge of PSTROBE, as controlled by the BRS bit. Both are deasserted together at the end of a transfer, which is usually at the end of a TDELAY interval. If ARB=1, the PACK pulse follows the PBUSY pulse in time. PBUSY is asserted in response to an assertion of PSTROBE and is deasserted when the Parallel Port Data Register has been read and PSTROBE is Low. PACK is asserted at the same time PBUSY is deasserted and is deasserted at the end of a TDELAY interval. **Bit 0: Autofeed (AFD)**—This bit reflects the level on the PAUTOFD input. A 1 indicates PAUTOFD is active (High), and a 0 indicates PAUTOFD is inactive (Low). ## 13.2.2 Parallel Port Status Register (PPST, Address 800000C8) The Parallel Port Status Register (Figure 13-2) indicates the status of the parallel port. #### Figure 13-2 Parallel Port Status Register **Bit 31: PSTROBE Level (STB)**—This bit indicates the level on the PSTROBE signal. If PSTROBE is Low, this bit is 0; if PSTROBE is High, this bit is 1. #### Bits 30-24: Reserved **Bits 23–16: TDELAY Counter Value (TDELAYV)**—This field indicates the current value of the TDELAY counter used to time transitions of the handshaking signals. This value changes as the TDELAY interval is being timed. #### Bits 15-10: Reserved Bits 9–8: Byte Count (BCT)—When the FWT bit is 1, this field indicates the number of bytes (that is, the number of complete handshakes) received on the parallel port since the most recent data request. This information is useful for handling partial-word transfers at the end of a block transfer. Bit 7: PBUSY Level (BSY)—This bit indicates the level on the PBUSY signal. If PBUSY is Low, this bit is 0; if PBUSY is High, this bit is 1. **Bit 6: PACK Level (ACK)**—This bit indicates the level on the PACK signal. If PACK is Low, this bit is 0; if PACK is High, this bit is 1. Bits 5-0: Reserved ## 13.2.3 Parallel Port Data Register (PPDT, Address 800000C4) The Parallel Port Data Register (Figure 13-3) is used to read from and write data to the parallel port. This register is not implemented directly on the processor, but rather is implemented by an external data latch connected to the parallel port interface cable. The processor converts an access of this register into an external access of the data latch. This access is similar to a PIA access, except the timing is fixed (see Section 13.3) and the access uses the signals POE and PWE to read and write the latch. ## Bits 7-0 (8-bit transfers) or Bits 31–0 (32-bit transfers): Parallel Port Data (PDATA), Am29200 microcontroller—This field contains the data being transferred to/from the microcontroller and the host over the parallel port. For transfers from the host, the width of this field is determined by the width of the external latch that implements the Parallel Port Data Register. However, the instruction or DMA channel that reads the parallel port must also specify the correct data width to properly read the Parallel Port Data Register. Full word transfers are not supported on the Am29205 microcontroller. #### 13.2.4 Initialization During a processor reset, the MODE field of the Parallel Port Control Register is reset to 00 (disabled) and the FBUSY bit is set to 1, forcing PBUSY Low (busy). The parallel port must be configured by software before the parallel port is enabled. Writing the value 00 into the MODE field resets the parallel port, forces PACK Low, and forces PBUSY High (unless FBUSY is set). The I/O port signal PIO15 may be used by the host to signal a change in the configuration of the parallel port. If the IRM15 field of the PIO Control Register has the value 11 (see Section 12.2.1), PIO15 causes an edge-triggered interrupt and causes the FBUSY bit to be set. Setting the FBUSY bit causes the parallel port to appear busy (PBUSY=0) to the host while the port's configuration is changed. The FBUSY bit must be reset by software (if required) once configuration is complete. ## 13.3 PARALLEL PORT TRANSFERS The parallel port does not attach directly to the microcontroller, but is attached to the interface cable via buffers. Data must be latched in the interface using a three-state latch such as a 74LS374. The handshaking signals, PSTROBE, PAUTOFD, PACK, and PBUSY, are connected to the microcontroller via simple interface circuits. The inputs PSTROBE and PAUTOFD should be connected to the processor via a Schmitt-trigger inverter such as a 74HCT14, and the outputs PACK and PBUSY should be connected to the host via an open-collector inverter such as a 7406. The hardware handshaking described in this section can be disabled by setting the DHH bit. If the DHH bit is 1, handshaking can be accomplished by software using the FACK and FBUSY bits. #### 13.3.1 Transfers from the Host Figure 13-4 shows the state-transition diagram for transferring data from the host to the microcontroller over the parallel port. Figure 13-5 through Figure 13-8 show the timing diagrams for these transfers. The timing diagrams differ in the settings of the BRS and ARB bits. The timing diagrams also show the signals as they appear at the processor interface, and do not reflect the inversions in the buffers to the parallel-port connector. The host begins the transfer by placing data on the interface and asserting the PSTROBE signal. The data is latched in the interface on the rising edge of PSTROBE if BRS=0, and can be latched by either edge if BRS=1. The TRA bit is set on the leading edge of PSTROBE. The microcontroller asserts PBUSY within three MEMCLK cycles after the leading edge of PSTROBE (BRS=0) or within three MEMCLK cycles after the trailing edge of PSTROBE (BRS=1). The microcontroller asserts PACK at the same time as PBUSY if ARB=0. The parallel port then generates either an interrupt request or a DMA request, as controlled by the MODE field, so the data can be read. If ARB=0, both PBUSY and PACK are deasserted once the TDELAY interval has expired, the Parallel Port Data Register (PDR) has been read, and the host has deasserted PSTROBE. If ARB=1, PBUSY is deasserted and PACK is asserted when the PDR has been read and PSTROBE is Low. PACK remains active until the TDELAY interval has expired. In any case, the TRA bit is reset when PACK is deasserted. The PDR is mapped to the external buffer register. Figure 13-9 shows the timing of the external access. This external access is treated as either a DMA access or a processor PIA access for the purpose of prioritization with other accesses. The PAUTOFD signal is used for software control during a transfer from the host. Software can detect the level on PAUTOFD by reading the AFD bit in the Parallel Port Control Register. #### 13.3.2 Transfers to the Host Figure 13-10 shows the state transition diagram for transferring data from the microcontroller to the host over the parallel port. Figure 13-11 shows the timing for this transfer. Transfers to the host are enabled by the host, using a system-dependent software protocol. This type of transfer is enabled in the processor by setting the DDIR bit in the Parallel Port Control Register. Setting the DDIR bit forces the PBUSY output active, Figure 13-4 State Transitions for Transfers from the Host preventing the host from transferring data to the microcontroller. The MODE bit must be 00 when the DDIR bit is set or reset. The microcontroller begins the transfer by writing data to the external buffer. Figure 13-12 shows the timing for a buffer write. The buffer is written by either software writing the Parallel Port Data Register or a DMA transfer that writes the Parallel Port Data Register. The parallel port automatically generates the first DMA or interrupt request to write the data. Thereafter, the parallel port generates a DMA or interrupt request after it completes each transfer to the host. During a transfer to the host, the PAUTOFD signal is used to indicate that the host is busy and cannot accept data. PAUTOFD has the same polarity as PBUSY for this purpose. After the data buffer has been written, the parallel port waits for one TDELAY interval and then asserts PACK as soon as PAUTOFD is High and PSTROBE is Low (these signal conditions may hold before the interval expires). The TDELAY interval is Figure 13-6 Transfer from the Host on the Parallel Port (BRS=0, ARB=1) used to provide data setup time for the host. PACK is active for one TDELAY interval, then is deasserted. In response to PACK, the host acknowledges the transfer by asserting PSTROBE, which resets the TRA bit. PSTROBE has no fixed relationship to PACK. The host may also assert PAUTOFD before the end of the transfer to indicate it is not ready for a subsequent transfer. Following the deassertion of PACK or the assertion of PSTROBE (whichever is later), the parallel port waits one TDELAY interval to provide data hold time to the host. At the end of the interval, the parallel port generates a new DMA or interrupt request to have the data buffer written again, starting a new transfer. Software or the DMA channel may determine that all transfers have been made, and a new transfer does not start in this case. Figure 13-7 Transfer from the Host on the Parallel Port (BRS=1, ARB=0) Figure 13-8 Transfer from the Host on the Parallel Port (BRS=1, ARB=1) Figure 13-10 State Transitions for Transfers to the Host Figure 13-11 Transfer to the Host on the Parallel Port Figure 13-12 Parallel Port Buffer Write Cycle # 14 SERIAL PORT This chapter describes the programmable registers of the serial port on the Am29200 and Am29205 microcontrollers. #### 14.1 OVERVIEW The on-chip serial port is a UART that permits full-duplex, bidirectional data transfer using the RS-232 standard. Serial port registers provide a programmable baud rate generator, odd/even parity capability, choice of word length, a test mode, and DMA access. The operations of the serial port are similar on the Am29200 and Am29205 microcontrollers, except that the $\overline{\text{DSR}}$ and $\overline{\text{DTR}}$ handshake signals are not available on the Am29205 microcontroller. These functions, if needed, can be recreated with available PIO signals. ## 14.2 PROGRAMMABLE REGISTERS ## 14.2.1 Serial Port Control Register (SPCT, Address 80000080) The Serial Port Control Register (Figure 14-1) controls both the transmit and receive sections of the serial port. ## Figure 14-1 Serial Port Control Register = Reserved on Am29205 microcontroller #### Bits 31-27: Reserved **Bit 26: Loopback (LOOP)**—Setting this bit places the serial port in the loopback mode. In this mode, the TXD output is set High and the Transmit Shift Register is connected to the Receive Shift Register. Data transmitted by the transmit section is immediately received by the receive section. The loopback mode is provided for testing the serial port. Bit 25: Send Break (BRK)—Setting this bit causes the serial port to send a break, which is a continuous Low level on the TXD output for a duration of more than one frame transmission time. The transmitter can be used to time the frame by setting the BRK bit when the transmitter is empty (indicated by the TEMT bit of the Serial Port Status Register), writing the Serial Port Transmit Holding Register with data to be transmitted, and then waiting until the TEMT bit is set again before resetting the BRK bit. Bit 24: Data Set Ready (DSR), Am29200 microcontroller—Setting this bit causes the DSR output to be asserted. Resetting this bit causes the DSR output to be deasserted. This bit is reserved on the Am29205 microcontroller. #### Bits 23-22: Reserved Bits 21–19: Parity Mode (PMODE)—This field specifies how parity generation and checking are performed during transmission and reception (the value "x" is a don't care): | PMODE Value | Parity Generation and Checking | | |---------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------|--| | 0xx<br>100<br>101<br>110<br>111 | No parity bit in frame Odd parity (odd number of 1s in frame) Even parity (even number of 1s in frame) Parity forced/checked as 1 Parity forced/checked as 0 | | Bit 18: Stop Bits (STP)—A 0 in this bit specifies that one stop bit is used to signify the end of a frame. A 1 in this bit specifies that two stop bits are used to signify the end of a frame. Bits 17–16: Word Length (WLGN)—This field indicates the number of data bits transmitted or received in a frame, as follows: | WLGN Value | Word Length | | |------------|-------------|--| | 00 | 5 bits | | | 01 | 6 bits | | | 10 | 7 bits | | | 11 | 8 bits | | Data words of less than eight bits are right-justified in the Transmit Holding Register and Receive Buffer Register. #### Bits 15-10: Reserved **Bits 9–8: Transmit Mode (TMODE)**—This field enables data transmission and controls the operational mode of the serial port for the transmission of data, as follows: | TMODE Value | Effect on Transmit Section | | |-------------|---------------------------------------------------------------------------------------------------------------|--| | 00 | Disabled Generate interrupt requests for service | | | 10<br>11 | Generate interrupt requests for service<br>Generate DMA Channel 0 requests<br>Generate DMA Channel 1 requests | | Requests for service are requests to write the Transmit Holding Register with data to be transmitted. Placing the transmit section into the disabled state causes all internal state machines to be reset and holds the transmit section in an idle state with TXD High. Serial port programmable registers are not affected when the transmit section is disabled. #### Bits 7-3: Reserved Bit 2: Receive Status Interrupt Enable (RSIE)—This bit enables the serial port to generate an interrupt because of an exception during reception. If this bit is 1 and the serial port receives a break or experiences a framing error, parity error, or overrun error, the serial port generates a Receive Status interrupt. Bits 1–0: Receive Mode (RMODE)—This field enables data reception and controls the operational mode of the serial port for the reception of data: | RMODE Value | Effect on Receive Section | | |-------------|-----------------------------------------|--| | 00 | Disabled | | | 01 | Generate interrupt requests for service | | | 10 | Generate DMA Channel 0 requests | | | 11 | Generate DMA Channel 1 requests | | Requests for service are requests to read data from the Receive Buffer Register. Placing the receive section into the disabled state causes all internal state machines to be reset and holds the receive section in an idle state. Serial port programmable registers are not affected when the receive section is disabled. ## 14.2.2 Serial Port Status Register (SPST, Address 80000084) The Serial Port Status Register (Figure 14-2) indicates the status of the transmit and receive sections of the serial port. Figure 14-2 Serial Port Status Register = Reserved on Am29205 microcontroller ### Bits 31-11: Reserved **Bit 10: Transmitter Empty (TEMT)**—This bit is 1 when the transmitter has no data to transmit and the Transmit Shift Register is empty. This indicates to software it is safe to disable the transmit section. Bit 9: Transmit Holding Register Empty (THRE)—When the THRE bit is 1, the Transmit Holding Register does not contain valid data and can be written with data to be transmitted. When the THRE bit is 0, the Transmit Holding Register contains valid data not yet copied to the Transmit Shift Register for transmission and cannot be written. If so enabled by the TMODE field, the THRE bit causes an interrupt or DMA request when it is set. The THRE bit is reset automatically by writing the Transmit Holding Register. This bit is read-only, allowing other bits of the Serial Port Status Register to be written (for example, resetting the BRKI bit) without interfering with the data request. Bit 8: Receive Data Ready (RDR)—When the RDR bit is 1, the Receive Buffer Register contains data that has been received on the serial port, and can be read to obtain the data. When the RDR bit is 0, the Receive Buffer Register does not contain valid data. If so enabled by the RMODE field, the RDR bit causes an interrupt or DMA request when it is set. The RDR bit is reset automatically by reading the Receive Buffer Register. #### Bits 7-5: Reserved - Bit 4: Data Terminal Ready (DTR), Am29200 microcontroller—The DTR bit indicates the level on the $\overline{DTR}$ pin. The DTR bit is 1 when the $\overline{DTR}$ pin is active; the DTR bit is 0 when the $\overline{DTR}$ pin is inactive. This bit is reserved on the Am29205 microcontroller. - **Bit 3: Break Interrupt (BRKI)**—The BRKI bit is set to indicate that a break has been received. If the RSIE bit is 1, the BRKI bit being set causes a Receive Status interrupt. The BRKI bit should be reset by the Receive Status interrupt handler. - Bit 2: Framing Error (FER)—This bit is set to indicate that a framing error occurred during reception of data. If the RSIE bit is 1, the FER bit being set causes a Receive Status interrupt. The FER bit should be reset by the Receive Status interrupt handler. - **Bit 1: Parity Error (PER)**—This bit is set to indicate that a parity error occurred during reception of data. If the RSIE bit is 1, the PER bit being set causes a Receive Status interrupt. The PER bit should be reset by the Receive Status interrupt handler. - **Bit 0: Overrun Error (OER)**—This bit is set to indicate that an overrun error occurred during reception of data. If the RSIE bit is 1, the OER bit being set causes a Receive Status interrupt. The OER bit should be reset by the Receive Status interrupt handler. ## 14.2.3 Serial Port Transmit Holding Register (SPTH, Address 80000088) The processor writes this register (Figure 14-3) with data to be transmitted on the serial port. The transmitter is double-buffered, and the transmit section copies data from the Transmit Holding Register to the Transmit Shift Register (which is not accessible to software) before transmitting the data. #### Figure 14-3 Serial Port Transmit Holding Register #### Bits 31-8: Reserved Bits 7–0: Transmit Data (TDATA)—This field is written with data to be transmitted on the serial port. The THRE bit of the Serial Port Status Register should be 1 when this register is written, to avoid overwriting data already in the register. Writing this register causes the THRE bit to be reset. ## 14.2.4 Serial Port Receive Buffer Register (SPRB, Address 8000008C) This register (Figure 14-4) contains data received over the serial port. The receiver is double-buffered, and the receive section can be receiving a subsequent frame of data in the Receive Shift Register (which is not accessible to software) while the Receive Buffer is being read by software or by a DMA channel. ## Figure 14-4 Serial Port Receive Buffer Register #### Bits 31-8: Reserved **Bits 7–0: Receive Data (RDATA)**—This field contains data received on the serial port. The RDR bit of the Serial Port Status Register should be 1 when this register is read, to avoid reading invalid data. Reading this register causes the RDR bit to be reset. ## 14.2.5 Baud Rate Divisor Register (BAUD, Address 80000090) This register (Figure 14-5) specifies a clock divisor for the generation of a serial clock that controls the serial port. The UCLK (serial clock rate) is 16 times faster than the baud rate of the serial port. The Baud Rate Divisor Register specifies the zero-based number of UCLK cycles in one phase (half period) of the 16x serial clock. The formula for the baud rate is thus: Baud Rate = (Frequency of UCLK) + (BAUDDIV+1) + 32 The maximum baud rate is 1/32 of INCLK and is achieved by tying UCLK to INCLK with BAUDDIV=0000, hexadecimal. ## Figure 14-5 Baud Rate Divisor Register #### Bits 31-16: Reserved **Bit 15–1: Baud Rate Divisor (BAUDDIV)**—This field specifies the amount by which the UCLK input is divided to generate one phase of the serial clock. The serial clock operates at 16 times the rate of transmission or reception of data. The BAUDDIV value is zero-based. For example, a value of two specifies a divisor of three. #### 14.2.6 Initialization During a processor reset, both the TMODE and RMODE fields of the Serial Port Control Register are reset to 00, disabling the transmit and receive sections of the serial port. Software must initialize the serial port before it is enabled. ## 15 VIDEO INTERFACE This chapter describes the bidirectional bit serializer/deserializer (known as the video interface) on the Am29200 and Am29205 microcontrollers. First the programmable registers of the video interface are described. This is followed by a discussion of video interface operation, including transmitting and receiving data. #### 15.1 OVERVIEW The video interface provides direct connection to printer engine interfaces, raster input devices, and other serial-driven devices. The programmable interface allows direct connection to a large number of marking engines. Features of the interface include programmable image scan rates, along with programmable horizontal and vertical image margins. Video interface operations are the same on both the Am29200 and Am29205 microcontrollers. #### 15.2 PROGRAMMABLE REGISTERS ## 15.2.1 Video Control Register (VCT, Address 800000E0) This register (see Figure 15-1) controls the operation of the video interface. #### Figure 15-1 Video Control Register #### Bits 31-16: Reserved Bit 15: Data Request (DRQ)—This bit is set to indicate that the video interface is ready for data to be written to or read from the Video Data Holding Register. If so enabled by the MODE field, this bit being set generates an interrupt or DMA request to write or read data. This bit is reset when the Video Data Holding Register is read or written. This bit is read-only, to allow other bits of the Video Control Register to be set (for example, the PSL bit) without interfering with the data request. Bits 14–11: Clock Divide (CLKDIV)—This field contains the divisor of the VCLK input used to generate the internal video clock. It specifies the number of VCLK periods in one phase (half period) of the internal video clock. For example, a value of 0001 indicates that one VCLK period constitutes one phase of the internal video clock—a divide by two. A value of 0000 causes VCLK to be used directly as the video clock. At the beginning of a video raster line, the clock divider is initialized so that, in the line, the first period of the internal clock is the correct number of VCLK periods. **Bit 10: Data Direction (DDIR)**—This bit controls the direction of video data. If the DDIR bit is 0, data is transmitted on the video interface. If the DDIR bit is 1, data is received on the video interface. **Bits 9–8: Video Interface Mode (MODE)**—This field enables the video interface and controls the operational mode of the video interface, as follows: | MODE Value | Effect on Video Interface | | |------------|-----------------------------------------|--| | 00 | Disabled | | | 01 | Generate interrupt requests for service | | | 10 | Generate DMA Channel 0 requests | | | 11 | Generate DMA Channel 1 requests | | Requests for service are requests to read or write the Video Data Holding Register. Placing the video interface into the disabled state causes all internal state machines to be reset and holds the video interface in an idle state. Video interface programmable registers are not affected when the interface is disabled. **Bit 7: Clock Invert (CLKI)**—If this bit is 0, the VDAT, PSYNC, and LSYNC pins are driven or sampled on the Low-to-High transition of the VCLK input. If this bit is 1, the VDAT, PSYNC, and LSYNC pins are driven or sampled on the High-to-Low transition of the VCLK input. #### Bit 6: Reserved Bit 5: Page Sync Input/Output (PSIO)—This bit determines whether or not PSYNC is an input or output. If this bit is 0, PSYNC is an input. If this bit is 1, PSYNC is an output. **Bit 4: Page Sync Invert (PSI)**—If this bit is 0 and PSYNC is an input, a Low-to-High transition of the PSYNC input indicates the beginning of a page. If this bit is 1 and PSYNC is an input, a High-to-Low transition of the PSYNC input indicates the beginning of a page. If this bit is 0 and PSYNC is an output, PSYNC is noninverted with respect to the PSL bit. A PSL bit of 0 is reflected as a Low level, a PSL bit of 1 is reflected as a High level, and a page starts on a Low-to-High transition. If this bit is 1 and PSYNC is an output, PSYNC is inverted with respect to the PSL bit. A PSL bit of 0 is reflected as a High level, a PSL bit of 1 is reflected as a Low level, and a page starts on a High-to-Low transition. **Bit 3: Page Sync Level (PSL)**—When PSYNC is an input, this bit reflects the level on PSYNC. When PSYNC is an output, this bit determines the level on PSYNC. If PSI=0, a 0 in this bit corresponds to a Low level on PSYNC and a 1 in this bit corresponds to a High level on PSYNC and a 1 in this bit corresponds to a Low level on PSYNC and a 1 in this bit corresponds to a Low level on PSYNC. Bit 2: Line Sync Invert (LSI)—If this bit is 0, a Low-to-High transition of the LSYNC input indicates the beginning of a line. If this bit is 1, a High-to-Low transition of the LSYNC input indicates the beginning of a line. **Bit 1: Shift Direction (SDIR)**—When this bit is 0, the Video Data Shift Register is shifted right to transfer data, with video data being shifted out of the least significant bit of the register (corresponding to bit 0 of the Video Data Holding Register) or into the most significant bit (corresponding to bit 31 of the Video Data Holding Register). When this bit is 1, the Video Data Shift Register is shifted left to transfer data, with video data being shifted out of the most significant bit of the register or into the least significant bit. **Bit 0: Video Invert (VIDI)**—When this bit is 0, a 1 in the Video Data Shift Register corresponds to a High level on VDAT and a 0 in the Video Data Shift Register corresponds to a Low level on VDAT. When this bit is 1, a 1 in the Video Data Shift Register corresponds to a Low level on VDAT and a 0 in the Video Data Shift Register corresponds to a High level on VDAT. ## 15.2.2 Top Margin Register (TOP, Address 800000E4) This register (Figure 15-2) specifies the number of lines in the top margin of a page. ### Figure 15-2 Top Margin Register #### Bits 31-12: Reserved Bits 11–0: Top Margin Count (TOPCNT)—This field specifies the number of lines in the top margin. ## 15.2.3 Side Margin Register (SIDE, Address 800000E8) This register (Figure 15-3) specifies the number of data bits in the left margin of a page and the number of bits in a raster line of video data. Together, this information sets the right and left margins of a page. ## Figure 15-3 Side Margin Register #### Bits 31-28: Reserved Bits 27–16: Left Margin Count (LEFTCNT)—This field specifies the number of data bit equivalents in the left margin of a page. Bits 15–0: Line Count (LINECNT)—This field specifies the number of data bits in a raster line of video data. ## 15.2.4 Video Data Holding Register (VDT, Address 800000EC) This register (Figure 15-4) contains data to be transmitted on or received from the video interface. Video data is double-buffered so data can be written to or read from the Video Data Holding Register while other data is transmitted from or received into the Video Data Shift Register. Bits 31–0: Video Data (VDATA)—This field is written or read to transmit or receive data on the video interface. #### 15.2.5 Initialization During a processor reset, the MODE field of the Video Control Register is reset to 00. Software must configure the video interface before it is enabled. To prevent possible driver conflicts during reset, the PSIO bit is reset and the DDIR bit is set so both PSYNC and VDAT are inputs. To allow time for the interface signals to settle, the inputs and outputs should be configured before the interface is enabled. #### 15.3 VIDEO INTERFACE OPERATION The operation of the video interface is synchronous to the VCLK input (see Section 7.1.10), which clocks the video interface either directly or at a frequency multiple specified by the CLKDIV field. The CLKDIV field specifies the number of VCLK periods in one phase (half period) of the internal video clock. If the CLKDIV field has the value 0000, the VCLK input is used directly. The following equations show how the CLKDIV field determines the internal video clock. ``` If CLKDIV = 0 Internal Video Clock period = VCLK If CLKDIV > 0 Internal Video Clock period = [VCLK * CLKDIV] * 2 and Internal Video Clock frequency = [VCLK ÷ (CLKDIV * 2)] ``` For example, assuming VCLK = 16 MHz = 62.5 ns period: | CLKDIV | Internal Video Clock Period | Internal Video Clock Frequency | |--------|------------------------------|--------------------------------| | 0 | 62.5 ns | 16.0 MHz (VCLK) | | √1 | [62.5 ns * 1] * 2 = 125 ns | 8.0 MHz (1/2 of VCLK) | | 2 | [62.5 ns * 2] * 2 = 250 ns | 4.0 MHz (1/4 of VCLK) | | 3 | [62.5 ns * 3] * 2 = 375 ns | 2.67 MHz (1/6 of VCLK) | | 4 | [62.5 ns * 4] * 2 = 500 ns | 2.0 MHz (1/8 of VCLK) | The progression continues in this way up to a value of 15 for CLKDIV. The clock divider circuit is initialized when the video interface is disabled, and does not operate until the interface is enabled by the MODE field. This circuit is also initialized by the transition of LSYNC that indicates the beginning of a line. Initializing the clock divider with LSYNC insures that the first internal clock period in the line is the indicated number of VCLK periods. The maximum frequency of VCLK is the same as the maximum frequency of INCLK. The maximum operating frequency of the video interface is the frequency of INCLK if the interface is used to output data. The maximum operating frequency is one-eighth of the frequency of INCLK if the interface is used to input data. The PSYNC, LSYNC, and VDAT pins are driven and/or sampled during either the Low-to-High (CLKI=0) or High-to-Low (CLKI=1) transition of the VCLK input. The clock divider sequences on the same transition. If the clock is not divided down, new data can be driven or sampled on every active transition of VCLK. If the clock is divided down, new data can be driven or sampled on every CLKDIV-times-2 active transition of VCLK. ## 15.3.1 Transmitting Data on the Video Interface Before the video interface is enabled to transmit, the Video Control Register should be set to configure the interface, and the Top Margin and Side Margin registers should be set with the appropriate counts. When the DDIR bit is 0 (VDAT is an output) and the video interface is disabled or is not transferring data, the VDAT output is held at a level corresponding to a 0 data bit (Low if VIDI=0 or High if VIDI=1). Once the video interface has been configured, it is enabled via the MODE field. Enabling the video interface with DDIR=0 causes the interface to set the DRQ bit, thereby generating an interrupt or DMA request to write the Video Data Holding Register. Writing data into the Video Data Holding Register resets the DRQ bit. Data is transferred from the Video Data Holding Register to the Video Data Shift Register whenever the Video Data Shift Register is empty. After the transfer, the DRQ bit is set to request more data. Thus, the DRQ bit may be set very soon after the first data word is written. Thereafter, however, the DRQ bit will be set only as data is transmitted on the interface. A page cycle begins by an active transition of PSYNC, either as an input or output. At the beginning of a page cycle, three count-down registers are loaded from the TOPCNT, LEFTCNT, and LINECNT fields. The TOPCNT counter enables the transmission of the first raster line when it counts down to zero. The LEFTCNT counter enables the transmission of raster data on a line when it counts down to zero. The LINECNT counter enables the transmission of raster data as long as it is nonzero. After the page cycle begins, the counter registers are not enabled to count until the first active transition of LSYNC. An active transition of LSYNC indicates the beginning of a line. Because of internal synchronization delay, the video interface does not respond to LSYNC until five VCLK cycles have elapsed (see Figure 15-5). If the Video Data Shift Register is not empty, an active transition on LSYNC causes the TOPCNT counter to decrement by one (the TOPCNT field is unaffected). The TOPCNT counter continues to decrement by one on each active transition of LSYNC until it reaches zero. Note that if the TOPCNT field contains zero at the beginning of a page, the video interface begins transmitting on the first active transition of LSYNC. When the TOPCNT counter reaches zero, the interface is enabled to transmit the first raster line. At the beginning of the line, the LEFTCNT counter decrements on each active transition of the interface clock, beginning five VCLK cycles after the active edge of LSYNC, until the counter reaches zero. When the LEFTCNT counter reaches zero, the data in the selected end of the Video Data Shift Register is enabled to drive the VDAT output, and the LINECNT counter is enabled to count. The LEFTCNT counter is reloaded from the LEFTCNT field but does not count until the next active transition of LSYNC. If the LEFTCNT field contains zero at the beginning of a line, video data is driven and the LINECNT counter is enabled to count immediately on the fifth VCLK cycle after the first active transition of LSYNC, after the TOPCNT counter reaches zero. The first bit of video data is driven for a period of the interface clock, during the cycle in which the LEFTCNT counter reaches zero. On the next active transition of the clock, the Video Data Shift Register is shifted right (SDIR=0) or left (SDIR=1) by one bit and the new data driven on the VDAT output. Also, the LINECNT counter is decremented by one. When the last bit in the Video Data Shift Register has been transmitted, new data is for top or left margins) loaded from the Video Data Holding Register and the DRQ bit is set to request more data. Data transmission continues until the LINECNT counter reaches zero. When the LINECNT counter reaches zero, the VDAT output is driven to correspond to a 0 data bit and the Video Data Shift Register is cleared. The LINECNT counter is reloaded but is not enabled to count until a new line begins and the LEFTCNT counter reaches zero once more. The VDAT output is held at a 0 data level and the Video Shift Register does not shift until the next line is transmitted. Clearing the Video Data Shift Register at the end of a line enables it to be reloaded with new data from the Video Data Holding Register as soon as this data is available. On each subsequent active transition of LSYNC, a subsequent line of data is transmitted. Each line begins with a synchronization period of five VCLK cycles, then a countdown of the LEFTCNT counter until it reaches zero, followed by data transmission and shifting until the LINECNT counter reaches zero. On any active transition of LSYNC, if the Video Data Shift Register is empty, the page cycle ends and the video interface waits for the next active transition of PSYNC. ## 15.3.2 Receiving Data on the Video Interface When the video interface is configured to receive data, the TOPCNT and LEFTCNT fields are not used, and the PSYNC pin is not used. Data reception is controlled by LSYNC, VCLK, and the LINECNT field. On the active edge of LSYNC, the LINECNT counter is loaded with the contents of the LINECNT field. On the fifth active edge of VCLK following the active edge of LSYNC (for synchronization), data is sampled into the selected end of the Video Data Shift Register, the register is shifted in the selected direction, and the LINECNT counter is decremented by one. When the Video Data Shift Register has received 32 bits, the contents of the register are transferred into the Video Data Holding Register and the DRQ bit is set to request that the data be read. Data sampling and shifting continue until the LINECNT counter reaches zero. To clear the data at the end of a line after the LINECNT counter reaches zero, the data in the Video Data Shift Register is transferred into the Video Data Holding Register as soon as the holding register is available, and the DRQ bit is set. The interface waits for the next active transition of LSYNC before it accepts a new line of data. ## 口 ## 16 INTERRUPTS AND TRAPS This chapter describes how to use interrupts and traps to control the behavior of the Am29200 and Am29205 microcontrollers. Programmable registers are defined, as are vector numbers and interrupt and trap priorities. Interrupt and trap handling is discussed along with exception reporting and timer configuration. The chapter concludes with a description of the microcontroller's internal interrupt handler. ### 16.1 OVERVIEW The Am29200 and Am29205 microcontrollers employ a lightweight interrupt and trap facility that does not automatically save its current state in memory. Saving and restoring state information is under software control. Interrupts and traps are dispatched using a vector table that can be relocated in memory. On the Am29205 microcontroller, external traps and $\overline{\text{WARN}}$ are not supported. Only the $\overline{\text{INTR3-INTR2}}$ inputs are available. The PIO signals can be used as additional interrupts when more inputs are required. #### 16.2 INTERRUPTS AND TRAPS Interrupts and traps cause the Am29200 and Am29205 microcontrollers to suspend the execution of an instruction sequence and to begin the execution of a new sequence. The processor may or may not later resume the execution of the original instruction sequence. The distinction between interrupts and traps is largely one of causation and enabling. Interrupts allow external devices and the timer facility to control processor execution and are always asynchronous to program execution. Traps are intended to be used for certain exceptional events that occur during instruction execution and are generally synchronous to program execution. A distinction is made between the point at which an interrupt or trap occurs and the point at which it is taken. An interrupt or trap is said to occur when all conditions that define the interrupt or trap are met. However, an interrupt or trap that occurs is not necessarily recognized by the processor, either because of various enables or because of the processor's operational mode (e.g., Halt mode). An interrupt or trap is taken when the processor recognizes the interrupt or trap and alters its behavior accordingly. ## 16.2.1 Current Processor Status Register (CPS, Register 2) This protected special-purpose register (see Figure 16-1) controls the behavior of the processor and its ability to recognize exceptional events. #### Bits 31-18: Reserved Bits 17: Timer Disable (TD)—When the TD bit is 1, the Timer interrupt is disabled. When this bit is 0, the Timer interrupt depends on the value of the IE bit of the Timer Reload Register. Note that Timer interrupts may be disabled by the DA bit regardless of the value of either TD or IE. The intent of this bit is to provide a means of disabling Timer interrupts without having to perform a non-atomic read-modify-write operation on the Timer Reload Register. #### Bit 16-15: Reserved Bit 14: Interrupt Pending (IP)—This bit allows software to detect the presence of interrupts while the interrupts are disabled. The IP bit is set if an interrupt request is active, but the processor is disabled from taking the resulting interrupt due to the value of the DA, DI, or IM bits. If all interrupt requests are subsequently deactivated while still disabled, the IP bit is reset. Bits 13–12: Trace Enable, Trace Pending (TE, TP)—The TE and TP bits implement a software-controlled, instruction single-step facility. Single stepping is not implemented directly, but rather emulated by trap sequences controlled by these bits. The value of the TE bit is copied to the TP bit whenever an instruction completes execution. When the TP bit is 1, a Trace trap occurs. Section 17.2 describes the use of these bits in more detail. Bit 11: Trap Unaligned Access (TU)—The TU bit enables checking of address alignment for external data-memory accesses. When this bit is 1, an Unaligned Access trap occurs if the processor either generates an address for an external word not aligned on a word address-boundary (i.e., either of the least significant two bits is 1) or generates an address for an external half-word not aligned on a half-word address boundary (i.e., the least significant address bit is 1). When the TU bit is 0, data-memory address alignment is ignored. Alignment is ignored for input/output accesses. The alignment of instruction addresses is also ignored (unaligned instruction addresses can be generated only by indirect jumps). Interrupt/trap vector addresses always are aligned properly by the processor. **Bit 10: Freeze (FZ)**—The FZ bit prevents certain registers from being updated during interrupt and trap processing, except by explicit data movement. The affected registers are: Channel Address, Channel Data, Channel Control, Program Counter 0, Program Counter 1, Program Counter 2, and the ALU Status Register. When the FZ bit is 1, these registers hold their values. An affected register can be changed only by a Move-To-Special-Register instruction. When the FZ bit is 0, there is no effect on these registers and they are updated by processor instruction execution as described in this manual. The FZ bit is set whenever an interrupt or trap is taken, holding critical state in the processor so it is not modified unintentionally by the interrupt or trap handler. If the Freeze (FZ) bit of the Current Processor Status Register is reset from 1 to 0, two cycles are required before all program state is reflected properly in the registers affected by the FZ bit. This implies that interrupts and traps cannot be enabled until two cycles after the FZ bit is reset, for proper sequencing of program state. There is no delay associated with setting the FZ bit from 0 to 1. Bit 9-8: Reserved **Bit 7: Wait Mode (WM)**—The WM bit places the processor in the Wait mode. When this bit is 1, the processor performs no operations. The Wait mode is reset by an interrupt or trap for which the processor is enabled, or by the assertion of the RESET pin. #### Bit 6-5: Reserved Bit 4: Supervisor Mode (SM)—The SM bit protects certain processor context, such as protected special-purpose registers. When this bit is 1, the processor is in the Supervisor mode and access to all processor context is allowed. When this bit is 0, the processor is in the User mode and access to protected processor context is not allowed. An attempt to access (either read or write) protected processor context causes a Protection Violation trap. Section 6.1 describes the processor state protected from User-mode access. Bits 3–2: Interrupt Mask (IM)—The IM field is an encoding of the processor priority with respect to external interrupts. The interpretation of the interrupt mask is specified in Section 16.2.2. Bit 1: Disable Interrupts (DI)—The DI bit prevents the processor from being interrupted by internal peripheral requests and by external interrupt requests INTR3—INTR0 on the Am29200 microcontroller and INTR3—INTR2 on the Am29205 microcontroller. When this bit is 1, the processor ignores all internal and external interrupts. However, traps (both internal and external), Timer interrupts, and Trace traps may be taken. When this bit is 0, the processor takes any interrupt enabled by the IM field, unless the DA bit is 1. Bit 0: Disable All Interrupts and Traps (DA)—The DA bit prevents the processor from taking any interrupts and most traps. When this bit is 1, the processor ignores interrupts and traps, except for the WARN trap on the Am29200 microcontroller. When the DA bit is 0, all traps are taken; interrupts are taken if otherwise enabled. #### 16.2.2 Interrupts Interrupts are caused by signals applied to any of the external INTRx inputs, by the timer facility (see Section 16.8), or by internal peripherals (see Section 16.9). The processor may be disabled from taking certain interrupts by the masking capability provided by the Disable All Interrupts and Traps (DA) bit, Disable Interrupts (DI) bit, and Interrupt Mask (IM) field in the Current Processor Status Register. Timer interrupts may be disabled by the Timer Disable (TD) bit of the Current Processor Status Register. The DA bit disables all interrupts. The DI bit disables external interrupts and internal peripheral interrupts without affecting the recognition of traps and Timer interrupts. The 2-bit IM field selectively enables external interrupts as follows: | IM Value | Result | Microcontroller | |----------|---------------------------------------------------------------------------------------------------------------|--------------------| | 00<br>01 | INTR0 enabled INTR1-INTR0 enabled | Am29200<br>Am29200 | | 10 | INTR2-INTR0 enabled INTR2 enabled | Am29200<br>Am29205 | | 11 | INTR3-INTR0 and internal peripheral interrupts enabled INTR3-INTR2 and internal peripheral interrupts enabled | Am29200<br>Am29205 | Note that the INTRO interrupt cannot be disabled by the IM field. Also, no external interrupt is taken if either the DA or DI bit is 1. The Interrupt Pending bit in the Current Processor Status indicates that one or more interrupt requests is active, but the corresponding interrupt is disabled due to the value of either DA, DI, or IM. ### 16.2.3 Traps Traps are caused by signals applied to one of the Am29200 microcontroller inputs TRAP1—TRAP0 or by exceptional conditions such as protection violations. Traps are disabled by the DA bit in the Current Processor Status; a 1 in the DA bit disables traps, and a 0 enables traps. It is not possible to selectively disable individual traps. ### 16.2.4 External Interrupts and Traps An external device causes an interrupt by asserting one of the INTRx inputs, and causes a trap by asserting one of the TRAP1-TRAP0 inputs on the Am29200 microcontroller. Transitions on each of these inputs may be asynchronous to the processor clock; they are protected against metastable states. For this reason, an assertion of one of these inputs that meets the proper set-up-time criteria does not cause the corresponding interrupt or trap until the fourth following cycle. The INTRx inputs are prioritized with respect to each other and with respect to the processor. To resolve conflicts between these inputs, the inputs are prioritized in order, so the interrupt caused by INTR0 has the highest priority and the interrupt caused by INTR3 has the lowest priority. The TRAP1-TRAP0 inputs on the Am29200 microcontroller are prioritized with respect to each other, so the trap caused by TRAP0 has priority over the trap caused by TRAP1 when a conflict occurs. Both TRAP0 and TRAP1 have priority over the INTR3-INTR0 inputs. The TRAP1-TRAP0 inputs cannot be disabled selectively. Both traps, however, can be disabled by the DA bit in the Current Processor Status Register. The INTRx and TRAP1—TRAP0 inputs are level-sensitive. Once asserted, they must be held active until the corresponding interrupt or trap is acknowledged by the interrupt or trap handler. This acknowledgment is system-dependent, since there is no interrupt-acknowledge mechanism defined for the processor. If any of these inputs is asserted, then deasserted before it is acknowledged, it is not possible to predict (unless the interrupt or trap is masked) whether or not the processor has taken the corresponding interrupt or trap. During interrupt and trap processing, the vector number is determined in part by which of the INTRx and TRAP1—TRAP0 inputs is active. If the input causing an interrupt or trap is deasserted before the vector number is determined, the vector number is unpredictable and the processor operation is also unpredictable. Typically, this situation results in the processor taking an Illegal Opcode trap. There is a five-cycle latency from the deassertion of an INTRx or TRAP1—TRAP0 input to the time the corresponding interrupt or trap is no longer recognized by the processor. The latency is due to the metastability hardening that allows these signals to be driven with slow-transition-time signals. The deassertion must be timed so the processor is not recognizing the interrupt or trap by the time the corresponding mask is reset. Otherwise, a spurious interrupt or trap may occur. External traps are not supported on the Am29205 microcontroller. Only the INTR3-INTR2 inputs are available for external interrupts. #### 16.2.5 Wait Mode A wait-for-interrupt capability is provided by the Wait mode. The processor is in the Wait mode whenever the Wait Mode (WM) bit of the Current Processor Status is 1. While in Wait mode, the processor neither fetches nor executes instructions and performs no external accesses. The Wait mode is exited when an interrupt or trap is taken. The processor can take only those interrupts or traps for which it is enabled, even in the Wait mode. For example, if the processor is in the Wait mode with the DA bit set to 1, it can leave the Wait mode only via a processor reset (see Section 2.9.2) or a WARN trap (see Section 16.5). #### 16.3 VECTOR AREA Interrupt and trap processing relies on the existence of a user-managed vector area in external instruction/data memory. The vector area begins at an address specified by the Vector Area Base Address Register and provides for as many as 256 different interrupt and trap handling routines. The processor reserves 64 routines for system operation and instruction emulation. The number and definition of the remaining 192 possible routines are system dependent. The structure of the vector area is a table of vectors in instruction/data memory. The layout of a single vector is shown in Figure 16-2. Each vector gives the beginning word-address of the associated interrupt or trap handling routine. ## 16.3.1 Vector Area Base Address Register (VAB, Register 0) This protected special-purpose register (Figure 16-3) specifies the beginning address of the interrupt/trap vector area. The vector area is a table of 256 vectors that point to interrupt and trap handling routines. When an interrupt or trap is taken, the vector number for the interrupt or trap (see Section 16.3.2) replaces bits 9–2 of the value in the Vector Area Base Address Register to generate the physical address for a vector contained in instruction/data memory. #### Figure 16-3 Vector Area Base Address Register Bits 31–10: Vector Area Base (VAB)—The VAB field gives the beginning physical address of the vector area. This address is constrained to begin on a 1-Kbyte address-boundary in instruction/data memory. **Bits 9–0: Zeros**—These bits force the alignment of the vector area to a 1-Kbyte boundary. #### 16.3.2 Vector Numbers When an interrupt or trap is taken, the processor determines an 8-bit vector number associated with the interrupt or trap. The vector number gives the number of a vector table entry. The physical address of the vector table entry is generated by replacing bits 9–2 of the value in the Vector Area Base Address Register with the vector number. Vector numbers are either predefined or specified by an instruction causing the trap. The assignment of vector numbers is shown in Table 16-1 (vector numbers are in decimal notation). Vector numbers 64 to 255 are used by trapping instructions; the definition of the routines associated with these numbers is system dependent. #### 16.4 INTERRUPT AND TRAP HANDLING Interrupt and trap handling consists of two distinct operations: taking the interrupt or trap and returning from the interrupt or trap handler. If the interrupt or trap handler returns directly to the interrupted routine, the interrupt or trap handler need not save and restore processor state. ## 16.4.1 Old Processor Status Register (OPS, Register 1) This protected special-purpose register has the same format as the Current Processor Status Register. The Old Processor Status Register stores a copy of the Current Processor Status Register when an interrupt or trap is taken. This is required since the Current Processor Status Register is modified to reflect the status of the interrupt/trap handler. During an interrupt return, the Old Processor Status Register is copied into the Current Processor Status Register. This allows the Current Processor Status Register to be set as required for the routine that is the target of the interrupt return. ## 16.4.2 Program Counter Stack The program counter unit, shown in Figure 16-4, forms and sequences instruction addresses for the instruction fetch unit. It contains the program counter (PC), the program-counter multiplexer (PC MUX), the return address latch, and the program-counter buffer. The PC forms addresses for sequential instructions executed by the processor. The master of the PC Register, PC L1, contains the address of the instruction being fetched in the instruction fetch unit. The slave of the PC Register, PC L2, contains the next sequential address, which may be fetched by the instruction fetch unit in the next cycle. The return address latch passes the address of the instruction following the delayed instruction of a call to the register file. This address is the return address of the call. The PC buffer stores the addresses of instructions in various stages of execution when an interrupt or trap is taken. The registers in this buffer—Program Counters 0, 1, and 2 (PC0, PC1, and PC2)—are normally updated from the PC as instructions flow through the processor pipeline. When an interrupt or trap is taken, the Freeze (FZ) bit in the Current Processor Status Register is set, holding the quantities in the PC buffer. When the FZ bit is set, PC0, PC1, and PC2 contain the addresses of the instructions in the decode, execute, and writeback stages of the pipeline, respectively. Upon the execution of an interrupt return, the target instruction stream is restarted using the instruction addresses in PC0 and PC1. Two registers are required here because the processor implements delayed branches. An interrupt or trap may be taken when the processor is executing the delay instruction of a branch and decoding the target of the branch. This discontinuous instruction sequence must be restarted properly upon an interrupt return. Restarting the instruction pipeline using two separate registers correctly **Table 16-1** Vector Number Assignments | Number | Type of Trap or Interrupt | Cause | |------------------|----------------------------------------------------|------------------------------------------------| | 0 | Illegal Opcode | Executing undefined instruction <sup>1</sup> | | 1 | Unaligned Access | Access on unnatural boundary, TU = 1 | | 2 | Out-of-Range | · Overflow or underflow | | 3–4<br>5 | Reserved Protection Violation | Invalid User-mode operation <sup>2</sup> | | 6 <del>-</del> 7 | Reserved | mvalid Oser-mode operation- | | 8 | User Instruction Mapping Miss | No DRAM mapping for access | | 9 | User Data Mapping Miss | No DRAM mapping for access | | 10 | Supervisor Instruction Mapping Miss | No DRAM mapping for access | | 11 | Supervisor Data Mapping Miss | No DRAM mapping for access | | 12–13 | Reserved | Times Facility | | 14<br>15 | Timer<br>Trace | Timer Facility Trace Facility | | 16 | INTRO <sup>4</sup> | INTRO input | | 17 | INTR14 | INTR1 input | | 18 | INTR2 | INTR2 input | | 19 | INTR3/Internal | INTR3 input or internal peripheral | | 20 | TRAP04 | TRAP0 input | | 21<br>22 | TRAP14 | TRAP1 input | | | Floating-Point Exception | Unmasked floating-point exception <sup>3</sup> | | 23 | Reserved | | | 24–29 | Reserved for instruction emulation (opcodes D8–DD) | | | 30 | MULTM | MULTM instruction | | 31 | MULTMU | MULTMU instruction | | 32 | MULTIPLY | MULTIPLY instruction | | 33 | DIVIDE | DIVIDE instruction | | 34 | MULTIPLU | MULTIPLU instruction | | 35 | DIVIDU | DIVIDU instruction | | 36 - | CONVERT | CONVERT instruction | | 37 | SQRT | SQRT instruction | | 38 | CLASS | CLASS instruction | | 39–41 | Reserved for instruction emulation (opcode E7–E9) | | | 42 | FEQ | FEQ instruction | | 43 | DEQ | DEQ instruction | | 44 | FGT | FGT instruction | | 45 | DGT | DGT instruction | | 46 | FGE | FGE instruction | | 47 | DGE | DGE instruction | | 48 | FADD | FADD instruction | | 49 | DADD | DADD instruction | | | | FSUB instruction | | 50 | FSUB | | | 51 | DSUB | DSUB instruction | | 52 | FMUL | FMUL instruction | #### Notes: - 1. This vector number also results if an external device removes INTRx or TRAP1-TRAP0 before the corresponding interrupt or trap is taken by the processor (see Section 16.2.4). - 2. Some Supervisor-mode operations cause Protection Violations to facilitate virtualization of certain operations. - 3. The Floating-Point Exception trap is not generated by the processor hardware. It is generated by the software that implements the virtual arithmetic interface (see Section 2.8). - 4. Cannot be generated by the Am29205 microcontroller hardware. Table 16-1 Vector Number Assignments (continued) | Number | Type of Trap or Interrupt | Cause | |--------|-------------------------------------------------------------------------------|-------------------| | 53 | DMUL | DMUL instruction | | 54 | FDIV | FDIV instruction | | 55 | DDIV | DDIV instruction | | 56 | Reserved for instruction emulation (opcode F8) | | | 57 | FDMUL | FDMUL instruction | | 58–63 | Reserved for instruction emulation (opcode FA–FF) | | | 64–255 | ASSERT and EMULATE instruction traps (vector number specified by instruction) | Note <sup>5</sup> | Notes: (continued) Some of Vector Numbers 64–255 are reserved for software compatibility (see Sections 4.2.3 and 4.2.6). These are documented in Chapter 4 and in the Host Interface (HIF) Specification (included in the RISC Design-Made-Easy Application Guide, PID #16693A), available from AMD. handles this special case; in this case, PC1 points to the delay instruction of the branch, and PC0 points to its target. PC2 does not participate in the interrupt return, but is included to report the addresses of instructions causing certain exceptions. The PC is not defined as a special-purpose register. It cannot be modified or inspected by instructions. Instead, the interrupting and restarting of the pipeline is done by the PC Buffer registers PC0 and PC1. ## 16.4.2.1 Program Counter 0 Register (PC0, Register 10) This protected special-purpose register (Figure 16-5) is used on an interrupt return to restart the instruction in the decode stage when the original interrupt or trap was taken. Figure 16-5 Program Counter 0 Register Bits 31–2: Program Counter 0 (PC0)—This field captures the word-address of an instruction as it enters the decode stage of the processor pipeline, unless the Freeze (FZ) bit of the Current Processor Status Register is 1. If the FZ bit is 1, PC0 holds its value. When an interrupt or trap is taken, the PC0 field contains the word-address of the instruction in the decode stage. The interrupt or trap has prevented this instruction from executing. The processor uses the PC0 field to restart this instruction on an interrupt return. Bits 1–0: Zeros—These bits are zero since instruction addresses are always word aligned. #### 16.4.2.2 Program Counter 1 Register (PC1, Register 11) This protected special-purpose register (Figure 16-6) is used on an interrupt return to restart the instruction in the execute stage when the original interrupt or trap was taken. Figure 16-6 Program Counter 1 Register Bits 31–2: Program Counter 1 (PC1)—This field captures the word-address of an instruction as it enters the execute stage of the processor pipeline, unless the Freeze (FZ) bit of the Current Processor Status Register is 1. If the FZ bit is 1, PC1 holds its value. When an interrupt or trap is taken, the PC1 field contains the word-address of the instruction in the execute stage; the interrupt or trap has prevented this instruction from completing execution. The processor uses the PC1 field to restart this instruction on an interrupt return. **Bits 1–0: Zeros**—These bits are zero, since instruction addresses are always word aligned. ### 16.4.2.3 Program Counter 2 Register (PC2, Register 12) This protected special-purpose register (Figure 16-7) reports the address of certain instructions causing traps. ## Figure 16-7 Program Counter 2 Register Bits 31–2: Program Counter 2 (PC2)—This field captures the word address of an instruction as it enters the write-back stage of the processor pipeline, unless the Freeze (FZ) bit of the Current Processor Status Register is 1. If the FZ bit is 1, PC2 holds its value. When an interrupt or trap is taken, the PC2 field contains the word address of the instruction in the write-back stage. In certain cases PC2 contains the address of the instruction causing a trap. The PC2 field is used to report the address of this instruction and has no other use in the processor. **Bits 1–0: Zeros**—These bits are zero since instruction addresses are always word aligned. ## 16.4.3 Taking an Interrupt or Trap The following operations are performed in sequence by the processor when an interrupt or trap is taken: - 1. Instruction execution is suspended. - 2. Instruction fetching is suspended. - 3. Any in-progress load or store operation is completed. Any additional operations are canceled in the case of load multiple and store multiple. - The contents of the Current Processor Status Register are copied into the Old Processor Status Register. - 5. The Current Processor Status register is modified as shown in Figure 16-8 (the value u means unaffected). Note that setting the Freeze (FZ) bit freezes the Channel Address, Channel Data, Channel Control, Program Counter 0, Program Counter 1, Program Counter 2, and ALU Status Registers. - The address of the first instruction of the interrupt or trap handler is determined. The address is obtained by accessing a vector from instruction/data memory, using the physical address obtained from the Vector Area Base Address Register and the vector number. This is a 32-bit access. - An instruction fetch is initiated using the instruction address determined in step 6. At this point, normal instruction execution resumes. Note that the processor does not explicitly save the contents of any registers when an interrupt is taken. If register saving is required, it is the responsibility of the interrupt- or trap-handling routine. For proper operation, registers must be saved before any further interrupts or traps may be taken. The FZ bit must be reset at least two instructions before interrupts or traps are re-enabled, to allow program state to be reflected properly in processor registers if an interrupt or trap is taken. Figure 16-8 Current Processor Status After an Interrupt or Trap #### 16.4.4 Returning from an Interrupt or Trap Two instructions are used to resume the execution of an interrupted program: Interrupt Return (IRET), and Interrupt Return and Invalidate (IRETINV). These instructions are identical in the Am29200 and Am29205 microcontrollers; in other 29K Family processors, the IRETINV instruction resets all Valid bits in an instruction cache, whereas the IRET instruction does not affect the Valid bits. In some situations, the processor state must be set properly by software before the interrupt return is executed. The following is a list of operations normally performed in such cases: - 1. The Current Processor Status Register is configured as shown in Figure 16-9 (the value x is a *don't care*). Note that setting the FZ bit freezes the registers listed below so they may be set for the interrupt return. - 2. The Old Processor Status Register is set to the value of the Current Processor Status for the target routine. - 3. The Channel Address, Channel Data, and Channel Control registers are set to restart or resume uncompleted external accesses of the target routine. - 4. The Program Counter 1 and Program Counter 0 registers are set to the addresses of the first and second instructions, respectively, to be executed in the target routine. - 5. Other registers are set as required. These may include registers such as the ALU Status, Q, and so forth, depending on the particular situation. Some of these registers are unaffected by the FZ bit so they must be set in such a manner that they are not modified unintentionally before the interrupt return. Figure 16-9 Current Processor Status Before Interrupt Return Once the processor registers are configured properly, as described above, an interrupt return instruction (IRET or IRETINV) performs the remaining steps necessary to return to the target routine. The following operations are performed by the interrupt return instruction: - Any in-progress load or store operation is completed. If a load-multiple or store-multiple sequence is in progress, the interrupt return is not executed until the sequence completes. - Interrupts and traps are disabled, regardless of the settings of the DA, DI, and IM fields of the Current Processor Status, for steps 3 through 10. - 3. The contents of the Old Processor Status Register are copied into the Current Processor Status Register. This normally resets the FZ bit, allowing the Program Counter 0, 1, 2, Channel Address, Data, Control, and ALU Status registers to update normally. Since certain bits of the Current Processor Status Register always are updated by the processor, this copy operation may be irrelevant for certain bits (e.g., the Interrupt Pending bit). - 4. If the Contents Valid (CV) bit of the Channel Control Register is 1, and the Not Needed (NN) and Multiple Operation (ML) bits are both 0, an external access is started. This operation is based on the contents of the Channel Address, Channel Data, and Channel Control registers. The Current Processor Status Register conditions the access as usual. Load-multiple and store-multiple operations are not restarted at this point. - The address in Program Counter 1 is used to fetch an instruction. The Current Processor Status Register conditions the fetch. This step is treated as a branch in the processor pipeline. - 6. The instruction fetched in step 5 enters the decode stage of the pipeline. - The address in Program Counter 0 is used to fetch an instruction. The Current Processor Status Register conditions the fetch. This step is treated as a branch in the processor pipeline. - 8. The instruction fetched in step 5 enters the execute stage of the pipeline, and the instruction fetched in step 7 enters the decode stage. - 9. If the CV bit in the Channel Control Register is a 1, the NN bit is 0, and the ML bit is 1, a load-multiple or store-multiple sequence is started based on the contents of the Channel Address, Channel Data, and Channel Control registers. - 10. Interrupts and traps are enabled per the appropriate bits in the Current Processor Status Register. - 11. The processor resumes normal operation. #### 16.4.5 Lightweight Interrupt Processing The registers affected by the FZ bit of the Current Processor Status Register are those modified by almost any usual sequence of instructions. Since the FZ bit is set by an interrupt or trap, the interrupt or trap handler is able to execute while not disturbing the state of the interrupted routine, though its execution is somewhat restricted. Thus, it is not necessary in many cases for the interrupt or trap handler to save the registers affected by the FZ bit. This permits the implementation of lightweight interrupt handlers that do not have all of the overhead normally associated with interrupt handlers. The processor provides an additional benefit to lightweight interrupts if the Program Counter 0 and Program Counter 1 Registers are not modified by the interrupt or trap handler. If Program Counters 0 and 1 contain the addresses of sequential instructions when an interrupt or trap is taken, and if they are not modified before an interrupt return is executed, step 7 of the interrupt return sequence in Section 16.4.4 occurs as a sequential fetch—instead of a branch—for the interrupt return. The performance impact of a sequential fetch is normally less than that of a branch. Because the registers affected by the FZ bit are sometimes required for instruction execution, it is not possible for the lightweight interrupt or trap handler to execute all instructions, unless the required registers are first saved elsewhere (e.g., in one or more global registers). Most of the restrictions due to register dependencies are obvious (e.g., the Byte Pointer for byte extracts) and will not be discussed here. Other less obvious restrictions are listed below: - Load Multiple and Store Multiple. The Channel Address, Channel Data, and Channel Control registers are used to sequence load-multiple and store-multiple operations, so these instructions cannot be executed while the registers are frozen. However, other external accesses may occur; the Channel Address, Channel Data, and Channel Control registers are required only to restart an access after an exception, and the interrupt or trap handler is not expected to encounter any exceptions. - Loads and stores that set the Byte Pointer. If the SB bit of a load or store instruction is 1 and the FZ bit is also 1, there is no effect on the Byte Pointer. Thus, the execution of external byte and half-word accesses using this mechanism is not possible. - Extended arithmetic. The Carry bit of the ALU Status Register is not updated while the FZ bit is 1. - Divide step instructions. The Divide Flag of the ALU Status Register is not updated when the FZ bit is 1. If the interrupt or trap handler does not save the state of the interrupted routine, it cannot allow additional interrupts and traps. Also, the operation of the interrupt or trap handler cannot depend on any trapping instructions (e.g., floating-point instructions, assert instructions, illegal operation codes, arithmetic overflow, etc.), since these are disabled. There are certain cases, however, where traps are unavoidable. Special considerations for these cases are discussed in Section 16.7.6. #### 16.4.6 Simulation of Interrupts and Traps Assert instructions may be used by a Supervisor-mode program to simulate the occurrence of various interrupts and traps defined for the processor. Only an assert instruction executed in Supervisor mode can specify a vector number between 0 and 63. If this instruction causes a trap, the effect is to create an interrupt or trap similar to that associated with the specified vector number. Thus, the interrupt and trap routines defined for basic processor operation can be invoked without creating any particular hardware condition. For example, on the Am29200 microcontroller, an INTR1 interrupt may be simulated by an assert instruction that specifies a vector number of 17, without the activation of the INTR1 signal. #### 16.5 WARN TRAP (Am29200 Microcontroller) The processor recognizes a special trap, caused by the activation of the WARN input, that cannot be masked. The WARN trap is intended to be used for severe system-error or deadlock conditions. It allows the processor to be placed in a known, operable state, while preserving much of its original state for error reporting and possible recovery. Therefore, it shares some features in common with the Reset mode as well as features common to other traps described in this section. The major differences between the WARN trap and other traps are: - The processor does not wait for an in-progress external access to complete before taking the trap, since this access might not complete (for example, because WAIT is asserted). However, the information related to any outstanding access is retained by the Channel Address, Channel Data, and Channel Control registers when the trap is taken. - The vector-fetch operation is not performed when the WARN trap is taken. Instead, instruction fetching begins immediately at address 16. Note that the WARN trap may disrupt the state of the routine that is executing when it is taken, prohibiting this routine from being restarted. WARN may also reset some internal peripherals. #### 16.5.1 WARN Input (Am29200 Microcontroller) An inactive-to-active transition on the WARN input causes a WARN trap to be taken by the processor. The WARN trap cannot be disabled; the processor responds to the WARN input regardless of its internal condition unless the RESET input is also asserted. The WARN input is provided so the system can gain control of the processor in extreme situations, such as when system power is about to be removed or when a severe non-recoverable error occurs. The WARN input is edge-sensitive so an active level on the WARN input for long intervals does not cause the processor to take multiple WARN traps. However, WARN must be held active for at least four cycles in order to be properly recognized by the processor. The processor still takes the WARN trap if WARN is deasserted after four cycles. Another WARN trap occurs if WARN makes another inactive-to-active transition. The processor enters the Executing mode when the $\overline{WARN}$ input is asserted, regardless of its previous operational mode. Either seven or eight cycles after $\overline{WARN}$ is asserted (depending on internal synchronization time), the processor performs a trap-handler instruction access on the bus. This access is directed to address 16. #### 16.6 SEQUENCING OF INTERRUPTS AND TRAPS On every cycle, the processor decides either to execute instructions or to take an interrupt or trap. Since there are multiple sources of interrupts and traps, more than one interrupt or trap may be pending on a given cycle. To resolve conflicts, interrupts and traps are taken according to the priority shown in Table 16-2. In this table, interrupts and traps are listed in order of decreasing priority. This section discusses the first three columns of Table 16-2. The last two columns are discussed in Section 16.7. In Table 16-2, interrupts and traps fall into one of two categories depending on the timing of their occurrence relative to instruction execution. These categories are indicated in the third column of Table 16-2 by the labels *Inst* and *Async*. These labels have the following meaning: - Inst—Generated by the execution or attempted execution of an instruction. - Async—Generated asynchronous to and independent of the instruction being executed, although it may be a result of an instruction executed previously. **Table 16-2** Interrupt and Trap Priority Table | | | | | · | |----------------|----------------------------------------------------------------------------------------------------------------------------|--------------------------------------|----------------------------------------------|----------------------------------------| | Priority | Type of Interrupt or Trap | Inst/Async | PC1 | Channel Regs | | 1<br>(Highest) | WARN <sup>2</sup> | Async | Next | Note <sup>1</sup> | | 2 | User-Mode Data Mapping Miss<br>Supervisor-Mode Data Mapping Miss | Inst<br>Inst | Next<br>Next | All<br>All | | 3 | Unaligned Access Out-of-Range Assert Instructions Floating-Point Instructions Integer Multiply/Divide Instructions EMULATE | Inst<br>Inst<br>Inst<br>Inst<br>Inst | Next<br>Next<br>Next<br>Next<br>Next<br>Next | Ali<br>N/A<br>N/A<br>N/A<br>N/A<br>N/A | | 4 | TRAP0 <sup>2</sup> | Async | Next | Multiple | | 5 | TRAP12 | Async | Next | Multiple | | 6 | INTR0 <sup>2</sup> | Async | Next | Multiple | | 7 | INTR12 | Async | Next | Multiple | | 8 | INTR2 | Async | Next | Multiple | | 9 | INTR3<br>Internal peripheral interrupts | Async<br>Async | Next<br>Next | Multiple<br>Multiple | | 10 | Timer | Async | Next | Multiple | | 11 | Trace | Async | Next | Multiple | | 12 | User-mode Inst Mapping Miss<br>Supervisor-mode Inst Mapping Miss | Inst<br>Inst | Curr<br>Curr | N/A<br>N/A | | 13<br>(Lowest) | Illegal Opcode<br>Protection Violation | Inst<br>Inst | Curr<br>Curr | N/A<br>N/A | | | | | | | #### Notes: <sup>1.</sup> The Channel Address, Channel Data, and Channel Control registers are set for a WARN trap on the Am29200 microcontroller only if an external access is in progress when the trap is taken. <sup>2.</sup> Not supported on the Am29205 microcontroller. The principle for interrupt and trap sequencing is that the highest priority interrupt or trap is taken first. Other interrupts and traps either remain active until they can be taken or they are regenerated when they can be taken. This is accomplished depending on the type of interrupt or trap, as follows: - 1. All traps in Table 16-2 with priority 13 through 15 are regenerated by the re-execution of the causing instruction. - 2. Most of the interrupts and traps of priority 4 through 12 must be held by external hardware until they are taken. The exceptions to this are listed in item 3. - 3. The exceptions to item 2 are the Timer interrupt and the Trace trap. These are caused by bits in various registers in the processor and are held by these registers until taken or cleared. The two relevant bits are the Interrupt (IN) bit of the Timer Reload Register for Timer interrupts and the Trace Pending (TP) bit of the Current Processor Status Register for Trace traps. - 4. All traps of priority 2 and 3 in Table 16-2, except for the Unaligned Access trap, are not regenerated. These traps are mutually exclusive and are given high priority because they cannot be regenerated; they must be taken if they occur. If one of these traps occurs at the same time as a reset or WARN trap, it is not taken and its occurrence is lost. - 5. The Unaligned Access trap is regenerated internally when an external access is restarted by the Channel Address, Channel Data, and Channel Control registers. Note this trap is not necessarily exclusive to the traps discussed in item 4 above. The Channel Address, Channel Data, and Channel Control registers are set for a WARN trap only if an external access is in progress when the trap is taken. #### 16.7 EXCEPTION REPORTING AND RESTARTING When an instruction encounters an exceptional condition, the Program Counter 0, Program Counter 1, and Program Counter 2 registers report the relevant instruction address(es) and allow the instruction sequence to be restarted once the exceptional condition has been remedied (if possible). Similarly, when an external access encounters an exceptional condition, the Channel Address, Channel Data, and Channel Control registers report information on the access or transfer and allow it to be restarted. This section describes the interpretation and use of these registers. The *PC1* column in Table 16-2 describes the value held in the Program Counter 1 Register (PC1) when the interrupt or trap is taken. For traps in the *Inst* category, PC1 contains either the address of the instruction causing the trap, indicated by *Curr*, or the address of the instruction following the instruction causing the trap, indicated by *Next*. For interrupts and traps in the *Async* category, PC1 contains the address of the first instruction not executed due to the taking of the interrupt or trap. This is the next instruction to be executed upon interrupt return, as indicated by *Next* in the PC1 column. ### 16.7.1 Instruction Exceptions For traps caused by the execution of an instruction (e.g., the Out-of-Range trap), the Program Counter 2 Register contains the address of the instruction causing the trap. In all of these cases, PC1 is in the *Next* category. The traps associated with instruction fetches (i.e., those of priority 13) occur only if the processor attempts the execution of the associated instruction. An exception may be detected during an instruction prefetch, but the associated trap does not occur if the processor branches before it attempts to execute the invalid instruction. This prevents spurious instruction exceptions. #### 16.7.2 Restarting Mapped DRAM Accesses DRAM mapping is provided to support application needs such as on-the-fly data compression and decompression. In such applications, programs operate on large, compressed data structures by decompressing data into a smaller region of memory, operating on the data, and then compressing back into the large compressed structure. The ability to store the data in a compressed format reduces system memory requirements, while the ability to operate on the data in a decompressed format simplifies the application software. For generality, mapped DRAM accesses allow the mapping configuration to be changed on demand. In other words, the DRAM mapping is performed by a system routine that changes the mapping as needed by the application program. This allows applications written with no knowledge of DRAM mapping to operate in a system that uses DRAM mapping. Since the DRAM mapping trap is part of normal system operation and does not represent an error, the access that causes the trap must be restarted—once the trapping condition is remedied—in a manner that cannot be detected by the program causing the trap. The Am29200 and Am29205 microcontrollers overlap external accesses with the execution of instructions. Thus, traps caused by accesses are imprecise. The address of the instruction that initiated the access cannot be determined by the trap handler. Since the address of the initiating instruction is unknown, the access cannot be restarted by re-executing this instruction. Even if the address could be determined, the instruction might not be restartable since an instruction executed before the trap occurred, but after the access began, may have altered the conditions of the access, such as by altering the address source register. In order to provide for the restarting of loads and stores that cause exceptions, the processor saves all information required to restart these accesses in the Channel Address, Channel Data, and Channel Control registers. The Contents Valid (CV) and Not Needed (NN) bits in the Channel Control Register indicate that the information contained in these registers represents an access that must be restarted. The CV bit indicates the access did not complete, and the NN bit indicates whether or not the data from the access is required by the processor. Note that since instruction execution is overlapped with external accesses, an instruction that executes after a load may alter the destination register for the load. If a trap occurs in this situation, the access information in the Channel Address, Channel Data, and Channel Control registers is correct, but the load cannot be restarted because it will destroy the new value in the destination register. The NN bit provides correct operation in this case. When an interrupt or trap is taken, the handling routine has access to the Channel Address, Channel Data, and Channel Control registers. The contents of these registers may contain information relevant to an incomplete access and can be preserved for restarting this access. Since these registers are frozen (due to the FZ bit of the Current Processor Status) they are not available to monitor any external accesses in the interrupt or trap handler until their contents are saved and the FZ bit is reset. Upon an interrupt return (IRET or IRETINV), the processor restarts an access using the Channel Address, Channel Data, and Channel Control registers. The access is initiated if the CV bit of the Channel Control Register is 1 and the NN bit is 0. The restart cannot be detected in the logical operation of the restarted routine, although the timing of execution is altered. The mechanism used to restart trapping accesses has the additional benefit of allowing a fast interrupt-response time when the processor is performing a load-multiple or store-multiple operation. An interrupted load-multiple or store-multiple is restarted as if it had faulted. In this case, the operation resumes from the point of interruption, not from the beginning of the sequence. #### 16.7.2.1 Channel Address Register (CHA, Register 4) This protected special-purpose register (Figure 16-10) is used to report exceptions during external accesses. It is also used to restart interrupted load-multiple and store-multiple operations and to restart other external accesses when possible (e.g., after DRAM mapping misses are serviced). The Channel Address Register is updated on the execution of every load or store instruction and on every load or store in a load-multiple or store-multiple sequence, except when the Freeze (FZ) bit in the Current Processor Status Register is 1. #### Figure 16-10 Channel Address Register Bits 31–0: Channel Address (CHA)—This field contains the address of the current access (if the FZ bit of the Current Processor Status Register is 0). #### 16.7.2.2 Channel Data Register (CHD, Register 5) This protected special-purpose register (Figure 16-11) is used to report exceptions during external accesses. It is also used to restart the first store of an interrupted store-multiple operation and to restart other external accesses when possible (e.g., after DRAM mapping misses are serviced). The Channel Data Register is updated on the execution of every load or store instruction and on every load or store in a load-multiple or store-multiple sequence, except when the Freeze (FZ) bit in the Current Processor Status Register is 1. When the Channel Data Register is updated for a load operation, the resulting value is unpredictable. #### Figure 16-11 Channel Data Register Bits 31–0: Channel Data (CHD)—This field contains the data (if any) associated with the current access (if the FZ bit of the Current Processor Status Register is 0). If the current access is not a store, the value of this field is irrelevant. #### 16.7.2.3 Channel Control Register (CHC, Register 6) This protected special-purpose register (Figure 16-12) is used to report exceptions during external accesses. It is also used to restart interrupted load-multiple and store-multiple operations and to restart other external accesses when possible (e.g., after DRAM mapping misses are serviced). The Channel Control Register is updated on the execution of every load or store instruction and on every load or store in a load-multiple or store-multiple sequence, except when the Freeze (FZ) bit in the Current Processor Status Register is 1. #### Figure 16-12 Channel Control Register Bits 31–24:—These bits are a direct copy of bits 23–16 from the load or store instruction that started the current access (see Section 3.3). Bits 23–16: Load/Store Count Remaining (CR)—The CR field indicates the remaining number of transfers for a load-multiple or store-multiple operation that encountered an exception or was interrupted before completion. This number is zero-based; for example, a value of 28 in this field indicates that 29 transfers remain to be completed. Bit 15: Load/Store (LS)—The LS bit is 0 if the access is a store operation and is 1 if the access is a load operation. Bit 14: Multiple Operation (ML)—The ML bit is 1 if the current access is a partially-complete load-multiple or store-multiple operation; otherwise it is 0. Bit 13: Set (ST)—The ST bit is 1 if the current access is for a Load and Set instruction; otherwise it is 0. #### Bit 12-10: Reserved Bits 9–2: Target Register (TR)—The TR field indicates the absolute register number of the data operand for the current access (either a load target or store data source). Since the register number in this field is absolute, it reflects the Stack-Pointer addition when the indicated register is a local register. **Bit 1: Not Needed (NN)**—The NN bit indicates that even though the Channel Address, Channel Data, and Channel Control registers contain a valid representation of an incomplete load operation, the data requested is not needed. This situation arises when a load instruction is overlapped with an instruction that writes the load target register. Bit 0: Contents Valid (CV)—The CV bit indicates the contents of the Channel Address, Channel Data, and Channel Control registers are valid. #### 16.7.3 Integer Exceptions Some integer add and subtract instructions—ADDS, ADDU, ADDCS, ADDCU, SUBS, SUBU, SUBCS, SUBCU, SUBRS, SUBRU, SUBRCS, and SUBRCU—cause an Out-of-Range trap upon overflow or underflow of a 32-bit signed or unsigned result, depending on the instruction. Two integer multiply instructions—MULTIPLY and MULTIPLU—cause an Out-of-Range trap upon overflow of a 32-bit signed or unsigned result, respectively, if the MO bit of the Integer Environment Register is 0. If the MO bit is 1, these multiply instructions cannot # AMD cause an Out-of-Range trap. Since the processor does not contain hardware to directly support these instructions, the Out-of-Range trap must be generated by the software that implements the virtual arithmetic interface (see Section 2.8). Two integer divide instructions—DIVIDE and DIVIDU—take the Out-of-Range trap upon overflow of a 32-bit signed or unsigned result, respectively, if the DO bit of the Integer Environment Register is 0. If the DO bit is 1, the divide instructions cannot cause an Out-of-Range trap unless the divisor is zero. If the divisor is zero, an Out-of-Range trap always occurs, regardless of the DO bit. For the MULTIPLY, MULTIPLU, DIVIDE, and DIVIDU instructions, the destination register (or registers) is unchanged if an Out-of-Range trap is taken. #### 16.7.4 Floating-Point Exceptions A Floating-Point Exception trap occurs when an exception is detected during a floating-point operation and the exception is not masked by the corresponding bit of the Floating-Point Mask Register. In this context, a floating-point operation is defined as any operation that accepts a floating-point number as a source operand, that produces a floating-point result, or both. Thus, for example, the CONVERT instruction may create an exception while attempting to convert a floating-point value to an integer value or vice versa. In addition to the operations described in Section 16.4.3, the following operations are performed when a Floating-Point Exception trap is taken: - The status of the trapping operation is written into the trap status bits of the Floating-Point Status Register. The written status bits do not depend on the values of the corresponding mask bits in the Floating-Point Environment Register. - 2. The destination register or registers are left unchanged. #### 16.7.5 Correcting Out-of-Range Results Some Arithmetic instructions cause an Out-of-Range trap if the arithmetic operation causes an overflow or underflow. When an Out-of-Range trap occurs, the result of the operation, though incorrect, is written into the destination register. Furthermore, the Program Counter 2 Register contains the address of the trapping instruction, and the ALU Status Register contains an indication of the cause of the trap. It is possible, if required, for the trap handler to use this information to form the correct result. The ALU Status indicates the cause of the Out-of-Range trap based on the operation performed, as follows: - 1. Signed overflow. If the Out-of-Range trap is caused by signed, two's-complement overflow (this can occur for both signed adds and subtracts), the V bit is 1. - 2. Unsigned overflow. If the Out-of-Range trap is caused by unsigned overflow (this can occur only for unsigned adds), the C bit is 1. - 3. Unsigned underflow. If the Out-of-Range trap is caused by unsigned underflow (this can occur only for unsigned subtracts), the C bit is 0. The multiply instructions, MULTIPLY and MULTIPLU, can cause an Out-of-Range trap if the MO bit of the Integer Environment Register is 0 and the operation overflows. However, these instructions do not set the ALU Status Register. This exception is detected by reading the trapping instruction whose address is in the PC2 Register. #### 16.7.6 Exceptions During Interrupt and Trap Handling In most cases, interrupt and trap handling routines are executed with the DA bit in the Current Processor Status having a value of 1. It is normally assumed these routines do not create many of the exceptions possible in most other processor routines. If these assumptions are not valid for a particular interrupt or trap handler, the handler must save the state of the processor and reset the FZ bit of the Current Processor Status so the handler itself may be restarted properly. This must be accomplished before any interrupts or traps can be taken. In this case, the state (or the state of some other process) must be restored before an interrupt return is executed. #### 16.8 TIMER FACILITY The processor has a built-in timer facility that can be configured to cause periodic interrupts. The timer facility consists of two special-purpose registers—the timer counter and the timer reload registers—accessible only to Supervisor-mode programs. Also, the Current Processor Status Register contains a control bit as part of the timer facility. These registers implement timing functions independent of program execution. #### 16.8.1 Timer Facility Operation The Timer Counter Register has a 24-bit Timer Count Value (TCV) field that decrements by one on every processor cycle. If the TCV field decrements to zero, it is written with the Timer Reload Value (TRV) field of the Timer Reload Register on the next cycle; the Interrupt (IN) bit of the Timer Reload register is set at the same time. Reloading the TCV field by the TRV field maintains the accuracy of the timer facility. The Timer Reload Register contains the 24-bit TRV field and the control bits Overflow (OV), Interrupt (IN), and Interrupt Enable (IE). If the IN bit is 1 and the IE bit also 1, a Timer interrupt occurs. If the IN bit is 1 when the TCV field decrements to zero, the OV bit is also set. The OV bit indicates a Timer interrupt may have occurred before a previous interrupt was serviced. The Current Processor Status Register contains the Timer Disable (TD) control bit. If the TD bit is 1, Timer interrupts are disabled. The TD bit and the IE bit have equivalent functions; the TD bit is provided so the timer may be disabled without having to perform a non-atomic read-modify-write operation on the Timer Reload Register. There is a possibility the TCV might decrement to zero and set the IN bit as the modified value is written back to the Timer Reload Register, causing a Timer interrupt to be missed. #### 16.8.2 Timer Facility Initialization To initialize the timer facility, the following steps should be taken in the specified order (it is assumed that Timer interrupts are disabled by the DA bit of the Current Processor Status Register or the TD bit of the Current Processor Status Register during the following steps): - Set the TCV field with the desired interval count for the first timing interval. This interval must be sufficiently large to allow the execution of the next step before the TCV field decrements to zero (this normally is the case). - Set the TRV field with the desired interval count for the second timing interval. The OV and IN bits are reset and the IE bit is set as desired. The second timing interval may be equivalent to the first timing interval. #### 16.8.3 Handling Timer Interrupts The following is a suggested list of actions necessary to handle a Timer interrupt: - 1. Read the Timer Reload Register into a general-purpose register. - 2. Reset the IN bit in the general-purpose register. - Set the TRV field in the general-purpose register to the desired value for the next timing interval. Note that at this time the timer counter is timing the current interval. This step may be omitted if all intervals are equivalent. - Write the contents of the general-purpose register back into the Timer Reload Register. - 5. Test the general-purpose-register copy of the OV bit and, if it is set, report the error as appropriate. - 6. Perform any system operations required for the Timer interrupt. - 7. Execute an interrupt return. #### 16.8.4 Timer Facility Uses Since the timer facility has a resolution of a single processor cycle, it may be used to perform precise timing of system events. For example, it may be used to determine an exact measurement of the number of cycles between two events in the system or to perform precise time-critical control functions. The Timer interrupt is enabled and disabled separately from other processor interrupts so its priority can be specified. The timer facility can be shared among multiple processes. This sharing is accomplished by the implementation of a queue for timer events, which are sorted in order of increasing event time. On each occurrence of a Timer interrupt, the TRV field is set for the interval between the next two events in the queue, while the Timer Counter Register is counting the current interval (because of a previous setting of the TRV field). The event at the beginning of the queue identifies other system actions to be taken for the Timer interrupt. This event is removed from the queue after the appropriate actions are taken. # 16.8.5 Timer Counter Register (TMC, Register 8) This protected special-purpose register (Figure 16-13) contains the counter for the timer facility. #### Figure 16-13 Timer Counter Register #### Bits 31-24: Reserved Bits 23–0: Timer Count Value (TCV)—The 24-bit TCV field decrements by one on each processor clock. When the TCV field decrements to zero, it is reloaded with the content of the Timer Reload Value field in the Timer Reload Register. At this time, the Interrupt bit in the Timer Reload Register is set. The TCV field is zero-based with respect to the Timer interrupt interval; for example, a value of 28 in the TCV field causes the IN bit to be set in the 29th subsequent processor cycle. The TCV field is zero for a complete cycle before the IN bit is set. #### 16.8.6 Timer Reload Register (TMR, Register 9) This protected special-purpose register (Figure 16-14) maintains synchronization of the Timer Counter Register, enables Timer interrupts, and maintains timer facility status information. #### Figure 16-14 Timer Reload Register #### Bits 31-27: Reserved **Bit 26: Overflow (OV)**—The OV bit indicates a Timer interrupt occurred before a previous Timer interrupt was serviced. It is set if the Interrupt (IN) bit is1 when the Timer Count Value (TCV) field of the Timer Counter Register decrements to zero. In this case, a Timer interrupt caused by the IN bit has not been serviced when another interrupt is created. **Bit 25: Interrupt (IN)**—The IN bit is set whenever the TCV field decrements to zero. If this bit is 1 and the IE bit is also 1, a Timer interrupt occurs. The IN bit is set when the TCV field decrements to zero, regardless of the value of the IE bit. The IN bit is reset by software that handles the Timer interrupt. **Bit 24: Interrupt Enable (IE)**—When the IE bit is 1, the Timer interrupt is enabled and the Timer interrupt occurs whenever the IN bit is 1. When this bit is 0, the Timer interrupt is disabled. The Timer interrupt may be disabled by the DA bit of the Current Processor Status Register regardless of the value of the IE bit. The Timer interrupt can also be disabled by the TD bit of the CPS Register, regardless of the value of IE and/or DA. Bits 23—0: Timer Reload Value (TRV)—The value of this field is written into the Timer Count Value (TCV) field of the Timer Counter Register when the TCV field decrements to zero. #### 16.9 INTERNAL INTERRUPT CONTROLLER The various peripherals and controllers on the Am29200 and Am29205 microcontrollers can cause interrupts having the same effect on the processor as asserting the processor's INTR3 input. The interrupt controller provides a central location for generating interrupts, indicating which interrupts are active and permitting software to reset the interrupts independent of servicing the interrupting peripheral. #### 16.9.1 Interrupt Control Register (ICT, Address 80000028) Bits of the Interrupt Control Register (Figure 16-15) are set at the leading edge of an interrupt condition, except for the bits related to the I/O port (in the IOPI field), since I/O port signals are independently configurable to generate edge-triggered interrupts. For example, the DMA0I bit is set when the CTI bit transitions from 0 to 1 in the DMA0 Control Register. When a bit in this register is 1, it causes an internal assertion of the processor's INTR3 input (there is no external indication of this on INTR3). Software can inspect this register to determine the source of the interrupt and can reset bits in this register to clear the interrupt. Bits in the Interrupt Control Register are reset-only. Writing a 1 into a bit position causes the bit to be reset unless an interrupting condition becomes active at the same time, in which case the bit remains set. Writing a bit with 0 does not affect the bit, and the bit may be set by an interrupting condition at the same time the bit is written with 0. #### Figure 16-15 Interrupt Control Register Bits 31-28: Reserved Bit 27: Video Interrupt (VDI)—A 1 in this bit indicates the video interface has generated an interrupt request. #### Bits 26-24: Reserved Bits 23–16: I/O Port Interrupt (IOPI)—A 1 in this field indicates the respective PIO signal has generated an interrupt request. A 1 in the most significant bit of the IOPI field indicates PIO15 has caused an interrupt, the next bit indicates PIO14 has caused an interrupt, and so on. #### Bit 15: Reserved **Bit 14: DMA Channel 0 Interrupt (DMA0I)**—A 1 in this bit indicates DMA Channel 0 has generated an interrupt request. Bit 13: DMA Channel 1 Interrupt (DMA1I)—A 1 in this bit indicates DMA Channel 1 has generated an interrupt request. #### Bit 12: Reserved Bit 11: Parallel Port Interrupt (PPI)—A 1 in this bit indicates the parallel port has generated an interrupt request. #### Bits 10-8: Reserved Bit 7: Serial Port Receive Status Interrupt (RXSI)—A 1 in this bit indicates the serial port has generated an interrupt request because of the status of the receive logic. Bit 6: Serial Port Receive Data Interrupt (RXDI)—A 1 in this bit indicates the serial port has generated an interrupt request because receive data is ready. Bit 5: Serial Port Transmit Data Interrupt (TXDI)—A 1 in this bit indicates the serial port has generated an interrupt request because the Transmit Holding Register is empty. #### Bits 4-0: Reserved #### 16.9.2 Interrupt Controller Initialization Processor interrupts are disabled by a processor reset, but the Interrupt Control Register is not affected by a reset. To prevent spurious interrupts, software should reset all bits of the Interrupt Control Register to 0 before processor interrupts are enabled. #### 16.9.3 Servicing Internal Interrupts The Interrupt Control Register allows software to determine the source of an internal interrupt. Software can prioritize these interrupts using the processor's Count Leading Zeros instruction. Software clears an interrupt by writing a 1 into the bit that is causing the interrupt (normally, the leading 1-bit in the Interrupt Control Register). For level-sensitive I/O port interrupts, the interrupting condition must be cleared and the corresponding PIO signal be in an inactive state before the Interrupt Control Register bit is cleared, otherwise another interrupt will be generated. For other types of interrupts, the condition causing the interrupt can be cleared in the interrupting peripheral independent of resetting the bit in the Interrupt Control Register, because the leading edge of the condition must be detected again before another interrupt can occur. However, the interrupt should not be cleared in a way that might lose the occurrence of a newly generated interrupt. Because the Interrupt Control Register is reset-only and because resetting a bit takes lower precedence than setting a bit, bits can be reset without interfering with other interrupts or with the detection of a new interrupt of the type being cleared. # **DEBUGGING AND TESTING** This chapter details the features of the Am29200 and Am29205 microcontrollers that support debugging and testing. The chapter first describes the trace facility and instruction breakpoints that aid in software debugging. Next, the test/development interface, the test access port, and the boundary-scan architecture are discussed. A description of how to use an Am29200 microcontroller to emulate an Am29205 microcontroller concludes the chapter. #### 17.1 OVERVIEW The Am29200 microcontroller provides debugging and testing features at both the hardware and software levels. Instruction tracing and instruction breakpoints are supported. The microcontroller's test development interface is composed of pins that indicate the state of the processor and control its operation. A JTAG-compliant test access port facilitates system testing in a production environment. The Am29205 microcontroller supports software debugging only. Hardware testing and debugging can be accomplished by using an Am29200 microcontroller to emulate an Am29205 microcontroller. #### 17.2 TRACE FACILITY Software debug for the Am29200 and Am29205 microcontrollers is supported by the trace facility. The trace facility guarantees exactly one trap after the execution of any instruction in a program being tested. This allows a debug routine to follow the execution of instructions and to determine the state of the processor and system at the end of each instruction. Tracing is controlled by the Trace Enable (TE) and Trace Pending (TP) bits of the Current Processor Status Register. The value of the TE bit is always copied into the TP bit when an instruction enters the write-back stage of the processor pipeline. A Trace trap occurs whenever the TP bit is 1. As with most traps, the Trace trap can be disabled only by the DA bit of the Current Processor Status Register. In order to trace the execution of a program, the debug routine performs an interrupt return to cause the program to begin or resume execution. However, before the interrupt return is executed, the TE and TP bits of the Old Processor Status Register are set with the values 1 and 0, respectively. The interrupt return causes these bits to be copied into the TE and TP bits of the Current Processor Status Register. When the target instruction of the interrupt return (whose address is contained in the Program Counter 1 Register when the interrupt return is executed) enters the write-back stage, the processor copies the value of the TE bit into the TP bit. Since the TP bit is a 1, a Trace trap occurs. This trap prevents any further instruction execution in the target routine until the interrupt is taken and the routine is resumed with an interrupt return. When the Trace trap is taken, the TE and TP bits are both reset automatically, preventing any further Trace traps. Since the trace facility is managed by the Old and Current Processor Status registers, it operates properly in the event the processor takes an interrupt or trap—unrelated to the trace facility—before the above trace sequence completes. When the unrelated interrupt or trap is taken, the state of the trace facility (i.e., the values of the TE and TP bits) is copied into the Old Processor Status Register from the Current Processor Status Register. The trace facility then resumes operation when the interrupted routine is restarted by an interrupt return. It is possible to cause a Trace trap by directly setting the TP and/or TE bits in the Current Processor Status Register. This may be accomplished only by a Supervisor-mode program. #### 17.3 INSTRUCTION BREAKPOINTS The HALT instruction can be used as an instruction breakpoint by a hardware-development system. However, the HALT instruction normally is a privileged instruction, causing a Protection Violation trap upon attempted execution by a User-mode program. The hardware-development system can disable this Protection Violation trap as outlined in Section 17.7.5. The assert class of instructions and the Illegal Opcode trap can be used by software to implement instruction breakpoints. An instruction breakpoint is set by replacing an instruction with the assert instruction or an illegal opcode in the program under test. When the breakpoint instruction is encountered, the instruction breakpoint causes a trap. The illegal opcode is preferred since the Program Counter 1 (PC1) points to the illegal opcode when the trap is taken, whereas PC1 points to the instruction following the breakpoint if an assert instruction is used. #### 17.4 PROCESSOR STATUS OUTPUTS (Am29200 MICROCONTROLLER) The STAT2-STAT0 outputs on the Am29200 microcontroller indicate certain information about processor modes along with information about processor operation. STAT2-STAT0 may be used to provide feedback of processor behavior during normal processor operation and when the processor is under the control of a hardware-development system. The encoding of STAT2-STAT0 is as follows: | | STAT2 | STAT1 | STAT0 | Condition | | |---|-------|-------|-------|-------------------------------------------------|--| | • | 0 | 0 | 0 | Halt or Step modes | | | | 0 | 0 | 1 | Interrupt/trap vector fetch (vector valid) | | | | 0 | 1 | 0 | Load Test Instruction mode, Halt/Freeze | | | | 0 | 1 | 1 | Branch target fetch (instruction valid) | | | | 1 | 0 | 0 | External data access (data valid) | | | | 1 | 0 | 1 | External instruction access (instruction valid) | | | | 1 | 1 | 0 | Internal peripheral access (data valid) | | | | 1 | 1 | 1 | Idle or data/instruction not valid | | The STAT2–STAT0 outputs are a delayed indication of a mode or condition, so a mode or condition on a given cycle is reflected on STAT2–STAT0 on the second following cycle. For example, STAT2–STAT0=100 indicates an external data access was valid on ID31–ID0 at the end of the second previous cycle. The R/W output indicates the direction of an access, adding information about the access indicated on STAT2–STAT0 (the R/W signal appears with the access and is not delayed as are STAT2–STAT0). If an access is extended by WAIT, the appropriate status is shown for every additional cycle until the access does complete. The address always appears on A23–A0, whether the access is a read or a write and whether the access is external or internal (that is, to an internal peripheral). The data appears on ID31-ID0, except on a read of an internal peripheral. # 17.5 CONTROL FIELD IN SCAN PATH (Am29200 MICROCONTROLLER) The Am29200 microcontroller incorporates a boundary-scan interface that is compatible with the IEEE 1149.1 JTAG specification. This interface permits access to testing and debug features of the processor core not visible on the external interface (see Section 17.6.2). A two-bit CNTL field appears on the Boundary Scan Register. The CNTL field corresponds to the CNTL1–CNTL0 pins that appear on other 29K family processors. This field can be used to halt, step, and start the processor, as well as force the processor to execute instructions for the purpose of testing and debugging. The CNTL field affects processor operations as follows: | CNTL Value | Mode | |------------|-----------------------| | 00 | Load Test Instruction | | 01 | Step | | 10 | Halt | | 11 | Normal | Changes to the CNTL field are restricted so that only one bit of the CNTL field may change at any given time. The allowed transitions are shown in Figure 17-1. If these restrictions are violated, processor operation is unpredictable and a processor reset is required to resume predictable operation. Figure 17-1 Valid Transitions for CNTL Field Because of the restrictions just described, it is not possible to transition directly between all possible modes controlled by the CNTL field. For example, the processor cannot go from the Load Test Instruction mode to Normal operation without first entering the Halt or Step modes. #### 17.6 TEST ACCESS PORT (Am29200 MICROCONTROLLER) The Am29200 microcontroller implements the Standard Test Access Port (TAP) and Boundary-Scan Architecture as specified by the IEEE Specification 1149.1–1990 (JTAG), with the exception that the INCLK pin is not part of the boundary-scan register. The IEEE 1149.1–1990 Specification includes many details omitted from the discussion in this section and is included by reference. The following description discusses Am29200 microcontroller-specific considerations. #### 17.6.1 Boundary-Scan Cells The test access port can access, affect, and sample the processor inputs and outputs because a Boundary Scan Register (BSR) and Parallel Data Register (PDR) are incorporated into the design of the input and output cells. The Boundary Scan Register allows serial data to be loaded into or read out of the processor input/output boundary. The Parallel Data Register holds data stable at inputs and outputs during scanning, so system signals are not adversely affected during scanning. An input or output cell incorporating a BSR and PDR register bit is referred to as a boundary-scan cell. This section describes the implementation of the Am29200 microcontroller boundary-scan cells. Figure 17-2 shows the design of an input boundary-scan cell, and Figure 17-3 shows the design of an output boundary-scan cell. Bidirectional signals use both of these designs in the same cell. Multiplexor selects, when active, select the lower multiplexor input. Figure 17-2 Input Boundary-Scan Cell Note: For the CNTL field, the boundary-scan value is also selected by the ICTEST1 instruction. Figure 17-3 Output Boundary-Scan Cell The shift and update clocks, when used to sample or drive processor and system signals, are synchronized to the processor internal clocks so all signals (except the TAP signals) are sampled or driven synchronously to system clocks. However, the shift and update clocks still satisfy the JTAG constraints that inputs are sampled after the rising edge of TCK, outputs change after the falling edge of TCK, and TCK is the only control needed to affect sampling and driving. The IEEE 1149.1–1990 Specification requires that it be possible to force the processor three-state outputs to be enabled. This is accomplished by cells that have no associated pin. The outputs of these cells force groups of output drivers to be enabled. Some outputs can be disabled by these cells even though the outputs cannot be disabled during normal operation (for example, the A23–A0 outputs can be disabled). The boundary-scan cells for the CNTL1–CNTL0 field and STAT2–STAT0 outputs are part of the BSR and are accessible by scanning the BSR. However, they can also be scanned individually using the ICTEST1 instruction (see Section 17.6.2). If the ICTEST1 instruction is active, no other boundary-scan cell is scanned. However, the contents of the other scan cells are undefined after this operation. The INCLK input is not a boundary-scan cell. The clocks to the processor must continue to operate even if the test access port is active. However, a fault on this input is readily visible in the operation of the test access port. The MEMCLK pin has an output boundary-scan cell. The EXTEST and INTEST instructions hold the MEMCLK signal at a fixed logic level for the duration of the instruction. #### 17.6.2 Instruction Register and Implemented Instructions The Instruction Register (IREG) of the test access port is a three-bit register. The least significant bit (IREG0) is the bit nearest the TDO output. Instructions are encoded as follows: | IREG2 | IREG1 | IREG0 | Instruction | |---------------------------------|----------------------------|----------------------------|------------------------------------------------------------------------------------------------------------| | 0<br>0<br>0<br>0<br>1<br>1<br>1 | 0<br>0<br>1<br>1<br>0<br>0 | 0<br>1<br>0<br>1<br>0<br>1 | EXTEST Preloaded value (acts like BYPASS) ICTEST2 Reserved (acts like BYPASS) INTEST SAMPLE ICTEST1 BYPASS | The EXTEST, BYPASS, INTEST, and SAMPLE instructions are specified by the 1149.1–1990 Specification. Reserved instructions behave as BYPASS instructions to conform to the specification. ICTEST1 and ICTEST2 are AMD public instructions. Most of these instructions are described in detail in the IEEE 1149.1–1990 Specification. Below is a brief description of the special considerations in the Am29200 microcontroller implementation. #### 17.6.2.1 EXTEST The EXTEST instruction is provided for external continuity and logic tests. It allows the test access port to drive outputs and sample inputs. EXTEST selects the Boundary Scan Register (BSR) for scanning. During execution - 1. Processor outputs are driven from the Parallel Data Register (PDR). - 2. Processor internal output signals are sampled into the BSR. This is default behavior. - 3. Processor inputs are sampled into the BSR. - Processor internal input signals are driven from the PDR. This prevents internal logic from seeing invalid combinations of input signals possibly received from other chips during the test. #### 17.6.2.2 INTEST The INTEST instruction is provided to test the processor's internal logic. Its primary value is to allow a hardware-development system to drive the processor's test interface without a direct electrical connection to all pins of the package. INTEST selects the BSR for scanning. During execution - Processor outputs are driven from the PDR. This prevents external logic from seeing invalid combinations of output signals. - 2. Processor internal output signals are sampled into the BSR. - 3. Processor inputs are sampled into the BSR. This is default behavior. - 4. Processor internal input signals are driven from the PDR. The INTEST instruction allows the hardware-development system to alter and inspect internal registers, using processor load and store instructions, without having the external system see any bus activity. #### 17.6.2.3 SAMPLE The SAMPLE instruction is provided to inspect the processor's external signals without interfering with system operations. SAMPLE selects the BSR for scanning. During execution - 1. Processor outputs are driven by the processor. - 2. Processor internal output signals are sampled into the BSR. - 3. Processor inputs are sampled into the BSR. - 4. Processor internal input signals are driven from the processor inputs. #### 17.6.2.4 ICTEST1 The ICTEST1 instruction is defined for AMD processors using the extension mechanisms permitted by the IEEE 1149.1–1990 Specification. It is provided to drive the CNTL field and sample the STAT2–STAT0 outputs while leaving other inputs and outputs in their normal system connection. This allows a hardware-development system to control the processor and system using the test access port. ICTEST1 selects a subset of the BSR for scanning. During execution - 1. Processor outputs are driven by the processor. - 2. Processor internal output signals are sampled into the BSR. This is default behavior for most signals, but allows the sampling of STAT2–STAT0. - 3. Processor input signals are sampled into the BSR. This is default behavior. - 4. The processor CNTL field is driven by the PDR. Processor internal inputs are driven from the processor inputs. #### 17.6.2.5 ICTEST2 The ICTEST2 instruction is defined for AMD processors using the extension mechanisms permitted by IEEE 1149.1–1990. ICTEST2 is similar to EXTEST with the exception that the scan path for ICTEST2 excludes most of the processor outputs so the system is not disrupted (for example, by interfering with refresh). This allows a hardware-development system to access and modify processor internal state without disrupting the system. - Processor ID31-ID0 and STAT2-STAT0 outputs are driven from the PDR. The output enable for the ID bus is controlled by the PDR. Other processor outputs are controlled by the processor. - Processor internal output signals for ID31–ID0 and STAT2–STAT0 are sampled into the BSR. This allows a hardware-development system to sample the processor's status and data driven by the processor. - Processor internal input signals for ID31–ID0 are driven from the PDR. This allows a hardware-development system to provide data to the processor, independent of system controls. #### 17.6.2.6 BYPASS The BYPASS instruction is provided to bypass the BSR and shorten access times to other devices at the board level. BYPASS selects the Bypass Register for scanning. The processor is not otherwise affected. #### 17.6.3 Order of Scan Cells in Boundary-Scan Path This section documents the scan paths and the order of scan cells in the paths. The cells are listed in order from TDI to TDO. In the Am29200 microcontroller, there are five scan paths from TDI to TDO: 1) the instruction path, 2) the bypass path, 3) the main data path, 4) the ICTEST1 path, and 5) the ICTEST2 path. #### 17.6.3.1 Instruction Path This three-cell path outlined in Table 17-1 is used to scan into the Instruction Register. When the instruction path is selected the captured data is always IREG2–IREG0 = 001 and the instruction is set by scanning. The preloaded pattern 001 is used to test for faults in the boundary-scan connections at the board level. The instructions are specified in Section 17.6.2. #### Table 17-1 Instruction Scan Path | Bit | Cell Name | | |-------------|-------------------------|--| | 1<br>2<br>3 | IREG2<br>IREG1<br>IREG0 | | #### **17.6.3.2** Bypass Path This is a one-cell path that is used to bypass the processor and shorten access to other devices at the board level. When the bypass path is selected, the captured data is always 0 and the scan-in data has no effect on the processor. #### 17.6.3.3 Main Data Path This is a 188-cell path used to access the processor pins. This path is divided into five sets of cells. Where applicable, each set has a cell that enables the outputs of the set to be driven on the processor's pins. These cells are not connected to a processor pin. For convenience, the drive enable cells are shown in Table 17-2 in boldface. Some of these cells affect outputs not normally enabled and disabled during normal system operation. The sets of cells are divided logically as follows: 1) clocks, requests, and reset, 2) miscellaneous peripheral control signals, 3) memory and peripheral controls, 4) instruction/data bus. Table 17-2 Main Data Scan Path | Bit | Cell Name | Comments | | |------------------------------------------------------------------------------|--------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------|--| | 1<br>2<br>3<br>4<br>5<br>6<br>7<br>8<br>9<br>10<br>11<br>12<br>13<br>14 | MEMCLK RESET LSYNC VCLK WARN INTR3 INTR2 INTR1 INTR0 TRAP1 TRAP0 TDMA DREQ0 DREQ1 | The MEMCLK scan cell is an output scan cell it captures processor internal MEMCLK and substitutes the scanned value for the output. | | | 15 | GREQ | | | | 16<br>17<br>18<br>19<br>20<br>21<br>22<br>23<br>24<br>25 | TOPDRV PSYNCI PSYNCO VDATI VDATO STAT0 STAT1 STAT2 PIOI0 PIOO0 | Enables the drivers for PSYNC through PWE PSYNC input PSYNC output VDAT input VDAT output PIO0 input PIO0 output | | | 26<br>27 | PIOI1<br>PIOO1 | PIO1 input<br>PIO1 output | | | 54<br>55<br>56<br>57<br>58<br>59<br>60<br>61<br>62<br>63 | PIOI15 PIOO15 PBUSY PACK POE PWE PSTROBE PAUTOFD WAIT BOOTW | PIO15 input<br>PIO15 output | | | 64<br>65<br>66 | ABIDRV<br>A0<br>A1 | Enables the driving of the A23-A0 outputs | | | • | | | | | 88 | A23 | | | | 89<br>90<br>91<br>92<br>93<br>94<br>95<br>96<br>97<br>98<br>99<br>100<br>101 | BOTDRV DACKO DACK1 R/W PIAOE PIAWE PIACS0 PIACS1 PIACS2 PIACS3 PIACS4 PIACS5 GACK WE | Enables the drivers for DACKO through DSR | | | 103<br>104<br>105<br>106<br>107<br>108<br>109<br>110<br>111<br>112<br>113<br>114<br>115<br>116<br>117<br>118<br>119<br>120<br>121 | TR CAS0 CAS1 CAS2 CAS3 RAS0 RAS1 RAS2 RAS3 ROMOE RSWE BURST ROMCS0 ROMCS1 ROMCS2 ROMCS3 TXD DSR UCLK RXD DTR | | | |-----------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------|--| | 124<br>125<br>126<br>127<br>128 | DBIDRV<br>IDI0<br>IDO0<br>IDI1<br>IDO1 | Enables the ID bus drivers ID0 input ID0 output ID1 input ID1 output ID1 output ID1 output | | | 187<br>188 | IDI31<br>IDO31 | ID31 input<br>ID31 output | | Note: Drive-enable cells are shown in boldface. #### 17.6.3.4 ICTEST1 Path This five-bit path, outlined in Table 17-3, is used to provide quick access to the CNTL field and the STAT2–STAT0 output signals while keeping other inputs and outputs in their normal system connection. Table 17-3 ICTEST1 Scan Path | Bit | Cell Name | Comments | |-----|----------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | 1 | CNTL0<br>CNTL1 | Internal control field only | | 3 | STAT0 | Outputs: These signals are scanned out and are | | 5 | STAT1<br>STAT2 | shown on the TDO pin. The scan-in values do not replace the processor output values. In ICTEST1, the processor outputs STAT2–STAT0 continue to reflect the internal processor signals. | If the ICTEST1 path is scanned, the contents of the shift register bits in the other scan cells become undefined. This occurs because all scan paths share the same shift clocks. #### 17.6.3.5 ICTEST2 Path The ICTEST2 path includes only the ID bus, the CNTL field, and the STAT2–STAT0 signals. It is provided so a hardware-development system can access the processor without disrupting the system. Table 17-4 ICTEST2 Scan Path | Bit | Cell Name | Comments | |--------|----------------|----------------------------------------------------------------------------------------------------------------------------------------| | 1 | CNTL0 | Internal control field only | | 2<br>3 | CNTL1<br>STAT0 | Outputs: These signals are scanned out and are | | 4 | STAT1 | shown on the TDO pin. The scan-in values do not | | 5 | STAT2 | replace the processor output values. In ICTEST1, the processor outputs STAT2—STAT0 continue to reflect the internal processor signals. | | 6 | DBIDRV | Enables the ID bus drivers | | 7 | IDI0 | IDO input | | 8 | IDO0 | ID0 output | | 8<br>9 | IDI1 | ID1 input | | 10 | IDO1 | ID1 output | | 69 | IDI31 | ID31 input | | 70 | IDO31 | ID31 output | # 17.7 IMPLEMENTING A HARDWARE-DEVELOPMENT SYSTEM (Am29200 MICROCONTROLLER) The Halt, Step, and Load Test Instruction modes of operation, invoked using the CNTL field in the boundary-scan path, are defined to support the debugging of the processor system by a hardware-development system (both hardware and software debug). This section describes the use of these modes during debug and describes the corresponding activity on the CNTL field and STAT2–STAT0 pins. #### 17.7.1 Halt Mode The Halt mode allows the hardware-development system to stop processor operation while preserving its internal state. The Halt mode is defined so normal operation may resume from the point the processor enters the Halt mode. All external accesses are completed before the Halt mode is entered, so a minimum amount of system logic is required to support the Halt mode. The Halt mode can be invoked by applying a value of 10 to the CNTL field. The processor enters the Halt mode within two or three cycles after the CNTL field is changed (depending on synchronization time), except it first completes any external data access in progress. The Halt mode can also be entered as the result of executing a HALT instruction. When a HALT instruction is executed, the processor enters the Halt mode on the next cycle, except it completes any external data accesses in progress. In this case, the processor remains in the Halt mode even though the CNTL field is 11. However, the processor cannot exit the Halt mode except as the result of the CNTL field or RESET input. If the instruction following a Halt instruction has an exception (e.g., instruction mapping miss), the trap associated with the exception is taken before the processor enters the Halt mode. The Halt instruction is designed as an instruction breakpoint by the hardware-development system. However, the Halt instruction is normally a privileged instruction, causing a Protection Violation trap upon attempted execution by a User-mode program. The hardware-development system can disable this Protection Violation as described in Section 17.7.5. In most cases, the STAT2–STAT0 outputs have a value of 000 whenever the processor is in the Halt mode. These outputs can be used to verify the processor is in Halt mode. However, the STAT2–STAT0 outputs have a value of 010 if the Freeze (FZ) bit of the Current Processor Status Register is 1 when the Halt mode is entered. This indicates the visible registers do not reflect the current program state. While in the Halt mode, the processor does not execute instructions and performs no external accesses. The timer facility does not operate (i.e., the Timer Counter Register does not change). The Halt mode is exited when the Reset mode is entered or the CNTL field places the processor into another mode. The only valid transitions on the CNTL field from the value of 10 are to the value 00, which places the processor into the Load Test Instruction mode, or to the value 11, which causes the processor to resume normal execution. #### 17.7.2 Step Mode The Step mode causes the Am29200 microcontroller to execute at a rate determined by the hardware-development system, allowing the hardware-development system to easily control and monitor processor operation. The Step mode is defined so normal operation may resume after stepping is complete. Since all external accesses are completed during any step, a minimum amount of system logic is required to support the slower rate of execution. The Step mode is invoked by the value of 01 in the CNTL field. The processor enters the Step mode within two or three cycles after the CNTL field is changed (depending on synchronization time), except it first completes any external data access in progress. In most cases, the STAT2–STAT0 outputs have a value of 000 whenever the processor is in the Step mode; these outputs can be used as a verification the processor is in Step mode. However, the STAT2–STAT0 outputs have a value of 010 if the Freeze (FZ) bit of the Current Processor Status Register is 1 when the Step mode is entered. This indicates the visible registers do not reflect the current program state. While in the Step mode, the processor does not execute instructions and performs no external accesses. The Timer Facility does not operate (i.e., the Timer Counter Register does not change) while the processor is in the Step mode. The Step mode is identical to the Halt mode in every respect except one. This difference is apparent on the transition of the CNTL field from the value 01 (Step mode) to the value 11 (Normal). On this transition, the processor steps. That is, the processor state advances by one pipeline stage, and it completes any external access that is initiated by this state change. If the processor immediately enters the Pipeline Hold mode on a step, the step may require multiple cycles to execute, since the processor pipeline cannot advance while the processor is in the Pipeline Hold mode. The STAT2—STAT0 lines reflect the state of the processor for every cycle of the step. The timer counter decrements by one for every cycle of the step; if the timer counter decrements to zero, the usual timer-facility actions are performed and a timer interrupt may occur. After the step is performed, the processor re-enters the Step mode and remains in the Step mode even though the CNTL field has the value 11 (this prevents the need for a time-critical transition on the CNTL field). The processor remains in this condition until the CNTL field transitions to 10 or 01 (or RESET is asserted). The transition to 10 causes the processor to enter the Halt mode and is used to clear the Step mode. The transition to 01 causes the processor to remain in the Step mode so it may perform additional steps. If the processor is placed in the Halt or Step mode while either a LOADM or STOREM instruction is being executed, the STAT2–STAT0 outputs indicate the Halt or Step mode for one cycle (STAT2–STAT0 = 000). They then indicate the Pipeline Hold mode (STAT2–STAT0 = 001) until the final access of the LOADM or STOREM is complete, at which time they return to indicating the Halt or Step mode. A hardware-development system must therefore ignore any single-cycle Halt/Step mode indication on the STAT2–STAT0 outputs as an indication the processor is halted. #### 17.7.3 Load Test Instruction Mode The processor incorporates an Instruction Register (IR) that holds instructions while they are decoded. In the Load Test Instruction mode, the IR is enabled to receive the content of the instruction bus regardless of the state of the processor's instruction fetcher. This allows the hardware-development system to provide instructions for execution directly, thereby providing means for the hardware-development system to examine and modify the internal state of the processor without altering the processor's instruction stream. The hardware-development system can place an instruction in the IR by first placing 00 in the CNTL field. The processor enters the Load Test Instruction mode within two or three cycles after the CNTL field is changed (depending on synchronization time). However, it first completes and terminates any established burst-mode instruction access. The Load Test Instruction mode can be entered only from the Halt or Step modes. When the processor enters the Load Test Instruction Mode, the processor behaves as though the Current Processor Status Register were forced to the value shown in Figure 17-4, even though the register is not changed (the value "u" means unaffected). Figure 17-4 Processor Status While in Load Test Instruction Mode The visible processor state remains unchanged while the processor is in the Load Test Instruction Mode. The processor status shown in Figure 17-4 remains in effect until the next transition to the Normal Mode via the Halt Mode. While the processor is in the Load Test Instruction mode, it ignores all interrupts and traps, except for the WARN trap. The STAT2-STAT0 lines have a value of 010 while the processor is in the Load Test Instruction mode; this may be used as a verification that the processor is loading the IR. While the processor is in the Load Test Instruction mode, the IR is continually storing the value on the instruction/data bus; any change in the value on this bus is reflected in the IR on the next cycle. The hardware-development system can place a desired instruction into the IR by driving this instruction on the Instruction/Data Bus or via the scan interface. The processor exits the Load Test Instruction mode in the second cycle following a change to the CNTL field. The only valid change here is either to the Halt mode (CNTL = 10) or the Step mode (CNTL = 01). When the Load Test Instruction mode is exited, the most recent value stored into the IR is held. If the processor is placed in the Step mode, the IR is marked as having valid content, enabling the processor to decode and execute the instruction. If the processor is placed in the Halt mode, it ignores any instruction placed in the IR by the Load Test Instruction mode and reverts to its normal instruction-fetch mechanism. Once the IR has been set by the Load Test Instruction mode, the instruction in the IR may be executed via the Step mode as discussed in the previous section. A single step is sufficient to cause the execution of this instruction. However, because of pipelining, multiple steps may be required before the instruction completes execution. If more than one step is performed, the processor executes the instruction in the IR on every step. If it is desired to step an instruction to completion without repeated execution, a NO-OP may be set into the IR (using the Load Test Instruction mode) after the first step. The Load Test Instruction mode may be used to cause the execution of most processor instructions (restrictions are discussed below). This allows inspection and modification of the processor state. Because of sequencing constraints, the Load Test Instruction mode cannot be used to cause the execution of the following instructions: conditional jumps, Load Multiple, Store Multiple, Interrupt Return, and Interrupt Return and Invalidate. Unconditional jumps and calls are permitted, but affect only the Program Counter. Instruction sequencing is not affected. The contents of the Program Counter 0, Program Counter 1, Program Counter 2, Channel Address, Channel Data, Channel Control, and ALU Status registers are not updated while instructions are executed via the Load Test Instruction mode, except explicitly by Move To Special Register instructions. Instructions executed using the Load Test Instruction mode may access the protected processor state even though the processor is in the User mode. Instructions executed via the Load Test Instruction mode may be used to access an external device or memory. Recall that the processor completes any normal data access before completing a step. This allows the processor to access devices and memories on behalf of the hardware-development system and simplifies the timing constraints on the hardware-development system. During processor execution via the Load Test Instruction mode, the processor retains the information required to resume normal operation. If any processor state is modified by the hardware-development system, this state must be restored properly for normal operation to resume properly. Once all instructions have been executed via the Load Test Instruction mode, the Halt mode (CNTL=10) prepares the processor to resume normal operation. When the CNTL field transitions to 11, the processor resumes normal operation. The sequence for the CNTL field to clear the Load Test Instruction mode and resume normal operation is thus 00/10/11. # 17.7.4 Accessing Internal State Via Boundary Scan The hardware-development system uses load and store instructions, executed via the Load Test Instruction mode, to alter and inspect the contents of general-purpose registers. The OPT field for these loads and stores have the value 110 and are directed to the ROM address space (for example, address 0): this causes the processor to prevent the resulting access from appearing in the system. The access is visible only via the boundary-scan register. Furthermore, it causes the Am29200 microcontroller to ignore the generation of wait states: the access completes at the end of the next stepped instruction. This provides a means for a hardware-development system to perform accesses. It is not possible to execute a load directly following a store, nor a store directly following a load, using the Load Test Instruction mode. At least one NO-OP (or other operation) must be executed between adjacent loads and stores, because of control conflicts that arise when these instructions are stepped in a system that performs the resulting accesses at normal speed. However, a sequence of only loads or only stores is permitted without restriction. This section describes the sequence of boundary-scan operations performed to access processor internal state. #### 17.7.4.1 Altering State Via Boundary Scan A hardware-development system uses load instructions to alter the contents of generalpurpose registers. Since the contents of general-purpose registers can be moved to special-purpose registers, this provides a means to alter other state as well as the values in general-purpose registers. With the processor in the Halt mode, the hardware-development system uses the following sequence to modify the value in a general-purpose register: - 1. Set the CNTL field to 10 (Halt) using the ICTEST1 boundary-scan instruction. - 2. Set the CNTL field to 00 (Load Test Instruction) using the ICTEST1 instruction. - Using the ICTEST2 instruction, set the IDI31-IDI0 cells with an instruction to load the desired register from the ROM address space, with OPT=110, and set the CNTL field to 01 (Step). This places the load instruction into the IR and prepares the processor to step. - Using the ICTEST1 instruction, sequence the CNTL field through the values 11, 01, and 00 (Normal, Step, and Load Test Instruction). This steps the processor and prepares it to receive another instruction. - 5. Using the ICTEST2 instruction, set the IDI31–IDI0 cells to 70400101, hexadecimal (NOOP), and set the CNTL field to 01. This loads a NO-OP into the IR. - Using the ICTEST2 instruction, set the IDI31-IDI0 cells to the value to be loaded and set the CNTL field to 11. This steps the processor and applies the value to be loaded into the register. - 7. Set the CNTL field to 01 using the ICTEST1 instruction. - 8. Repeat steps 2 through 7 for the remaining registers. #### 17.7.4.2 Inspecting State Via Boundary Scan A hardware-development system uses store instructions to inspect the contents of general-purpose registers. Since the processor internal state can be moved to general-purpose registers, this provides a means to inspect other states as well as the values in general-purpose registers. With the processor in the Halt mode, the hardware-development system uses the following sequence to retrieve the value in a general-purpose register: - 1. Set the CNTL field to 10 (Halt) using the ICTEST1 boundary-scan instruction. - 2. Set the CNTL field to 00 (Load Test Instruction) using the ICTEST1 instruction. - Using the ICTEST2 instruction, set the IDI(31–0) cells with an instruction to store the desired register into the ROM address space, with OPT=110, and set the CNTL field to 01 (Step). This places the store instruction into the IR and prepares the processor to step. - 4. Using the ICTEST1 instruction, sequence the CNTL field through the values 11, 01, and 00 (Normal, Step, and Load Test Instruction). This steps the processor and prepares it to receive another instruction. - 5. Using the ICTEST2 instruction, set the IDI31–IDI0 cells to 70400101, hexadecimal (NO-OP), and set the CNTL field to 01. This loads a NO-OP into the IR. - 6. Set the CNTL field to 11, then back to 01 using the ICTEST1 instruction. This steps the processor. At the end of the step, the contents of the register are on the ID bus, and may be obtained in the Capture-DR state of the TAP controller (this state is described in the IEEE 1149.1–1990 Specification). The value will be held on the ID bus until the next step. - 7. Repeat steps 2 through 6 for the remaining registers. #### 17.7.5 HALT Instructions as Breakpoints The HALT instruction can be used by a hardware-development system to implement an instruction breakpoint. To use the HALT instruction as an instruction breakpoint, the hardware-development system must disable the protection checking that normally applies to the HALT instruction so the HALT does not cause a Protection Violation trap. To accomplish this, the hardware-development system must perform a special sequence of operations on the boundary-scan interface. This sequence is similar in effect to holding the CNTL1–CNTL0 inputs at 10 during a reset on other 29K Family processors. The special sequence is needed in the Am29200 microcontroller because it has no CNTL1–CNTL0 inputs, but rather implements a CNTL field in the boundary-scan register. The following sequence disables protection checking on the HALT instruction: - 1. Set the CNTL field to 10 using the ICTEST1 JTAG instruction. - Reset the boundary-scan cells RESET, DBIDRV, BOTDRV, ABIDRV, and TOPDRV to 0 using the INTEST instruction. If the boot ROM in Bank 0 is 8 or 16 bits wide, reset the BOOTW cell to 0. If the boot ROM is 32 bits wide, set the BOOTW cell to 1. This resets the processor. - Set the RESET cell to 1 using the INTEST instruction. If the boot ROM is 8 bits wide, set the BOOTW cell to 1 (otherwise leave it at 0 or 1). This takes the processor out of reset and configures the boot ROM. - 4. Set the CNTL field to 00 using the ICTEST1 instruction. - Using the ICTEST2 instruction, set the IDI31-IDI0 cells to a1000000, hexadecimal, and set the CNTL field to 01. This loads a jump to address 0 into the IR and prepares the processor to step. - 6. Using the ICTEST1 instruction, sequence the CNTL field through the values 11, 01, and 00. This steps the processor and prepares it to receive another instruction. - Using the ICTEST2 instruction, set the IDI31–IDI0 cells to 70400101, hexadecimal. This loads a NO-OP into the IR. - 8. Using the ICTEST1 instruction, sequence the CNTL field through the values 11, 01, and 11. This starts the processor with protection checking disabled on the HALT instruction. The TMS input must be kept High during this operation. #### 17.7.6 Forcing Outputs to High Impedance For the Am29200 microcontroller, a hardware-development system can force processor outputs to the high-impedance state by asserting the GREQ input during a processor reset. The outputs remain in the high-impedance state until a processor reset during which GREQ is not asserted. This supports functions such as replacing chip signals by emulator signals. For the Am29205 microcontroller, a hardware-development system can force processor outputs to high impedance by asserting the WAIT/TRIST signal. The outputs remain in the high-impedance state until a processor reset during which WAIT/TRIST is not asserted. #### 17.8 EMULATING THE Am29205 MICROCONTROLLER Unlike the Am29200 microcontroller, the Am29205 microcontroller does not have a hardware-development interface for testing and debugging. Hardware development is performed by using an Am29200 microcontroller to emulate an Am29205 microcontroller. Figure 17-5 shows a typical configuration for using an Am29200 microcontroller to emulate the Am29205 microcontroller. The hardware-development system asserts the WAIT/TRIST pin during processor reset to force the outputs of the Am29205 microcontroller into high impedance state. An Am29200 microcontroller is then connected to the Am29205 microcontroller and interfaced directly to the system logic. The hardware-development system then uses the JTAG to access the CNTL field on the Am29200 microcontroller and perform the hardware debugging functions previously described in this chapter. To avoid contention on the WAIT/TRIST signal, the target system must drive the WAIT/TRIST pin with an open collector buffer (e.g., 74HC07) and a pull-up resistor. The hardware-development system must continue to drive the WAIT/TRIST pin Low for at least one MEMCLK cycle following the deassertion of RESET so the correct value on the WAIT/TRIST pin is latched into the internal logic. On the Am29200 microcontroller, the memories (ROM and DRAM) are attached to ID31–ID16 and the peripherals (PIA and Parallel Port Data Register) are attached to ID15–ID0. On the Am29205 microcontroller, the memories and peripherals are attached to ID31–ID16. For the Am29200 microcontroller to access the memories and peripherals correctly, ID31–ID16 on the Am29200 microcontroller must be connected to ID15–ID0. To avoid contentions on the ID bus, three-state buffers (e.g., 2 x 74HC244) are used to isolate ID31–ID16 from ID15–ID0. These three-state buffers will be enabled when the processor is doing a PIA or parallel port read access. Figure 17-5 Using an Am29200 Microcontroller to Emulate an Am29205 Microcontroller #### CHAPTER # 18 # **INSTRUCTION SET** This chapter provides a specification of the instruction set for the Am29200 and Am29205 microcontrollers. Sections 18.1 and 18.2 describe the terminology and the instruction formats. Section 18.3 describes each instruction in detail; instructions are presented alphabetically by assembler mnemonic. Finally, Section 18.4 gives an index of instructions by operation code. #### 18.1 INSTRUCTION-DESCRIPTION NOMENCLATURE To simplify the specification of the instruction set, special terminology is used throughout this chapter. This section defines the terminology and symbols used to describe instruction operands, operations, and the assembly-language syntax. This section does not describe all terminology used. It excludes certain descriptive terms with obvious meanings. #### 18.1.1 Operand Notation and Symbols Throughout this chapter, instruction operands are signed two's-complement word integers unless otherwise noted. The term "register" is used consistently to denote a general-purpose register. Other types of registers are described explicitly. The following notation is used in the description of instruction operands: | 0116 | 16-bit immediate data, zero-extended to 32 bits | |------|-------------------------------------------------| | 1116 | 16-bit immediate data, one-extended to 32 bits | 116 16-bit immediate data BP The Byte Pointer (BP) field of the ALU Status Register. The BP field selects a byte or half-word within a word. C The Carry (C) bit of the ALU Status Register. The C bit is logically zero-extended to 32 bits when involved in a word operation. COUNT The value of the Count Remaining field of the Channel Control Reg- ister. Note that COUNT does not refer to this field directly, but rather to the value of the field at the beginning of a LOADM or STOREM instruction. DEST The general-purpose register that is the destination of an instruction (i.e., the register used to store the result). EXTERNAL The word in an external device or memory with address *n*. WORD[*n*] FALSE The Boolean constant FALSE FC The Funnel Shift Count (FC) field of the ALU Status Register h'n' The hexadecimal constant n IPA Indirect Pointer A Register IPB Indirect Pointer B Register IPC Indirect Pointer C Register PC Program Counter Register. This register is not explicitly accessible by instruction, but does appear as an operand for certain instructions. The Program Counter always contains the word address of the instruction being executed, and is 30 bits in length. Q Q Register Register RA Register RB These designate the general-purpose registers specified by the RB instruction fields RA, RB, and RC (see Section 18.2). Register RC SPDEST The special-purpose register that is the destination of an instruction. SPECIAL The contents of a special-purpose register, used as an instruction operand. Special-purpose Register SA Designates the special-purpose register specified by the instruction field SA (see Section 18.2). SRCA **SRCB** The contents of general-purpose registers, used as instruction operands. SRCA.BYTEn SRCB.BYTEn Designate the byte numbered *n* within the SRCA or SRCB operand. TARGET The target-instruction address specified by a jump or call instruc- tion. This address is either absolute or Program-Counter relative. TRUE Boolean constant TRUE TWIN General-purpose registers are paired by absolute-register numbers, such that even-numbered registers are paired with oddnumbered registers having the next-highest register number. The twin of a given register is the other register in the pair to which the given register belongs. For example, Local Register 5 is the twin of Local Register 4, and vice versa. #### 18.1.2 Operator Symbols The following symbols are used to describe instruction operations: A << B Left shift of the A operand by the shift amount given by the B operand A >> B Right shift of the A operand by the shift amount given by the B operand A // B Concatenation. The B operand is appended to the A operand. In the resulting quantity, the A operand makes up the high-order part, and the B operand makes up the low-order part. A & B Bitwise AND A I B Bitwise OR A ^ B Bitwise exclusive-OR ~ A One's-complement $A \leftarrow \exp$ Assignment of the A location by the result of the expression on the right side A = B Equal to A <> B Not equal to | A > B | Greater than | |------------|------------------------------------------------------------------------------------------------------------------------------------| | $A \ge B$ | Greater than or equal to | | A < B | Less than | | $A \leq B$ | Less than or equal to | | A + B | Addition | | A – B | Subtraction | | A * B | Multiplication | | A/B | Division | | A B | A subrange that includes the A operand and the B operand. This symbol is used for subranges of bits as well as subranges of words. | | A OR B | Logical OR of two Boolean conditions | ## 18.1.3 Control-Flow Terminology Continue The following terminology is used to describe the control functions performed during the execution of various instructions: Continue execution of the current instruction sequence. | IF condition<br>THEN operations<br>ELSE operations | The condition following the IF is tested. If the condition holds, the operations following the THEN are performed. If the condition does not hold, the operations following the ELSE are performed. If the ELSE is not present and the condition does not hold, no operation is performed. | |----------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Signed overflow | This condition is present when the result of an add or subtract of two's-complement operands cannot be represented by a signed word integer. | | Trap( <i>n</i> ) | Specifies a trap with vector number $n$ . The vector number $n$ may be specified indirectly (e.g., Trap (VN)) or explicitly by symbolic name (e.g., Trap (Out-of-Range)). | | Unsigned overflow | This condition is present when the result of an add of unsigned operands cannot be represented by an unsigned word integer. | | Unsigned<br>underflow | This condition is present when the result of a subtract of unsigned operands cannot be represented by an unsigned integer (i.e., when the result is less than zero). | | VN | Designates the trap vector number specified by the instruction field VN (see Section 16.3.2). | ### 18.1.4 Assembler Syntax This chapter does not contain a full description of the instruction assembler, but provides a rudimentary description of the assembler syntax. The following notation is used to describe assembler tokens: | cntl | Determines the 7-bit control field in a load or store instruction. | |---------|--------------------------------------------------------------------| | const8 | Specifies a constant that can be expressed by 8 bits. | | const16 | Specifies a constant that can be expressed by 16 bits | | ra<br>rb | These tokens name general-purpose registers. In a formal sense these represent the same token since the name of a | |----------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | rc | register does not depend on its instruction use. However, three distinct tokens are used to clarify the relationship between the assembler syntax, instruction operands, and instruction fields. | | spid | A symbolic identifier for a special-purpose register. | | target | A symbolic label for the target of a jump or call instruction. | | vn | Specifies a trap vector number. | ### 18.2 INSTRUCTION FORMATS All instructions for the Am29200 and Am29205 microcontrollers are 32 bits in length and are divided into four fields, as shown in Figure 18-1. These fields have several alternative definitions, as discussed below. In certain instructions, one or more fields are not used, and are reserved for future use. Even though they have no effect on processor operation, bits in reserved fields should be 0 to insure compatibility with future processor versions. Figure 18-1 Instruction Format The instruction fields are defined as follows: | Bits 31-24 | | |------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | OP | This field contains an operation code that defines the operation to be performed. In some instructions, the least significant bit of the operation code selects between two possible operands. For this reason, the least significant bit is sometimes labeled A or M with the following interpretations: | | Α | (Absolute): The A bit is used to differentiate between Program-Counter relative $(A = 0)$ and absolute $(A = 1)$ instruction addresses when these addresses appear within instructions. | | M | (Immediate): The M bit selects between a register operand $(M=0)$ and an immediate operand $(M=1)$ when the alternative is allowed by an instruction. | | Bits 23-16 | | The RC field contains a global or local register number. RC | I17 I10 | This field contains the most significant eight bits of a 16-bit instruction address. This is a word address and may be program-counter relative or absolute depending on the A bit of the operation code. | |------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | l15 l8 | This field contains the most significant eight bits of a 16-bit instruction constant. | | VN | This field contains an 8-bit trap vector number. | | CNTL | This field controls a load or store access as described in Section 3.3.1 | | Bits 15-8 | | | RA | The RA field contains a global or local register number. | | SA | The SA field contains a special-purpose register number. | | Bits 7-0 | | | RB | The RB field contains a global or local register number. | | RB or I | This field contains either a global or local register number, or an 8-bit instruction constant depending on the value of the M bit of the operation code. | | 19 I2 | This field contains the least significant eight bits of a 16-bit instruction address. This is a word address and may be program-counter relative or absolute depending on the A bit of the operation code. | | I7 IO | This field contains the least significant eight bits of a 16-bit instruction constant. | | UI //RND//FD//FS | This field controls the operation of the CONVERT instruction. | | reserved // FS | This field is the FS portion of the above field and specifies the operand format for the CLASS and SQRT instructions. | The fields described above may appear in many combinations. However, certain combinations that appear frequently are shown in Figure 18-2. ### 18.3 INSTRUCTION DESCRIPTION This section describes each microcontroller instruction in detail. Figure 18-3 illustrates the layout of the information given for each description. Figure 18-2 Frequently Occurring Instruction Field Uses Three operands with possible 8-bit constant: #### Three operands without constant: #### One register operand with 16-bit constant: #### Jumps and calls with 16-bit instruction address: #### Two operands with trap vector number: #### Loads and stores: Figure 18-3 Instruction-Description Format **ADD** ADD Add Operation: DEST ← SRCA + SRCB **Assembler** Syntax: ADD rc, ra, rb or ADD rc, ra, const8 Status: V, N, Z, C Operands: SRCA Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC **Description:** The SRCA operand is added to the SRCB operand and the result is placed into the DEST location. **ADDC** **ADDC** ## **Add with Carry** Operation: DEST ← SRCA + SRCB + C **Assembler** Syntax: ADDC rc, ra, rb or ADDC rc, ra, const8 Status: V, N, Z, C Operands: Content of register RA SRCA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC **Description:** The SRCA operand is added to the SRCB operand and the value of the ALU Status Carry bit, and the result is placed into the DEST location. ADDCS ADDCS ### Add with Carry, Signed **Operation:** DEST ← SRCA + SRCB + C IF signed overflow THEN Trap (Out of Range) **Assembler** Syntax: ADDCS rc, ra, rb or ADDCS rc, ra, const8 Status: V, N, Z, C Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC OP = 18, 19 **ADDCS** **Description:** The SRCA operand is added to the SRCB operand and the value of the ALU Status Carry bit, and the result is placed into the DEST location. If the add operation causes a two's-complement signed overflow, an Out-of-Range trap occurs. Note that the DEST location is altered whether or not an overflow occurs. **ADDCU** **ADDCU** ## Add with Carry, Unsigned Operation: DEST.← SRCA + SRCB + C IF unsigned overflow THEN Trap (Out of Range) **Assembler** Syntax: ADDCU rc, ra, rb or ADDCU rc, ra, const8 Status: V, N, Z, C Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DÈST Register RC OP = 1A, 1B **ADDCU** #### **Description:** The SRCA operand is added to the SRCB operand and the value of the ALU Status Carry bit, and the result is placed into the DEST location. If the add operation causes an unsigned overflow, an Out-of-Range trap occurs. Note that the DEST location is altered whether or not an overflow occurs. ADDS #### Add, Signed ADDS - **Operation:** DEST ← SRCA + SRCB IF signed overflow THEN Trap (Out of Range) **Assembler** Syntax: ADDS rc, ra, rb or **SRCB** ADDS rc, ra, const8 Status: V, N, Z, C Operands: SRCA Content of register RA M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC OP = 10, 11 **ADDS** **Description:** The SRCA operand is added to the SRCB operand and the result is placed into the DEST location. If the add operation causes a two's-complement signed overflow, an Out-of-Range trap occurs. Note that the DEST location is altered whether or not an overflow occurs. **ADDU** **ADDU** ## Add, Unsigned Operation: DEST ← SRCA + SRCB IF unsigned overflow THEN Trap (Out of Range) **Assembler** Syntax: ADDU rc, ra, rb ADDU rc, ra, const8 Status: V, N, Z, C Operands: **SRCA** Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC OP = 12, 13 ADDU **Description:** The SRCA operand is added to the SRCB operand and the result is placed into the DEST location. If the add operation causes an unsigned overflow, an Out-of-Range trap occurs. Note that the DEST location is altered whether or not an overflow occurs. AND AND ## **AND Logical** Operation: DEST ← SRCA & SRCB **Assembler** Syntax: AND rc, ra, rb or AND rc, ra, const8 Status: N, Z Operands: SRCA Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC **Description:** The SRCA operand is logically ANDed, bit-by-bit, with the SRCB operand and the result is placed into the DEST location. **ANDN** ANDN # **AND-NOT Logical** Operation: DEST ← SRCA & ~SRCB · **Assembler** Syntax: AND ANDN rc, ra, rb or ANDN rc, ra, const8 Status: N, Z Operands: SRCA Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC OP = 9C, 9D ANDN **Description:** The SRCA operand is logically ANDed, bit-by-bit, with the one's-complement of the SRCB operand and the result is placed into the DEST location. **ASEQ** **ASEQ** ### **Assert Equal To** Operation: IF SRCA = SRCB THEN Continue ELSE Trap (VN) **Assembler** Syntax: ASEQ vn, ra, rb or ASEQ vn, ra, const8 Status: Not affected Operands: SRCA Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) VN Trap vector number OP = 70,71 **ASEQ** **Description:** If the SRCA operand is equal to the SRCB operand, instruction execution continues; otherwise, a trap with the specified vector number occurs. For programs in the User mode, a Protection Violation trap occurs—instead of the assert trap—if a vector number between 0 and **ASGE** **ASGE** ### **Assert Greater Than or Equal To** Operation: IF SRCA ≥ SRCB THEN Continue ELSE Trap (VN) Assembler Syntax: : ASGE vn, ra, rb or ASGE vn, ra, const8 Status: Not affected Operands: **SRCA** Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) VN Trap vector number **Description:** If the value of the SRCA operand is greater than or equal to the value of the SRCB operand, instruction execution continues; otherwise, a trap with the specified vector number occurs. For programs in the User mode, a Protection Violation trap occurs—instead of the assert trap—if a vector number between 0 and **ASGEU** **ASGEU** #### Assert Greater Than or Equal To, Unsigned Operation: IF SRCA ≥ SRCB (unsigned) THEN Continue ELSE Trap (VN) **Assembler** Syntax: ASGEU vn, ra, rb or ASGEU vn, ra, const8 Status: Not affected Operands: Content of register RA SRCA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) VN Trap vector number OP = 5E, 5F **ASGEU** #### **Description:** If the value of the SRCA operand is greater than or equal to the value of the SRCB operand, instruction execution continues; otherwise, a trap with the specified vector number occurs. For the comparison, both operands are treated as unsigned integers. For programs in the User mode, a Protection Violation trap occurs—instead of the assert trap—if a vector number between 0 and **ASGT** **ASGT** #### **Assert Greater Than** Operation: IF SRCA > SRCB THEN Continue ELSE Trap (VN) **Assembler** Syntax: ASGT vn, ra, rb or ASGT vn, ra, const8 Status: Not affected Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M =1: I (Zero-extended to 32 bits) VN Trap vector number Description: If the value of the SRCA operand is greater than the value of the SRCB operand, instruction execution continues; otherwise, a trap with the specified vector number occurs. For programs in the User mode, a Protection Violation trap occurs—instead of the assert trap—if a vector number between 0 and ASGTU ASGTU ### **Assert Greater Than, Unsigned** Operation: IF SRCA > SRCB (unsigned) THEN Continue ELSE Trap (VN) **Assembler** Syntax: ASGTU vn, ra, rb 10 SRCB ASGTU vn, ra, const8 Status: Not affected Operands: SRCA Content of register RA M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) VN Trap vector number Description: If the value of the SRCA operand is greater than the value of the SRCB operand, instruction execution continues; otherwise, a trap with the specified vector number occurs. For the comparison, both operands are treated as unsigned integers. For programs in the User mode, a Protection Violation trap occurs—instead of the assert trap—if a vector number between 0 and 63 is specified. ASLE **ASLE** ### Assert Less Than or Equal To Operation: IF SRCA ≤ SRCB THEN Continue ELSE Trap (VN) Assembler Syntax: ASLE vn, ra, rb OI ASLE vn, ra, const8 Status: Not affected Operands: **SRCA** Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) VN Trap vector number Description: If the value of the SRCA operand is less than or equal to the value of the SRCB operand, instruction execution continues; otherwise, a trap with the specified vector number occurs. For programs in the User mode, a Protection Violation trap occurs—instead of the assert trap—if a vector number between 0 and ASLEU ASLEU ### Assert Less Than or Equal To, Unsigned **Operation:** IF SRCA ≤ SRCB (unsigned) THEN Continue ELSE Trap (VN) **Assembler** Syntax: ASLEU vn, ra, rb or ASLEU vn, ra, const8 Status: Not affected Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) VN Trap vector number **Description:** If the value of the SRCA operand is less than or equal to the value of the SRCB operand, instruction execution continues; otherwise, a trap with the specified vector number occurs. For the comparison, both operands are treated as unsigned integers. For programs in the User mode, a Protection Violation trap occurs—instead of the assert trap—if a vector number between 0 and 63 is specified. **ASLT** **ASLT** #### **Assert Less Than** Operation: IF SRCA < SRCB THEN Continue ELSE Trap(VN) **Assembler** Syntax: ASLT vn, ra, rb Or ASLT vn, ra, const8 Status: Not affected Operands: SRCA Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) VN Trap vector number **Description:** If the value of the SRCA operand is less than the value of the SRCB operand, instruction execution continues; otherwise, a trap with the specified vector number occurs. For programs in the User mode, a Protection Violation trap occurs—instead of the assert trap—if a vector number between 0 and ASLTU ASLTU ### Assert Less Than, Unsigned Operation: IF SRCA < SRCB (unsigned) THEN Continue ELSE Trap (VN) **Assembler** Syntax: ASLTU vn, ra, rb OI ASLTU vn, ra, const8 Status: Not affected Operands: SRCA SRCA Content of register RA SRCB M = 0: Content of regis M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) VN Trap vector number OP = 52, 53 **ASLTU** **Description:** If the value of the SRCA operand is less than the value of the SRCB operand, instruction execution continues; otherwise, a trap with the specified vector number occurs. For the comparison, both operands are treated as unsigned integers. For programs in the User mode, a Protection Violation trap occurs—instead of the assert trap—if a vector number between 0 and 63 is specified. **ASNEQ** **ASNEQ** ## **Assert Not Equal To** Operation: IF SRCA <> SRCB THEN Continue ELSE Trap (VN) **Assembler** Syntax: ASNEQ vn, ra, rb or ASNEQ vn, ra, const8 Status: Not affected Operands: **SRCA** Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) VN Trap vector number OP = 72,73 **ASNEQ** **Description:** If the SRCA operand is not equal to the SRCB operand, instruction execution continues; otherwise, a trap with the specified vector number occurs. For programs in the User mode, a Protection Violation trap occurs—instead of the assert trap—if a vector number between 0 and CALL #### **Call Subroutine** Operation: DEST ← PC // 00 + 8 PC ← TARGET Execute delay instruction **Assembler** Syntax: CALL ra, target Status: Not affected Operands: TARGET A = 0: I17 ... I10 // I9 ... I2 (sign-extended to 30 bits) + PC A = 1: I17 ... I10 // I9 ... I2 (zero-extended to 30 bits) DEST Register RA **Description:** The address of the second following instruction is placed into the DEST location and a non-sequential instruction fetch occurs to the instruction address given by the TARGET operand. The instruction following the CALL is executed before the non-sequential fetch occurs. **CALLI** **CALLI** ## **Call Subroutine, Indirect** Operation: DEST ← PC // 00 + 8 PC ← SRCB Execute delay instruction **Assembler** Syntax: CALLI ra, rb Status: Not affected Operands: SRCB Content of register RB DEST Register RA **Description:** The address of the second following instruction is placed into the DEST location and a non-sequential instruction fetch occurs to the instruction address given by the SRCB operand. The instruction following the CALLI is executed before the non-sequential fetch occurs. **CLASS** CLASS ### **Classify Floating-Point Operand** Operation: DEST ← CLASS(SRCA) **Assembler** Syntax: CLASS rc, ra, FS Status: None Operands: SRCA Content of register RA (single-precision floating-point) O Content of register RA and the twin of register RA (double-precision floating-point) **DEST** Register RC Control: FS Format of source operand SRCA 00 Reserved for future use 01 Single-precision floating-point Double-precision floating-point 10 11 Reserved for future use **Description:** A 32-bit classification code for operand SRCA is placed into the DEST location. Operand SRCA is a single- or double-precision operand, as specified by FS. The classification code has the following format: Bits 31-6: Reserved (forced to 0). Bit 5: Operand Sign (OS). The OS bit is 1 for a negative operand (including negative zero) and 0 for a non-negative operand. CLASS **CLASS** Bits 4–0: Exponent-Fraction Class (EFC). This field classifies the biased exponent and fraction fields of the source operand as follows: | EFC ' | Biased Exp (bexp) | Fraction (frac) | Comments | |-------|-------------------------|-------------------|-------------------| | 00000 | 0 | 0 | zero | | 00001 | | | unused | | 00010 | 0 | 0 < frac < .111 1 | denormalized | | 00011 | . 0 | .1111 | denormalized | | 00100 | 1 | | 0 | | 00101 | | | unused | | 00110 | 1 | | 0 < frac < .111 1 | | 00111 | 1 | .111 1 | | | 01000 | 1 < bexp < Max | 0 | | | 01001 | • | | unused | | 01010 | 1 < bexp < Max | 0 < frac < .111 1 | | | 01011 | 1 < bexp < Max | .111 1 | | | 01100 | Max | 0 % | | | 01101 | | | unused | | 01110 | Max | 0 < frac < .111 1 | • | | 01111 | Max | .111 1 | | | 10000 | Max + 1 | 0 | infinity | | 10001 | | | unuséd | | 10010 | Max + 1, frac $MSB = 0$ | <> 0 | SNaN | | 10011 | Max + 1, frac $MSB = 1$ | <> 0 | QNaN | Note: Max is the largest biased exponent used to represent a finite number in a given format. Max is 254 for single-precision and 2,046 for double-precision. This instruction is not supported directly in processor hardware. In the current implementation, this instruction causes a CLASS trap. When the trap occurs, the IPA and IPC registers are set to reference SRCA and DEST, and the IPB Register is set with the value of the FS field. CLZ CLZ ### **Count Leading Zeros** **Operation:** DEST $\leftarrow$ count of number of leading zeros in SRCB or I **Assembler** Syntax: CLZ rc, rb or CLZ rc, const8 Status: Not affected Operands: SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC Description: A count of the number of zero-bits to the first one-bit in the SRCB operand is placed into the DEST location. If the most significant bit of the SRCB operand is 1, the resulting count is zero. If the SRCB operand is zero, the resulting count is 32. CONST **CONST** #### Constant Operation: DEST ← 0I16 **Assembler** Syntax: CONST ra, const16 Status: Not affected Operands: 0116 I15 ... 8 // I7 ... I0 (Zero-extended to 32 bits) **DEST** Register RA **Description:** The 0l16 operand is placed into the DEST location. Note: To improve code readability, some assemblers implement CONST to take a 32-bit argument (rather than const16). The lower half of the argument is constructed by the CONST. CONSTH CONSTH #### Constant, High Operation: Replace high-order half-word of SRCA by I16 **Assembler** Syntax: CONSTH ra, const16 Status: Not affected Operands: Content of register RA SRCA 115 ... 18 // 17 ... 10 DEST Register RA Description: The low-order half-word of the SRCA operand is appended to the I16 operand and the result is placed into the DEST operand. Note that the destination register for this instruction is the same as the source register. Note: To improve code readability, some assemblers implement CONSTH to take a 32-bit argument (rather than const16). The upper half of the argument is constructed by the CONSTH. ## CONSTN CONSTN # **Constant, Negative** Operation: DEST ← 1I16 **Assembler** Syntax: CONSTN ra, const16 Status: N Not affected Operands: I15 ... I8 // I7 ... I0 (ones-extended to 32 bits) 1I16 DEST Register RA OP = 01 CONSTN $\textbf{Description:} \quad \text{The 1I16 operand is placed into the DEST location.}$ CONVERT CONVERT #### **Convert Data Format** Operation: DEST ← SRCA, with format modified per UI, RND, FD, FS **Assembler** CONVERT rc, ra, UI, RND, FD, FS Syntax: Status: fpX, fpU, fpV, fpR, fpN Operands: SRCA Content of register RA (single-precision floating-point) Content of register RA and the twin of register RA (double-precision floating-point) **DEST** Content of register RC (single-precision floating-point) Content of register RC and the twin of register RA (double-precision floating-point) Control: UI 0 = signed integer 1 = unsigned integer **RND** Round mode 000 Round to nearest 001 Round to minus infinity 010 Round to plus infinity Round to zero 011 Round using floating-point round mode (FRM) 100 101-111 Reserved FS,FD Format of source operand, format of destination > operand Integer 00 01 Single-precision floating-point 10 Double-precision floating-point 11 Reserved **Description:** The SRCA operand with format FS is converted to format FD and rounded according to RND, then placed into the DEST location. If the source or destination operand is an integer, it is a signed or unsigned value according to the value of UI. Note: Converting from format to like format is not supported and will produce unpredictable results. #### **CONVERT** #### **CONVERT** This instruction is not supported directly in processor hardware. In the current implementation this instruction causes a CONVERT trap. When the trap occurs, the IPA and IPC registers are set to reference SRCA and DEST, and the IPB Register is set with the value of the UI//RND//FD//FS field. If the UI bit is 1, the contents of the IPB Register reflect the value of this field after Stack-Pointer addition. The Stack Pointer must be subtracted from the contents of the IPB Register to recover the original value of this field. CPBYTE CPBYTE ### **Compare Bytes** Operation: IF (SRCA.BYTE0 = SRCB.BYTE0) OR (SRCA.BYTE1 = SRCB.BYTE1) OR (SRCA.BYTE2 = SRCB.BYTE2) OR (SRCA.BYTE3 = SRCB.BYTE3) THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: CPBYTE rc, ra, rb **SRCB** or CPBYTE rc, ra, const8 Status: Not affected Operands: SRCA Content of register RA M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC OP = 2E, 2F **CPBYTE** ### **Description:** Each byte of the SRCA operand is compared to the corresponding byte of the SRCB operand. If any corresponding bytes are equal, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. **CPEQ CPEQ** ## **Compare Equal To** Operation: IF SRCA = SRCB THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** CPEQ rc, ra, rb Syntax: or CPEQ rc, ra, const8 Status: Not affected Operands: **SRCA** Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC Description: If the SRCA operand is equal to the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. CPGE CPGE ### **Compare Greater Than or Equal To** **Operation:** IF SRCA ≥ SRCB THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: CPGE rc, ra, rb or CPGE rc, ra, const8 Status: Not affected Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC Description: If the value If the value of the SRCA operand is greater than or equal to the value of the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. CPGEU CPGEU ### Compare Greater Than or Equal To, Unsigned **Operation:** IF SRCA ≥ SRCB (unsigned) THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: CPGEU rc, ra, rb or CPGEU rc, ra, const8 Status: Not affected Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC Description: If the value of the SRCA operand is greater than or equal to the value of the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. For the comparison, both operands are treated as unsigned integers. CPGT CPGT ### **Compare Greater Than** Operation: IF SRCA > SRCB THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: CPGT rc, ra, rb or CPGT rc, ra, const8 Status: Not affected Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC **Description:** If the value of the SRCA operand is greater than the value of the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. **CPGTU** **CPGTU** # **Compare Greater Than, Unsigned** Operation: IF SRCA > SRCB (unsigned) THEN DEST $\leftarrow$ TRUE ELSE DEST ← FALSE **Assembler** Syntax: CPGTU rc, ra, rb or CPGTU rc, ra, const8 Status: Not affected Operands: Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** SRCA Register RC . **Description:** If the value of the SRCA operand is greater than the value of the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. For the comparison, both operands are treated as unsigned integers. CPLE CPLE ### **Compare Less Than or Equal To** **Operation:** IF SRCA ≤ SRCB THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: CPLE rc, ra, rb or CPLE rc, ra, const8 Status: Not affected Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC Description: If the value of the SRCA operand is less than or equal to the value of the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. CPLEU CPLEU # Compare Less Than or Equal To, Unsigned **Operation:** IF SRCA $\leq$ SRCB (unsigned) THEN DEST $\leftarrow$ TRUE ELSE DEST ← FALSE **Assembler** Syntax: CPLEU rc, ra, rb or CPLEU rc, ra, const8 Status: Not affected Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC **Description:** If the value of the SRCA operand is less than or equal to the value of the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. For the comparison, both operands are treated as unsigned integers. CPLT CPLT # **Compare Less Than** Operation: IF SRCA < SRCB THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: CPLT rc, ra, rb or CPLT rc, ra, const8 Status: Not affected Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC **Description:** If the value of the SRCA operand is less than the value of the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. **CPLTU CPLTU** ### Compare Less Than, Unsigned Operation: IF SRCA < SRCB (unsigned) THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: CPLTU rc, ra, rb CPLTU rc, ra, const8 Status: Not affected Operands: **SRCA** Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC OP = 42, 43 **Description:** If the value of the SRCA operand is less than the value of the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. For the comparison, both operands are treated as unsigned integers. CPNEQ CPNEQ ### **Compare Not Equal To** **Operation:** IF SRCA <> SRCB THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: CPNEQ rc, ra, rb or CPNEQ rc, ra, const8 Status: Not affected Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC OP = 62, 63 **CPNEQ** **Description:** If the SRCA operand is not equal to the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is Those is placed into the DEST location, otherwise, a book placed into the DEST location. DADD ### Floating-Point Add, Double-Precision Operation: DEST (double-precision) ← SRCA (double-precision) + SRCB (double-precision) **Assembler** Syntax: DADD rc, ra, rb **DEST** Status: fpX, fpU, fpV, fpR, fpN **Operands:** SRCA Content of register RA and the twin of register RA SRCB Content of register RB and the twin of register RB Register RC and the twin of register RC **Description:** The SRCA operand is added to the SRCB operand; the result is rounded according to the FRM field of the Floating-Point Environment Register and placed into the DEST location. The operands and the result of the addition are double-precision floating-point numbers. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes a DADD trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. DDIV DDIV ### Floating-Point Divide, Double-Precision Operation: DEST (double-precision) ← SRCA (double-precision) / SRCB (double-precision) **Assembler** Syntax: DDIV rc, ra, rb Status: fpD, fpX, fpU, fpV, fpR, fpN Operands: SRCA Content of register RA and the twin of register RA SRCB Content of register RB and the twin of register RB DEST Register RC and the twin of register RC **Description:** The SRCA operand is divided by the SRCB operand; the result is rounded according to the FRM field of the Floating-Point Environment Register and placed into the DEST location. The operands and the result of the division are double-precision floating-point numbers. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes a DDIV trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. DEQ DEQ ## Floating-Point Equal To, Double-Precision **Operation:** IF SRCA (double-precision) = SRCB (double-precision) THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: DEQ rc, ra, rb Status: fpl Operands: SRCA Content of register RA and the twin of register RA SRCB Content of register RB and the twin of register RB DEST Register RC ### **Description:** If the SRCA operand is equal to the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. SRCA and SRCB are double-precision floating-point numbers. The rounding mode specified by the FRM field of the Floating-Point Environment Register has no effect on this operation. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes a DEQ trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. DGE DGE ### Floating-Point Greater Than Or Equal To, Double-Precision **Operation:** IF SRCA (double-precision) ≥ SRCB (double-precision) THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: DGE rc, ra, rb Status: fpl Operands: SRCA Content of register RA and the twin of register RA SRCB C Content of register RB and the twin of register RB DEST Register RC **Description:** If the SRCA operand is greater than or equal to the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. SRCA and SRCB are double-precision floating-point numbers. The rounding mode specified by the FRM field of the Floating-Point Environment Register has no effect on this operation. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes a DGE trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. **DGT** **DGT** # Floating-Point Greater Than, Double-Precision Operation: IF SRCA (double-precision) > SRCB (double-precision) THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: DGT rc, ra, rb Status: fpl Operands: SRCA Content of register RA and the twin of register RA SRCB Content of register RB and the twin of register RB DEST Register RC #### **Description:** If the SRCA operand is greater than the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. SRCA and SRCB are double-precision floating-point numbers. The rounding mode specified by the FRM field of the Floating-Point Environment Register has no effect on this operation. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes a DGT trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. DIV DIV ### **Divide Step** Operation: Perform one-bit step of a divide operation (unsigned) **Assembler** Syntax: DIV rc, ra, rb or DIV rc, ra, const 8 Status: V, N, Z, C Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC ### **Description:** If the Divide Flag (DF) bit of the ALU Status Register is 1, the SRCB operand is subtracted from the SRCA operand. If the DF bit is 0, the SRCB operand is added to the SRCA operand. The carry-out of the add or subtract operation is exclusive-ORed with the value of the DF bit and the value of the Negative (N) bit of the ALU Status Register; the resulting value is complemented and placed into the DF bit. The sign of the result of the add or subtract is placed into the N bit. The content of the Q Register is appended to the result of the add or subtract, and the resulting 64-bit value is shifted left by one bit position; the value computed for the DF bit above fills the vacated bit position. The high-order 32 bits of the 64-bit shifted value are placed into the DEST location. The low-order 32 bits of the shifted value are placed into the Q Register. Examples of integer divide operations appear in Section 2.6.3. DIV<sub>0</sub> DIV<sub>0</sub> #### **Divide Initialize** **Operation:** Initialize for a sequence of divide steps (unsigned) Assembler Syntax: DIV0 rc, rb or DIV0 rc, const8 Status: V, N, Z, C Operands: SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC ### **Description:** The Divide Flag (DF) bit of the ALU Status Register is set. The sign of the SRCB operand is placed into the Negative bit of the ALU Status Register. The content of the Q register is appended to the SRCB operand, and the resulting 64-bit value is shifted left by one bit position; a 0 fills the vacated bit position. The high-order 32 bits of the 64-bit shifted value are placed into the DEST location. The low-order 32 bits of the shifted value are placed into the Q Register. Examples of integer divide operations appear in Section 2.6.3. DIVIDE DIVIDE ### Integer Divide, Signed Operation: DEST ← (Q // SRCA) / SRCB (signed) Remainder **Assembler** Syntax: DIVIDE rc, ra, rb Status: Not affected Operands: Q Content of the Q Register **SRCA** Content of register RA **SRCB** Content of register RB DEST Register RC **Description:** The SRCA operand is appended to the content of the Q register. The resulting 64-bit value is divided by the SRCB operand and the result is placed into the DEST location. This operation treats the operands as signed two's-complement integers and produces a signed two's-complement result. The remainder is placed into the Q register. A non-zero remainder always has the same sign as the dividend. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes a DIVIDE trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. DIVIDU DIVIDU # Integer Divide, Unsigned Operation: DEST $\leftarrow$ (Q // SRCA) / SRCB (unsigned) Q ← Remainder **Assembler** Syntax: DIVIDU rc, ra, rb Status: 1 Not affected Operands: Content of the Q Register SRCA - Content of register RA SRCB Content of register RB DEST Register RC OP = E3 DIVIDU #### Description: The SRCA operand is appended to the content of the Q Register. The resulting 64-bit value is divided by the SRCB operand and the result is placed into the DEST location. This operation treats the operands as unsigned integers, and produces an unsigned result. The remainder is placed into the Q Register. The remainder is also unsigned. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes a DIVIDU trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. DIVL DIVL ### **Divide Last Step** Operation: Complete a sequence of divide steps (unsigned) **Assembler** Syntax: DIVL rc, ra, rb Status: V, N, Z, C Operands: SRCA Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC ### **Description:** If the Divide Flag (DF) bit of the ALU Status Register is 1, the SRCB operand is subtracted from the SRCA operand. If the DF bit is 0, the SRCB operand is added to the SRCA operand. The result is placed into the DEST location. The carry-out of the add or subtract operation is exclusive-ORed with the value of the DF bit and the value of the Negative (N) bit of the ALU Status Register; the resulting value is complemented and placed into the DF bit. The sign of the result of the add or subtract is placed into the N bit. The content of the Q register is shifted left by one bit position; the value computed for the DF bit above fills the vacated bit position. The shifted value is placed into the Q Register. Examples of integer divide operations appear in Section 2.6.3. **DIVREM** **DIVREM** ### **Divide Remainder** Operation: Generate remainder for divide operation (unsigned) **Assembler** Syntax: DIVREM rc, ra, rb DIVREM rc, ra, const8 Status: V, N, Z, C Operands: **SRCA** Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC DIVREM **Description:** If the Divide Flag (DF) bit of the ALU Status Register is 1, the SRCA operand is placed into the DEST location. If the DF bit is 0, the SRCB operand is added to the SRCA operand and the result is placed into the DEST location. Examples of integer divide operations appear in Section 2.6.3. DMUL DMUL ### Floating-Point Multiply, Double-Precision Operation: DEST (double-precision) ← SRCA (double-precision) \* SRCB (double-precision) **Assembler** Syntax: DMUL rc, ra, rb Status: fpX, fpU, fpV, fpR, fpN Operands: SRCA Content of register RA and the twin of register RA SRCB Content of register RB and the twin of register RB DEST Register RC Description: The SRCB operand is multiplied by the SRCA operand; the result is rounded according to the FRM field of the Floating-Point Environment Register and is placed into the DEST location. The operands and the result of the multiplication are double-precision floating-point numbers. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes a DMUL trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. DSUB DSUB ### Floating-Point Subtract, Double-Precision **Operation:** DEST (double-precision) $\leftarrow$ SRCA (double-precision) - SRCB (double-precision) Assembler Syntax: DSUB rc, ra, rb Status: fpX, fpU, fpV, fpR, fpN Operands: SRCA Content of register RA and the twin of register RA **SRCB** Content of register RB and the twin of register RB DEST Register RC **Description:** The SRCB operand is subtracted from the SRCA operand; the result is rounded according to the FRM field of the Floating-Point Environment Register and is placed into the DEST location. The operands and the result of the subtraction are double-precision floating-point numbers. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes a DSUB trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. EMULATE ### **Trap to Software Emulation Routine** **EMULATE** Operation: Load IPA and IPB registers with operand register numbers and Trap (VN) **Assembler** Syntax: EMULATE vn, ra, rb .Status: Not affected Operands: Absolute-register numbers for registers RA and RB VN Trap vector number **Description:** The IPA and IPB registers are set to the register numbers of registers RA and RB, respectively. A trap with the specified vector number occurs. Note that the IPC register is also affected by this instruction, but its value has no interpretation. For programs in the User mode, a Protection Violation trap occurs—instead of the EMULATE trap—if a vector number between 0 and 63 is specified. A Protection Violation trap also occurs if RA or RB specifies a register protected by the Register Bank Protect Register. **EXBYTE** **EXBYTE** ### **Extract Byte** Operation: DEST ← SRCB, with low-order byte replaced by byte in SRCA selected by BP **Assembler** Syntax: EXBYTE rc, ra, rb or EXBYTE rc, ra, const8 Status: Not affected Operands: Content of register RA SRCA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC OP = 0A, 0B **EXBYTE** **Description:** A byte in the SRCA operand is selected by the Byte Pointer (BP) field of the ALU Status Register. The selected byte replaces the low-order byte of the SRCB operand and the resulting word is placed into the DEST location. Note: The selection of bytes within words is specified in Section 3.3.5.1. EXHW EXHW ### **Extract Half-Word** **Operation:** DEST ← SRCB, with low-order half-word replaced by half-word in SRCA selected by BP **Assembler** Syntax: EXHW rc, ra, rb or EXHW rc, ra, const8 Status: Not affected Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC **Description:** A half-word in the SRCA operand is selected by the Byte Pointer (BP) field of the ALU Status Register. The selected half-word replaces the low-order half-word of the SRCB operand and the resulting word is placed into the DEST location. Note: The selection of half-words within words is specified in Section 3.3.5.1. **EXHWS** **EXHWS** # Extract Half-Word, Sign-Extended Operation: DEST ← half-word in SRCA selected by BP, sign-extended to 32 bits **Assembler** Syntax: EXHWS rc, ra Status: Not affected Operands: SRCA Content of register RA **DEST** Register RC . **Description:** A half-word in the SRCA operand is selected by the Byte Pointer (BP) field of the ALU Status Register. The selected half-word is sign-extended to 32 bits and the resulting word is placed into the DEST location. Note: The selection of half-words within words is specified in Section 3.3.5.1. EXTRACT ### **Extract Word, Bit-Aligned** **Operation:** DEST ← high-order word of (SRCA // SRCB << FC) **Assembler** Syntax: EXTRACT rc, ra, rb or EXTRACT rc, ra, const8 Status: Not affected **SRCB** Operands: SRCA SRCA Content of register RA M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC OP = 7A, 7B **EXTRACT** **Description:** The SRCB operand is appended to the SRCA operand and the resulting 64-bit value is shifted left by the number of bit-positions specified by the Funnel Shift Count (FC) field of the ALU Status register. The high-order 32 bits of the 64-bit shifted value are placed in the DEST location. If the SRCB operand is the same as the SRCA operand, the EXTRACT instruction performs a rotate operation. **FADD** **FADD** # Floating-Point Add, Single-Precision Operation: DEST (single-precision) ← SRCA (single-precision) + SRCB (single-precision) **Assembler** Syntax: FA FADD rc, ra, rb Status: fpX, fpU, fpV, fpR, fpN Operands: Content of register RA SRCA SRCB Content of register RB DEST Register RC ### **Description:** The SRCA operand is added to the SRCB operand; the result is rounded according to the FRM field of the Floating-Point Environment Register and placed into the DEST location. The operands and the result of the addition are single-precision floating-point numbers. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes an FADD trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. FDIV FDIV ### Floating-Point Divide, Single-Precision **Operation:** DEST (single-precision) ← SRCA (single-precision) / SRCB (single-precision) **Assembler** Syntax: FDIV rc, ra, rb Status: fpD, fpX, fpU, fpV, fpR, fpN Operands: SRCA Content of register RA SRCB Content of register RB DEST Register RC **Description:** The SRCA operand is divided by the SRCB operand; the result is rounded according to the FRM field of the Floating-Point Environment Register and placed into the DEST location. The operands and the result of the division are single-precision floating-point numbers. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes an FDIV trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. FDMUL FDMUL ### Floating-Point Multiply, Single-to-Double Precision Operation: DEST (double-precision) ← SRCA (single-precision) \* SRCB (single-precision) **Assembler** Syntax: FDMUL rc, ra, rb Status: fpR, fpN Operands: SRCA SRCB Content of register RB DEST Register RC Content of register RA 4 **Description:** The SRCB operand is multiplied by the SRCA operand; the result is placed into the DEST location. SRCA and SRCB are single-precision floating-point numbers; the result is produced in double-precision format. Because the product of two single-precision operands can always be represented exactly as a double-precision number, the FDMUL result does not depend on the FRM field of the Floating-Point Environment Register. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes an FDMUL trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. **FEQ FEQ** ### Floating-Point Equal To, Single-Precision Operation: IF SRCA (single-precision) = SRCB (single-precision) THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: FEQ rc, ra, rb Status: fpN Operands: SRCA Content of register RA **SRCB** Content of register RB DEST Register RC **Description:** If the SRCA operand is equal to the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. SRCA and SRCB are single-precision floating-point numbers. The rounding mode specified by the FRM field of the Floating-Point Environment Register has no effect on this operation. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes an FEQ trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. FGE FGE ### Floating-Point Greater Than Or Equal To, Single-Precision **Operation:** IF SRCA (single-precision) ≥ SRCB (single-precision) THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: FGE rc, ra, rb **DEST** Status: fpN Operands: SRCA Content of register RA SRCB Content of register RB Register RC ### **Description:** If the SRCA operand is greater than or equal to the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. SRCA and SRCB are single-precision floating-point numbers. The rounding mode specified by the FRM field of the Floating-Point Environment Register has no effect on this operation. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes an FGE trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. FGT FGT ### Floating-Point Greater Than, Single-Precision **Operation:** IF SRCA (single-precision) > SRCB (single-precision) THEN DEST ← TRUE ELSE DEST ← FALSE **Assembler** Syntax: FGT rc, ra, rb Status: fpN Operands: SRCA Content of register RA SRCB Content of register RB DEST Register RC **Description:** If the SRCA operand is greater than the SRCB operand, a Boolean TRUE is placed into the DEST location; otherwise, a Boolean FALSE is placed into the DEST location. SRCA and SRCB are single-precision floating-point numbers. The rounding mode specified by the FRM field of the Floating-Point Environment Register has no effect on this operation. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes an FGT trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. FMUL FMUL ### Floating-Point Multiply, Single-Precision Operation: DEST (single-precision) ← SRCA (single-precision) \* SRCB (single-precision) **Assembler** Syntax: FMUL rc, ra, rb Status: fpX, fpU, fpV, fpR, fpN Operands: SRCA Content of register RA SRCB Content of register RB DEST Register RC **Description:** The SRCA operand is multiplied by the SRCB operand; the result is rounded according to the FRM field of the Floating-Point Environment Register and placed into the DEST location. The operands and the result of the multiplication are single-precision floating-point numbers. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes an FMUL trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. **FSUB FSUB** ### Floating-Point Subtract, Single-Precision Operation: DEST (single-precision) ← SRCA (single-precision) – SRCB (single-precision) Assembler Syntax: FSUB rc, ra, rb Status: fpX, fpU, fpV, fpR, fpN Operands: **SRCA** Content of register RA **SRCB** Content of register RB **DEST** Register RC **FSUB** #### **Description:** The SRCB operand is subtracted from the SRCA operand; the result is rounded according to the FRM field of the Floating-Point Environment Register and placed into the DEST location. The operands and the result of the subtraction are single-precision floating-point numbers. Note: This instruction is not supported directly in processor hardware. In the current implementation, this instruction causes an FSUB trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. **HALT** **HALT** #### **Enter Halt Mode** Operation: Enter Halt mode on next cycle Assembler Syntax: HALT Status: Not affected Operands: Not applicable ### **Description:** The processor is placed into the Halt mode in the next cycle, or in the cycle after an external data access is completed if an access is in progress. This instruction may be executed only by Supervisor-mode programs. An attempted execution by a User-mode program causes a Protection Violation trap to occur unless the Protection Violation trap was disabled during reset (see Section 17.7.5). If the instruction following a Halt instruction has an exception (e.g., TLB Miss), the trap associated with this exception is taken before the processor enters the Halt mode. **INBYTE** **INBYTE** ### **Insert Byte** Operation: DEST ←SRCA, with byte selected by BP replaced by low-order byte of SRCB **Assembler** Syntax: INBYTE rc, ra, rb INBYTE rc, ra, const8 Status: Not affected Operands: **SRCA** Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC OP =0C, 0D **INBYTE** #### **Description:** A byte in the SRCA operand is selected by the Byte Pointer (BP) field of the ALU Status Register. The selected byte is replaced by the low-order byte of the SRCB operand and the resulting word is placed into the DEST location. Note: The selection of bytes within words is specified in Section 3.3.5.1. INHW **INHW** ### **Insert Half-Word** **Operation:** DEST ←SRCA, with half-word selected by BP replaced by low-order half-word of SRCB **Assembler** Syntax: INHW rc, ra, rb or **SRCA** INHW rc, ra, const8 Status: Not affected Operands: Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC Description: A half-word in the SRCA operand is selected by the Byte Pointer (BP) field of the ALU Status Register. The selected half-word is replaced by the low-order half-word of the SRCB operand and the resulting word is placed into the DEST location. Note: The selection of half-words within words is specified in Section 3.3.5.1. INV INV ### Invalidate Operation: None **Assembler** Syntax: INV Status: Not affected Operands: Not applicable OP = 9F **Description:** In 29K Family processors with instruction caches, this instruction causes all cache valid bits to be reset. In the Am29200 and Am29205 microcontrollers, this instruction performs no operation, except it is a privileged instruction. Attempted execution by a User-mode program causes a Protection Violation trap to occur. **IRET** **IRET** ## Interrupt Return Operation: Perform an interrupt return sequence Assembler Syntax: **IRET** Status: Not affected Operands: Not applicable **Description:** This instruction performs the interrupt return sequence described in Section 16.4.4. This instruction may be executed only by Supervisor-mode programs. An attempted execution by a User-mode program causes a Protection Violation trap to occur. IRETINV IRETINV ### Interrupt Return and Invalidate Operation: Perform an interrupt return sequence **Assembler** Syntax: IRETINV Status: Not affected Operands: Not applicable Description: This instruction performs the interrupt return sequence described in Section 16.4.4. In 29K Family processors with an instruction cache, this instruction also resets the valid bits in the cache. In the Am29200 and Am29205 microcontrollers, this instruction is identical to the IRET instruction. This instruction may be executed only by Supervisor-mode programs. An attempted execution by a User-mode program causes a Protection Violation trap to occur. **JMP** **JMP** Jump Operation: PC ← TARGET Execute delay instruction Assembler Syntax: JMP target Status: Not affected Operands: **TARGET** A = 0: I17 ... I10 // I9 ... I2 (sign-extended to 30 bits) + PC A = 1: I17 ... I10//I9 ... I2 (zero-extended to 30 bits) **Description:** A non-sequential instruction fetch occurs to the instruction address given by the TARGET operand. The instruction following the JMP is executed before the non-sequential fetch occurs. **JMPF** **JMPF** ### **Jump False** Operation: IF SRCA = FALSE THEN PC ← TARGET Execute delay instruction **Assembler** Syntax: JMPF ra, target Status: Not affected Operands: SRCA Content of register RA **TARGET** A = 0: I17 ... I10 // I9 ... I2 (sign-extended to 30 bits) + PC A = 1: I17 ... I10 // I9 ... I2 (zero-extended to 30 bits) Description: If SRCA is a Boolean FALSE, a non-sequential instruction fetch occurs to the instruction address given by the TARGET operand. If SRCA is a Boolean TRUE, this instruction has no effect. The instruction following the JMPF is executed regardless of the value of SRCA. JMPFDEC JMPFDEC ### **Jump False and Decrement** **Operation:** IF SRCA = FALSE THEN SRCA ← SRCA – 1 PC ← TARGET **ELSE** SRCA ← SRCA – 1 Execute delay instruction **Assembler** Syntax: JMPFDEC ra, target Status: Not affected Operands: SRCA Content of register RA TARGET A = 0: I17 ... I10 // I9 ... I2 (sign-extended to 30 bits) + PC A = 1: 117 ... 110 // 19 ... 12 (zero-extended to 30 bits) OP = B4, B5 **JMPFDEC** #### **Description:** If SRCA is a Boolean FALSE, a non-sequential instruction fetch occurs to the instruction address given by the TARGET operand. If SRCA is a Boolean TRUE, this instruction has no effect on the instruction-execution sequence. The SRCA operand is decremented by one, regardless of whether or not the non-sequential instruction fetch occurs. Note that a negative number for the SRCA operand is a Boolean TRUE. The instruction following the JMPFDEC is executed regardless of the value of SRCA. JMPFI JMPFI ## **Jump False Indirect** Operation: IF SRCA = FALSE THEN PC ← SRCB Execute delay instruction **Assembler** Syntax: JMPFI ra, rb Status: Not affected Operands: SRCA Content of register RA **SRCB** Content of register RB **Description:** If the SRCA is a Boolean FALSE, a non-sequential instruction fetch occurs to the instruction address given by the SRCB operand. If SRCA is a Boolean TRUE, this instruction has no effect. The instruction following the JMPFI is executed regardless of the value of SRCA. **JMPI** **JMPI** # Jump Indirect Operation: PC ← SRCB Execute delay instruction **Assembler** Syntax: JMPI rb Status: Not affected Operands: **SRCB** Content of register RB Description: A non-sequential instruction fetch occurs to the instruction address given by the SRCB operand. The instruction following the JMPI is executed before the non-sequential fetch occurs. **JMPT** **JMPT** ### **Jump True** Operation: IF SRCA = TRUE THEN PC ← TARGET Execute delay instruction **Assembler** Syntax: JMPT ra, target Status: Not affected Operands: SRCA Content of register RA TARGET A = 0: I17 ... I10 // I9 ... I2 (sign-extended to 30 bits) + PC A = 1: I17 ... I10 // I9 ... I2 (zero-extended to 30 bits) **Description:** If SRCA is a Boolean TRUE, a non-sequential instruction fetch occurs to the instruction address given by the TARGET operand. If SRCA is a Boolean FALSE, this instruction has no effect. The instruction following the JMPT is executed regardless of the value of SRCA. JMPTI JMPTI ## **Jump True Indirect** **Operation:** IF SRCA = TRUE THEN PC $\leftarrow$ SRCB Execute delay instruction **Assembler** Syntax: JMPTI ra, rb Status: Not affected Operands: SF SRCA Content of register RA SRCB Content of register RB **Description:** If the SRCA is a Boolean TRUE, a non-sequential instruction fetch occurs to the instruction address given by the SRCB operand. If SRCA is a Boolean FALSE, this instruction has no effect. The instruction following the JMPTI is executed regardless of the value of SRCA. LOAD LOAD Load Operation: DEST ← EXTERNAL WORD [SRCB] **Assembler** Syntax: LOAD 0, cntl, ra, rb or LOAD 0, cntl, ra, const8 Status: Not affected Operands: SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RA **Description:** The external word addressed by the SRCB operand is placed into the DEST location. The CNTL field of the LOAD instruction affects the bus access as described in Section 3.3.1. LOADL LOADL ### Load and Lock Operation: DEST ← EXTERNAL WORD [SRCB] **Assembler** Syntax: LOADL 0, cntl, ra, rb or LOADL 0, cntl, ra, const8 Status: Not affected Operands: SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RA **Description:** The external word addressed by the SRCB operand is placed into the DEST location. The CNTL field of the LOADL instruction affects the bus access as described in Section 3.3.1. In other 29K Family processors, this instruction is provided for the implementation of interlock protocols. In the Am29200 and Am29205 microcontrollers, the LOADL instruction is identical to the LOAD instruction. LOADM ### **Load Multiple** Operation: DEST... DEST+COUNT ← EXTERNAL WORD [SRCB] ... EXTERNAL WORD [SRCB + (COUNT \* 4)] **Assembler** Syntax: LOADM 0, cntl, ra, rb or LOADM 0, cntl, ra, const8 Status: Not affected Operands: SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST register RA **Description:** External words at consecutive word addresses beginning with the word addressed by the SRCB operand, are placed into consecutive registers beginning with the DEST location. The total number of words accessed in the sequence is specified by the Count Remaining (CR) field of the Channel Control Register (which also appears in the Load/Store Count Remaining Register) at the beginning of the bus access. The total number of words is the value of the CR field plus one. The CNTL field of the LOADM instruction affects the access as described in Section 3.3.1. Note: The address and register-number sequences for the LOADM instruction are specified in Section 3.3.4. Because this instruction uses the Channel Address and Control Registers, it should not be executed when the FZ bit is 1. LOADSET LOADSET ### Load and Set Operation: DEST ← EXTERNAL WORD [SRCB] EXTERNAL WORD [SRCB] ← h'FFFFFFF' **Assembler** Syntax: LOADSET 0, cntl, ra, rb or LOADSET 0, cntl, ra, const8 Status: Not affected Operands: SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RA ### **Description:** The external word addressed by the SRCB operand is placed into the DEST location. After the DEST location is altered, the external word addressed by the SRCB operand is written, atomically, with a word consisting of a 1 in every bit position. The CNTL field of the LOADSET instruction affects the bus access as described in Section 3.3.1. **MFSR** **MFSR** # **Move from Special Register** Operation: DEST ← SPECIAL **Assembler** Syntax: MFSR rc, spid Status: Not affected SPECIAL Operands: Content of special-purpose register SA DEST Register RC Description: The SPECIAL operand is placed into the DEST location. For programs in the User mode, a Protection Violation trap occurs if SA specifies a protected special-purpose register. If a trap occurs, the DEST location is not altered. MFTLB MFTLB ### Move from Translation Look-Aside Buffer Register Operation: None **Assembler** Syntax: MFTLB rc, ra Status: Not affected Operands: SRCA Content of register RA, bits 6 ... 0 DEST Register RC **Description:** In 29K Family processors with an MMU, this instruction reads TLB entries. In the Am29200 and Am29205 microcontrollers, this instruction performs no operation except it is a privileged instruction. Attempted execution by a User-mode program causes a Protection Violation trap to occur. **MTSR** **MTSR** ## Move to Special Register Operation: SPDEST ← SRCB **Assembler** Syntax: MTSR spid, rb Status: Not affected unless the destination is the ALU Status Register Operands: Content of register RB SRCB SPDEST Special-purpose register SA **Description:** The SRCB operand is placed into the SPECIAL location. For programs in the User mode, a Protection Violation trap occurs if SA specifies a protected special-purpose register. If a trap occurs, the SPDEST location is not altered. MTSRIM MTSRIM ## Move to Special Register Immediate Operation: SPDEST ← 0l16 **Assembler** Syntax: MTSRIM spid, const16 Status: Not affected unless the destination is the ALU Status Register SPDEST Special-purpose register SA **Description:** The 0l16 operand is placed into the SPECIAL location. For programs in the User mode, a Protection Violation trap occurs if SA specifies a protected special-purpose register. If a trap occurs, the SPDEST location is not altered. **MTTLB** **MTTLB** Move to Translation Look-Aside Buffer Register Operation: None **Assembler** Syntax: MTTLB ra, rb Status: Not affected Operands: Content of register RA, bits 6...0 SRCA SRCB Content of register RB **Description:** In 29K Family processors with an MMU, this instruction modifies TLB entries. In the Am29200 and Am29205 microcontrollers, this instruction performs no operation except it is a privileged instruction. Attempted execution by a User-mode program causes a Protection Violation trap to occur. MUL MUL ## **Multiply Step** **Operation:** Perform one-bit step of a multiply operation **Assembler** Syntax: MUL rc, ra, rb **SRCB** or MUL rc, ra, const 8 Status: V, N, Z, C Operands: SRCA SRCA Content of register RA M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) ... (=5.5 5...5... DEST Register RC **Description:** If the least significant bit of the Q Register is 1, the SRCA operand is added to the SRCB operand. If the least significant bit of the Q register is 0, a zero word is added to the SRCB operand. The content of the Q Register is appended to the result of the add and the resulting 64-bit value is shifted right by one bit position; the true sign of the result of the add fills the vacated bit position (i.e., the sign of the result is complemented if an overflow occurred during the add operation). The high-order 32 bits of the 64-bit shifted value are placed into the DEST location. The low-order 32 bits of the shifted value are placed into the Q Register. Examples of integer multiply operations appear in Section 2.6.2. MULL MULL #### **Multiply Last Step** Operation: Complete a sequence of multiply steps (for signed multiply) **Assembler** Syntax: MU MULL rc, ra, rb or MULL rc, ra, const 8 Status: V, N, Z, C SRCA Operands: Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC #### Description: If the least significant bit of the Q Register is 1, the SRCA operand is subtracted from the SRCB operand. If the least significant bit of the Q register is 0, a zero word is subtracted from the SRCB operand. The content of the Q Register is appended to the result of the subtract and the resulting 64-bit value is shifted right by one bit position; the true sign of the result of the subtract fills the vacated bit position (i.e., the sign of the result is complemented if an overflow occurred during the subtract operation). The high-order 32 bits of the 64-bit shifted value are placed into the DEST location. The low-order 32 bits of the shifted value are placed into the Q Register. Examples of integer multiply operations appear in Section 2.6.2. **MULTIPLU** MULTIPLU ## Integer Multiply, Unsigned Operation: DEST ← SRCA \* SRCB **Assembler** Syntax: MULTIPLU rc, ra, rb Status: None Operands: **SRCA** Content of register RA **SRCB** Content of register RB DEST Register RC OP = E2 **MULTIPLU** #### **Description:** The SRCA operand is multiplied by the SRCB operand. The low-order 32 bits of the 64-bit result are placed into the DEST location. This operation treats the SRCA and SRCB operands as unsigned integers and produces an unsigned result. The contents of the Q register are undefined after a MULTIPLU operation. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes an MULTIPLU trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. **MULTIPLY** **MULTIPLY** ### Integer Multiply, Signed Operation: DEST ← SRCA \* SRCB **Assembler** Syntax: MULTIPLY rc, ra, rb Status: None Operands: **SRCA** Content of register RA Content of register RB **SRCB DEST** Register RC **Description:** The SRCA operand is multiplied by the SRCB operand. The low-order 32 bits of the 64-bit result are placed into the DEST location. This operation treats the SRCA and SRCB operands as two's-complement integers and produces a two's-complement result. The contents of the Q register are undefined after a MULTIPLY operation. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes a MULTIPLY trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. MULTM **MULTM** ## Integer Multiply most significant Bits, Signed Operation: DEST ← SRCA \* SRCB **Assembler** Syntax: MULTM rc, ra, rb Status: None Operands: SRCA Content of register RA SRCB Content of register RB DEST Register RC Description: The SRCA operand is multiplied by the SRCB operand. The high-order 32 bits of the 64-bit result are placed into the DEST location. This operation treats the SRCA and SRCB operands as two's-complement integers and produces a two's-complement result. The contents of the Q register are undefined after a MULTM operation. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes a MULTM trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. MULTMU MULTMU Integer Multiply most significant Bits, Unsigned Operation: DEST ← SRCA \* SRCB Assembler Syntax: MULTMU rc, ra, rb Status: None Operands: **SRCA** Content of register RA **SRCB** Content of register RB **DEST** Register RC MULTMU #### **Description:** The SRCA operand is multiplied by the SRCB operand. The high-order 32 bits of the 64-bit result are placed into the DEST location. This operation treats the SRCA and SRCB operands as unsigned integers and produces an unsigned result. The contents of the Q register are undefined after a MULTMU operation. Note: This instruction is not supported directly in processor hardware. In the current implementation this instruction causes an MULTMU trap. When the trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, SRCB, and DEST. MULU MULU ## Multiply Step, Unsigned Operation: Perform one-bit step of a multiply operation (unsigned) **Assembler** Syntax: MULU rc, ra, rb or MULU rc, ra, const 8 Status: V, N, Z, C Operands: SRCA Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC #### **Description:** If the least significant bit of the Q Register is 1, the SRCA operand is added to the SRCB operand. If the least significant bit of the Q register is 0, a zero word is added to the SRCB operand. The content of the Q register is appended to the result of the add and the resulting 64-bit value is shifted right by one bit position; the carry-out of the add fills the vacated bit position. The high-order 32 bits of the 64-bit shifted value are placed into the DEST location. The low-order 32 bits of the shifted value are placed into the Q Register. Examples of integer multiply operations appear in Section 2.6.2. NAND **NAND** # **NAND Logical** Operation: DEST ← ~(SRCA & SRCB) **Assembler** Syntax: NAND rc, ra, rb 10 NAND rc, ra, const8 Status: N, Z Operands: Content of register RA SRCA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC **Description:** The SRCA operand is logically ANDed, bit-by-bit, with the SRCB operand. The one's-complement of the result is placed into the DEST location. NOR NOR ## **NOR Logical** Operation: DEST ← ~(SRCA I SRCB) **Assembler** Syntax: NOR rc, ra, rb or NOR rc, ra, const8 Status: N, Z Operands: SRCA Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC **Description:** The SRCA operand is logically ORed, bit-by-bit, with the SRCB operand. The one's-complement of the result is placed into the DEST location. OR OR # **OR Logical** Operation: DEST ← SRCA | SRCB Assembler<sup>®</sup> Syntax: OR rc, ra, rb or OR rc, ra, const8 Status: N, Z Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC **Description:** The SRCA operand is logically ORed, bit-by-bit, with the SRCB operand, and the result is placed into the DEST location. **SETIP** **SETIP** #### **Set Indirect Pointers** Operation: Load IPA, IPB, and IPC registers with operand-register numbers **Assembler** Syntax: SETIP rc, ra, rb Status: Not affected Operands: Absolute-register numbers for registers RA, RB, and RC **Description:** The IPA, IPB, and IPC registers are set to the register numbers of registers RA, RB, and RC, respectively. For programs in the User mode, a Protection Violation trap occurs if RA, RB, or RC specifies a register protected by the Register Bank Protect Register. Note: This instruction has a delayed effect on the indirect pointer registers as discussed in Section 5.6. SLL SLL ### **Shift Left Logical** Operation: DEST ← SRCA << SRCB (zero fill) **Assembler** Syntax: SLL rc, ra, rb or SLL rc, ra, const8 Status: Not affected Operands: SRCA SRCB Content of register RA . M = 0: Content of register RB, bits 4 ... 0 M = 1: I, bits 4...0 DEST Register RC **Description:** The SRCA operand is shifted left by the number of bit positions specified by the SRCB operand; zeros fill vacated bit positions. The result is placed into the DEST location. SQRT ### **Floating-Point Square Root** **Operation:** DEST ← SQRT(SRCA) **Assembler** Syntax: SQRT rc, ra, FS Status: fpX, fpR, fpN Operands: SRCA Content of register RA (single-precision floating-point) or Content of register RA and the twin of register RA (double-precision floating-point) DEST Register RC (single-precision floating-point) or Register RC and twin of Register RC (double-precision floating-point) Control: FS Format of source operand SRCA 00 Reserved for future use 01 Single-precision floating-point 10 Double-precision floating-point 11 Reserved for future use Description: This operation computes the square root of floating-point operand SRCA; the result is rounded according to the FRM field of the Floating-Point Environment Register and placed into the DEST location. The operand and result are single- or double-precision floating-point numbers as specified by FS. Note: This instruction is not supported directly in processor hardware. In the current implementation, this instruction causes an SQRT trap. When the trap occurs, the IPA and IPC registers are set to reference SRCA and DEST, and the IPB Register is set with the value of the FS field. SRA SRA ## **Shift Right Arithmetic** Operation: DEST ← SRCA >> SRCB (sign fill) **Assembler** Syntax: SRA rc, ra, rb or SRA rc, ra, const8 Status: Not affected Operands: Content of register RA SRCA SRCB M = 0: Content of register RB, bits 4.... 0 M = 1: I, bits 4 ... 0 DEST Register RC Description: The SRCA operand is shifted right by the number of bit positions specified by the SRCB operand; the sign of the SRCA operand fills vacated bit positions. The result is placed into the DEST location. SRL SRL # **Shift Right Logical** Operation: DEST ← SRCA >> SRCB (zero fill) **Assembler** Syntax: SRL rc, ra, rb or SRL rc, ra, const8 Status: Not affected Operands: SRCA Content of register RA SRCB M = 0: Content of register RB, bits 4 ... 0 M = 1: I, bits 4 ... 0 DEST Register RC **Description:** The SRCA operand is shifted right by the number of bit positions specified by the SRCB operand; zeros fill vacated bit positions. The result is placed into the DEST location. STORE STORE #### Store Operation: EXTERNAL WORD [SRCB] ← SRCA **Assembler** Syntax: STORE 0, cntl, ra, rb or STORE 0, cntl, ra, const8 Status: Not affected Operands: Content of register RA SRCA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) Description: The SRCA operand is placed into the external word addressed by the SRCB operand. The CNTL field of the STORE instruction affects the bus access as described in Section 3.3.1. **STOREL** **STOREL** #### Store and Lock Operation: EXTERNAL WORD [SRCB] ← SRCA **Assembler** Syntax: STOREL 0, cntl, ra, rb STOREL 0, cntl, ra, const8 Status: Not affected Operands: Content of register RA **SRCA SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) Description: The SRCA operand is placed into the external word addressed by the SRCB operand. The CNTL field of the STOREL instruction affects the bus access as described in Section 3.3.1. In other 29K Family processors, this instruction is provided for the implementation of interlock protocols. In the Am29200 and Am29205 microcontrollers, the STOREL instruction is identical to the STORE instruction. **STOREM STOREM** #### Store Multiple Operation: EXTERNAL WORD [SRCB] ... EXTERNAL WORD [SRCB + (COUNT \* 4)] ← SRCA ... SRCA + COUNT **Assembler** Syntax: STOREM 0, cntl, ra, rb **SRCB** STOREM 0, cntl, ra, const8 Status: Not affected Operands: **SRCA** Content of register RA M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **Description:** The contents of consecutive registers, beginning with the SRCA operand, are placed into external words at consecutive word addresses, beginning with the word addressed by the SRCB operand. The total number of words accessed in the sequence is specified by the Count Remaining (CR) field of the Channel Control Register (which also appears in the Load/Store Count Remaining Register) at the beginning of the bus access. The total number of words is the value of the CR field plus one. The CNTL field of the STOREM instruction affects the access as described in Section 3.3.1. Note: The address and register-number sequences for the STOREM instruction are specified in Section 3.3.4. Because this instruction uses the Channel Address, Data, and Control Registers, it should not be executed when the FZ bit is 1. SUB SUB ### **Subtract** Operation: DEST ← SRCA – SRCB **Assembler** Syntax: SUB rc, ra, rb or SUB rc, ra, const8 Status: V, N, Z, C Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST Register RC** Description: The SRCA operand is added to the two's-complement of the SRCB operand and the result is placed into the DEST location. SUBC SUBC ## **Subtract with Carry** Operation: DEST ← SRCA - SRCB - 1 + C **Assembler** Syntax: SUBC rc, ra, rb or SUBC rc, ra, const8 Status: \ V, N, Z, C Operands: SRCA Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC **Description:** The SRCA operand is added to the one's-complement of the SRCB operand and the value of the ALU Status Carry bit. The result is placed into the DEST location. SUBCS SUBCS ## Subtract with Carry, Signed **Operation:** DEST $\leftarrow$ SRCA - SRCB - 1 + C IF signed overflow THEN Trap (Out of Range) **Assembler** Syntax: SUBCS rc, ra, rb or SUBCS rc, ra, const8 Status: V, N, Z, C Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC OP = 28.29 **SUBCS** #### **Description:** The SRCA operand is added to the one's-complement of the SRCB operand and the value of the ALU Status Carry bit. The result is placed into the DEST location. If the add operation causes a two's-complement signed overflow, an Out-of-Range trap occurs. Note that the DEST location is altered whether or not an overflow SUBCU SUBCU ### Subtract with Carry, Unsigned **Operation:** DEST $\leftarrow$ SRCA - SRCB - 1 + C IF unsigned underflow THEN Trap (Out of Range) **Assembler** Syntax: SUBCU rc, ra, rb or SUBCU rc, ra, const8 Status: V, N, Z, C Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC **Description:** The SRCA operand is added to the one's-complement of the SRCB operand and the value of the ALU Status Carry bit. The result is placed into the DEST location. If the add operation causes an unsigned underflow, an Out-of-Range trap occurs. Note that the DEST location is altered whether or not an underflow **SUBR** **SUBR** #### **Subtract Reverse** Operation: DEST ← SRCB - SRCA Assembler Syntax: SUBR rc, ra, rb or SUBR rc, ra, const8 Status: V, N, Z, C Operands: SRCA Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC **Description:** The SRCB operand is added to the two's-complement of the SRCA operand and the result is placed into the DEST location. **SUBRC** **SUBRC** # **Subtract Reverse with Carry** Operation: DEST ← SRCB - SRCA - 1 + C **Assembler** Syntax: SUBF SUBRC rc, ra, rb or SUBRC rc, ra, const8 Status: V, N, Z, C Operands: Content of register RA SRCA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC Description: The SRCB operand is added to the one's-complement of the SRCA operand and the value of the ALU Status Carry bit. The result is placed into the DEST location. **SUBRCS** **SUBRCS** ## Subtract Reverse with Carry, Signed Operation: DEST ← SRCB - SRCA - 1 + C IF signed overflow THEN Trap (Out of Range) Assembler Syntax: SUBRCS rc, ra, rb SRCB SUBRCS rc, ra, const8 Status: V, N, Z, C Operands: **SRCA** Content of register RA M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC OP = 38, 39 **SUBRCS** #### **Description:** The SRCB operand is added to the one's-complement of the SRCA operand and the value of the ALU Status Carry bit. The result is placed into the DEST location. If the add operation causes a two's-complement signed overflow, an Out-of-Range trap occurs. Note that the DEST location is altered whether or not an overflow SUBRCU SUBRCU ## **Subtract Reverse with Carry, Unsigned** **Operation:** DEST $\leftarrow$ SRCB - SRCA - 1 + C IF unsigned underflow THEN Trap (Out of Range) **Assembler** Syntax: SUBRCU rc, ra, rb or SUBRCU rc, ra, const8 Status: V, N, Z, C Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC **Description:** The SRCB operand is added to the one's-complement of the SRCA operand and the value of the ALU Status Carry bit. The result is placed into the DEST location. If the add operation causes an unsigned underflow, an Out-of-Range $\,$ trap occurs. Note that the DEST location is altered whether or not an underflow **SUBRS** **SUBRS** ## Subtract Reverse, Signed Operation: DEST ← SRCB - SRCA IF signed overflow THEN Trap (Out of Range) **Assembler** Syntax: SUBRS rc, ra, rb SUBRS rc, ra, const8 Status: V, N, Z, C Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC OP = 30, 31 **SUBRS** **Description:** The SRCB operand is added to the two's-complement of the SRCA operand and the result is placed into the DEST location. If the add operation causes a two's-complement signed overflow, an Out-of-Range trap occurs. Note that the DEST location is altered whether or not an overflow occurs. SUBRU SUBRU ## **Subtract Reverse, Unsigned** Operation: DEST ← SRCB - SRCA IF unsigned underflow THEN Trap (Out of Range) **Assembler** Syntax: SUBRU rc, ra, rb or SUBRU rc, ra, const8 Status: V, N, Z, C Operands: SRCA Content of register RA **SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC **Description:** The SRCB operand is added to the two's-complement of the SRCA operand and the result is placed into the DEST location. If the add operation causes an unsigned underflow, an Out-of-Range trap occurs. Note that the DEST location is altered whether or not an underflow SUBS SUBS ## Subtract, Signed **Operation:** DEST ← SRCA – SRCB IF signed overflow THEN Trap (Out of Range) **Assembler** Syntax: SUBS rc, ra, rb or SUBS rc, ra, const8 Status: V, N, Z, C Operands: SRCA Co SRCB M= Content of register RA M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC OP = 20, 21 **SUBS** Description: The SRCA operand is added to the two's-complement of the SRCB operand and the result is placed into the DEST location. If the add operation causes a two's-complement signed overflow, an Out-of-Range trap occurs. Note that the DEST location is altered whether or not an overflow SUBU SUBU #### Subtract, Unsigned **Operation:** DEST ← SRCA – SRCB IF unsigned underflow THEN Trap (Out of Range) **Assembler** Syntax: SUBU rc, ra, rb or SUBU rc, ra, const8 Status: V, N, Z, C Operands: SRCA Content of register RA SRCB M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC **Description:** The SRCA operand is added to the two's-complement of the SRCB operand and the result is placed into the DEST location. If the add operation causes an unsigned underflow, an Out-of-Range trap occurs. Note that the DEST location is altered whether or not an underflow **XNOR** **XNOR** ## **Exclusive-NOR Logical** Operation: DEST ← ~ (SRCA ^ SRCB) **Assembler** Syntax: XNOR rc, ra, rb XNOR rc, ra, const8 Status: N, Z Operands: Content of register RA **SRCA SRCB** M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) **DEST** Register RC **Description:** The SRCA operand is logically exclusive-ORed, bit-by-bit, with the SRCB operand. The one's-complement of the result is placed into the **DEST** location. **XOR** **XOR** ## **Exclusive-OR Logical** Operation: DEST ← SRCA ^ SRCB **Assembler** Syntax: XOR rc, ra, rb or XOR rc, ra, const8 Status: N, Z Operands: SRCA Content of register RA SRCB M M = 0: Content of register RB M = 1: I (Zero-extended to 32 bits) DEST Register RC **Description:** The SRCA operand is logically exclusive-ORed, bit-by-bit, with the SRCB operand, and the result is placed into the DEST location. | 40.4 | | | | AIVID | |------|----------------|---------|-------------------------------------------------|-------| | 18.4 | | | BY OPERATION CODE | | | | 01 | CONSTN | Constant, Negative | | | | 02 | CONSTH | Constant, High | | | | 03 | CONST | Constant | | | | 04 | MTSRIM | Move to Special Register Immediate | · | | | 06,07 | LOADL | Load and Lock | | | | 08,09 | CLZ | Count Leading Zeros | | | | 0A,0B | EXBYTE | Extract Byte | | | | 0C,0D | INBYTE | Insert Byte | | | | 0E,0F | STOREL | Store and Lock | | | | 10,11<br>12,13 | ADDU | Add, Signed | | | | 12,13 | ADDO | Add, Unsigned<br>Add | | | | | LOAD | Load | | | 4 | 16,17<br>18,19 | ADDCS | Add with Carry, Signed | | | | 16,19<br>1A,1B | ADDCU | Add with Carry, Signed Add with Carry, Unsigned | | | | 1C,1D | ADDCO | Add with Carry | | | | 16,16<br>1E,1F | STORE | Store | | | | 20,21 | SUBS | Subtract, Signed | | | | 22,23 | SUBU | Subtract, Unsigned | | | | 24,25 | SUB | Subtract | | | | 26,27 | LOADSET | Load and Set | | | | 28,29 | SUBCS | Subtract with Carry, Signed | | | | 2A,2B | SUBCU | Subtract with Carry, Unsigned | | | | 2C,2D | SUBC | Subtract with Carry | | | | 2E,2F | CPBYTE | Compare Bytes | | | | 30,31 | SUBRS | Subtract Reverse, Signed | | | | 32,33 | SUBRU | Subtract Reverse, Unsigned | | | | 34,35 | SUBR | Subtract Reverse | | | | 36,37 | LOADM | Load Multiple | | | | 38,39 | SUBRCS | Subtract Reverse with Carry, Signed | | | | 3A,3B | SUBRCU | Subtract Reverse with Carry, Unsigned | | | | 3C,3D | SUBRC | Subtract Reverse with Carry | | | | 3E,3F | STOREM | Store Multiple | | | | 40,41 | CPLT | Compare Less Than | | | | 42,43 | CPLTU | Compare Less Than, Unsigned | | | | 44,45 | CPLE | Compare Less Than or Equal To | | | | 46,47 | CPLEU | Compare Less Than or Equal To, Unsigned | | | | 48,49 | CPGT | Compare Greater Than | | | | 4A,4B | CPGTU | Compare Greater Than, Unsigned | | | | 4C,4D | CPGE | Compare Greater Than or Equal To | | | | 4E,4F | CPGEU | Compare Greater Than or Equal To, Unsigne | d | | | 50,51 | ASLT | Assert Less Than | | | | 52,53 | ASLTU | Assert Less Than, Unsigned | | | | 54,55 | ASLE | Assert Less Than or Equal To | | | | 56,57 | ASLEU | Assert Less Than or Equal To, Unsigned | | | 58,59 | ASGT | Assert Greater Than | |------------|---------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | 5A,5B | ASGTU | Assert Greater Than, Unsigned | | 5C,5D | ASGE | Assert Greater Than or Equal To | | 5E,5F | ASGEU | Assert Greater Than or Equal To, Unsigned | | 60,61 | CPEQ | Compare Equal To | | 62,63 | CPNEQ | Compare Not Equal To | | 64,65 | MUL | Multiply Step | | 66,67 | MULL | Multiply Last Step | | 68,69 | DIV0 | Divide Initialize | | 6A,6B | DIV | Divide Step | | 6C,6D | DIVL | Divide Last Step | | 6E,6F | DIVREM | Divide Remainder | | | | | | 70,71 | ASEQ | Assert Equal To | | 72,73 | ASNEQ | Assert Not Equal To | | 74,75 | MULU | Multiply Step, Unsigned | | 78,79 | INHW | Insert Half-Word | | 7A,7B | EXTRACT | Extract Word, Bit-Aligned | | 7C,7D | EXHW | Extract Half-Word | | 7E | EXHWS | Extract Half-Word, Sign-Extended | | 80,81 | SLL | Shift Left Logical | | 82,83 | SRL | Shift Right Logical | | 86,87 | SRA | Shift Right Arithmetic | | 88 | IRET | Interrupt Return | | 89 | HALT | Enter HALT Mode | | 8C | IRETINV | Interrupt Return and Invalidate | | 90,91 | AND | AND Logical | | 92,93 | OR | OR Logical | | 94,95 | XOR | Exclusive-OR Logical | | 96,97 | XNOR | Exclusive-NOR Logical | | 98,99 | NOR | NOR Logical | | 9A,9B | NAND | NAND Logical | | 9C,9D | ANDN | AND-NOT Logical | | 9E | SETIP | Set Indirect Pointers | | 9F | INV | Invalidate | | A0,A1 | JMP | Jump | | A4,A5 | JMPF | Jump False | | A8,A9 | CALL | Call Subroutine | | AC,AD | JMPT | Jump True | | B4,B5 | JMPFDEC | Jump False and Decrement | | B6 | MFTLB | Move from Translation Look-Aside Buffer Register | | BE | MTTLB | Move to Translation Look-Aside Buffer Register | | C0 | JMPI | Jump Indirect | | C4 | JMPFI | Jump False Indirect | | C6 | MFSR | Move from Special Register | | C8 | CALLI | Call Subroutine, Indirect | | CC | JMPTI | Jump True Indirect | | J <b>.</b> | | and the second of o | | CE | MTSR | Move to Special Register | |-------|-----------------|--------------------------------------------------------------| | D7 | EMULATE | Trap to Software Emulation Routine | | D8-DD | | ulation (trap vector numbers 24–29) | | DE | MULTM | Integer Multiply Most Significant Bits, Signed | | DF | MULTMU | Integer Multiply Most Significant Bits, Unsigned | | E0 | MULTIPLY | Integer Multiply, Signed | | E1 | DIVIDE | Integer Divide, Signed | | E2 | MULTIPLU | Integer Multiply, Unsigned | | E3 | DIVIDU | Integer Divide, Unsigned | | E4 | CONVERT | Convert Data Format | | E5 | SQRT | Square Root | | E6 | CLASS | Classify Floating-Point Operand | | E7-E9 | Reserved for em | ulation (trap vector number 39-41) | | EA | FEQ | Floating-Point Equal To, Single-Precision | | EB | DEQ | Floating-Point Equal To, Double-Precision | | EC | FGT | Floating-Point Greater Than, Single-Precision | | ED | DGT | Floating-Point Greater Than, Double-Precision | | EE | FGE | Floating-Point Greater Than or Equal To,<br>Single-Precision | | EF | DGE | Floating-Point Greater Than or Equal To, Double-Precision | | F0 | FADD | Floating-Point Add, Single-Precision | | F1 | DADD | Floating-Point Add, Double-Precision | | F2 | FSUB | Floating-Point Subtract, Single-Precision | | F3 | DSUB | Floating-Point Subtract, Double-Precision | | F4 | FMUL | Floating-Point Multiply, Single-Precision | | F5 | DMUL | Floating-Point Multiply, Double-Precision | | F6 | FDIV | Floating-Point Divide, Single-Precision | | F7 | DDIV | Floating-Point Divide, Double-Precision | | F8 | Reserved for em | ulation (trap vector number 56) | | F9 | FDMUL | Floating-Point Multiply, Single-to-Double-Precision | | FA-FF | Reserved for em | ulation (trap vector numbers 58-63) | • # SPECIAL SETTINGS FOR THE Am29200 AND Am29205 MICROCONTROLLERS #### **Am29200 MICROCONTROLLER** Before using the Am29200 microcontroller product, the user should prepare the microcontroller by setting the following signals as shown. - Tie the TRST signal to RESET, whether or not the JTAG port will be used. - If the JTAG port will not be used, drive TCK, TMS, and TDI to known states, preferably through pull-up resistors. Although these signals have weak internal active pull-ups, tying them to a known signal level will eliminate any system noise from being coupled into the JTAG interface. - If the serial port will not be used, tie the UCLK signal High. #### **Am29205 MICROCONTROLLER** Before using the Am29205 microcontroller product, the user should prepare the microcontroller by setting the following signals and fields as shown. - Pull the WAIT/TRIST signal High. - If the serial port will not be used, tie the UCLK signal High. - Program the ASEL3 field in the ROM Configuration Register with a value that does not overlap with addresses specified for ROM Banks 0 through 2. - In the PIA Control Register 0, write bits 15–0 with 0s. - In the PIO Control Register, write bits 7–0 with 0s. - In the PIO Output Register, write bits 7–0 with 0s. - In the PIO Output Enable Register, write bits 7–0 with 0s. - In the Parallel Port Control Register, set the FWT bit to 0. Note that all registers with fields designated as "reserved" should be programmed with 0s to ensure compatibility. # B # PROCESSOR REGISTER SUMMARY Figure B-1 General-Purpose Register Organization Figure B-2 Register Bank Organization | Register<br>Bank Protect<br>Register Bit | Absolute-Register<br>Numbers | General-Purpose<br>Registers | |------------------------------------------|------------------------------|------------------------------| | 0 | 2 through 15 | Bank 0<br>(not implemented) | | 1 | 16 through 31 | Bank 1<br>(not implemented) | | 2 | 32 through 47 | Bank 2<br>(not implemented) | | 3 | 48 through 63 | Bank 3<br>(not implemented) | | 4 | 64 through 79 | Bank 4 | | 5 | 80 through 95 | Bank 5 | | 6 | 96 through 111 | Bank 6 | | 7 | 112 through 127 | Bank 7 | | 8 | 128 through 143 | Bank 8 | | 9 | 144 through 159 | Bank 9 | | 10 | 160 through 175 | Bank 10 | | 11 | 176 through 191 | Bank 11 | | 12 | 192 through 207 | Bank 12 | | 13 | 208 through 223 | Bank 13 | | -14 | 224 through 239 | Bank 14 | | 15 | 240 through 255 | Bank 15 | Figure B-3 Special-Purpose Registers Figure B-3 Special-Purpose Registers (continued) REG# Figure B-3 Special-Purpose Registers (continued) REG# Figure B-3 Special-Purpose Registers (continued) Table B-1 Processor Register Field Summary | Label | Field Name | Register | Bit | |-------|--------------------------------------|--------------------------------------------------|--------------| | В0 | Bank 0 Protection Bit | Register Bank Protect | 0 | | B1 | Bank 1 Protection Bit | Register Bank Protect | 1 | | B2 | Bank 2 Protection Bit | Register Bank Protect | 2 | | В3 | Bank 3 Protection Bit | Register Bank Protect | 3 | | B4 | Bank 4 Protection Bit | Register Bank Protect | 4 | | B5 | Bank 5 Protection Bit | Register Bank Protect | 5 | | B6 | Bank 6 Protection Bit | Register Bank Protect | 6 | | B7 | Bank 7 Protection Bit | Register Bank Protect | 7 | | B8 | Bank 8 Protection Bit | Register Bank Protect | 8 | | B9 | Bank 9 Protection Bit | Register Bank Protect | 9 | | B10 | Bank 10 Protection Bit | Register Bank Protect | 10 | | B11 | Bank 11 Protection Bit | Register Bank Protect | 11 | | B12 | Bank 12 Protection Bit | Register Bank Protect | 12 | | B13 | Bank 13 Protection Bit | Register Bank Protect | 13 | | B14 | Bank 14 Protection Bit | Register Bank Protect | 14 | | B15 | Bank 15 Protection Bit | Register Bank Protect | 15 | | BP | Byte Pointer | ALU Status<br>Byte Pointer | 6–5<br>1–0 | | С | Carry | ALU Status | . 7 | | CHA | Channel Address | Channel Address | 31–0 | | CHD | Channel Data | Channel Data | 31–0 | | CNTL | Control | Channel Control | 30–24 | | CR | Load/Store Count Remaining | Channel Control<br>Load/Store Count Remaining | 23–16<br>7–0 | | cv | Contents Valid | Channel Control | 0 | | DA | Disable All Interrupts and Traps | Current Processor Status<br>Old Processor Status | 0 | | DF | Divide Flag | ALU Status | 11, | | DI | Disable Interrupts | Current Processor Status Old Processor Status | 1<br>1 | | DM | Floating-Point Divide By Zero Mask | Floating-Point Environment | 5 | | DO | Integer Division Overflow Mask | Integer Environment | 1 | | DS | Floating-Point Divide By Zero Sticky | Floating-Point Status | 5 | | DT | Floating-Point Divide By Zero Trap | ALU Status | 13 | | FF | Fast Floating-Point Select | Floating-Point Environment | 8 | | FC | Funnel Shift Count | ALU Status | 4–0 | | | Electing Boint Pound Made | Funnel Shift Count | 4–0 | | FRM | Floating-Point Round Mode | Floating-Point Environment | 7–6 | | FZ | Freeze | Current Processor Status<br>Old Processor Status | 10<br>10 | Table B-1 Register Field Summary (continued) | Label | Field Name | Register | Bit | |-------|-----------------------------------------|--------------------------------------------------|------------| | IE. | Interrupt Enable | Timer Reload | 24 | | IM | Interrupt Mask | Old Processor Status<br>Current Processor Status | 3–2<br>3–2 | | IN | Interrupt | Timer Reload | 25 | | IP | Interrupt Pending | Current Processor Status<br>Old Processor Status | 14<br>14 | | IPA | Indirect Pointer A | Indirect Pointer A | 9–2 | | IPB | Indirect Pointer B | Indirect Pointer B | 9–2 | | IPC | Indirect Pointer C | Indirect Pointer C | 9–2 | | LS | Load/Store | Channel Control | 15 | | ML . | Multiple Operation | Channel Control | 14 | | МО | Integer Multiplication Overflow Mask | Integer Environment | 0 | | N | Negative | ALU Status | 9 | | NM | Floating-Point Invalid Operation Mask | Floating-Point Environment | 0 | | NN | Not Needed | Channel Control | 1 | | NS | Floating-Point Invalid Operation Sticky | Floating-Point Status | 0 | | NT | Floating-Point Invalid Operation Trap | Floating-Point Status | 8 | | OV | Overflow | Timer Reload | 26 | | PC0 | Program Counter 0 | Program Counter 0 | 31–2 | | PC1 | Program Counter 1 | Program Counter 1 | 31–2 | | PC2 | Program Counter 2 | Program Counter 2 | 31–2 | | PRL | Processor Release Level | Configuration | 31–24 | | Q | Quotient/Multiplier | Q Register | 31–0 | | RM . | Floating-Point Reserved Operand Mask | Floating-Point Environment | 1 | | RS | Floating-Point Reserved Operand Sticky | Floating-Point Status | 1 | | RT | Floating-Point Reserved Operand Trap | Floating-Point Status | 9 | | SM | Supervisor Mode | Current Processor Status<br>Old Processor Status | 4 4 | | ST | Set | Channel Control | - 13 | | TCV | Timer Count Value | Timer Counter | 23–0 | | TD | Timer Disable | Current Processor Status<br>Old Processor Status | 17<br>17 | | TE | Trace Enable | Current Processor Status<br>Old Processor Status | 13<br>13 | Table B-1 Register Field Summary (continued) | Label | Field Name | Register | Bit | |-------|--------------------------------------|--------------------------------------------------|----------| | TP | Trace Pending | Current Processor Status<br>Old Processor Status | 12<br>12 | | TR | Target Register | Channel Control | 9–2 | | TRV | Timer Reload Value | Timer Reload | 23–0 | | TU | Trap Unaligned Access | Current Processor Status<br>Old Processor Status | 11<br>11 | | UM | Floating-Point Underflow Mask | Floating-Point Environment | 3 | | US | Floating-Point Underflow Sticky | Floating-Point Status | 3 | | UT | Floating-Point Underflow Trap | Floating-Point Status | 11 | | V | Overflow | ALU Status | 10 | | VAB | Vector Area Base | Vector Area Base Address | 31–10 | | VM | Floating-Point Overflow Mask | Floating-Point Environment | 2 | | VS | Floating-Point Overflow Sticky | Floating-Point Status | 2 | | VT | Floating-Point Overflow Trap | Floating-Point Status | 10 | | WM | Wait Mode | Current Processor Status<br>Old Processor Status | 7<br>7 | | XM | Floating-Point Inexact Result Mask | Floating-Point Environment | 4 | | XS | Floating-Point Inexact Result Sticky | Floating-Point Status | 4 | | XT | Floating-Point Inexact Result Trap | Floating-Point Status | 12 | | Z | Zero | ALU Status | 8 | # PERIPHERAL REGISTER SUMMARY ### Figure C-1 On-Chip Peripheral Registers <sup>=</sup> Reserved on Am29205 microcontroller Figure C-1 On-Chip Peripheral Registers (continued) Figure C-1 On-Chip Peripheral Registers (continued) Figure C-1 On-Chip Peripheral Registers (continued) Figure C-1 On-Chip Peripheral Registers (continued) Table C-1 Peripheral Register Field Summary | Label | Field Name | Register | Bit | |---------|-----------------------------|-----------------------------------------|----------------| | ACK | PACK Level | Parallel Port Status | 6 | | ACS | Assert Chip Select | DMA0 Control<br>DMA1 Control | 19<br>19 | | AFD | Autofeed | Parallel Port Control | 0 | | AMASK0 | Address Mask, Bank 0 | ROM Configuration DRAM Configuration | 26-24<br>26-24 | | AMASK1 | Address Mask, Bank 1 | ROM Configuration<br>DRAM Configuration | 18–16<br>18–16 | | AMASK2 | Address Mask, Bank 2 | ROM Configuration<br>DRAM Configuration | 10–8<br>10–8 | | AMASK3 | Address Mask, Bank 3 | ROM Configuration<br>DRAM Configuration | 2–0<br>2–0 | | ARB | ACK Relationship to BUSY | Parallel Port Control | 1 | | ASEL0 | Address Select, Bank 0 | ROM Configuration<br>DRAM Configuration | 31–27<br>31–27 | | ASEL1 | Address Select, Bank 1 | ROM Configuration DRAM Configuration | 23–19<br>23–19 | | ASEL2 | Address Select, Bank 2 | ROM Configuration DRAM Configuration | 15–11<br>15–11 | | ASEL3 | Address Select, Bank 3 | ROM Configuration DRAM Configuration | 7–3<br>7–3 | | BAUDDIV | Baud Rate Divisor | Baud Rate Divisor | 15–0 | | BCT | Byte Count | Parallel Port Status | 9–8 | | BRK | Send Break | Serial Port Control | 25 . | | BRKI | Break Interrupt | Serial Port Status | 3 | | BRS | BUSY Relationship to STROBE | Parallel Port Control | 2 | | BST0 | Burst-Mode ROM, Bank 0 | ROM Control | 31 | | BST1 | Burst-Mode ROM, Bank 1 | ROM Control | 23 | | BST2 | Burst-Mode ROM, Bank 2 | ROM Control | 15 | | BST3 | Burst-Mode ROM, Bank 3 | ROM Control | 7 | | BSY | PBUSY Level | Parallel Port Status | . 7 | | BWE | Byte Write Enable | ROM Control | 27 | | CLKDIV | Clock Divide | Video Control | 14–11 | | CLKI | Clock Invert | Video Control | 7 | | CTE | Count Terminate Enable | DMA0 Control<br>DMA1 Control | 5<br>5 | | CTI | Count Terminate Interrupt | DMA0 Control<br>DMA1 Control | 0<br>0 | Table C-1 Peripheral Register Field Summary (continued) | Label | Field Name | Register | Bit | |----------|----------------------------|---------------------------------------------------|----------------------| | DDIR | Data Direction | Parallel Port Control<br>Video Control | 10<br>10 | | DHH | Disable Hardware Handshake | Parallel Port Control | 5 | | DMA0I | DMA Channel 0 Interrupt | Interrupt Control | 14 | | DMA1I | DMA Channel 1 Interrupt | Interrupt Control | 13 | | DMACNT | DMA Count | DMA0 Count<br>DMA0 Count Tail<br>DMA1 Count | 23–0<br>23–0<br>23–0 | | DMAEXT | DMA Extend | DMA0 Control<br>DMA1 Control | 31<br>31 | | DMAWAIT | DMA Wait States | DMA0 Control<br>DMA1 Control | 28–24<br>28–24 | | DRAMADDR | DRAM Address | DMAO Address<br>DMAO Address Tail<br>DMA1 Address | 23–0<br>23–0<br>23–0 | | DRM | DMA Request Mode | DMA0 Control<br>DMA1 Control | 21–20<br>21–20 | | DRQ | Data Request | Parallel Port Control<br>Video Control | 15<br>15 | | DSR | Data Set Ready | Serial Port Control | 24 | | DTR | Data Terminal Ready | Serial Port Status | 4 | | DW | Data Width | DMA0 Control<br>DMA1 Control | 22-23<br>22-23 | | DW0 | Data Width, Bank 0 | ROM Control<br>DRAM Control | 30–29<br>30 | | DW1 | Data Width, Bank 1 | ROM Control<br>DRAM Control | 22–21<br>26 | | DW2 | Data Width, Bank 2 | ROM Control DRAM Control | 14–13<br>22 | | DW3 | Data Width, Bank 3 | ROM Control DRAM Control | 6–5<br>18 | | EN | Enable | DMA0 Control<br>DMA1 Control | 7<br>7 | | FACK | Force ACK | Parallel Port Control | 6 | | FBUSY | Force Busy | Parallel Port Control | 7 | | FER | Framing Error | Serial Port Status | 2 | | FWT | Full Word Transfer | Parallel Port Control | 30 | | INVERT | PIO Inversion | PIO Control | 15-0 | Table C-1 Peripheral Register Field Summary (continued) | Label | Field Name | Register | Bit | |---------|--------------------------------------------|----------------------------------------|----------------| | IOEXT0 | Input/Output Extend, Region 0 | PIA Control 0 | 31 | | IOEXT1 | Input/Output Extend, Region 1 | PIA Control 0 | 23 | | IOEXT2 | Input/Output Extend, Region 2 | PIA Control 0 | 15 | | IOEXT3 | Input/Output Extend, Region 3 | PIA Control 0 | 7 | | IOEXT4 | Input/Output Extend, Region 4 | PIA Control 1 | 31 | | IOEXT5 | Input/Output Extend, Region 5 | PIA Control 1 | 23 | | IOPI | I/O Port Interrupt | Interrupt Control | 23–16 | | IOWAIT0 | Input/Output Wait States, Region 0 | PIA Control 0 | 28–24 | | IOWAIT1 | Input/Output Wait States, Region 1 | PIA Control 0 | 20–16 | | IOWAIT2 | Input/Output Wait States, Region 2 | PIA Control 0 | 12–8 | | IOWAIT3 | Input/Output Wait States, Region 3 | PIA Control 0 | 4–0 | | IOWAIT4 | Input/Output Wait States, Region 4 | PIA Control 1 | 28-24 | | IOWAIT5 | Input/Output Wait States, Region 5 | PIA Control 1 | 20-16 | | IRM8 | Interrupt Request Mode, PIO8 | PIO Control | 17–16 | | IRM9 | Interrupt Request Mode, PIO9 | PIO Control | 19–18 | | IRM10 | Interrupt Request Mode, PIO10 | PIO Control | 21–20 | | IRM11 | Interrupt Request Mode, PIO11 | PIO Control | 23-22 | | IRM12 | Interrupt Request Mode, PIO12 | PIO Control | 25–24 | | IRM13 | Interrupt Request Mode, PIO13 | PIO Control | 27-26 | | IRM14 | Interrupt Request Mode, PIO14 | PIO Control | 29–28 | | IRM15 | Interrupt Request Mode, PIO15 | PIO Control | 31–30 | | LEFTCNT | Left Margin Count | Side Margin | 27–16 | | LINECNT | Line Count | Side Margin | 15–0 | | LM | Large Memory | ROM Control<br>DRAM Control | 28<br>28 | | LOOP | Loopback | Serial Port Control | 26 | | LSI | Line Sync Invert | Video Control | 2 | | MODE | Parallel Port Mode<br>Video Interface Mode | Parallel Port Control<br>Video Control | 9–8<br>9–8 | | OER | Overrun Error | Serial Port Status | 0 | | PDATA | Parallel Port Data | Parallel Port Data | 7–0<br>31–0 | | PER | Parity Error | Serial Port Status | 1 | | PERADDR | Peripheral Address | DMA0 Address<br>DMA1 Address | 31-24<br>31-24 | Table C-1 Peripheral Register Field Summary (continued) | Label | Field Name | Register | Bit | |---------|--------------------------------------|----------------------------------------------------------------------|--------------------------| | PG0 | Page-Mode DRAM, Bank 0 | DRAM Control | 31 | | PG1 | Page-Mode DRAM, Bank 1 | DRAM Control | 27 | | PG2 | Page-Mode DRAM, Bank 2 | DRAM Control | 23 | | PG3 | Page-Mode DRAM, Bank 3 | DRAM Control | 19 | | PHYBASE | Physical Base Address | DRAM Mapping 0<br>DRAM Mapping 1<br>DRAM Mapping 2<br>DRAM Mapping 3 | 7-0<br>7-0<br>7-0<br>7-0 | | PIN | PIO Input | PIO Input | 15–0 | | PMODE | Parity Mode | Serial Port Control | 21-19 | | POEN | PIO Output Enable | PIO Output Enable | 15–0 | | POUT | PIO Output | PIO Output | 15–0 | | PPI | Parallel Port Interrupt | Interrupt Control | 11 | | PSI | Page Sync Invert | Video Control | 4 | | PSIO | Page Sync Input/Output | Video Control | 5 | | PSL | Page Sync Level | Video Control | 3 | | QEN | Queue Enable | DMA0 Control | 4 | | RDATA | Receive Data | Serial Port Receive Buffer | 7-0 | | RDR | Receive Data Ready | Serial Port Status | 8 | | REFRATE | Refresh Rate | DRAM Control | 8-0 | | RMODE | Receive Mode | Serial Port Control | 1–0 | | RSIE | Receive Status Interrupt Enable | Serial Port Control | 2 | | RW | Read/Write | DMA0 Control<br>DMA1 Control | 8<br>8 | | RXDI | Serial Port Receive Data Interrupt | Interrupt Control | 6 | | RXSI | Serial Port Receive Status Interrupt | Interrupt Control | 7 | | sc · | Static-Column DRAM | DRAM Control | 15 | | SDIR | Shift Direction | Video Control | 1 | | STB | PSTROBE Level | Parallel Port Status | 31 | | STP | Stop Bits | Serial Port Control | 18 | | TDATA | Transmit Data | Serial Port Transmit Holding | 7–0 | | TDELAY | Transfer Delay | Parallel Port Control | 23-16 | | TDELAYV | TDELAY Counter Value | Parallel Port Status | 23–16 | | TEMT | Transmitter Empty | Serial Port Status | 10 | | THRE | Transmit Holding Register Empty | Serial Port Status | 9 | ### Table C-1 Peripheral Register Field Summary (continued) | Label | Field Name | Register | Bit | |----------|-------------------------------------|----------------------------------------------------------------------|----------------------------------| | TMODE | Transmit Mode | Serial Port Control | 9–8 | | TOPCNT | Top Margin Count | Top Margin | 11–0 | | TRA | Transfer Active | Parallel Port Control | 14 | | TTE | TDMA Terminate Enable | DMA0 Control<br>DMA1 Control | 6<br>6 | | TTI - | TDMA Terminate Interrupt | DMA0 Control<br>DMA1 Control | 1<br>1 | | TXDI | Serial Port Transmit Data Interrupt | Interrupt Control | 5 | | UD | Transfer Up/Down | DMA0 Control<br>DMA1 Control | 9<br>9 | | VALID | Valid Mapping | DRAM Mapping 0<br>DRAM Mapping 1<br>DRAM Mapping 2<br>DRAM Mapping 3 | 31<br>31<br>31<br>31 | | VIDI | Video Invert | Video Control | 0 | | VDATA | Video Data | Video Data Holding | 31–0 | | VDI | Video Interrupt | Interrupt Control | 27 | | VIRTBASE | Virtual Base Address | DRAM Mapping 0<br>DRAM Mapping 1<br>DRAM Mapping 2<br>DRAM Mapping 3 | 23–16<br>23–16<br>23–16<br>23–16 | | WLGN | Word Length | Serial Port Control | 17–16 | | WS0 | Wait States, Bank 0 | ROM Control | 25–24 | | WS1 | Wait States, Bank 1 | ROM Control | 17–16 | | WS2 | Wait States, Bank 2 | ROM Control | 9-8 | | WS3 | Wait States, Bank 3 | ROM Control | 1–0 | ## **INDEX** | A | ALU Status Register | |----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | A23-A0 signals | arithmetic instructions, 2-1<br>description, 2-16–2-17 | | definition, 7-1 | logical instructions, 2-4 | | external DMA transfers, 11-4 | Am29200 microcontroller family | | absolute-register number, 2-10 | development tools, 1-7 | | access priority, 7-7-7-8 | overview, xv, 1-1 product support, 1-7 | | ACK bit (PACK Level), 13-4 | , , , | | ACS bit (Assert Chip Select), 11-3, 11-6 | Am29200 microcontroller block diagram, 1-3 | | activation records | design philosophy, xv-xvii | | allocation, 4-1-4-2, 4-4 | distinctive characteristics, 1-2-1-3 | | definition, 4-1 | overview burst-mode memory support, 1-8 | | ADD (Add) instruction, description, 18-8 | bus-compatibility, 1-7 | | Add Wait States signal. See WAIT signal; WAIT/TRIST signal | data formats, 1-8-1-9<br>debugging and testing, 1-9-1-10 | | ADDC (Add with Carry) instruction, description, 18-9 | DRAM mapping, 1-9 | | ADDCS (Add with Carry, Signed) instruction, description, 18-10 | instruction set, 1-8 instruction timing, 1-7–1-8 interfaces, 1-6 | | ADDCU (Add with Carry, Unsigned) instruction, description, 18-11 | interrupts and traps, 1-9-1-10 page-mode memory support, 1-8 | | addition instructions ADD (Add), 18-8 ADDC (Add with Carry), 18-9 ADDCS (Add with Carry, Signed), 18-10 ADDCU (Add with Carry, Unsigned), 18-11 ADDS (Add, Signed), 18-12 ADDU (Add, Unsigned), 18-13 DADD (Floating-Point Add, Double-Precision), | peripherals on-chip, 1-5–1-6 pipelining, 1-8 price/performance, 1-6 protection, 1-9 software-compatibility, 1-7 performance overview, 1-7–1-9 product support, iii special settings, A-1 | | 18-47 | Am29205 microcontroller | | FADD (Floating-Point Add, Single-Precision), 18-65<br>Address Bus signals. See A23–A0 signals | block diagram, 1-4<br>design philosophy, xv–xvii | | | distinctive characteristics, 1-4-1-5 | | addressing<br>byte and half-word addressing, 3-11–3-12<br>indirect register addressing, 2-12–2-14<br>internal peripheral address assignments, 7-8–7-10<br>registers, 2-10 | emulating the Am29205 microcontroller, 17-17 overview bus-compatibility, 1-7 data formats, 1-8-1-9 debugging and testing, 1-9-1-10 | | ADDS (Add, Signed) instruction, description, 18-12 | DRAM mapping, 1-9 | | ADDU (Add, Unsigned) instruction, description, 18-13 | instruction set, 1-8 instruction timing, 1-7–1-8 | | AFD bit (Autofeed), 13-3 | interfaces, 1-6 | | alignment of bytes within words, 3-4 of instructions, 3-13 of words and half-words, 3-13 Unaligned Access trap, 16-2 | interrupts and traps, 1-9–1-10 page-mode memory support, 1-8 peripherals on-chip, 1-5–1-6 pipelining, 1-8 price/performance, 1-6 | Index I-1 protection, 1-9 software-compatibility, 1-7 performance overview, 1-7–1-9 pin changes, 7-7 product support, iii special settings, A-1 AMASK0 field (Address Mask, Bank 0) DRAM Configuration Register, 9-3 ROM Configuration Register, 8-2–8-3 AMASK1 field (Address Mask, Bank 1) DRAM Configuration Register, 9-3 ROM Configuration Register, 8-3 AMASK2 field (Address Mask, Bank 2) DRAM Configuration Register, 9-3 ROM Configuration Register, 8-3 AMASK3 field (Address Mask, Bank 3) Am29205 microcontroller, 8-3 DRAM Configuration Register, 9-3 ROM Configuration Register, 8-3 AND (AND logical) instruction, description, 18-14 ANDN (AND-NOT logical) instruction, description, 18-15 ARB bit (ACK Relationship to BUSY), 13-3 argcount value, 4-15 argument passing, 4-7-4-8 arithmetic instructions See also specific types of arithmetic instructions ALU Status Register, 2-1 multiprecision integer operations, 2-25 overview, 2-1-2-3 status results, 2-17 table, 2-2 trapping, 2-26 virtual arithmetic processor, 2-26 ASEL0 field (Address Select, Bank 0) DRAM Configuration Register, 9-3 ROM Configuration Register, 8-2 ASEL1 field (Address Select, Bank 1) DRAM Configuration Register, 9-3 ROM Configuration Register, 8-3 ASEL2 field (Address Select, Bank 2) DRAM Configuration Register, 9-3 ROM Configuration Register, 8-3 ASEL3 field (Address Select, Bank 3) Am29205 microcontroller, 8-3 DRAM Configuration Register, 9-3 required setting for Am29205 microcontroller, A-1 ROM Configuration Register, 8-3 ASEQ (Assert Equal To) instruction description, 18-16 NO-OPs, 2-25–2-26 ASGE (Assert Greater Than or Equal To) instruction, description, 18-17 ASGEU (Assert Greater Than or Equal To, Unsigned) instruction, description, 18-18 ASGT (Assert Greater Than) instruction, description, 18-19 ASGTU (Assert Greater Than, Unsigned) instruction, description, 18-20 ASLE (Assert Less Than or Equal To) instruction, description, 18-21 ASLEU (Assert Less Than or Equal To, Unsigned) instruction, description, 18-22 ASLT (Assert Less Than) instruction, description, 18-23 ASLTU (Assert Less Than, Unsigned) instruction, description, 18-24 ASNEQ (Assert Not Equal To) instruction description, 18-25 operating system calls, 2-24 assert instructions run-time checking, 2-24 setting instruction breakpoints, 17-2 simulating interrupts and traps, 16-13–16-14 trapping, 2-24 ### B B15–B0 field (Bank 15–Bank 0 Protection Bits), 6-3 Baud Rate Divisor Register, description, 14-5 BAUDDIV field (Baud Rate Divisor), 14-5 BCT field (Byte Count), 13-3–13-4 big endian, 3-1, 3-2, 3-12 bit strings Funnel Shift Count Register, 3-3-3-4 overview, 3-3-3-4 bits ACK (PACK Level), 13-4 ACS (Assert Chip Select), 11-3, 11-6 AFD (Autofeed), 13-3 AMASKO (Address Mask, Bank 0), 8-2-8-3, 9-3 AMASK1 (Address Mask, Bank 1), 8-3, 9-3 AMASK2 (Address Mask, Bank 2), 8-3, 9-3 AMASK3 (Address Mask, Bank 3), 8-3, 9-3 ARB (ACK Relationship to BUSY), 13-3 ASEL0 (Address Select, Bank 0), 8-2, 9-3 ASEL1 (Address Select, Bank 1), 8-3, 9-3 ASEL2 (Address Select, Bank 2), 8-3, 9-3 ASEL3 (Address Select, Bank 3), 8-3, 9-3 B15-B0 (Bank 15-Bank 0 Protection Bits), 6-3 BAUDDIV (Baud Rate Divisor), 14-5 BCT (Byte Count), 13-3-13-4 BP (Byte Pointer), 2-16-2-17, 3-3 BRK (Send Break), 14-1 IOEXT3 (Input/Output Extend, Region 3), 10-2 BRKI (Break Interrupt), 14-4 IOEXT4 (Input/Output Extend, Region 4), 10-2 BRS (BUSY Relationship to STROBE), 13-3 IOEXT5 (Input/Output Extend, Region 5), 10-2 BST0 (Burst-Mode ROM, Bank 0), 8-1 IOPI (I/O Port Interrupt), 16-24 BST1 (Burst-Mode ROM, Bank 1), 8-2 IOWAIT0 (Input/Output Wait States, Region 0), 10-2 BST2 (Burst-Mode ROM, Bank 2), 8-2 IOWAIT1 (Input/Output Wait States, Region 1), 10-2 BST3 (Burst-Mode ROM, Bank 3), 8-2 IOWAIT2 (Input/Output Wait States, Region 2), 10-2 BSY (PBUSY Level), 13-4 IOWAIT3 (Input/Output Wait States, Region 3), 10-2 BWE (Byte Write Enable), 8-2 IOWAIT4 (Input/Output Wait States, Region 4), 10-2 C (Carry), 2-16 IOWAIT5 (Input/Output Wait States, Region 5), 10-2 CHA (Channel Address), 16-18 IP (Interrupt Pending), 16-2 CHD (Channel Data), 16-18 IPA (Indirect Pointer A), 2-13 CLKDIV (Clock Divide), 15-1 IPB (Indirect Pointer B), 2-14 CLKI (Clock Invert), 15-2 IPC (Indirect Pointer C), 2-13 CR (Load/Store Count Remaining), 3-11, 16-19 IRM14-IRM8, 12-2 CTE (Count Terminate Enable), 11-3, 11-7 IRM15 (Interrupt Request Mode, PIO15), 12-1-12-2 CTI (Count Terminate Interrupt), 11-4, 11-7 LEFTCNT (Left Margin Count), 15-3 CV (Contents Valid), 16-19 LINECNT (Line Count), 15-3 DA (Disable All Interrupts and Traps), 16-3 LM (Large Memory), 8-2, 9-2 DDIR (Data Direction), 13-2, 15-2 LOOP (Loopback), 14-1 DF (Divide Flag), 2-16 LS (Load/Store), 16-19 DHH (Disable Hardware Handshake), 13-2-13-3 LSI (Line Sync Invert), 15-2 DI (Disable Interrupts), 16-3 ML (Multiple Operation), 16-19 DM (Floating-Point Divide-By-Zero Mask), 2-15 MO (Integer Multiplication Overflow Exception DMA0I (DMA Channel 0 Interrupt), 16-24 Mask), 2-16 DMA1I (DMA Channel 1 Interrupt), 16-24 MODE (Parallel Port Mode), 13-2 MODE (Video Interface Mode), 15-2 DMACNT (DMA Count), 11-5 N (Negative), 2-16 DMAEXT (DMA Extend), 11-2, 11-6 NM (Floating-Point Invalid Operation Mask), 2-15 DMAWAIT (DMA Wait States), 11-2, 11-6 NN (Not Needed), 16-19 DO (Integer Division Overflow Mask), 2-15 DRAMADDR (DRAM Address), 11-4 NS (Floating-Point Invalid Operation Sticky), 2-19 DRM (DMA Request Mode), 11-2, 11-6 NT (Floating-Point Invalid Operation Trap), 2-19 DRQ (Data Request), 13-2, 15-1 OER (Overrun Error), 14-4 DS (Floating-Point Divide-By-Zero Sticky), 2-19 OPT (Option), 3-8 OV (Overflow), 16-23 DSR (Data Set Ready), 14-1 DT (Floating-Point Divide-By-Zero Trap), 2-18-2-19 PC0 (Program Counter 0), 16-9 PC1 (Program Counter 1), 16-9 DTR (Data Terminal Ready), 14-4 DW (Data Width), 11-2, 11-6 PC2 (Program Counter 2), 16-10 DW0 (Data Width, Bank 0), 8-2, 9-2 PDATA (Parallel Port Data), 13-4 DW1 (Data Width, Bank 1), 8-2, 9-2 PER (Parity Error), 14-4 DW2 (Data Width, Bank 2), 8-2, 9-2 PERADDR (Peripheral Address), 11-4 DW3 (Data Width, Bank 3), 8-2, 9-2 peripheral registers (table), C-6-C-11 EN (Enable), 11-3, 11-7 PG0 (Page-Mode DRAM, Bank 0), 9-2 FACK (Force ACK), 13-2 PG1 (Page-Mode DRAM, Bank 1), 9-2 PG2 (Page-Mode DRAM, Bank 2), 9-2 FBUSY (Force Busy), 13-2 PG3 (Page-Mode DRAM, Bank 3), 9-2 FC (Funnel Shift Count), 2-17, 3-3-3-4 FER (Framing Error), 14-4 PHYBASE (Physical Base Address), 9-4 FF (Fast Float Select), 2-14 PIN (PIO Input), 12-2 PMODE (Parity Mode), 14-2 FRM (Floating-Point Round Mode), 2-14 FWT (Full Word Transfer), 13-1 POEN (PIO Output Enable), 12-3 POUT (PIO Output), 12-3 FZ (Freeze), 16-2 1, 3-8 PPI (Parallel Port Interrupt), 16-24 IE (Interrupt Enable), 16-23 PRL (Processor Release Level), 2-27 processor registers (table), B-7-B-9 IM (Interrupt Mask), 16-3 IN (Interrupt), 16-23 PSI (Page Sync Invert), 15-2 INVERT (PIO Inversion), 12-2 PSIO (Page Sync Input/Output), 15-2 IOEXT0 (Input/Output Extend, Region 0), 10-2 PSL (Page Sync Level), 15-2 IOEXT1 (Input/Output Extend, Region 1), 10-2 Q (Quotient/Multiplier), 2-20 IOEXT2 (Input/Output Extend, Region 2), 10-2 QEN (Queue Enable), 11-3 | RA, 3-8 RB, 3-8 RDATA (Receive Data), 14-5 RDR (Receive Data Ready), 14-3 REFRATE (Refresh Rate), 9-2 reserved fields, A-1 RM (Floating-Point Invalid Operand Mask), 2-15 RMODE (Receive Mode), 14-3 RS (Floating-Point Reserved Operand Sticky), 2-19 | XM (Floating-Point Inexact Result Mask), 2-15 XS (Floating-Point Inexact Result Sticky), 2-19 XT (Floating-Point Inexact Result Trap), 2-19 Z (Zero), 2-16 Boolean data, 3-5 BOOTW signal definition, 7-3 setting width of boot ROM, 8-3 | |-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | RSIE (Receive Status Interrupt Enable), 14-2 RT (Floating-Point Reserved Operand Trap), 2-19 RW (Read/Write), 11-3, 11-7 RXDI (Serial Port Receive Data Interrupt), 16-24 RXSI (Serial Port Receive Status Interrupt), 16-24 SB (Set Byte Pointer/Sign Bit), 3-8 SC (Static-Column DRAM), 9-2 SDIR (Shift Direction), 15-2 SM (Supervisor Mode), 16-3 ST (Set), 16-19 | boundary-scan cells bypass scan path, 17-8 description, 17-4–17-5 ICTEST1 scan path, 17-10 ICTEST2 scan path, 17-10–17-11 instruction scan path, 17-8 main data scan path, 17-8–17-10 Boundary-Scan Register (BSR), 17-4–17-5 CNTL field, 17-3–17-4 | | STB (PSTROBE Level), 13-3<br>STP (Stop Bits), 14-2<br>TCV (Timer Count Value), 16-22-16-23 | BP field (Byte Pointer) ALU Status Register, 2-16–2-17 Byte Pointer Register, 3-3 | | TD (Timer Disable), 16-2 TDATA (Transmit Data), 14-4 TDELAY (Transfer Delay), 13-2 TDELAYV (TDELAY Counter Value), 13-3 TE (Trace Enable), 16-2 TEMT (Transmitter Empty), 14-3 THRE (Transmit Holding Register Empty), 14-3 TMODE (Transmit Mode), 14-2 TOPCNT (Top Margin Count), 15-3 TP (Trace Pending), 16-2 TR (Target Register), 16-19 TRA (Transfer Active), 13-2 TPV (Timer Pelacet Value), 16-22 | branch instructions CALL (Call Subroutine), 18-26 CALLI (Call Subroutine, Indirect), 18-27 JMP (Jump), 18-79 JMPF (Jump False), 18-80 JMPFDEC (Jump False and Decrement), 18-81 JMPFI (Jump False Indirect), 18-82 JMPI (Jump Indirect), 18-83 JMPT (Jump True), 18-84 JMPTI (Jump True Indirect), 18-85 overview, 2-7 table, 2-7 | | TRV (Timer Reload Value), 16-23 TTE (TDMA Terminate Enable), 11-3, 11-7 TTI (TDMA Terminate Interrupt), 11-3, 11-7 TU (Trap Unaligned Access), 16-2 | breakpoints using assert instructions, 17-2 using the HALT instruction, 17-2 | | TXDI (Serial Port Transmit Data Interrupt), 16-24 | BRK bit (Send Break), 14-1 | | UD (Transfer Up/Down), 11-3, 11-6–11-7<br>UM (Floating-Point Underflow Mask), 2-15 | BRKI bit (Break Interrupt), 14-4 | | US (Floating-Point Underflow Sticky), 2-19 | BRS bit (BUSY Relationship to STROBE), 13-3 | | UT (Floating-Point Underflow Trap), 2-19 | BSR. See Boundary-Scan Register (BSR) | | V (Overflow), 2-16 | | | VAB (Vector Area Base), 16-5 | BST0 bit (Burst-Mode ROM, Bank 0), 8-1 | | VALID (Valid Mapping), 9-3<br>VDATA (Video Data), 15-4 | BST1 bit (Burst-Mode ROM, Bank 1), 8-2 | | VDI (Video Interrupt), 16-24 | BST2 bit (Burst-Mode ROM, Bank 2), 8-2 | | VIDI (Video Invert), 15-3 | BST3 bit (Burst-Mode ROM, Bank 3), 8-2 | | VIRTBASE (Virtual Base Address), 9-3 | BSY bit (PBUSY Level), 13-4 | | VM (Floating-Point Overflow Mask), 2-15<br>VS (Floating-Point Overflow Sticky), 2-19 | BURST signal, definition, 7-3 | | VT (Floating-Point Overflow Trap), 2-19 | | | WLGN (Word Length), 14-2 | burst-mode<br>boot ROM state, 8-3 | | WM (Wait Mode), 16-3 | DRAM page-mode accesses, 9-2, 9-6–9-7, 9-10 | | WS0 (Wait States, Bank 0), 8-2 | multiple data accesses, 3-11 | | WS1 (Wait States, Bank 1), 8-2<br>WS2 (Wait States, Bank 2), 8-2 | ROM accesses, 8-1, 8-2, 8-5, 8-6, 8-8, 8-10 | | WS3 (Wait States, Bank 3), 8-2 | Burst-Mode Access signal. See BURST signal | | · · · · · · · · · · · · · · · · · · · | | | BWE bit (Byte Write Enable), 8-2 BYPASS instruction, 17-8 bypass scan path, 17-8 Byte Pointer Register, description, 3-2-3-3 byte writes, ROM space, 8-7-8-8 | boundary-scan cells, 17-5 Halt mode, 17-11–17-12 ICTEST1 scan path, 17-10 ICTEST2 scan path, 17-10–17-11 Load Test Instruction mode, 17-13–17-14 Step mode, 17-12–17-13 | |--------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | | Column Address Strobes, Banks 3–0 signals. See CAS3–CAS0 signals | | C | compare instructions | | C bit (Carry) ALU Status Register, 2-16 arithmetic operation status results, 2-17 multiprecision integer operations, 2-25 | ASEQ (Assert Equal To), 18-16 ASGE (Assert Greater Than or Equal To), 18-17 ASGEU (Assert Greater Than or Equal To, Unsigned), 18-18 ASGT (Assert Greater Than), 18-19 | | CALL (Call Subroutine) instruction, description, 18-26 | ASGTU (Assert Greater Than, Unsigned), 18-20<br>ASLE (Assert Less Than or Equal To), 18-21 | | CALLI (Call Subroutine, Indirect) instruction, description, 18-27 | ASLEU (Assert Less Than or Équal To, Unsigned),<br>18-22 | | calling conventions, 4-13-4-14 | ASLT (Assert Less Than), 18-23 | | CAS3—CAS0 signals definition, 7-4 during static-column accesses, 9-2 using as byte stobes, 8-2 | ASLTU (Assert Less Than, Unsigned), 18-24 ASNEQ (Assert Not Equal To), 18-25 CPBYTE (Compare Bytes), 18-36 CPEQ (Compare Equal To), 18-37 CPGE (Compare Greater Than or Equal To), 18-38 | | CAS-before-RAS refresh cycles<br>DRAM refresh, 9-10<br>REFRATE field (Refresh Rate), 9-2<br>timing (diagram), 9-11 | CPGEU (Compare Greater Than or Equal To,<br>Unsigned), 18-39<br>CPGT (Compare Greater Than), 18-40<br>CPGTU (Compare Greater Than, Unsigned), 18-41 | | CHA field (Channel Address), 16-18 | CPLE (Compare Less Than or Equal To), 18-42<br>CPLEU (Compare Less Than or Equal To, | | Channel Address Register description, 16-18 multiple data accesses, 3-10 | Unsigned), 18-43<br>CPLT (Compare Less Than), 18-44<br>CPLTU (Compare Less Than, Unsigned), 18-45 | | Channel Control Register description, 16-18–16-19 multiple data accesses, 3-10–3-11 | CPNEQ (Compare Not Equal To), 18-46 overview, 2-1–2-3 table, 2-3 | | Channel Data Register, description, 16-18 | compiler<br>delayed branches, 5-3-5-4 | | character data, format, 3-1-3-2 | High C 29K optimizing C compiler, 1-7 | | character strings, overview, 3-4 | run-time stack organization, 4-1-4-6 | | CHD field (Channel Data), 16-18 | temporary registers, 4-13<br>transparent procedures, 4-13-4-15 | | CLASS (Classify Floating-Point Operand) instruction, description, 18-28–18-29 | complementing a Boolean, 2-25 | | CLKDIV field (Clock Divide), 15-1 | Configuration Register, description, 2-26–2-27 | | CLKI bit (Clock Invert), 15-2 | CONST (Constant) instruction description, 18-31 | | clock signals<br>INCLK, 7-1<br>MEMCLK, 7-1 | generation of large constants, 3-5<br>large jump and call ranges, 2-25 | | TCK, 7-6<br>UCLK, 7-6<br>VCLK, 7-6 | constant instructions CONST (Constant), 18-31 CONSTH (Constant, High), 18-32 CONSTN (Constant, Negative), 18-33 | | CLZ (Count Leading Zeros) instruction, description, 18-30 | overview, 2-5 table, 2-5 | | CNTL field<br>Boundary Scan Register, 17-3–17-4 | CONSTH (Constant, High) instruction description, 18-32 | generation of large constants, 3-5 large jump and call ranges, 2-25 CONSTN (Constant, Negative) instruction description, 18-33 generation of large constants, 3-5 CONVERT (Convert Data Format) instruction, description, 18-34-18-35 CPBYTE (Compare Bytes) instruction character data, 3-2 description, 18-36 detection of characters within words, 3-4 CPEQ (Compare Equal To) instruction, description, 18-37 CPGE (Compare Greater Than or Equal To) instruction complementing a Boolean, 2-25 description, 18-38 CPGEU (Compare Greater Than or Equal To, Unsigned) instruction, description, 18-39 CPGT (Compare Greater Than) instruction, description, 18-40 CPGTU (Compare Greater Than, Unsigned) instruction, description, 18-41 CPLE (Compare Less Than or Equal To) instruction, description, 18-42 CPLEU (Compare Less Than or Equal To, Unsigned) instruction, description, 18-43 CPLT (Compare Less Than) instruction, description, 18-44 CPLTU (Compare Less Than, Unsigned) instruction, description, 18-45 CPNEQ (Compare Not Equal To) instruction, description, 18-46 CPU Status signals. See STAT2-STAT0 signals CR field (Load/Store Count Remaining) Channel Control Register, 16-19 Load/Store Count Remaining Register, 3-11 multiple access operations, 3-9-3-10 CTE bit (Count Terminate Enable), 11-3, 11-7 CTI bit (Count Terminate Interrupt), 11-4, 11-7 **Current Processor Status Register** after an interrupt or trap, 16-11 before interrupt return, 16-11 control of tracing, 17-1-17-2 delayed effects of registers, 5-5, 16-2 description, 16-1-16-3 Reset mode, 2-27 CV bit (Contents Valid), 16-19 multiple access operations, 3-10 restarting faulting accesses, 16-17-16-18 returning from interrupts or traps, 16-12 ### D DA bit (Disable All Interrupts and Traps) Current Processor Status Register, 16-3 disabling interrupts, 16-3 exceptions during interrupt and trap handling, 16-21 DACK1-DACK0 signals, definition, 7-4-7-5 DADD (Floating-Point Add, Double-Precision) instruction, description, 18-47 data movement instructions EXBYTE (Extract Byte), 18-61 EXHW (Extract Half-Word), 18-62 EXHWS (Extract Half-Word, Sign-Extended), 18-63 INBYTE (Insert Byte), 18-74 INHW (Insert Half-Word), 18-75 LOAD (Load), 18-86 LOADL (Load and Lock), 18-87 LOADM (Load Multiple), 18-88 LOADSET (Load and Set), 18-89 MFSR (Move from Special Register), 18-90 MFTLB (Move from Translation Look-Aside Buffer Register), 18-91 movement of large data blocks, 3-11 MTSR (Move to Special Register), 18-92 MTSRIM (Move to Special Register Immediate), MTTLB (Move to Translation Look-Aside Buffer Register), 18-94 overview, 2-4-2-6 STORE (Store), 18-110 STOREL (Store and Lock), 18-111 STOREM (Store Multiple), 18-112 table, 2-5 Data Set Ready signal. See DSR signal Data Terminal Ready signal. See DTR signal data types floating-point data types, 3-5-3-7 denormalized numbers, 3-7 double-precision floating-point values, 3-6 infinity, 3-7 Not-a-Number, 3-6-3-7 single-precision floating-point values, 3-5 special floating-point values, 3-6-3-7 zero. 3-7 integer data types, 3-1-3-5 bit strings, 3-3-3-4 Boolean data, 3-5 character data, 3-1-3-2 character string operations, 3-4 half-word operations, 3-2 instruction constants, 3-5 DDIR bit (Data Direction) Parallel Port Control Register, 13-2 Video Control Register, 15-2 DDIV (Floating-Point Divide, Double-Precision) DIVL (Divide Last Step), 18-56 instruction, description, 18-48 DIVREM (Divide Remainder), 18-57 FDIV (Floating-Point Divide, Single-Precision), debugging and testing accessing internal state via boundary-scan, 17-14-17-16 DIVL (Divide Last Step) instruction, description, 18-56 boundary-scan cells, 17-4-17-5 DIVREM (Divide Remainder) instruction, description, control field in scan path, 17-3-17-4 18-57 emulating the Am29205 microcontroller, 17-17 DM bit (Floating-Point Divide-By-Zero Mask), 2-15 forcing outputs to high impedance, 17-17-18 Halt mode, 17-11-17-12 DMA Acknowledge 1 through 0 signals. See implementing a hardware-development system, DACK1-DACK0 signals 17-11-17-17 DMA controller instruction breakpoints, 17-2 DMA queuing (DMA Channel 0), 11-12 Load Test Instruction mode, 17-13-17-14 DMA transfers, 11-8-11-12 overview, 17-1 direct transfers, 11-1, 11-12-11-15 processor status outputs, 17-2-17-3 external peripherals, 11-1 Step mode, 17-12-17-13 external transfers, 11-1, 11-9-11-11 Test Access Port, 17-4-17-11 generating external DMA requests, 11-9 tracing, 17-1-17-2 internal peripherals, 11-1 delayed branches, 5-2-5-4 internal transfers, 11-1, 11-8-11-9 latching external requests, 11-11-11-12 delayed effects of registers, 5-5 parallel port, 13-2 DEQ (Floating-Point Equal To, Double-Precision) serial port, 14-2, 14-3 instruction, description, 18-49 specifying direction, 11-8 development tools video interface, 15-2 AMD products, xx, 1-7 initialization, 11-7 compiler, xx, 1-7 overview, 11-1 debugger, xx, 1-7 programmable registers, 11-1-11-7 development boards, xx, 1-7 signals monitor, xx, 1-7 DACK1-DACK0, 7-4-7-5 third-party products, iii, xix-xx, 1-7 DREQ1-DREQ0, 7-4 **GACK**, 7-5 DF bit (Divide Flag), 2-16 **GREQ. 7-5** DGE (Floating-Point Greater Than or Equal To. TDMA, 7-5 Double-Precision) instruction, description, 18-50 DMA Request 1 through 0 signals. See DGT (Floating-Point Greater Than, Double-Precision) DREQ1-DREQ0 signals instruction, description, 18-51 DMA0 Address Register, description, 11-4 DHH bit (Disable Hardware Handshake), 13-2-13-3 **DMA0 Address Tail Register** DI bit (Disable Interrupts), 16-3 address assignments, 7-8-7-10 disabling interrupts, 16-3 description, 11-4 DIV (Divide Step) instruction, description, 18-52 DMA0 Control Register, description, 11-1-11-4 DIVO (Divide Initialize) instruction, description, 18-53 DMA0 Count Register, description, 11-5 DIVIDE (Integer Divide, Signed) instruction, DMA0 Count Tail Register description, 18-54 address assignments, 7-8-7-10 DIVIDU (Integer Divide, Unsigned) instruction, description, 11-5 description, 18-55 DMA01 bit (DMA Channel 0 Interrupt), 16-24 division, routines for performing, 2-19-2-20, DMA1 Address Register, description, 11-7 2-22-2-24 DMA1 Control Register, description, 11-5–11-7 division instructions DMA1 Count Register, description, 11-7 DDIV (Floating-Point Divide, Double-Precision), 18-48 DMA11 bit (DMA Channel 1 Interrupt), 16-24 DIV (Divide Step), 18-52 DMACNT field (DMA Count), 11-5 DIVO (Divide Initialize), 18-53 DIVIDE (Integer Divide, Signed), 18-54 DMAEXT bit (DMA Extend), 11-2, 11-6 DIVIDU (Integer Divide, Unsigned), 18-55 | DMAWAIT field (DMA Wait States), 11-2, 11-6 | DT bit (Floating-Point Divide-By-Zero Trap),<br>2-18-2-19 | |----------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------| | DMUL (Floating-Point Multiply, Double-Precision) instruction, description, 18-58 | DTR bit (Data Terminal Ready), Serial Port Status<br>Register, 14-4 | | DO bit (Integer Division Overflow Mask), 2-15 | DTR signal | | DRAM accesses<br>16-bit DRAM, 9-7<br>32-bit DRAM, 9-7 | activating, 14-4<br>definition, 7-6 | | address multiplexing, 9-5–9-7 | DW field (Data Width), 11-2, 11-6 | | DRAM address mapping, 9-4–9-5<br>DRAM refresh, 7-7, 9-10–9-12 | DW0 bit (Data Width, DRAM Bank 0), 9-2 | | mapped accesses, 9-8 | DW0 field (Data Width, ROM Bank 0), 8-2 | | normal access timing, 9-8 | DW1 bit (Data Width, DRAM Bank 1), 9-2 | | page-mode access timing, 9-10 restarting mapped DRAM accesses, 16-17–16-19 | DW1 field (Data Width, ROM Bank 1), 8-2 | | video DRAM interface, 9-12 | DW2 bit (Data Width, DRAM Bank 2), 9-2 | | DRAM Configuration Register, description, 9-2-9-3 | DW2 field (Data Width, ROM Bank 2), 8-2 | | DRAM Control Register, description, 9-1-9-2 | DW3 bit (Data Width, DRAM Bank 3), 9-2 | | DRAM controller | DW3 field (Data Width, ROM Bank 3), 8-2 | | See also DRAM accesses address mapping, 8-4, 9-4-9-5 initialization, 9-4 | dynamic parent, 4-12-4-13 | | overview, 9-1 | E | | programmable registers DRAM Configuration Register, 9-2–9-3 DRAM Control Register, 9-1–9-2 signals CAS3–CAS0, 7-4 RAS3–RAS0, 7-3 | EMULATE (Trap to Software Emulation Routine) instruction description, 18-60 operating-system calls, 2-24-2-25 | | TR/OE, 7-4<br>WE, 7-4 | EN bit (Enable), 11-3, 11-7<br>DMA initialization, 11-7 | | DRAM Mapping Register 0, description, 9-3-9-4 | endian. <i>See</i> big endian | | DRAM Mapping Register 1, description, 9-4 | EXBYTE (Extract Byte) instruction | | DRAM Mapping Register 2, description, 9-4 | BP field (Byte Pointer), 2-16–2-17 Byte Pointer Register, 3-2 | | DRAM Mapping Register 3, description, 9-4 | character data, 3-1 | | DRAM refresh, panic mode, 7-7 | description, 18-61 | | DRAMADDR field (DRAM Address), 11-4 | EXHW (Extract Half-Word) instruction BP field (Byte Pointer), 2-17 | | DREQ1-DREQ0 signals definition, 7-4 signaling external DMA requests, 11-2, 11-6 | Byte Pointer Register, 3-2 description, 18-62 | | signaling external DMA requests, 11-2, 11-6 DRM field (DMA Request Mode), 11-2, 11-6 | half-word operations, 3-2 | | DRQ bit (Data Request) | EXHWS (Extract Half-Word, Sign-Extended) instruction | | Parallel Port Control Register, 13-2 Video Control Register, 15-1 | BP field (Byte Pointer), 2-17 Byte Pointer Register, 3-2 | | DS bit (Floating-Point Divide-By-Zero Sticky), 2-19 | description, 18-63<br>half-word operations, 3-2 | | DSR bit (Data Set Ready), Serial Port Control<br>Register, 14-1 | External Memory Grant Acknowledge signal. See<br>GACK signal | | DSR signal | External Memory Grant Request signal. See GREQ | | activating, 14-1<br>definition, 7-6 | signal | | DSUB (Floating-Point Subtract, Double-Precision) | EXTEST instruction, 17-6 | | instruction, description, 18-59 | EXTRACT (Extract Word, Bit-Aligned) instruction bit strings, 3-3 | description, 18-64 FC field (Funnel Shift Count), 2-17 operating on double-word data, 2-4 #### F FACK bit (Force ACK), 13-2 FADD (Floating-Point Add, Single-Precision) instruction, description, 18-65 FBUSY bit (Force Busy), 13-2 FC field (Funnel Shift Count), 3-3-3-4 ALU Status Register, 2-17 FDIV (Floating-Point Divide, Single-Precision) instruction, description, 18-66 FDMUL (Floating-Point Multiply, Single-to-Double Precision) instruction, description, 18-67 FEQ (Floating-Point Equal To, Single-Precision) instruction, description, 18-68 FER bit (Framing Error), 14-4 FF bit (Fast Float Select), 2-14 FGE (Floating-Point Greater Than or Equal To, Single-Precision) instruction, description, 18-69 FGT (Floating-Point Greater Than, Single-Precision) instruction, description, 18-70 field summary peripheral registers (table), C-6–C-11 processor registers (table), B-7–B-9 fields. See bits fill handlers, 4-11 floating-point data types denormalized numbers, 3-7 double-precision floating-point values, 3-6 infinity, 3-7 Not-a-Number, 3-6–3-7 single-precision floating-point values, 3-5 special floating-point values, 3-6–3-7 zero, 3-7 Floating-Point Environment Register description, 2-14–2-15 not implemented in processor hardware, 2-11 Floating-Point Exception trap Floating-Point Environment Register, 2-15 Floating-Point Status Register, 2-18–2-19 trap status bits. 2-18–2-19 floating-point instructions CLASS (Classify Floating-Point Operand), 18-28-18-29 CONVERT (Convert Data Format), 18-34-18-35 DADD (Floating-Point Add, Double-Precision), 18-47 DDIV (Floating-Point Divide, Double-Precision), 18-48 DEQ (Floating-Point Equal To, Double-Precision), 18-49 DGE (Floating-Point Greater Than or Equal To, Double-Precision), 18-50 DGT (Floating-Point Greater Than, Double-Precision), 18-51 DMUL (Floating-Point Multiply, Double-Precision), 18-58 DSUB (Floating-Point Subtract, Double-Precision), 18-59 FADD (Floating-Point Add, Single-Precision), 18-65 FDIV (Floating-Point Divide, Single-Precision), 18-66 FDMUL (Floating-Point Multiply, Single-to-Double Precision), 18-67 FEQ (Floating-Point Equal To, Single-Precision), 18-68 FGE (Floating-Point Greater Than or Equal To, Single-Precision), 18-69 FGT (Floating-Point Greater Than, Single-Precision), 18-70 FMUL (Floating-Point Multiply, Single-Precision), 18-71 FSUB (Floating-Point Subtract, Single-Precision), 18-72 overview, 2-6 SQRT (Floating-Point Square Root), 18-107 status results, 2-18 table, 2-6 Floating-Point Status Register description, 2-18–2-19 not implemented in processor hardware, 2-11 Protection Violation trap, 2-26 sticky status bits, 2-18–2-19 trap status bits, 2-18–2-19 FMUL (Floating-Point Multiply, Single-Precision) instruction, description, 18-71 fp. See frame pointer (fp) frame, definition, 4-7 frame pointer (fp) definition, 4-5 register conventions, 4-14 Freeze bit. See FZ bit (Freeze) FRM field (Floating-Point Round Mode), 2-14 FSUB (Floating-Point Subtract, Single-Precision) instruction, description, 18-72 Funnel Shift Count Register, description, 3-3-3-4 FWT bit (Full Word Transfer), 13-1 required setting for Am29205 microcontroller, A-1 FZ bit (Freeze) Current Processor Status Register, 16-2 delayed effects of registers, 5-5, 16-2 Halt mode, 17-12 interrupt and trap handling, 16-6–16-13 lightweight interrupt processing, 16-12–16-13 Program Counter registers, 16-6–16-9 registers affected by, 16-10, 16-11–16-12 restarting the interrupt or trap handler, 16-21 Step mode, 17-12 #### G GACK signal definition, 7-5 direct DMA access by external devices, 11-12-11-15 general-purpose registers addressing terminology, 2-10 operands held by, 2-8-2-10 organization, 2-9, B-1 overview, 2-8-2-10 global registers global-register number, 2-10 overview, 2-10 GREQ signal definition, 7-5 direct DMA access by external devices, 11-12-11-15 ## H half-word data, format, 3-2 HALT (Enter Halt Mode) instruction description, 18-73 instruction breakpoints, 17-2, 17-16–17-17 Halt mode, 17-11–17-12 hardware-development system. See debugging and testing host interface (HIF) specification. See operating system ## I I/O port. See Programmable I/O Port (PIO) ICTEST1 instruction, 17-7 ICTEST1 scan path, 17-10 ICTEST2 instruction, 17-7 ICTEST2 scan path, 17-10-17-11 ID31-ID0 signals, definition, 7-1 IE bit (Interrupt Enable), 16-23 IEEE floating-point standard implementation, 3-5–3-7 improving performance of, 2-14 Illegal Opcode trap instruction breakpoints, 17-2 unimplemented instructions, 2-1 unpredictable vector number, 16-4 IM field (Interrupt Mask), 16-3 enabling interrupts, 16-3 IN bit (Interrupt), 16-23 INBYTE (Insert Byte) instruction BP field (Byte Pointer), 2-16–2-17 Byte Pointer Register, 3-2 character data, 3-2 description, 18-74 INCLK signal, definition, 7-1 Indirect Pointer A Register, description, 2-13 Indirect Pointer B Register, description, 2-14 Indirect Pointer C Register, description, 2-13 indirect pointers delayed effects of registers, 5-5 set by certain instructions, 2-13 INHW (Insert Half-Word) instruction BP field (Byte Pointer), 2-17 Byte Pointer Register, 3-2 description, 18-75 half-word operations, 3-2 #### initialization See also processor initialization DMA controller, 11-7 DRAM controller, 9-4 internal interrupt controller, 16-23–16-25 parallel port, 13-4–13-5 Peripheral Interface Adapter (PIA), 10-2 programmable I/O port, 12-3 ROM controller, 8-3–8-4 serial port, 14-5 timer facility, 16-21 video interface, 15-4 Input Clock signal. See INCLK Instruction Bus signals. See ID31-ID0 signals instruction constants, 3-5 instruction scan path, 17-8 instruction scheduling. See pipelining instruction set ADD (Add), 18-8 ADDC (Add with Carry), 18-9 ADDCS (Add with Carry, Signed), 18-10 ADDCU (Add with Carry, Unsigned), 18-11 ADDS (Add, Signed), 18-12 ADDU (Add, Unsigned), 18-13 AND (AND logical), 18-14 ANDN (AND-NOT logical), 18-15 arithmetic operation status results, 2-17 ASEQ (Assert Equal To), 18-16 ASGE (Assert Greater Than or Equal To), 18-17 ASGEU (Assert Greater Than or Equal To, Unsigned), 18-18 ASGT (Assert Greater Than), 18-19 ASGTU (Assert Greater Than, Unsigned), 18-20 ASLE (Assert Less Than or Equal To), 18-21 ASLEU (Assert Less Than or Equal To, Unsigned), 18-22 ASLT (Assert Less Than), 18-23 ASLTU (Assert Less Than, Unsigned), 18-24 ASNEQ (Assert Not Equal To), 18-25 assembler syntax, 18-3-18-64 branch instructions, 2-7 CALL (Call Subroutine), 18-26 CALLI (Call Subroutine, Indirect), 18-27 CLASS (Classify Floating-Point Operand), 18-28-18-29 CLZ (Count Leading Zeros), 18-30 compare instructions, 2-1-2-3 CONST (Constant), 18-31 constant instructions, 2-5 CONSTH (Constant, High), 18-32 CONSTN (Constant, Negative), 18-33 control-flow terminology, 18-3 CONVERT (Convert Data Format), 18-34-18-35 CPBYTE (Compare Bytes), 18-36 CPEQ (Compare Equal To), 18-37 CPGE (Compare Greater Than or Equal To), 18-38 CPGEU (Compare Greater Than or Equal To, Unsigned), 18-39 CPGT (Compare Greater Than), 18-40 CPGTU (Compare Greater Than, Unsigned), 18-41 CPLE (Compare Less Than or Equal To), 18-42 CPLEU (Compare Less Than or Equal To, Unsigned), 18-43 CPLT (Compare Less Than), 18-44 CPLTU (Compare Less Than, Unsigned), 18-45 CPNEQ (Compare Not Equal To), 18-46 DADD (Floating-Point Add, Double-Precision), 18-47 data movement instructions, 2-4-2-6 DDIV (Floating-Point Divide, Double-Precision), DEQ (Floating-Point Equal To, Double-Precision), 18-49 description format, 18-7 descriptions, 18-8-18-126 DGE (Floating-Point Greater Than or Equal To, Double-Precision), 18-50 DGT (Floating-Point Greater Than, Double-Precision), 18-51 DIV (Divide Step), 18-52 DIVO (Divide Initialize), 18-53 DIVIDE (Integer Divide, Signed), 18-54 DIVIDU (Integer Divide, Unsigned), 18-55 DIVL (Divide Last Step), 18-56 DIVREM (Divide Remainder), 18-57 DMUL (Floating-Point Multiply, Double-Precision), 18-58 DSUB (Floating-Point Subtract, Double-Precision), 18-59 EMULATE (Trap to Software Emulation Routine), 18-60 EXBYTE (Extract Byte), 18-61 EXHW (Extract Half-Word), 18-62 EXHWS (Extract Half-Word, Sign-Extended), 18-63 EXTRACT (Extract Word, Bit-Aligned), 18-64 FADD (Floating-Point Add, Single-Precision), 18-65 FDIV (Floating-Point Divide, Single-Precision), 18-66 FDMUL (Floating-Point Multiply, Single-to-Double Precision), 18-67 FEQ (Floating-Point Equal To, Single-Precision), 18-68 FGE (Floating-Point Greater Than or Equal To, Single-Precision), 18-69 FGT (Floating-Point Greater Than, Single-Precision), 18-70 floating-point instructions, 2-6 floating-point operation status results, 2-18 FMUL (Floating-Point Multiply, Single-Precision), FSUB (Floating-Point Subtract, Single-Precision), HALT (Enter Halt Mode), 18-73 INBYTE (Insert Byte), 18-74 INHW (Insert Half-Word), 18-75 instruction formats, 18-4-18-5 integer arithmetic instructions, 2-1-2-3 INV (Invalidate), 18-76 IRET (Interrupt Return), 18-77 IRETINV (Interrupt Return and Invalidate), 18-78 JMP (Jump), 18-79 JMPF (Jump False), 18-80 JMPFDEC (Jump False and Decrement), 18-81 JMPFI (Jump False Indirect), 18-82 JMPI (Jump Indirect), 18-83 JMPT (Jump True), 18-84 JMPTI (Jump True Indirect), 18-85 LOAD (Load), 18-86 load and store instructions, 3-7-3-9 LOADL (Load and Lock), 18-87 LOADM (Load Multiple), 18-88 LOADSET (Load and Set), 18-89 logical instructions, 2-4 logical operation status results, 2-17-2-18 MFSR (Move from Special Register), 18-90 MFTLB (Move from Translation Look-Aside Buffer Register), 18-91 miscellaneous instructions, 2-7-2-9 MTSR (Move to Special Register), 18-92 MTSRIM (Move to Special Register Immediate), 18-93 MTTLB (Move to Translation Look-Aside Buffer Register), 18-94 MUL (Multiply Step), 18-95 MULL (Multiply Last Step), 18-96 MULTIPLU (Integer Multiply, Unsigned), 18-97 MULTIPLY (Integer Multiply, Signed), 18-98 | MULTM (Integer Multiply Most Significant Blts, | Channel Control Register, 16-18-16-19 | |------------------------------------------------------|---------------------------------------------------| | Signed), 18-99 | Channel Data Register, 16-18 | | MULTMU (Integer Multiply Most Significant Blts, | correcting out-of-range results, 16-20 | | Unsigned), 18-100 | exceptions during interrupt and trap handling, | | MULU (Multiply Step, Unsigned), 18-101 | 16-21 | | NAND (NAND Logical), 18-102 | floating-point exceptions, 16-20 | | NOR (NOR Logical), 18-103 | instruction exceptions, 16-16 | | operand notation and symbols, 18-1-18-2 | integer exceptions, 16-19-16-20 | | operation code index, 18-127-18-129 | restarting faulting accesses, 16-17-16-19 | | operator symbols, 18-2–18-3 | external interrupts and traps, 16-4 | | OR (OR Logical), 18-104 | interrupt controller | | overview, 2-1-2-8 | initialization, 16-25 | | reserved instructions, 2-8 | Interrupt Control Register, 16-25 | | SETIP (Set Indirect Pointers), 18-105 | overview, 16-23 | | shift instructions, 2-4 | servicing internal interrupts, 16-25 | | SLL (Shift Left Logical), 18-106 | interrupts, 16-3 | | SQRT (Floating-Point Square Root), 18-107 | latency, 16-4 | | SRA (Shift Right Arithmetic), 18-108 | lightweight interrupt processing, 16-12–16-13 | | SRL (Shift Right Logical), 18-109 | Old Processor Status Register, description, 16-6 | | STORE (Store), 18-110 | overview, 16-1 | | STOREL (Store and Lock), 18-111 | priority (table), 16-15 | | STOREM (Store Multiple), 18-112 | Program Counter stack, 16-6–16-10 | | SUB (Subtract), 18-113 | Program Counter 0 Register, 16-9 | | SUBC (Subtract with Carry), 18-114 | Program Counter 1 Register, 16-9 | | SUBCS (Subtract with Carry, Signed), 18-115 | Program Counter 2 Register, 16-10 | | SUBCU (Subtract with Carry, Unsigned), 18-116 | returning from an interrupt or trap, 16-11-16-12 | | SUBR (Subtract Reverse), 18-117 | sequencing, 16-14-16-16 | | SUBRC (Subtract Reverse with Carry), 18-118 | simulation of interrupts and traps, 16-13-16-14 | | SUBRCS (Subtract Reverse with Carry, Signed), | taking an interrupt or trap, 16-10 | | 18-119 | Timer Facility | | SUBRCU (Subtract Reverse with Carry, Unsigned), | handling timer interrupts, 16-22 | | 18-120 | initialization, 16-21 | | SUBRS (Subtract Reverse, Signed), 18-121 | overview, 16-21 | | SUBRU (Subtract Reverse, Unsigned), 18-122 | Timer Counter Register, 16-22–16-23 | | SUBS (Subtract, Signed), 18-123 | Timer Reload Register, 16-23 | | SUBU (Subtract, Unsigned), 18-124 | uses, 16-22 | | terminology, 18-1–18-4 | traps, 16-4 | | XNOR (Exclusive-NOR Logical), 18-125 | vector area, 16-5-16-6 | | XOR (Exclusive-OR Logical), 18-126 | Vector Area Base Address Register, description, | | integer arithmetic instructions. See arithmetic | 16-5 | | instructions | vector numbers | | | assignments (table), 16-7-16-9 | | integer data types, 3-1–3-5 | definition, 16-5-16-6 | | Integer Environment Register, description, 2-15–2-16 | Wait mode, 16-4-16-5 | | internal peripherals | WARN input, 16-14-16-16 | | address assignments (table), 7-10 | WARN trap, 16-13-16-14 | | alternate register addresses, 7-8-7-9 | INTEST instruction, 17-6-17-7 | | DMA transfers, 11-1 | | | | INTR3-INTR0 signals | | Interrupt Control Register, description, 16-23–16-24 | definition, 7-2 | | Interrupt Requests 3–0 signals. See INTR3-INTR0 | interrupts, 16-3, 16-4 | | signals | INTR3, internal interrupt controller, 16-23-16-25 | | - | INV (Invalidate) instruction, description, 18-76 | | interrupts, enabling and disabling, 16-3 | • | | interrupts and traps | INVERT field (PIO Inversion), 12-2 | | Current Processor Status Register, description, | IOEXT0 bit (Input/Output Extend, Region 0), 10-2 | | 16-1–16-3 | IOEXT1 bit (Input/Output Extend, Region 1), 10-2 | | exception reporting and restarting, 16-16-16-21 | | | Channel Address Register, 16-18 | IOEXT2 bit (Input/Output Extend, Region 2), 10-2 | IOEXT3 bit (Input/Output Extend, Region 3), 10-2 TDI, 7-6 TDO, 7-7 IOEXT4 bit (Input/Output Extend, Region 4), 10-2 TMS, 7-6 IOEXT5 bit (Input/Output Extend, Region 5), 10-2 **TRST, 7-7** IOPI field (I/O Port Interrupt), 16-24 jump instructions JMP (Jump), 18-79 IOWAIT0 field (Input/Output Wait States, Region 0), JMPF (Jump False), 18-80 JMPFDEC (Jump False and Decrement), 18-81 IOWAIT1 field (Input/Output Wait States, Region 1), JMPFI (Jump False Indirect), 18-82 10-2 JMPI (Jump Indirect), 18-83 IOWAIT2 field (Input/Output Wait States, Region 2), JMPT (Jump True), 18-84 10-2 JMPTI (Jump True Indirect), 18-85 IOWAIT3 field (Input/Output Wait States, Region 3), jumps 10-2 delayed branches, 5-2-5-4 large jump and call ranges, 2-25 IOWAIT4 field (Input/Output Wait States, Region 4), IOWAIT5 field (Input/Output Wait States, Region 5), IP bit (Interrupt Pending), 16-2 large return pointer (Irp) description, 4-10 IPA field (Indirect Pointer A), 2-13 register conventions, 4-13 IPB field (Indirect Pointer B), 2-14 leaf procedures IPC field (Indirect Pointer C), 2-13 calling other procedures, 4-8 IRET (Interrupt Return) instruction register stack leaf frames, 4-11 description, 18-77 LEFTCNT field (Left Margin Count), 15-3 restarting mapped DRAM accesses, 16-17-16-18 Line Synchronization signal. See LSYNC signal returning from interrupts and traps, 16-11-16-12 LINECNT field (Line Count), 15-3 IRETINV (Interrupt Return and Invalidate) instruction description, 18-78 LM bit (Large Memory), 8-2, 9-2 restarting mapped DRAM accesses, 16-17-16-18 LOAD (Load) instruction, description, 18-86 returning from interrupts and traps, 16-11-16-12 load and store instructions IRM14-IRM8 fields, 12-2 BP field (Byte Pointer), 2-17 IRM15 field (Interrupt Request Mode, PIO15), format, 3-7-3-9 12-1-12-2 OPT field (Option), 3-8 RA, 3-8 RB or I, 3-8 SB bit (Set Byte Pointer/Sign Bit), 3-8 load operations, 3-9 JMP (Jump) instruction, description, 18-79 multiple accesses, 3-9-3-11 overlapped loads and stores, 5-4-5-5 JMPF (Jump False) instruction, description, 18-80 store operations, 3-9 JMPFDEC (Jump False and Decrement) instruction. Load Test Instruction mode, 17-13-17-14 description, 18-81 Load/Store Count Remaining Register, description, JMPFI (Jump False Indirect) instruction, description, 3-11 18-82 LOADL (Load and Lock) instruction, description, JMPI (Jump Indirect) instruction, description, 18-83 18-87 JMPT (Jump True) instruction, description, 18-84 LOADM (Load Multiple) instruction JMPTI (Jump True Indirect) instruction, description, description, 18-88 18-85 multiple data accesses, 3-9-3-11 JTAG 1149.1 boundary-scan interface LOADSET (Load and Set) instruction, description, See also Test Access Port 18-89 IEEE standard document, xx local registers signals TCK, 7-6 local-register number, 2-10 overview, 2-10-2-11 SETIP (Set Indirect Pointers), 18-105 table, 2-8 logical instructions AND (AND logical), 18-14 ML bit (Multiple Operation) ANDN (AND-NOT logical), 18-15 Channel Control Register, 16-19 multiple data accesses, 3-10 NAND (NAND Logical), 18-102 NOR (NOR Logical), 18-103 returning from interrupts or traps, 16-12 OR (OR Logical), 18-104 MO bit (Integer Multiplication Overflow Exception overview, 2-4 Mask), 2-16 SLL (Shift Left Logical), 18-106 MODE field (Parallel Port Mode), Parallel Port Control SRL (Shift Right Logical), 18-109 Register, 13-2 status results, 2-17-2-18 table, 2-4 MODE field (Video Interface Mode), Video Control XNOR (Exclusive-NOR Logical), 18-125 Register, 15-2 XOR (Exclusive-OR Logical), 18-126 msize value, 4-15 LOOP bit (Loopback), 14-1 msp. See memory stack pointer (msp) Irp. See large return pointer (Irp) MTSR (Move to Special Register) instruction LS bit (Load/Store), 16-19 accessing special-purpose registers, 2-8 BP field (Byte Pointer), 2-17 LSI bit (Line Sync Invert), 15-2 description, 18-92 LSYNC signal, definition, 7-6 FC field (Funnel Shift Count), 2-17 MTSRIM (Move to Special Register Immediate) instruction М accessing special-purpose registers, 2-8 description, 18-93 main data scan path, 17-8-17-10 MTTLB (Move to Translation Look-Aside Buffer Regis-MEMCLK signal, definition, 7-1 ter) instruction, description, 18-94 Memory Clock signal. See MEMCLK signal MUL (Multiply Step) instruction, description, 18-95 memory frame pointer (mfp), description, 4-12 MULL (Multiply Last Step) instruction, description, memory map, 7-8-7-10 18-96 multiple data accesses Memory Stack description, 3-9-3-11 description, 4-5-4-6 Load/Store Count Remaining Register, 3-11 local variables and memory-stack frames, movement of large data blocks, 3-11 4-11-4-12 prologue and epilogue routines for allocation, 4-12 multiplication, routines for performing, 2-19-2-20 storage allocation, 4-2 multiplication instructions memory stack pointer (msp) DMUL (Floating-Point Multiply, Double-Precision), description, 4-6, 4-12 18-58 register conventions, 4-14 FDMUL (Floating-Point Multiply, Single-to-Double Precision), 18-67 memory-stack frame, 4-11-4-12 FMUL (Floating-Point Multiply, Single-Precision), MFSR (Move from Special Register) instruction 18-71 accessing special-purpose registers, 2-8 MUL (Multiply Step), 18-95 description, 18-90 MULL (Multiply Last Step), 18-96 MFTLB (Move from Translation Look-Aside Buffer MULTIPLU (Integer Multiply, Unsigned), 18-97 Register) instruction, description, 18-91 MULTIPLY (Integer Multiply, Signed), 18-98 MULTM (Integer Multiply Most Significant Blts, miscellaneous instructions Signed), 18-99 CLZ (Count Leading Zeros), 18-30 MULTMU (Integer Multiply Most Significant Blts, EMULATE (Trap to Software Emulation Routine), Unsigned), 18-100 MULU (Multiply Step, Unsigned), 18-101 HALT (Enter Halt Mode), 18-73 INV (Invalidate), 18-76 MULTIPLU (Integer Multiply, Unsigned) instruction, IRET (Interrupt Return), 18-77 description, 18-97 IRETINV (Interrupt Return and Invalidate), 18-78 MULTIPLY (Integer Multiply, Signed) instruction, overview, 2-7-2-9 description, 18-98 MULTM (Integer Multiply Most Significant Bits, P Signed) instruction, description, 18-99 PACK signal MULTMU (Integer Multiply Most Significant Bits, assertion duration, 13-2 Unsigned) instruction, description, 18-100 definition, 7-5 MULU (Multiply Step, Unsigned) instruction, disabling hardware handshakes, 13-2-13-3 description, 18-101 forcing active level, 13-2 signal level, 13-4 Page Synchronization signal. See PSYNC signal N page-mode N bit (Negative) DRAM accesses, 9-10 ALU Status Register, 2-16 DRAM page-mode read cycle (diagram), 9-10 arithmetic operation status results, 2-17 DRAM page-mode write cycle (diagram), 9-11 logical operation status results, 2-17-2-18 sequential DRAM accesses, 9-6-9-7 NAND (NAND Logical) instruction, description, 18-102 specifying, 9-2 static-column accesses, 9-2 NM bit (Floating-Point Invalid Operation Mask), 2-15 Parallel Data Register (PDR), 17-4-17-5 NN bit (Not Needed) parallel port Channel Control Register, 16-19 restarting faulting accesses, 16-17-16-18 initialization, 13-4-13-5 returning from interrupts or traps, 16-12 internal DMA transfers, 11-1, 11-8-11-12, 13-2 overview, 13-1 NO-OPs, 2-25-2-26 programmable registers NOR (NOR Logical) instruction, description, 18-103 Parallel Port Control Register, 13-1-13-3 Parallel Port Data Register, 13-4 Not-a-Number Parallel Port Status Register, 13-3-13-4 definition, 3-6-3-7 signals Quiet NaNs (QNaNs), 3-6-3-7 **PACK, 7-5** Signaling NaNs (SNaNs), 3-6-3-7 PAUTOFD, 7-5 NS bit (Floating-Point Invalid Operation Sticky), 2-19 **PBUSY**, 7-5 NT bit (Floating-Point Invalid Operation Trap), 2-19 **POE**, 7-5 PSTROBE, 7-5 **PWE. 7-5** transfers from the host, 13-5 transfers to the host, 13-5-13-7 OER bit (Overrun Error), 14-4 Parallel Port Acknowledge signal. See PACK signal Old Processor Status Register Parallel Port Autofeed signal. See PAUTOFD signal control of tracing, 17-1-17-2 description, 16-6 Parallel Port Busy signal. See PBUSY signal operating system services, host interface (HIF) Parallel Port Control Register, description, 13-1-13-3 specification, xx, 1-7, 16-8 Parallel Port Data Register, description, 13-4 operating-system calls, 2-24-2-25 Parallel Port Output Enable signal. See POE signal OPT field (Option) Parallel Port Status Register alignment of words and half-words, 3-13 address assignments, 7-9-7-11 byte and half-word accesses, 3-12-3-13 description, 13-3-13-4 load and store instruction format, 3-8 Parallel Port Strobe signal. See PSTROBE signal OR (OR Logical) instruction, description, 18-104 Parallel Port Write Enable signal. See PWE signal Out-of-Range trap PAUTOFD signal correcting out-of-range results, 16-20 definition, 7-5 Integer Environment Register, 2-15-2-16 signal level, 13-3 integer exceptions, 16-19-16-20 **PBUSY** signal OV bit (Overflow), 16-23 assertion duration, 13-2 overflow. See spill handler definition, 7-5 disabling hardware handshakes, 13-2-13-3 forcing active level, 13-2 | signal level, 13-4 | PIAWE signal | |--------------------------------------------------------------------|--------------------------------------------------------------------------| | PC0 field (Program Counter 0), 16-9 | definition, 7-4 | | PC1 field (Program Counter 1), 16-9 | extending PIA accesses, 10-2<br>external DMA transfers, 11-9-11-11 | | PC2 field (Program Counter 2), 16-10 | pin changes, Am29205 microcontroller, 7-7 | | PDATA field (Parallel Port Data), 13-4 | PIN field (PIO Input), 12-2 | | PDR. See Parallel Data Register (PDR) | PIO Control Register, description, 12-1–12-2 | | PER bit (Parity Error), 14-4 | PIO Input Register, description, 12-2 | | PERADDR field (Peripheral Address), 11-4 | | | Peripheral Chip Selects, Regions 5–0 signals. See | PIO Output Enable Register, description, 12-3 | | PIACS5—PIACS0 signals | PIO Output Register, description, 12-2–12-3 | | Peripheral Interface Adapter (PIA) | PIO15–PIO0 signals, definition, 7-5 | | See also PIA | Pipeline Hold mode, multiple data accesses, 3-10 | | initialization, 10-2 | pipelining | | overview, 10-1<br>PIA accesses, 10-2–10-4 | delayed branch, 5-2-5-4<br>delayed effects of registers, 5-5 | | extending a PIA read cycle with WAIT (diagram), | four-stage instruction execution, 5-1 | | 10-5 | overlapped loads and stores, 5-4-5-5 | | extending a PIA write cycle with WAIT (diagram), | overview, 5-1 | | 10-5 | Pipeline Hold mode, 5-1-5-2 | | extending I/O cycles, 10-3–10-4<br>normal access timing, 10-2–10-3 | serialization, 5-2 | | PIA read cycle (diagram), 10-3 | PMODE field (Parity Mode), 14-2 | | PIA write cycle (diagram), 10-4 | POE signal, definition, 7-5 | | programmable registers, 10-1-10-2 | POEN field (PIO Output Enable), 12-3 | | signals PIACS5-PIACS0, 7-4 | pointers | | PIAOE, 7-4 | See also indirect pointers | | PIAWE, 7-4 | frame pointer (fp), 4-5 | | Peripheral Output Enable signal. See PIAOE signal | register allocate bound pointer (rab), 4-5 | | peripheral registers | register free bound pointer (rfb), 4-5 register stack pointer (rsp), 4-5 | | See also registers | POUT field (PIO Output), 12-3 | | address assignments, 7-8–7-10 | PPI bit (Parallel Port Interrupt), 16-24 | | field summary (table), C-6–C-11 register summary, C-1–C-10 | | | Peripheral Write Enable signal. See PIAWE signal | PRL field (Processor Release Level), 2-27 | | | procedure linkage<br>argument passing, 4-7-4-8 | | PG0 (Page-Mode DRAM, Bank 0), 9-2 | complex procedure call example, 4-14 | | PG1 (Page-Mode DRAM, Bank 1), 9-2 | conventions, 4-6-4-13 | | PG2 (Page-Mode DRAM, Bank 2), 9-2 | fill handlers, 4-11 local variables and memory-stack frames, | | PG3 (Page-Mode DRAM, Bank 3), 9-2 | 4-11-4-12 | | PHYBASE field (Physical Base Address), 9-4 | Memory Stack, 4-5-4-6 | | PIA Control Register 0, description, 10-1–10-2 | overview, 4-1 procedure epilogue, 4-10-4-11 | | PIA Control Register 1, description, 10-1–10-2 | procedure prologue | | PIACS5-PIACS0 signals | allocation of memory-stack frames, 4-11-4-12 | | definition, 7-4 | definition, 4-8 | | external DMA transfers, 11-3, 11-6 | frame allocation on Register Stack, 4-8–4-9 Register Stack, 4-3 | | PIAOE signal | register stack leaf frame, 4-11 | | definition, 7-4 | register usage convention, 4-13-4-14 | | extending PIA accesses, 10-2 external DMA transfers, 11-9-11-11 | return values, 4-10 | | CALCITICIDATA HANSICIS, 11-3-11-11 | run-time stack, 4-1-4-6 | | | | static link pointer, 4-12-4-13 floating-point instructions, 2-6 trace-back tags, 4-15-4-17 Floating-Point Status Register, 2-18-2-20 transparent procedures, 4-13 floating-point status results, 2-18 general-purpose registers, 2-8-2-11 processor initialization, 2-26-2-28 global registers, 2-10 See also initialization Indirect Pointer A Register, 2-13 Configuration Register, 2-26-2-27 Indirect Pointer B Register, 2-14 Current Processor Status Register, 2-27 Indirect Pointer C Register, 2-13 overview, 2-26 indirect register addressing, 2-12-2-14 Reset mode, 2-27-2-28 instruction environment, 2-14-2-16 processor registers instruction scheduling, 5-1-5-6 field summary (table), B-7-B-9 instruction set, 2-1-2-8 register summary, B-1-B-9 integer arithmetic instructions, 2-1 Integer Environment Register, 2-15-2-16 processor signals integer multiplication and division, 2-19-2-24 A23-A0, 7-1 large jump and call ranges, 2-25 ID31-ID0, 7-1 local registers, 2-10-2-11 **INTR3-INTR0, 7-2** local-register stack pointer, 2-11-2-28 R/W, 7-2 logical instructions, 2-4 RESET, 7-2 logical operation status results, 2-17-2-18 STAT2-STAT0, 7-2 miscellaneous instructions, 2-7 TRAP1-TRAP0, 7-3 multiplication, 2-20-2-22 **WAIT**, 7-1 multiprecision integer operations, 2-25 WAIT/TRIST, 7-2 NO-OPs, 2-25 **WARN, 7-2** operating-system calls, 2-24-2-25 product support pipelining, 5-1-5-6 bulletin board service, iii procedure linkage, 4-1-4-17 documentation and literature, iii, xix-xx processor initialization, 2-26-2-28 technical support hotline, iii Q Register, 2-20 Program Counter 0 Register, description, 16-9 register addressing, 2-10 register model, 2-8-2-12 Program Counter 1 Register, description, 16-9 register usage convention, 4-13-4-14 Program Counter 2 Register, description, 16-10 reserved instructions, 2-8 reset mode, 2-27-2-28 Programmable I/O Port (PIO) run-time checking, 2-24 See also PIO run-time stack organization, 4-1-4-6 initialization, 12-3 shift instructions, 2-4 operating the I/O port, 12-3 special-purpose registers, 2-11-2-13 overview, 12-1 status results of instructions, 2-16-2-19 programmable registers trapping arithmetic instructions, 2-26 PIO Control Register, 12-1-12-2 virtual arithmetic processor, 2-26 PIO Input Register, 12-2 virtual registers, 2-26 PIO Output Enable Register, 12-3 PIO Output Register, 12-2-12-3 protection of registers. See system protection signals, PIO15-PIO0, 7-5 Protection Violation trap Programmable Input/Output signals. See PIO15-PIO0 assert instructions, 2-1 signals protected special-purpose registers, 2-12 Supervisor mode, 6-1 programming User mode, 6-1 activation records, 4-1-4-6 virtual registers, 2-26 ALU Status Register, 2-16-2-17 arithmetic operation status results, 2-17 PSI bit (Page Sync Invert), 15-2 branch instructions, 2-7 PSIO bit (Page Sync Input/Output), 15-2 compare instructions, 2-1 complementing a Boolean, 2-25 PSL bit (Page Sync Level), 15-2 Configuration Register, 2-26-2-27 PSTROBE signal constant instructions, 2-5 definition, 7-5 data movement instructions, 2-4 signal level, 13-3 division, 2-22-2-24 timing hardware handshakes, 13-3 Floating-Point Environment Register, 2-14-2-15 PSYNC signal definition, 7-6 signal level, 15-2 PWE signal, definition, 7-5 ### a Q field (Quotient/Multiplier), 2-20 Q Register, description, 2-20 QEN bit (Queue Enable), 11-3 ### R R/W signal, definition, 7-2 rab. See register allocate bound (rab) RAS3-RAS0 signals, definition, 7-3 RDATA field (Receive Data), 14-5 RDR bit (Receive Data Ready), 14-3 Read/Write signal. See R/W signal Receive Data signal. See RXD signal REFRATE field (Refresh Rate), 9-2 register allocate bound pointer (rab) definition, 4-5 register conventions, 4-14 Register Bank Protect Register description, 6-2-6-3 protecting general-purpose registers, 6-1-6-3 register free bound pointer (rfb) definition, 4-5 register conventions, 4-14 register number, 2-10 Register Stack activation record, 4-3 description, 4-3 local registers as a stack cache, 4-4-4-5 local registers for caching, 4-4-4-5 local variables and memory-stack frames, 4-11-4-12 procedure prologue for frame allocation, 4-8-4-9 relationship to stack cache (figure), 4-4 storage allocation, 4-2 register stack pointer (rsp) definition, 4-5 register conventions, 4-13 register summary, special-purpose registers, B-3-B-6, C-1-C-5 bank organization, B-2 Baud Rate Divisor (BAUD, Address 80000090), Byte Pointer (BP, Register 133), 3-2-3-3 Channel Address (CHA, Register 4), 16-18 Channel Control (CHC, Register 6), 16-18-16-19 Channel Data (CHD, Register 5), 16-18 Configuration (CFG, Register 3), 2-26-2-27 Current Processor Status (CPS, Register 2), 16-1-16-3 delayed effects, 5-5 DMA0 Address (DMAD0, Address 80000034), 11-4 DMA0 Address Tail (TAD0, Address 80000070), 11-4 DMA0 Control (DMCT0, Address 80000030), 11-1-11-4 DMA0 Count (DMCN0, Address 80000038), 11-5 DMA0 Count Tail (TCN0, Address 8000003C), 11-5 DMA1 Address (DMAD1, Address 80000044), 11-7 DMA1 Control (DMCT1, Address 80000040), 11-5-11-7 DMA1 Count Register (DMCN1, Address 80000048), 11-7 DRAM Configuration (DRCF, Address 8000000C), 9-2-9-3 DRAM Control (DRCT, Address 80000008), 9-1-9-2 DRAM Mapping 0 (DRM0, Address 80000010), 9-3-9-4 DRAM Mapping 1 (DRM1, Address 80000014), 9-4 DRAM Mapping 2 (DRM2, Address 80000018), 9-4 DRAM Mapping 3 (DRM3, Address 8000001C), 9-4 Floating-Point Environment (FPE, Register 160), 2-14-2-15 Floating-Point Status (FPS, Register 162), 2-18-2-19 Funnel Shift Count (FC, Register 134), 3-3-3-4 general-purpose, 2-8-2-11 global, 2-10 Indirect Pointer A (IPA, Register 129), 2-13 Indirect Pointer B (IPB, Register 130), 2-14 Indirect Pointer C (IPC, Register 128), 2-13 Integer Environment (INTE, Register 161), 2-15-2-16 Interrupt Control (ICT, Address 80000028), 16-23-16-24 Load/Store Count Remaining (CR, Register 135), 3-11 local, 2-10-2-11 Old Processor Status (OPS, Register 1), 16-6 Parallel Port Control (PPCT, Address 800000C0), 13-1-13-3 Parallel Port Data (PPDT, Address 800000C4), 13-4 Parallel Port Status (PPST, Address 800000C8), 13-3-13-4 peripheral register address assignments, 7-8-7-10 peripheral register summary, C-1-C-10 PIA Control 0 (PICT0, Address 80000020), 10-1-10-2 PIA Control 1 (PICT1, Address 80000024), 10-1-10-2 registers addressing, 2-10 addressing indirectly, 2-12-2-14 ALU Status (ALU, Register 132), 2-16-2-17 PIO Control (POCT, Address 800000D0), 12-1-12-2 ROM Chip Selects, Banks 3-0 signals. See PIO Input (PIN, Address 800000D4), 12-2 ROMCS3-ROMCS0 signals PIO Output (POUT, Address 800000D8), 12-2-12-3 ROM Configuration Register, description, 8-2-8-3 PIO Output Enable (POEN, Address 800000DC), ROM Control Register, description, 8-1-8-2 12-3 processor register summary, B-1-B-9 ROM controller Program Counter 0 (PC0, Register 10), 16-9 See also ROM accesses Program Counter 1 (PC1, Register 11), 16-9 address mapping, 8-4 Program Counter 2 (PC2, Register 12), 16-10 initialization, 8-3-8-4 protection, 6-1-6-3 overview, 8-1 Q (Q, Register 131), 2-20 programmable registers Register Bank Protect (RBP, Register 7), 6-2-6-3 ROM Configuration Register, 8-2-8-3 register usage conventions, 4-13-4-14 ROM Control Register, 8-1-8-2 reserved fields. A-1 signals ROM Configuration (RMCF, Address 80000004), **BOOTW, 7-3** 8-2-8-3 **BURST**, 7-3 ROM Control (RMCT, Address 80000000), 8-1-8-2 ROMCS3-ROMCS0, 7-3 Serial Port Control (SPCT, Address 80000080), ROMOE, 7-3 14-1-14-3 **RSWE**, 7-3 Serial Port Receive Buffer (SPRB, Address ROM Output Enable signal. See ROMOE signal 8000008C), 14-4-14-5 Serial Port Status (SPST, Address 80000084), ROMCS3-ROMCS0 signals, definition, 7-3 14-3-14-4 ROMOE signal, definition, 7-3 Serial Port Transmit Holding (SPTH, Address 80000088), 14-4 round mode, 2-14 Side Margin (SIDE, Address 800000E8), 15-3 Row Address Strobe, Banks 3-0 signals. See special-purpose, 2-11-2-13, B-3-B-6 RAS3-RAS0 signals Timer Counter (TMC, Register 8), 16-22-16-23 RS bit (Floating-Point Reserved Operand Sticky), Timer Reload (TMR, Register 9), 16-23 2-19 Top Margin (TOP, Address 800000E4), 15-3 Vector Area Base Address (VAB, Register 0), 16-5 RSIE bit (Receive Status Interrupt Enable), Serial Port Video Control (VCT, Address 800000E0), 15-1-15-3 Control Register, 14-2 Video Data Holding (VDT, Address 800000EC), rsize value, 4-8 15-3-15-4 virtual, 2-26 rsp. See register stack pointer (rsp) reserved instructions, table, 2-8 RSWE signal, definition, 7-3 Reset mode, 2-27-2-29 RT bit (Floating-Point Reserved Operand Trap), 2-19 RESET signal run-time checking, 2-24 definition, 7-2 run-time organization, register usage conventions, invoking Reset mode, 2-27-2-28 4-13-4-14 Reset signal. See RESET signal run-time stack activation records, 4-1 rfb. See register free bound pointer (rfb) allocation of storage locations, 4-2 RM bit (Floating-Point Invalid Operand Mask), 2-15 definition, 4-1-4-6 RMODE field (Receive Mode) local registers as a stack cache, 4-4-4-5 Serial Port Control Register, 14-3 management, 4-1-4-2 serial port initialization, 14-5 memory stack, 4-5-4-7 Register Stack, 4-3 ROM accesses stack cache, 4-4-4-6 burst-mode accesses, 8-8 stack overflow, 4-6 byte writes, 8-7-8-8 extending ROM cycles, 8-8 RW bit (Read/Write), 11-3, 11-7 narrow ROM accesses, 8-4-8-6 RXD signal, definition, 7-6 ROM address mapping, 8-4 simple ROM accesses, 8-4 RXDI bit (Serial Port Receive Data Interrupt), 16-24 simple writes, 8-7 RXSI bit (Serial Port Receive Status Interrupt), 16-24 | S | ID31-ID0, 7-1<br>INCLK, 7-1 | |---------------------------------------------------------|-----------------------------------------------------------------------------------------------------| | SAMPLE instruction, 17-7 | INTR3-INTR0, 7-2 | | SB bit (Set Byte Pointer/Sign Bit), 3-8 | LSYNC, 7-6 | | | MEMCLK, 7-1<br>PACK, 7-5 | | SC bit (Static-Column DRAM), 9-2 | PAUTOFD, 7-5 | | SDIR bit (Shift Direction), 15-2 | PBUSY, 7-5 | | serial port | PIACS3-PIACS0, 7-4 | | clock specification, 14-5 | PIAOE, 7-4 | | initialization, 14-5 | PIAWE, 7-4 | | internal DMA transfers, 11-1, 11-8–11-12, 14-2, 14-3 | PIO15–PIO0, 7-5 | | overview, 14-1 programmable registers | POE, 7-5 | | Baud Rate Divisor Register, 14-5 | PSTROBE, 7-5<br>PSYNC, 7-6 | | Serial Port Control Register, 14-1–14-3 | PWE, 7-5 | | Serial Port Receive Buffer Register, 14-4–14-5 | R/W, 7-2 | | Serial Port Status Register, 14-3-14-4 | RAS3-RAS0, 7-3 | | Serial Port Transmit Holding Register, 14-4 | RESET, 7-2 | | signals | ROMCS3-ROMCS0, 7-3 | | DSR, 7-6 | ROMOE, 7-3 | | DTR, 7-6<br>RXD, 7-6 | RSWE, 7-3 | | TXD, 7-6 | RXD, 7-6<br>STAT2-STAT0, 7-2 | | UCLK, 7-6 | TCK, 7-6 | | Serial Port Control Register, description, 14-1-14-3 | TDI, 7-6 | | | TDMA, 7-5 | | Serial Port Receive Buffer Register, description, | TDO, 7-7 | | 14-4-14-5 | TMS, 7-6 | | Serial Port Status Register, description, 14-3–14-4 | TR/OE, 7-4 | | Serial Port Transmit Holding Register, description, | TRAP1-TRAP0, 7-3 | | 14-4 | TRIST, 7-2<br>TRST, 7-7 | | serializer/deserializer. See video interface | TXD, 7-6 | | SETIP (Set Indirect Pointers) instruction, description, | UCLK, 7-6 | | 18-105 | VCLK, 7-6 | | shift instructions | VDAT, 7-6 | | EXTRACT (Extract Word, Bit-Aligned), 18-64 | WAIT, 7-1 | | overview, 2-4 | WAIT/TRIST, 7-2<br>WARN, 7-2 | | SLL (Shift Left Logical), 18-106 | WE, 7-4 | | SRA (Shift Right Arithmetic), 18-108 | size value, 4-8 | | SRL (Shift Right Logical), 18-109 table, 2-4 | | | Side Margin Register, description, 15-3 | SLL (Shift Left Logical) instruction, description, 18-106 slp. <i>See</i> static link pointer (slp) | | signal description, 7-1-7-7 | SM bit (Supervisor Mode), 16-3 | | signals | special-purpose registers. See registers | | A23–A0, 7-1 | special-purpose registers | | access priority, 7-7–7-8 | organization, 2-12 | | BOOTW, 7-3<br>BURST, 7-3 | overview, 2-11–2-12 | | CAS3-CAS0, 7-4 | spill handler, 4-10 | | DACK1-DACK0, 7-4-7-5 | · · | | DREQ1-DREQ0, 7-4 | SQRT (Floating-Point Square Root) instruction, description, 18-107 | | DSR, 7-6 | • | | DTR, 7-6 | SRA (Shift Right Arithmetic) instruction, description, | | GACK, 7-5 | 18-108 | | GREQ, 7-5 | | SRL (Shift Right Logical) instruction, description, SUBRCS (Subtract Reverse with Carry, Signed) 18-109 instruction, description, 18-119 ST bit (Set), 16-19 SUBRCU (Subtract Reverse with Carry, Unsigned) instruction, description, 18-120 stack. See run-time stack SUBRS (Subtract Reverse, Signed) instruction, stack cache description, 18-121 definition, 4-2 SUBRU (Subtract Reverse, Unsigned) instruction, relationship to Register Stack (figure), 4-4 description, 18-122 stack overflow, 4-5 SUBS (Subtract, Signed) instruction, description, Stack Pointer 18-123 allocating activation records, 4-4 subtraction instructions definition, 2-11 DSUB (Floating-Point Subtract, Double-Precision), delayed effects of registers, 5-5 protection, 2-25 FSUB (Floating-Point Subtract, Single-Precision), stack underflow, 4-5 18-72 SUB (Subtract), 18-113 STAT2-STAT0 signals SUBC (Subtract with Carry), 18-114 boundary-scan cells, 17-5 SUBCS (Subtract with Carry, Signed), 18-115 definition, 7-2 SUBCU (Subtract with Carry, Unsigned), 18-116 Halt mode, 17-11-17-12 SUBR (Subtract Reverse), 18-117 ICTEST1 scan path, 17-10 SUBRC (Subtract Reverse with Carry), 18-118 ICTEST2 scan path, 17-10-17-11 SUBRCS (Subtract Reverse with Carry, Signed), Load Test Instruction mode, 17-13-17-14 18-119 processor status outputs, 17-2-17-3 SUBRCU (Subtract Reverse with Carry, Unsigned), Step mode, 17-12-17-13 18-120 static link pointer (slp) SUBRS (Subtract Reverse, Signed), 18-121 description, 4-12-4-13 SUBRU (Subtract Reverse, Unsigned), 18-122 register conventions, 4-13 SUBS (Subtract, Signed), 18-123 static parent, 4-12-4-13 SUBU (Subtract, Unsigned), 18-124 STB bit (PSTROBE Level), 13-3 SUBU (Subtract, Unsigned) instruction, description, 18-124 Step mode, 17-12-17-13 Supervisor mode, overview, 6-1 STORE (Store) instruction, description, 18-110 support. See product support store instructions. See load and store instructions system address partition, 7-8 STOREL (Store and Lock) instruction, description, system overview 18-111 access priority, 7-7-7-8 STOREM (Store Multiple) instruction internal peripheral address assignments, 7-8-7-10 description, 18-112 internal peripherals and controllers, 7-8-7-9 multiple data accesses, 3-9-3-11 pin changes, 7-7 STP bit (Stop Bits), 14-2 signal description, 7-1-7-7 clocks, 7-1 SUB (Subtract) instruction, description, 18-113 DMA controller, 7-4-7-5 SUBC (Subtract with Carry) instruction, description, DRAM interface, 7-3-7-4 18-114 I/O port, 7-5 JTAG 1149.1 boundary scan interface, 7-6-7-7 SUBCS (Subtract with Carry, Signed) instruction, description, 18-115 parallel port, 7-5 Peripheral Interface Adapter (PIA), 7-4 SUBCU (Subtract with Carry, Unsigned) instruction, processor signals, 7-1-7-3 description, 18-116 ROM interface, 7-3 SUBR (Subtract Reverse) instruction, description, serial port, 7-6 18-117 video interface, 7-6 system address partition, 7-8 SUBRC (Subtract Reverse with Carry) instruction, description, 18-118 system protection general-purpose registers, 6-1-6-3 overview. 6-1 initialization, 16-21 special-purpose registers, 2-11 operation, 16-21 overview, 16-21 Timer Counter Register, 16-22-16-23 Timer Reload Register, description, 16-23 uses, 16-22 tav. See trap handler argument (tav) Timer interrupt, 16-22 TCK signal Timer Reload Register, description, 16-23 definition, 7-6 TMODE field (Transmit Mode) required setting, A-1 Serial Port Control Register, 14-2 TCV field (Timer Count Value), 16-22-16-23 serial port initialization, 14-5 TD bit (Timer Disable), 16-2 TMS signal TDATA field (Transmit Data), 14-4 definition, 7-6 required setting, A-1 TDELAY field (Transfer Delay), 13-2 Top Margin Register, description, 15-3 TDELAYV field (TDELAY Counter Value), 13-3 TOPCNT field (Top Margin Count), 15-3 TDI signal TP bit (Trace Pending) definition, 7-6 control of tracing, 17-1-17-2 required setting, A-1 Current Processor Status Register, 16-2 TDMA signal definition, 7-5 tpc. See trap handler return address (tpc) DMA transfer count, 11-5 TR field (Target Register), 16-19 processor interrupt, 11-3, 11-7 TR/OE signal, definition, 7-4 terminating external DMA transfers, 11-3, 11-7 TRA bit (Transfer Active), 13-2 TDO signal, definition, 7-7 Trace Facility, 17-1-17-2 TE bit (Trace Enable) control of tracing, 17-1-17-2 trace-back tags, 4-15-4-17 Current Processor Status Register, 16-2 Transmit Data signal. See TXD signal TEMT bit (Transmitter Empty), 14-3 trap handler argument (tav) Terminate DMA signal. See TDMA signal description, 4-10 register conventions, 4-13 Test Access Port, 17-4-17-11 boundary-scan cells, 17-4-17-5 trap handler return address (tpc) BYPASS instruction, 17-8 description, 4-10 EXTEST instruction, 17-6 register conventions, 4-13 ICTEST1 instruction, 17-7 Trap Requests 1-0 signals. See TRAP1-TRAP0 ICTEST2 instruction, 17-7 signals implemented instructions, 17-6-17-8 trap status bits, Floating-Point Exception trap, instruction register, 17-6-17-8 2-18-2-19 INTEST instruction, 17-6-17-7 SAMPLE instruction, 17-7 TRAP1-TRAP0 signals scan paths, 17-8-17-11 definition, 7-3 traps, 16-4 Test Clock Input signal. See TCK signal traps Test Data Input signal. See TDI signal See also interrupts and traps; specific trap names Test Data Output signal. See TDO signal EMULATE (Trap to Software Emulation Routine), 18-60 Test Mode Select signal. See TMS signal enabling and disabling, 16-4 Test Reset Input signal. See TRST signal external traps, 16-4 THRE bit (Transmit Holding Register Empty), 14-3 Floating-Point Exception trap, 16-20 Illegal Opcode trap, 16-4, 17-2 Three-State Control signal. See WAIT/TRIST signal Out-of-Range trap, 16-16, 16-19-16-20 Timer Counter Register, description, 16-22-16-23 priority table, 16-15 Protection Violation trap, 6-1 Timer Facility disabling Timer interrupts, 16-2 TRAP1-TRAP0 signals, 16-4 trapping Arithmetic instructions, 2-26 assignments (table), 16-7-16-9 Unaligned Access trap, 3-13, 16-2 specifying, 2-24 WARN trap, 16-13-16-14 Video Clock signal. See VCLK signal TRST signal Video Control Register, description, 15-1-15-3 definition, 7-7 Video Data Holding Register, description, 15-3-15-4 required setting, A-1 Video Data signal. See VDAT signal TRV field (Timer Reload Value), 16-23 Video DRAM Transfer/Output Enable signal. See TTE bit (TDMA Terminate Enable), 11-3, 11-7 TR/OE signal TTI bit (TDMA Terminate Interrupt), 11-3, 11-7 video DRAM transfers, 9-12 TU bit (Trap Unaligned Access), 16-2 video interface TXD signal, definition, 7-6 clock specification, 15-4-15-5 initialization, 15-4-15-5 TXDI bit (Serial Port Transmit Data Interrupt), 16-24 internal DMA transfers, 11-1, 11-8-11-12, 15-2 operation, 15-4-15-7 overview, 15-1 programmable registers Side Margin Register, 15-3 UART Clock signal. See UCLK signal Top Margin Register, 15-3 UCLK signal Video Control Register, 15-1-15-3 definition, 7-6 Video Data Holding Register, 15-3-15-4 required setting, A-1 receiving data, 15-6-15-7 serial clock divisor, 14-5 signals UD bit (Transfer Up/Down), 11-3, 11-6-11-7 LSYNC, 7-6 PSYNC, 7-6 UM bit (Floating-Point Underflow Mask), 2-15 VCLK, 7-6 Unaligned Access trap, 16-2 VDAT, 7-6 OPT field values, 3-13 transmitting data, 15-5-15-6 underflow. See fill handlers VIDI bit (Video Invert), 15-3 Universal Debug Interface (UDI), 1-7 VIRTBASE field (Virtual Base Address), 9-3 UNIX common object file format (COFF), extensions, VM bit (Floating-Point Overflow Mask), 2-15 VS bit (Floating-Point Overflow Sticky), 2-19 US bit (Floating-Point Underflow Sticky), 2-19 VT bit (Floating-Point Overflow Trap), 2-19 User mode, overview, 6-1-6-2 UT bit (Floating-Point Underflow Trap), 2-19 W Wait mode, 16-4-16-5 WAIT signal V bit (Overflow) definition, 7-1 ALU Status Register, 2-16 extending PIA I/O cycles, 10-3-10-4 arithmetic operation status results, 2-17 figures, 10-5 extending ROM cycles, 8-8 VAB field (Vector Area Base), 16-5 figures, 8-11 VALID bit (Valid Mapping), 9-3 external DMA transfers, 11-9-11-11 VCLK signal WAIT/TRIST signal definition, 7-6 See also WAIT signal video clock divisor, 15-1 definition, 7-2 VDAT signal, definition, 7-6 required setting, A-1 VDATA field (Video Data), 15-4 WARN signal definition, 7-2 VDI bit (Video Interrupt), 16-24 description, 16-14 Vector Area Base Address Register, description, 16-5 WARN trap, 16-13-16-14 vector numbers WE signal, definition, 7-4 WLGN field (Word Length), 14-2 WM bit (Wait Mode), 16-3 Write Enable signal. See WE signal WS0 field (Wait States, Bank 0), 8-2 WS1 field (Wait States, Bank 1), 8-2 WS2 field (Wait States, Bank 2), 8-2 WS3 field (Wait States, Bank 3), 8-2 # X XM bit (Floating-Point Inexact Result Mask), 2-15 XNOR (Exclusive-NOR Logical) instruction, description, 18-125 XOR (Exclusive-OR Logical) instruction, description, 18-126 XS bit (Floating-Point Inexact Result Sticky), 2-19 XT bit (Floating-Point Inexact Result Trap), 2-19 ### Z Z bit (Zero) ALU Status Register, 2-16 arithmetic operation status results, 2-17 logical operation status results, 2-17–2-18 • #### Sales Offices | Chicago (Itasca) | North America | | | International (Continued) | |--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------|--------|------------------|----------------------------------------------------| | ARIZONA CALIFORNIA CAL | ALABAMA | | .(205) 882-9122 | KOREA, SeoulTEL(82) 2-784-0030 | | CALIFORNIAN | | | .(602) 242-4400 | FAX(82) 2-784-8014 | | Newport Seach | | | · | I ATIN AMERICA | | Newport Seach | | | | Ft. LauderdaleTELTEL (305) 484-8600 | | San Diego (1619) 580-7030 San Jose (408) 922-0500 Woodland Hills (918) 878-9988 CANADA, Ortario, Kanata. (613) 592-0060 Kanata. (613) 592-0060 Kanata. (614) 522-7800 COLORADO (203) 247-800 COLORADO (203) 247-800 COLORADO (203) 247-800 FLORIDA (203) 247-800 COLORADO (203) 247-800 FLORIDA (203) 247-800 Colorado (Longwood) (407) 81-000 81 | | | | FAX(305) 485-9736 | | San Jose (408) 922-0500 Woodland Hills (918) 879-9988 | | | | SINGAPORETEL(65) 3481188 | | Woodland Hills | | | | FAX(65) 3480161 | | CANADA, Ontario, Kanata. (613) 592-0060 Willowdale (1416) 222-7900 CONNECTICUT (203) 264-7900 CONNECTI | | | | SWEDEN. | | Kanata. (613) 592-0500 Willowdale (416) 222-7800 Willowdale (416) 222-7800 Willowdale (416) 222-7800 Willowdale (416) 222-7800 COLORADO (303) 741-2900 COLORADO (303) 741-2900 FLORIDA, (203) 264-7800 HLORIDA, | | ••••• | .(818) 878-9988 | Stockholm areaTEL(08) 98 61 80 | | Willowadis | CANADA, Ontario, | | | (Bromma) FAX(08) 98 09 06 | | COLORADO (303) 741-2900 (CONNECTICUT (202) 264-7800 FLORIDA (203) 264-7800 FLORIDA (203) 264-7800 FAX (0925) 8303 (Warrington) (0948) 7404 (Woking) FAX (0948) 7404 (Woking) FAX (0048) 7404 (Woking) FAX (00443) 7404 (Woking) FAX (00443) 7404 (Woking) FAX (00443) 7404 (Woking) FAX (0404) 747-440 (Wok | | | | TAIWAN, TaipeiTEL(886) 2-7153536 | | CONNECTICUT (203) 264-7800 Connecticut (205) 830-25 Connecticut (205) 830-25 Connecticut (207) 810-005 Conne | | | | FAX(886) 2-7122183 | | FLORIDA Clearwater (813) 530-9971 Clearwater (814) 530-9971 Clearwater (814) 530-9971 Clearwater (407) 861-0050 Clearwater (407) 862-9292 CEORGIA (404) 449-7920 Clearwater (407) 862-9292 CEORGIA (404) 449-7920 Clearwater (407) 862-9292 CEORGIA (404) 449-7920 Clearwater (708) 773-4422 Naperville (708) 505-9517 Clearwater Cle | | | | UNITED KINGDOM, | | Clearwater | CONNECTICUT | | .(203) 264-7800 | | | Boca Raton | FLORIDA, | | | (Warrington) FAX(0925) 830204 | | Color Colo | | | | London area | | CECRGIA (404) 449-7920 CANADA C | | | | (Woking) FAX(0483) 756196 | | IDAHO (208) 377-0393 | Orlando (Longwood) | | . (407) 862-9292 | North American Depresentatives | | ILLINOIS | GEORGIA | | .(404) 449-7920 | North American Representatives | | ILLINOIS Chicago (Itasca) | IDAHO | | .(208) 377-0393 | | | Chicago (Itasca) | ILLINOIS. | | | Burnaby, B.C DAVETEK MARKETING(604) 430-3680 | | Naperville | Chicago (Itasca) | | .(708) 773-4422 | Kanata, Ontario - VITEL ELECTRONICS (613) 592-0060 | | MARYLAND | Naperville | | .(708) 505-9517 | | | MASSACHUSETTS | | | | | | MINNESOTA (612) 938-0001 Skokie - INDUSTRIAL REPRESENTATIVES, INC (708) 967-84 (704) 875-835 Representatives, Inc (708) 967-84 97-85 Representatives, Inc (708) 97-85 Representatives, Inc (708) 97-85 Representatives, Inc (708) 97-85 Representatives, Inc | | | | | | NEW JERSEY, Cherry Hill | | | | | | Cherry Hill | NEW JERSEY, | | , , | | | Parsippany | Cherry Hill | | .(609) 662-2900 | IOWA | | NEW YORK Brewster (914) 279-8323 Rochester (716) 425-8050 Merriam - LORENZ SALES (913) 469-13 Michita - LORENZ SALES (316) 721-05 MEXICO Chuld Vista (CA) - SONIKA ELECTRONICA (523) 647-42 Mexico (5 | Parsippany | | .(201) 299-0002 | LOBENZ SALES (319) 377-4666 | | Brewster | | | • | | | Rochester | | | (914) 279-8323 | | | NORTH CAROLINA | | | | | | Charlotte | | | | | | Raleigh | Charlotte | | (704) 875-3091 | | | OHIO. Columbus (Westerville) (614) 891-6455 Dayton (513) 439-0268 OREGON (503) 245-0080 PENNSYLVANIA (215) 398-8006 TEXAS, Austin (512) 346-7830 Dallas (214) 934-9099 Houston (713) 376-8084 International | Ralainh | •••••• | (919) 878-8111 | Conditions CONICA ELECTRONICA(619) 490-0340 | | Columbus (Westerville) | | | .(313) 070-0111 | | | Dayton | | ie) | (614) 891-6455 | | | OREGON | | | | | | PENNSYLVANIA | • | | · · · | | | TEXAS, | | | | | | Austin | | ••••• | .(213) 390-0000 | | | Dallas | | | /E10\ 246 7000 | | | Houston | | | | | | NEBRASKA | Houston | •••••• | (713) 376-8084 | | | Description | | | | | | FAX (03) 248 46 42 FRANCE, Paris TEL (1) 49-75-10-10 FAX (1) 49-75-10-13 49- | International _ | | | NEBRASKA | | FRANCE, Paris TEL (03) 248 46 42 TEL (1) 49-75-10-10 GERMANY, Bad Homburg TEL (06172)-24061 FAX (06172)-23195 München TEL (069) 45053-0 FAX (089) 45053-0 Wanchai FAX (852) 865-4355 Wanchai FAX (852) 865-4355 ITALY, Milano TEL (02) 3390541 FAX (0462) 29-8460 Atsugi TEL (0462) 29-8460 FAX (0462) 29-8458 Kanagawa TEL (0462) 47-2911 Tokyo TEL (03) 3346-7550 Osaka TEL (06) 243-3250 Osaka TEL (06) 243-3250 FAX (03) 3342-5196 Osaka TEL (06) 243-3250 FAX (03) 3342-5196 Osaka TEL (06) 243-3250 FAX (060) | RELGILIM Antworpen | TEI | (03) 348 43 00 | | | FRANCE, Paris TEL (1) 49-75-10-10 FAX (1) 49-75-10-13 GERMANY, Bad Homburg TEL (06172)-24061 FAX (06172)-23195 München TEL (089) 45053-0 FAX (088) 406490 FAX (852) 865-435 Wanchai FAX (852) 865-435 ITALY, Milano TEL (02) 3390541 FAX (02) 33103458 JAPAN, Atsugi TEL (0462) 29-8460 Kanagawa TEL (0462) 47-1729 Tokyo TEL (03) 3346-7550 Osaka TEL (06) 243-3250 Milo 243-240 Milo TEL (06) 243-240 Milo TEL (06 | | | | | | FAX | EDANCE Porio | TEI | (1) 40 75 10 10 | | | TEL | FRANCE, Falls | | | NEW YORK | | Bad Homburg TEL | GERMANY | FAA | .(1) 49-75-10-13 | | | München | Rad Hambura | TEI | (06172),24061 | Hauppauge – COMPONENT | | München TEL (089) 45053-0 Centerville - DOLFUSS ROOT & CO (513) 433-67 | Dad Holliburg | FAY | (06172)-24001 | | | FAX | München | TEI | (080) 45053-0 | | | HONG KONG, TEL (852) 865-4525 Wanchai FAX (852) 865-4335 ITALY, Milano TEL (02) 3390541 FAX (02) 38103458 JAPAN, Atsugi TEL (0462) 29-8460 Kanagawa TEL (0462) 47-1729 Tokyo TEL (03) 3346-7550 FAX (03) 3342-5196 Osaka TEL (06) 243-3250 Osaka TEL (06) 243-3250 Osaka TEL (06) 243-3250 Columbus – DOLFUSS ROOT & CO. (614) 885-48 Westlake – DOLFUSS ROOT & CO. (216) 899-93 PENNSYLVANIA RUSSELL F. CLARK CO.,INC. (412) 242-95 PUERTO RICO COMP REP ASSOC, INC (809) 746-65 UTAH FRONT RANGE MARKETING (801) 288-25 WASHINGTON ELECTRA TECHNICAL SALES (206) 821-74 WISCONSIN Brookfield – INDUSTRIAL | Mulicieli | FAY | 0089145055-0 | Centerville – DOLFUSS ROOT & CO(513) 433-6776 | | Wanchai FAX (852) 865-4335 Westiake - DOLFUSS ROOT & CO | HONG KONG | TEI | (852) 965 4525 | | | TALY, Milano | Wanchai | EAV | (032) 003-4325 | Westlake - DOLFUSS ROOT & CO(216) 899-9370 | | FAX (02) 38103458 | ITAL V Milano | TEI | (002) 000-4000 | PENNSYLVANIA | | JAPAN, PUERTO RICO Atsugi TEL (0462) 29-8460 COMP REP ASSOC, INC (809) 746-65 Kanagawa TEL (0462) 47-2911 FRONT RANGE MARKETING (801) 288-25 FAX (0462) 47-1729 WASHINGTON ELECTRA TECHNICAL SALES (206) 821-74 Tokyo FAX (03) 3342-5196 WISCONSIN WISCONSIN Brookfield – INDUSTRIAL | TIALI, Willand | | | RUSSELL F. CLARK COINC(412) 242-9500 | | Atsugi TEL (0462) 29-8460 COMP REP ASSOC, INC (809) 746-65 FAX (0462) 29-8458 Kanagawa TEL (0462) 47-2911 FRONT RANGE MARKETING (801) 288-25 FAX (0462) 47-1729 WASHINGTON Tokyo TEL (03) 3346-7550 ELECTRA TECHNICAL SALES (206) 821-74 FAX (03) 3342-5196 WISCONSIN Osaka TEL (06) 243-3250 Brookfield – INDUSTRIAL | IADAN | | (02) 30103436 | | | FAX (0462) 29-8458 UTAH Kanagawa TEL (0462) 47-2911 FRONT RANGE MARKETING (801) 288-25 FAX (0462) 47-1729 WASHINGTON Tokyo TEL (03) 3346-7550 ELECTRA TECHNICAL SALES (206) 821-74 FAX (03) 3342-5196 WISCONSIN Osaka TEL (06) 243-3250 Brookfield – INDUSTRIAL | | TEI | (0463) 30 9460 | | | Kanagawa TEL (0462) 47-2911 FRONT RANGE MARKETING (801) 288-25 FAX (0462) 47-1729 WASHINGTON WASHINGTON 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 <td< td=""><td>Alsugi</td><td></td><td></td><td></td></td<> | Alsugi | | | | | FAX | Kananawa | TFI | (0462) 47-0438 | FRONT RANGE MARKETING (801) 288-2500 | | Tokyo | Nanayawa | | | | | FAX(03) 3342-5196 WISCONSIN OsakaTEL(06) 243-3250 Brookfield - INDUSTRIAL | Tokyo | | | | | OsakaTEL(06) 243-3250 Brookfield INDUSTRIAL | 10ky0 | | | | | | Ocaka | | | | | 1 AA | Joana | | | | | | | I 77 | (00) 240-0200 | NEFREGENTATIVES,ING(414) 5/4-9393 | | | | | | | Advanced Micro Devices reserves the right to make changes in its product without notice in order to improve design or performance characteristics. The performance characteristics listed in this document are guaranteed by specific tests, guard banding, design and other practices common to the industry. For specific testing details, contact your local AMD sales representative. The company assumes no responsibility for the use of any circuits described herein. ADVANCED MICRO DEVICES, INC. 901 Thompson Place P.O. Box 3453 Sunnyvale, California 94088-3453 (408) 732-2400 (800) 538-8450 TWX: 910-339-9280 TELEX: 34-6306 APPLICATIONS HOTLINE & LITERATURE ORDERING USA (408) 749-5703 JAPAN 3346-7550 UK & EUROPE 44-(0)256-811101 TOLL FREE USA (800) 222-9323 FRANCE 0590-8621 GERMANY 0130-813875 ITALY 1678-77224 EMBEDDED PROCESSOR DIVISION (EPD) TECHNICAL SUPPORT HOTLINE USA (512) 602-4118 TOLL FREE USA (800) 2929-AMD JAPAN 0031-11-1163 UK 0-800-89-1455 RECYCLED & RECYCLABLE Printed in USA Ban-15.5M-1/94-0 16362C