





# TOSCA Multi-platform FPGA Design Kit

# Data Sheet TOSCA FDK DS 092001 10

#### **Features**

- FPGA Design Kit for high-end applications development
- · Hardware platform independent
- · Network on Chip (NoC) solution
- · VHDL source code fully available
- Full support provided at Hardware, Firmware and Software levels
- Drastically reduce development time, focusing on the user application

## **PCI Express-Centric Architecture**

- Transaction Layer Packet (TLP) data packets routed through configurable full mesh switches
- Segregated I/O Space (CONTROL Plane) and Memory Space (DATA Plane)

#### **Hardware Environment**

- · Targets Xilinx Virtex-5T device family
- Full simulation environment and Bus Functional Models (BFM) provided
- Extensive procedures library along with tutorials and reference designs
- Implementation based on standard Xilinx design tools (XST & ISE 10.x)
- Architecture ready for migration to next generation FPGA
- Other FPGA family support planned
- PEV\_1100 Evaluation Hardware Platform available

#### **Software Environment**

- · Device Drivers and User Library
- XprsMON Monitor
- · FPGA bit stream remote update
- Built In Test capability

#### **IOXOS IP Core Library**

- TOSCA NoC Full Mesh Central Switch (x4)
- PCI Express EP Wrapper
- · DDR2 Multi-port Memory Controller
- Intelligent DMA Controller
- · I2C, SPI and SRAM Controllers
- VME64x Master/Slave (+VXI add-on)



## **Overview**

Field Programmable Gate Arrays (FPGAs) are gaining acceptance as a powerful platform for many applications in today's industry due to their speed, embedded features and flexibility. The continuous increase in terms of complexity of these applications combined with shortening time to market requirements have lead to structural optimization in FPGA design and the use of intellectual property (IP) cores.

The most challenging issue FPGA designers face is the integration of their application with all the IP cores necessary to implement a fully functional FPGA.

To overcome this challenge, IOxOS Technologies SA releases the TOSCA FPGA Design Kit solution, a comprehensive FPGA design environment which provides full visibility and facilitates the integration of the user application within a high-performance PCI Express switch-centric architecture. This approach makes possible the implementation of a complete Network on Chip (NoC) solution which includes a PCI Express endpoint, DDR2 memory controllers, intelligent DMA engines, Multi Root I/O Virtualization and embedded user areas tightly coupled with these built-in resources.

The TOSCA FPGA Design Kit provides RTL and behavioral VHDL source code together with reference designs, access to the IOxOS IP core library, a set of test-benches and Bus Functional Models. IOxOS Technologies SA also provides an evaluation platform to dramatically reduce the development time.

## Introduction

Conventional design kits offer a set of IP cores along with implementation examples. IOxOS Technologies SA goes one step further releasing the TOSCA FPGA Design Kit, a comprehensive system oriented design environment that covers all the path, from the software application to the FPGA user code.

Every Software element (OS device drivers, utilities and debugging tools) is fully integrated within the Hardware environment (bus interfaces, I/O & Configuration space, embedded DDR2 controllers, intelligent DMA engines and interrupt managers inter alia).



The allocated FPGA user area can be implemented with VHDL or the following high-level tools methodologies in order to fulfill the most demanding applications such as signal processing or high performance computing. Soft IP microprocessors, such as Xilinx MicroBlaze, can be easily integrated in the FPGA user area with full remote control over the PCI Express infrastructure.

# **FPGA Technology Trends**

Today's high-performance 65nm FPGA devices such as Xilinx Virtex-5T family, provide a very high density of logic resources. Even the smaller part (XC5SX30T) can host a complete NoC system. Next generation of programmable logic, based on 40nm FPGA devices will double the logic resources density without increasing the cost. The system clock frequency will be also improved by 10%.

All these facts along with FPGA inherent parallelization capabilities make these devices a very powerful and attractive platform to implement DSP or mathematical algorithms.

To make the most of FPGA current and future features, TOSCA FPGA Design Kit is ready for the next FPGA generation, offering the TOSCA architecture, a fully scalable NoC solution with parameterizable IP cores and based on well established technologies with longterm life cycles, such as PCI Express and Ethernet.

#### **NoC Architecture**

The Network on Chip (NoC) solution is an alternative to the traditional bus-based architectures widely used in System on Chip (SoC) implementations. This alternative has proven to be plenty of benefits:

- Scalability and growing capabilities
- Increased point to point bandwidth ( > GBytes/s )
- Controlled latency with best in class Quality of Service (OoS)
- Easier verification due to modular approach
- Clocking strategy depends on physical layer
- Easier silicon mapping

FPGAs are suitable platforms for NoC architectures, because of their high density of DPRAM resources, widely used for local packet buffering.

The TOSCA architecture is based on a NoC approach which uses PCI Express as legacy network, routing PCI Express Transaction Layer Packet (TLP) data through configurable full mesh switches. Each TLP is composed of a TLP Header, a data payload and an optional TLP digest field for ECRC purposes.

This PCI Express centric implementation provides a de-facto standard packet structure in order to support Single or Multi Root I/O Virtualization (SR\_IOV and MR\_IOV capabilities referenced in the new PCI Express specification released by the PCISIG), which is a key element for the upcoming generation of multi-core microprocessors and embedded systems.

## **Reference Design Example**

The following diagram provides an example of a TOSCA architecture based application which can be implemented using TOSCA FPGA Design Kit available resources. A dual multi-core CPU system on a VME64x Single Board Computer (SBC) with a Virtex-5T FPGA on board. The FPGA interfaces both multi-core CPUs with the VME64x backplane, providing user specific I/O control and local DDR2 shared memory access with DMA capability.



## **TOSCA Architecture Overview**

The TOSCA architecture is based on a PCI Express switch centric structure which implements a memory mapped model with segregated I/O Space (CONTROL Plane) and Memory Space (DATA Plane).

The CONTROL Plane implements the I/O Space Read/Write transactions, initialization tasks and interrupt handling. The IO\_Bus, a 32-bit Master/Slave IOxOS proprietary bus, supports these facilities across the different modules (referenced as Agent\_SW) connected to the central switch. The DATA plane implements the Memory Space Read/Write transactions with optimal data rate performance across a switched NoC infrastructure.

### **TOSCA Central Switch**

The TOSCA architecture main core is the TOSCA Central Switch (referenced as Central\_SW), a N-port dual full mesh non blocking switch. Both Request and Response switches integrate INgress and OUTgress TLP buffer queues (FIFO) providing an optimal data transfer bandwidth.

Thanks to its full mesh interconnection, each TOSCA Central\_SW port can provide the following four independent back-end interfaces, able to run simultaneously at full speed (i.e 64-bit @ 200MHz  $\sim$  1.6 GBvtes/s):

- Master Write Request Read Request
- Master Read Response
- Slave Write\_Request Read\_Request
- Slave Read Response



The TOSCA Central\_SW is fully configurable and can be customized from 2 up to 8 ports. A 64-bit 4-port TOSCA Central\_SW (default configuration) running at 200 MHz provides an aggregate bandwidth of 2 x 4 x 1.6 GBytes/s =  $\sim$  12.8 Gbytes/s. Configurations of 128-bit running at 250 MHz are ready to be supported.

The routed data packets are directly based on PCI Express TLP format. Additional 16-bit information is added to the TLP Header providing the switch with self-routing information (Source &

Destination Port Number) and flushing management. Based on the Source information, TLP are sorted in separate queues, preventing the Slave to be blocked by a single Master.

The INgress and OUTgress buffer queues, implemented with true DPRAM on every back-end interface, allow the TOSCA Central\_SW back-end interfaces to run in their own time domain.

## Agent\_SW

The following TOSCA Agent\_SW IP cores are supplied:

- PCI Express EP Wrapper
- DDR2 Multi-port & IDMA Controllers
- VMF64x Master/Slave
- USER Block



VXI and PXI Agent\_SW IP cores are still under development and will be released soon.

The Agent\_SW CONTROL plane back-end is handled by the following two (2) IO\_Bus interfaces:

- Master IO\_Bus Read/Write, used by local intelligent resources such as embedded processors (i.e MicroBlaze)
- Slave IO\_Bus Read/Write used to map I/O Space resources

The Agent\_SW DATA plane back-end is handled by the following four (4) interfaces, all of them implemented through a 64-bit back-end running at 100-250 MHz:

- 1. Master Tx Read Request and Write Request
- Master Rx Read\_Response
- 3. Slave Rx Read\_Request and Write\_Request
- 4. Slave Tx Read\_Response

Each interface can be active concurrently, providing a global datarate of 6.4 GBytes/s per Agent\_SW, and interfaced to the TOSCA Central SW with its own time domain, simplifying the IP integration.

Thanks to its memory mapped model, two Agent\_SW connected through the TOSCA Central\_SW can deploy a transparent Bus Bridge function as long as an address remapping capability is provided. As an example, when both PCI Express EP Wrapper and VME64x Agent\_SW are instantiated, transparent bridging between PCI Express and VME64x is supported.

Both PCI Express EP Wrapper and VME64x Agent\_SW integrate programmable INgress MMU supporting address remapping and physical switch port allocation based on source address information.

A fully functional PCI Express to VME64x Transparent Bridge function is implemented on the PEV\_1100 Evaluation Hardware Platform using the TOSCA FPGA Design Kit.

# PCI Express EP Wrapper Agent\_SW

The PCI Express EP Wrapper Agent\_SW, built on top of the Virtex-5T embedded PCI Express EP LogiCORE IP, is the default Host Interface for the TOSCA FPGA Design Kit. All software utilities, running under PC or Workstation deploy this interface. This Agent\_SW provides a PCI Express Endpoint solution with both x1 and x4 lanes configuration.



The PCI Express Controller module interfaces the PCI Express EP LogiCORE IP Transaction Layer Interface (TLI) protocol and the onchip TOSCA infrastructure. The TLP packets decoded in I/O Space are sent to an IO\_Bus Master controller, providing a direct mapped access to all Agent\_SW I/O Space area. The TLP packets decoded in Memory Space are reformatted to TOSCA NoC format and sent across the NoC infrastructure. Address remapping and NoC routing are supported through a programmable INgress MMU.

The Central Interrupt Manager (CIM) module handles the on chip system interrupts and dispatch them through the PCI Express Message Signaled Interrupt (MSI) interface.

The TOSCA architecture can support up to two (2) PCI Express EP Wrapper Agent\_SW in Virtex-5T devices hosting more than one PCI Express EP LogiCORE IP (XC5VFX70T, XC5VFX100T and XC5VFX200T). This mode of operation can be used in applications where the TOSCA architecture is shared with an embedded CPU.

## VME64x Agent\_SW

The VME64x Agent\_SW implements a fully functional VME64x Master/Slave Interface with Slot-1 function capability. It handles all data transfer types defined by the VME64x VITA standard (SBT, BLT32, MBLT64, 2eVME & 2eSST) and integrates configurable byte swapper capability (hardware)



The VME64x Master is a memory mapped Interface acting as transparent bridge, directly accessible from the following Agent SW:

- PCI Express EP Wrapper
- DDR2 Multi-port & IDMA Controllers
- USER Block

The VME64x Slave module allows to map internal resources over the VME A32 addressing space. A programmable INgress MMU, divided in pages of 1 MBytes, allows to remap the VME addressing space to the internal TOSCA address mapping.

The VME64x Agent\_SW also integrates some specific functions:

- VME Error handling support
- Programmable Interrupt Generator
- Global 48-bit Timer with external synchronization through IRIG-B or direct VME signaling

# DDR2 Multi-port Memory Agent\_SW

The DDR2 Multi-port Memory Agent\_SW implements a complete DDR2 controller with segregated Read and Write data accesses. The multi-port arbitration algorithms as well as dynamic port enabling are selectable, providing best in class QoS. The four ports are assigned by default as follows:

Port 1 : TOSCA Central SW Slave port (NoC)

• Port 2: IDMA Controller Descriptor Fetch & Update

Port 3: IDMA Controller data

Port 4 : Private USER Block Agent SW



The DDR2 memory array is accessible through the Port\_1 (NoC) for every Agent\_SW connected to the TOSCA Central\_SW .

The DDR2 Multi-port Memory Agent\_SW supports up to 4 DDR400 devices (256MB, 512MB, 1GB and 2GB) providing up to 1 GBytes of shared memory.

- DDR500 DDR667 (technology dependent)
- 128-bit data interface ( → 3.2 GBytes/s )
- RLDRAM II protocol
- Additional arbitrated Ports (up to 8) dedicated to user specific purpose

The DDR2 Memory Controller is tightly coupled through Port\_2 and

Port\_3 to the embedded IDMA Controller Agent\_SW to optimize DMA accesses to the DDR2 array.

# IDMA Controller Agent\_SW

The IDMA Controller Agent\_SW implements a multi-channel chained DMA controller. Data transfer is handled autonomously between the DDR2 memory array and the NoC infrastructure. The IDMA Controller Agent\_SW uses two independent engines (Read\_Engine and Write\_Engine).

The Read\_Engine reads data from the DDR2 array memory and send them to the NoC infrastructure (to any of the Agent\_SW) or to its Direct\_OUT port. The Write\_Engine reads data from the NoC infrastructure (from any of the Agent\_SW) or from its Direct\_OUT port and send them to the DDR2 array memory.

The Direct\_OUT and Direct\_IN ports are both connected to the USER Block Agent\_SW to provide the user with a fast access to this resource.



Each DMA channel has chaining capability (up to 16 MBytes per chain-descriptor) and is associated to an "End of Transfer" Interrupt, coupled to status and time-stamp information. The DMA engines are controlled by a set of registers mapped into the I/O space, the chain descriptors are located in the DDR2 memory. The DMA engines implement an inter-engine triggering capability which allows to start a 2<sup>nd</sup> engine on condition occurred on a 1<sup>st</sup> engine. The DMA Write\_Engines are able to issue multiple outstanding Read\_Request to support optimal PCI Express read performance.

# USER Block Agent\_SW

The FPGA area and logic resources allocated to the user application are embedded in the USER Block Agent\_SW. Multiple USER Block Agent\_SW can be instantiated in a TOSCA architecture. As any other Agent\_SW, the USER Block Agent\_SW has access to the predefined interfaces with the NoC infrastructure but also integrates a direct connection with the DDR2 and IDMA Controller Agent\_SW.



Each USER Block Agent SW is provided with the following ports:

- IO\_Bus Slave port to interface the USER Block Agent\_SW with the I/O Space infrastructure (1KByte per Agent\_SW).
- IO\_Bus Master port to support local intelligent modules such as embedded processors (i.e. MicroBlaze).
- NoC Slave port to map the USER Block Agent\_SW into the TOSCA MEMORY Space. It provides support for Write\_Request, Read\_Request and Read Response.
- NoC Master port to provide the USER Block Agent SW with full access to the NOC infrastructure.
- DDR2 direct path to provide a simple Read/Write interface to the DDR2 Multi-port Memory Agent\_SW. It allows the USER Block Agent\_SW to access the DDR2 array without implementing the complete NoC Master controller.
- IDMA Direct\_IN/Direct\_OUT ports to support simple DMA driven data channels.

The USER Block Agent\_SW logic resource availability is limited by the targeted FPGA device. Associated timing and physical constraints must be defined by the user in the constraints file. There is no limitation in terms of I/O technology (logical and electrical).

The USER Block Agent\_SW functionality can also be simulated in its own environment, supported with specific Bus Functional Models (BFM) delivered with the TOSCA FPGA Design Kit, to emulate the TOSCA NoC infrastructure back-end interfaces.

The allocated FPGA user area can be implemented with VHDL or following high-level tools methodologies to design:

- Avionics interfaces such as ARINC429, MIL1553, AFDX-ARINC-664 from VHDL hardware descriptions,
- DSP applications generated with high-level tools such as Xilinx AccelDSP & XtremeDSP or SynplifyDSP
- C/C++ based applications from companies as Mitrionics or languages like Impulse C, among other applications

## Interrupt Support

Each Agent\_SW integrates a 16-input Interrupt Controller (ITC16). When an Interrupt is locally detected, the event is sent to the Central Interrupt Manager (CIM) which manages all ITC16 instantiated. The CIM controller is usually integrated in the PCI Express EP Wrapper Agent\_SW and generates PCI Express MSI Messages. The MSI parameters (MSI Address / MSI Data) are initialized by the Host Controller.



## Other IP Cores Provided

To make the system integration faster and easier, the TOSCA FPGA Design Kit includes a VHDL IP core library with the following complementary functions:

- I2C Master Controller
- Virtex-5T System Monitor interfaces to on-chip IO Bus
- Virtex-5T PCI Express LogiCORE interfaces to on-chip IO Bus
- External FPGA configuration from SPI Serial Flash
- Power On Finite State Machine (PON\_FSM) microprogrammed controller to manage system initialization from external SPI Flash EPROM
- Virtex-5T System Monitor controller

## **FPGA Simulation Environment**

The TOSCA FPGA Design Kit is delivered with full VHDL source code together with a set of test-benches and Bus Functional Models (BFM) to set up a complete VHDL simulation environment for functional verification purposes. This simulation environment is supported by the main HDL simulation tool vendors (Mentor Graphics ModelSim and Aldec Riviera).

The following BFMs, driven by VHDL Procedure Packages, are available:

- PCI Express LogiCORE TLI Target/Initiator
- VME64x Master/Slave with direct support of SBT, BLT32, MBLT64, 2eVME & 2eSST



## **FPGA Physical Implementation**

The FPGA physical implementation is based on the following standard Xilinx design tools (no 3<sup>rd</sup> party VHDL synthesis tool required):

- Synthesis: Xilinx XST (included in Xilinx ISE 10.x)
- Place & Route: Xilinx ISE 10.x
- PROM file generation: Xilinx iMPACT

A complete set of tutorials and reference designs are also supplied to guide the designer through all the implementation and verification flow.

#### Software and Services

The TOSCA FPGA Design Kit is supported with the following software items:

- LINUX device drivers (PCI Express & VME64x)
- User library with examples
- · XprsMON and FPGA Built utilities

XprsMON is a Linux application linked with the TOSCA user library which allows the user to interactively perform a set of commands to operate the TOSCA infrastructure:

- Patch, Examine, Deposit & Display operations on Memory and I/O Space references
- Patch, Examine, Deposit & Display operations on VME64x references
- Exercise Interrupts, Global\_Time and DMA capabilities
- Built In Test (BIST) and Configuration support
- Flash Memory programming (FPGA bit stream download)

# Licensing

The TOSCA FPGA Design Kit is delivered with full RTL and behavioral VHDL source code. This source code is provided under NDA and can not be supplied to 3<sup>rd</sup> party end users. The TOSCA FPGA Design Kit basic package includes a 5 days training and 100 runtime (RT) licenses. Additional RT licenses are available upon request.

Learn more: info@ioxos.ch



4, chemin de Fontenailles 1196 Gland SWITZERLAND

tel: +41 22 364 76 90 fax:+41 22 364 76 94 Email: info@ioxos.ch

## Ordering information

| Product       | Description                         |
|---------------|-------------------------------------|
| TOSCA1100-BAS | TOSCA FPGA Design Kit Basic Package |
| TOSCA1100-RT1 | 200 FPGA RT licenses                |
| TOSCA1100-RT2 | 500 FPGA RT licenses                |
| TOSCA1100-RT3 | 1K FPGA RT licenses                 |

©IOxOS Technologies SA All rights reserved All other brands or names are the property of their respective holders