NEW BOT Телеграм, страница - 703912492

fpga_news

687 subscribers

28 photos

4 videos

17 files

477 links

FPGA/ASIC + Machine Learning: Data Centers, Self-driving cars and Edge devices
@nslmike

Download Telegram

About

Blog

Apps

Platform

687 subscribers

Next gen facebook accelerator (MTIA v2)
https://ai.meta.com/blog/next-generation-meta-training-inference-accelerator-AI-MTIA/

Our next generation Meta Training and Inference Accelerator

We are sharing details of our next generation chip in our Meta Training and Inference Accelerator (MTIA) family. MTIA is a long-term bet to provide the most efficient architecture for Meta’s unique workloads.

1.03K viewsedited 16:45

Intel Agilex 5 AI Tensor Block
https://www.youtube.com/watch?v=BrfwvLqxpPk

1.06K views16:03

FPGA Startup offers LLM performance better than Nvidia A100
https://hc2023.hotchips.org/assets/program/posters/HC2023.hyperaccel.ai.Moon.Poster.pdf

1.45K views17:01

for #math researchers
Numerical behavior of NVIDIA tensor cores @ PubMed:
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959640/
source thread: https://twitter.com/rzidane360/status/1786958225419706683

PubMed Central (PMC)

Numerical behavior of NVIDIA tensor cores

We explore the floating-point arithmetic implemented in the NVIDIA tensor cores, which are hardware accelerators for mixed-precision matrix multiplication available on the Volta, Turing, and Ampere microarchitectures. Using Volta V100, Turing T4, and ...

1.16K viewsedited 09:07

Google TPUv6 Trillium
https://cloud.google.com/blog/products/compute/introducing-trillium-6th-gen-tpus/

Google Cloud Blog

Introducing Trillium, sixth-generation TPUs | Google Cloud Blog

The new sixth-generation Trillium Tensor Processing Unit (TPU) makes it possible to train and serve the next generation of AI foundation models.

1.29K views17:40

Groq's ready for 2nd chip tape-out

1.23K views09:29

New Startup #Furiosa and a paper
https://furiosa.ai/

1.13K views17:35

https://dli5ezlttyahz.cloudfront.net/FuriosaAI-tensor-contraction-processor-isca24.pdf?p=download/FuriosaAI-tensor-contraction-processor-isca24

1.21K views17:35

Education of Chip Designers at a Large Scale: A Proposal
https://ieeexplore.ieee.org/document/10584365

982 views15:33

https://thecharlieblake.co.uk/visualising-ml-number-formats

Charlie Blake: Blog

Visualising ML number formats

A visualisation of number formats for machine learning --- I couldn’t find any good visualisations of machine learning number formats online, so I decided to make one. It’s interactive, and hopefully gives a sense of the trade-offs between using different…

1.25K views15:45

https://www.anandtech.com/show/21482/tenstorrent-launches-wormhole-ai-processors-466-fp8-tflops-at-300w

Tenstorrent Launches Wormhole AI Processors: 466 FP8 TFLOPS at 300W

Tenstorrent has unveiled its next-generation Wormhole processor for AI workloads that promises to offer decent performance at a low price. The company currently offers two add-on PCIe cards carrying one or two Wormhole processors as well as TT-LoudBox, and…

1.46K views20:51

Exploring logic synthesis with Yosys
https://www.linkedin.com/posts/ashwinrajesh_a-guide-to-logic-synthesis-using-yosys-ugcPost-7221574339165396993-6sN5
56-pages doc: https://drive.google.com/file/d/13ER2Jb7fj6pUIeCzoba837SHPWG-xX-Y/view

As a digital design student, have you ever wondered how the RTL code we write magically gets transformed into circuits? | Ashwin…

As a digital design student, have you ever wondered how the RTL code we write magically gets transformed into circuits?

Have you ever thought how those always blocks were synthesized into gates and LUTs, or even special blocks like BRAM and DSP blocks?
…

1.12K views21:09

RISC-V Ecosystem Panel | Open Source is Transforming AI and Hardware

https://www.youtube.com/watch?v=hQfmT_LM-zY

RISC-V Ecosystem Panel | Open Source is Transforming AI and Hardware

【2024 ANDES RISC-V CON Silicon Valley】
DEEP DIVE INTO AUTOMOTIVE / AI / APPLICATION PROCESSORS AND SECURITY TRENDS
📍About The Event
Recently, RISC-V, with its open, streamlined, and scalable configuration, has become the mainstream solution adopted by leading…

1.23K views21:22

#BlockFP #FPGA #LLM
https://myrtle.ai/resources/llama3-blockfloat16-quantization/

Optimizing Llama3: Leveraging Blockfloat16 for Weights and Activations

Explore how Block Floating Point 16 (BFP16) with an effective 9-bit rate enhances Llama3 for up to 8x faster inference on FPGAs. Learn why BFP16 is a compelling alternative to FP8 for optimizing large language models without sacrificing accuracy.

1.53K views10:55

https://www.cnbc.com/video/2024/08/23/how-google-makes-custom-cloud-chips-that-power-apple-ai-and-gemini.html

Inside Google's chip lab, where it makes custom silicon to train Gemini and Apple AI models

Google was the first cloud provider to make its own custom AI chips, called TPUs, when they first came out in 2015 - a trend both Amazon and Microsoft followed years later. Now, Apple has revealed it uses TPUs to train its AI models, positioning Google chips…

1.39K views17:02

#MTIA
https://engineering.fb.com/2024/08/22/ml-applications/meta-mtia-hardware-co-design/

Engineering at Meta

Inside the hardware and co-design of MTIA

In this talk from AI Infra @ Scale 2024, Joel Colburn, a software engineer at Meta, technical lead Junqiang Lan, and software engineer Jack Montgomery discuss the second generation of MTIA, Meta’s …

1.75K views14:27

https://arxiv.org/abs/2409.03384

Hardware Acceleration of LLMs: A comprehensive survey and comparison

Large Language Models (LLMs) have emerged as powerful tools for natural language processing tasks, revolutionizing the field with their ability to understand and generate human-like text. In this...

2.05K views11:45

Let it be here. Life lessons from one of the greatest computer scientists.
https://cacm.acm.org/opinion/life-lessons-from-the-first-half-century-of-my-career/

1.31K views19:49

Tenstorrent Wormhole Series

Part 1: Physicalities
Part 2: Which disabled rows?
Part 3: NoC propagation delay
Part 4: A touch of Ethernet
Part 5: Taking apart T tiles
Part 6: Vector instruction set
Part 7: Bits of the MatMul

https://tenstorrent.com/vision/community-highlight-tenstorrent-wormhole-series-part-1-physicalities

Community Highlight: Tenstorrent Wormhole Series Part 1: Physicalities | Tenstorrent

An in depth look at Tenstorrent Wormhole, originally posted on corsix.org

5.54K views17:20

I'm excited to share a sneak peek of our latest work at Intel Corporation —
a groundbreaking approach to Electronic Design Automation (EDA) that integrates intelligent design agents into the engineering workflow.

Our agent, trained on massive datasets from the best of our engineers, provides real-time insights and solutions within the EDA tools, enabling the solving of tasks ranging from simple to complex multi-iteration challenges, making the design process more efficient and innovative.

*This demo utilizes The OpenROAD Project, an open-source EDA tool developed by The Regents of the University of California.

url: https://www.linkedin.com/posts/itai-yeshurun_intel-eda-llm-ugcPost-7267911435886682112-IIhh

1.47K viewsedited 16:49

This media is not supported in your browser

VIEW IN TELEGRAM

1.55K views16:49