-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GPU component manufacture impacts #65
Comments
ECODIAG APPROACHLink : https://ecoinfo.cnrs.fr/ecodiag-calcul/
|
First approximation would be to consider a GPU as "a compute chip" + "a memory bank" + "a board" and apply adapted formulas from CPU, RAM and a fix value for the board. Compute chip partThe original CPU formula to compute manufacturing impact is: We can rewrite it for GPU components: Memory partWe can keep the same formula (using highest density value if unknown): Board partFor the board part I guess we can take constants from the motherboard and maybe divide them by 2 (or 3)?. I guess that given the precision of the above formulas it would be fair to say that a GPU board is around half (or one-third) the size of a motherboard and contains half (or one-third) the number of electronic components. Total manufacturing impactLet me know what you think on this approach. 🤗 Edit: Change latex formula because GitHub can't render them properly... |
After a great discussion with @ThibaultPirson we will make some updates to the proposed formulas above and we will start with the "board part". The objective here is to quantify the contribution of the PCB manufacturing for the GPU. Board manufacturing formulaHypothesis:
So the final GPU board manufacturing formula will only depend on the With the followings:
Estimate impacts factorsLiu et al., 2014For a 4-layered PCB epoxy based substrate (FR-4) we can retain the GWP impact factor per square meter: Ozkan et al., 2018The study compares there results with Liu et al., 2014. Still with a 4-layered FR-4 PCB. Nassajfar et al., 2021Still with a 4-layered FR-4 PCB. Conclusion and criticsWe can take the following impact factors for now:
We can consider these impact factors as lower bounds because, from what I have seen, GPU boards tend to be from 8 to 14 layers PCB. Here we have impact factors for 4 layered PCB only. Any input on this part to estimate for PCBs with more layers will be a great help. Let me know if you have any comments, I'll try to do the same work for memory and chip parts soon! Thanks again @ThibaultPirson for sharing his knowledge on this subject ! 🤗| Edit: Change latex formula because GitHub can't render them properly... |
Thank you for this work ! I was wondering if we could open 1 issue per component, since all features of the API will benefit from these improvements ? It would also be easier to list all available impact factors for each component. I think we could also open one for SSD and HDD sine we have new data sources that could be used (even if it is not related to GPU). For the motherboard, we were wondering how could we use ADPf (given in MJ) to retrieve an impact factor for PE ? I guess that the main difference is that PE will also account for the primary energy produce by renewable sources. I'll try to dig on this subject. Maybe I could ask the LCA experts on Boavizta chat. We could capitalize on this aggregation of impact factors sources to:
I will try to dig down myself and ping some LCA experts to get these data sources. |
Still in the process of untanggling all impact factors of component formulas. Here is the actual state for CPUs. For Boavizta members only, the original document is here. CPU Manufacturing FormulaDieScope and hypotheses:
IMP_CPU_DIE:
SocketScope and hypotheses:
IMP_CPU_SOCKET:
TransportationScope and hypotheses:
IMP_CPU_TRANSPORTATION:
Packaging??? No mention of a packaging impact. Heat sinkScope and hypotheses:
IMP_CPU_HEAT_SINK:
Aggregated impactsSource: Green Cloud Computing I feel like all the sub impacts listed for CPU manufacturing also apply for GPU "compute chip" manufacturing. Does anyone have an opinion on that? |
Correction from the previous message. When computing the impacts for the compute chip in a GPU, we should not include the "socket" part as it is soldered on the board directly. Thus, the formula is: |
RAM Manufacturing FormulaDieScope and hypotheses:
With the IMP_RAM_DIE:
TransportationScope and hypotheses:
IMP_RAM_TRANSPORTATION:
PCBScope and hypotheses:
IMP_RAM_PCB:
Gold connectorsScope and hypotheses:
IMP_RAM_GOLD:
Aggregated impactsSource: Green Cloud Computing For the GPU memory part we can take the impacts of the memory die and transportation. The PCB part is already done above and the gold part can be inferred from the size of the connector (next step). |
Interesting observation, we clearly don't have the same values concerning the PCB manufacturing for RAM modules in Green Cloud Computing compared to other studies above.
I guess the assumption of a 4-layer PCB is too light for RAM modules and thus GPU board as well? We also need to check where Green Cloud Computing values come from. |
Concerning the amount of gold in the connector, I think we can estimate based on a comparison of RAM DDR4 connector from Green Cloud Computing and PCIe 4.0 connectors. DDR4 connectorSpecifications:
Pin area estimationDDR4 RAM modules have pins of different height, but constant width. Using the schematics from Micron we can get the following dimensions :
A small portion of the pins also have intermediate heights between 1.9 and 2.4 mm. I assume that we have 50/50 big pins and small pins. This is an upper bound estimation, there is a little more of small pins compared to big pins. So, the average pin dimensions are 2.15 mm [height] x 0.6 mm [width]. Thus, the average pin area is 2.15 x 0.6 = 1.29 mm2 Density of goldTotal area of pins: 288 * 1.29 = 371.52 mm2 PCIe connectorSpecifications:
Today, the majority of GPUs use PCIe x16, and smaller ones use PCIe x8. From the specs, we can deduce the following characteristics:
Some pins have a smaller height, but we will not consider it. So, we will get an upper bound estimation. Amount of goldFrom the previous estimations, we can compute the amount of gold for PCIe connectors based on the size of the pins.
Manufacturing impactsBased on the impacts reported for DDR4 RAM in Green Cloud Computing, we can estimate them for a PCIe connector.
|
Nice work ! Thanks for given that mutch detail. As far as I understand, you got all the piece of the GPU puzzle ? Have you been able to compute it for some GPU ? I would be interested to see and to share it with some “experts”. |
Thanks @da-ekchajzer I'll make a complete summary and compute for some GPUs this week! |
Summary of GPU manufacturing impactsEdits:
Compute chipScope: Die + Transportation Hypotheses and limitations:
Memory chipsScope: Die + Transportation Hypotheses and limitations:
Board and otherScope: PCB + PCIe connector + Heat Sink Hypotheses and limitations:
Impact factors
Discussions
ExamplesNVIDIA GeForce RTX 4090Characteristics (from techpowerup)
ImpactsCompute chip
Memory chips
Board
Total
NVIDIA A100 80GBCharacteristics (from techpowerup)
ImpactsCompute chip
Memory chips
Board
Total
|
Follow-up meeting with UCL researchers
|
One quick explanation on why the impacts are low (especially for the memory part) is the calculation of density. In the old (pre-v1) API version, we use the value that maximizes the impact. The density is thus 0.625 GB/cm2 compared to the value taken here of 1.875 GB/cm2. The first value corresponds to a Samsung DRAM with a technology node of 30 nm (Green Cloud Computing). 30 nm technology node for DRAM seems to be pre 2014 value. The second value is the average for 20 nm. Data from Green Cloud Computing ( I feel like for the case of GPUs and especially recent ones like the NVIDIA GeForce RTX 4090 and NVIDIA A100 80 GB from above, it is fair to take the value from 20 nm technology node instead of the "old" 30 nm, even though it reduces the impact. We can also consider from Pirson & Bol 2021 a calculated average over ~12 years of data from 2010 to 2022 density for DRAM of 1.625 GB/cm2. Here are the different values of memory chips impacts of an NVIDIA A100 80 GB with difference hypotheses:
Do you have an opinion on this @ThibaultPirson @blubrom @da-ekchajzer ? |
Corrections on gold quantity of DDR4 and PCIe connectorsLooking back at the Green Cloud Computing study hypotheses for gold quantity estimation on a DDR4 connector, I have noticed some issues both in the study and in our estimations of pin area above. Estimation of the gold area in GCC studyFrom this table, we have an estimation of the pin area (thus area of gold) on the connector of 760 mm2. I have not found the precise calculation in the study, but I believe it is done the following way:
Manual estimation of the pin size (2nd try)Using another data sheet from Samsung, we have the precise quotations of the pin dimensions. The pin width is still 0.60 mm, but the height ranges from 2.25 to 1.75 mm. We can repeat previous calculations to estimate the total area of pins.
Quantity of goldUsing the density of gold at 19.32 g/cm3 and with the hypothesis that the gold layer thickness is 2 µm, we can re-estimate the quantity of gold needed for a connector based on the total area of the pins.
Update on PCIeRecall pin size/area from the specs:
We can update accordingly the quantity of gold and impact factors:
|
🔴 Draft Corrections on die manufacturing impactsAnother feedback from UCL researchers is that our impact factor for die manufacturing is probably a lower band compared to other studies. A comparative analysis in Pirson et al. 2023 draw the following graphs: Primary Energy: (in red the current value we are using)Global Warming Potential: (in red the current value we are using)Data from imec.netzeroExample of data we can get from imec.netzero [public] database for a A100-like GPU die: Parameters:
|
Hello, Your approach looks promising. I have several questions, though:
Can you elaborate more on the details of these 2 factors, please? |
Hello @sbaudoin, Some quick answers below. Know that this is in pause for now, I have not had enough time to work on it recently.
If you are interested in the subject and a Boavizta contributor, I can link you to more resources in the internal wiki. In that case, just ping on Mattermost |
Problem
No GPU components are implemented. It makes it to impossible to evaluate high performance compute server. Some services cannot be assessed :
Solution
We need resources on scope 3 impacts of GPUs. Are GPUs manufacture impacts depend on their die size ?
Additional context or elements
Linked with this issue : Boavizta/environmental-footprint-data#50
The text was updated successfully, but these errors were encountered: