Author Topic: Very slow geometry phase 10x slower than old pc  (Read 901 times)

2024-03-11, 13:23:52

webuilddreams

  • Active Users
  • **
  • Posts: 23
    • View Profile
Hi, i have a new pc and the geometry phase is very very slow, mostly 10x slower than my old pc & and the scene parsing is mostly twice slower???

The new pc is an AMD EPYC 9754 (9004 Genoa) (singel cpu) on a Supermicro board with 384 GB DDR5 and the old pc is an AMD Ryzen 5900X with 128 GB DDR4.

I testted it with 20+ scenes all the same result, and is see the following times:
new pc(max 2021 and corona 11):
scene parsing 52 sec, scene geometry 97 sec, render time 12 minutes @ 850,000 samples/s actual average

old pc (max 2021 and corona 11)
scene parsing 30 sec, scene geometry 13 sec, render time 95 minutes @ 115,000 samples/s actual average

I also tested with the corona benchmark at 31 832 636 (9th place in the ranking) and with Vray benchmark 132950 (5 th place) so both very very fast, I am lost what to do?
« Last Edit: 2024-03-12, 08:42:35 by webuilddreams »

2024-03-11, 14:34:08
Reply #1

pokoy

  • Active Users
  • **
  • Posts: 1865
    • View Profile
As far as I know the info displayed is not actually only scene/geometry parsing but also other things like like loading of maps, so it might be that your new PC is slower at loading maps from disk or through network. These are also single threaded so you may have a mix of different reasons - single thread speed of your new CPU might be slower than of the CPU of the old machine, disk/network access speed, and your new CPU might suffer from under-utilization of CPU threads (where only half of the threads are working on a task etc).

I'd look into other means of benchmarking (not 3dsmax) to measure different tasks (CPU single threaded vs multi-threaded, RAM speed, disk/network I/O) and compare how each of the PCs performs. This will probably give you a better picture of what exactly is different between the two PCs.

2024-03-11, 14:48:53
Reply #2

webuilddreams

  • Active Users
  • **
  • Posts: 23
    • View Profile
Thanks Pokoy, we load the 3dsmax scene on both pc's local from de D SSD M.2 drives, so that is not an issue, the new pc has a single core speed of 3.08 Ghz and the old pc 4.5 Ghz, that could explane the scene parsing perhaps (but still the old pc single core isnt twice faster only 50%).

I benchmarked all different computer parts (memory , cpu ssd, etc etc) and on all benchmarks the new pc is much faster than the 4 year old cheap pc,

But thanks for thinking with me!

2024-03-11, 14:53:56
Reply #3

arqrenderz

  • Active Users
  • **
  • Posts: 996
  • https://www.behance.net/Arqrenderz1
    • View Profile
    • arqrenderz
The AMD EPYC™ 9754 goes to 3.1GHZ, and the 5900x turbo up to 4.8 GHZ, in my experience thats were that time is, the rendering is so much faster because its using all the cores, not all the parsing is multithreaded (and cant be )

2024-03-11, 17:12:44
Reply #4

webuilddreams

  • Active Users
  • **
  • Posts: 23
    • View Profile
thanks, i start to meassure this:  in the scene parsing corona does in fact uses only a few cores, but in the geometry phase corona uses arround 70% of the cores. The screenshot was taken in the geometry phase.

2024-04-12, 13:15:43
Reply #5

webuilddreams

  • Active Users
  • **
  • Posts: 23
    • View Profile
Hi all, we are a step closer to identifing the problem, the AMD Epyc 9754 has a whopping 128 cores and so 256 threads, in the render phase it is fast but in the geometrie phase it is a lot slower compared to our oldest pc (all the scenes have the same issue so it's not scene releated bort pc corone 11.1 and max 2021.3 and max 2024.2):

old pc (ryzen 5900x) win 11 pro (128 gb), geometry phase takes 15 secs
amd epyc 9754 win 11 pro(384 gb), geomtery phase takes 60 sec!!

The difference is in the Embree calculation time (see corona log file in C:\Users\USERNAME\AppData\Local\Autodesk\3dsMax\2021 - 64bit\ENU\en-US\plugcfg\corona) and this does not make any sence : this cpu is one of the fastets where it comes to calculating Embree (https://openbenchmarking.org/test/pts/embree)
So we tested this (the Embree benchmark from github) and we got a nice fast score arround 130 frames per sec: conclusion our pc is fine


So is it possible that one of the core componens of corona render is a bit old?
And does any one have the same problem?

2024-04-12, 15:51:33
Reply #6

arqrenderz

  • Active Users
  • **
  • Posts: 996
  • https://www.behance.net/Arqrenderz1
    • View Profile
    • arqrenderz
Really hoping one of the devs can check this out !

2024-04-12, 16:16:12
Reply #7

TomG

  • Administrator
  • Active Users
  • *****
  • Posts: 5468
    • View Profile
We are checking this out :) In fact we just took delivery of a new super high core/thread count dual EPYC machine this week so that we can better investigate any reported issues along these lines.
Tom Grimes | chaos-corona.com
Product Manager | contact us

2024-04-16, 11:45:45
Reply #8

webuilddreams

  • Active Users
  • **
  • Posts: 23
    • View Profile
We are checking this out :) In fact we just took delivery of a new super high core/thread count dual EPYC machine this week so that we can better investigate any reported issues along these lines.

Great, that would be great, it is also very frustrating to work in interactive rendering(IR): before i can make any change (for example the color materials or move an object) i have to wait until IR finish rendering (max passes) so we put max passes at 5 (btw the thread override IR is set to -16)
We still can use Photoshop and other programs, just max hangs (test this with HF11.1 and 2021.3 and 2024.2)

2024-04-16, 12:42:14
Reply #9

Juraj

  • Active Users
  • **
  • Posts: 4761
    • View Profile
    • studio website
I hope solution for your problem is found, but I will like to stress that if in some case you have non-retail stepping (ES/QS/etc..), the issue can be with on-chip memory controller which creates certain spiral of firmware/OS issues that can manifest under certain workloads (whether it's scheduler issues, CPU groups, NUMA nodes, etc..).

So if Corona team received fully retail unit (which in 2024 means all-in-one system for most Epyc stuff), they might not be able to replicate your issue.

It's bit risky proposition to build custom multi-CPU servers today, too many variables with zero support. Super hard to investigate and find what can be culprit.

Not saying that's the case, just be prepared.

Nice to know that Corona devs got the machine though.
Please follow my new Instagram for latest projects, tips&tricks, short video tutorials and free models
Behance  Probably best updated portfolio of my work
lysfaere.com Please check the new stuff!

2024-04-18, 13:24:26
Reply #10

webuilddreams

  • Active Users
  • **
  • Posts: 23
    • View Profile
I hope solution for your problem is found, but I will like to stress that if in some case you have non-retail stepping (ES/QS/etc..), the issue can be with on-chip memory controller which creates certain spiral of firmware/OS issues that can manifest under certain workloads (whether it's scheduler issues, CPU groups, NUMA nodes, etc..).

So if Corona team received fully retail unit (which in 2024 means all-in-one system for most Epyc stuff), they might not be able to replicate your issue.

It's bit risky proposition to build custom multi-CPU servers today, too many variables with zero support. Super hard to investigate and find what can be culprit.

Not saying that's the case, just be prepared.

Nice to know that Corona devs got the machine though.

Hi we have a single retail Epyc 9754 (so no Numa nodes), we ruled out that the processor (and the whole system) is the culprit:

the slow geometry phase is caused by a particular step 'Embree scene calculation'.

The epyc is in the calculation of the 'Embree scene building' step very slow, 10x slower than for example a ryzen 5900x (12 cores)!

So to rule out that the Epyc cpu is slower when it comes to Embree calculation we want to test this slow Embree scene calculation and we tested it with Embree benchmarked software : phoronix-test-suite benchmark embree, the result was :

AMD Epyc 9754                              : 127 frames/sec
 
Compared to
 
AMD Ryzen Threadripper Pro 7995WX  : 116 frames/sec
 


Conclusion the Epyc is a bit faster in Embree calculations compared to a 7995WX (96 cores) so our system is not the issue, could it be the Embree version inside corona ?( Corona using old embree version?)Intel Embree is currently at version 4.3 do you know what version currently is used in Corona 11.1?

« Last Edit: 2024-04-18, 13:33:59 by webuilddreams »

2024-04-18, 14:54:11
Reply #11

Juraj

  • Active Users
  • **
  • Posts: 4761
    • View Profile
    • studio website
That is not what I meant, there is nothing faulty with CPU. But that doesn't mean all software can utilize it in same way. How Embree can utilize the hardware depends on many aspects of software stack, from motherboard firmware to OS scheduler. If those things are written for retail SKU, tiny discrepancy can cause misutilization.

Years ago, one of such solutions was upgrading Visual Studio redistributable used for compiling. Just saying it isn't as easy as updating libraries often, though I do hope it is for this particular issue.
Please follow my new Instagram for latest projects, tips&tricks, short video tutorials and free models
Behance  Probably best updated portfolio of my work
lysfaere.com Please check the new stuff!

2024-04-18, 15:30:21
Reply #12

webuilddreams

  • Active Users
  • **
  • Posts: 23
    • View Profile
That is not what I meant, there is nothing faulty with CPU. But that doesn't mean all software can utilize it in same way. How Embree can utilize the hardware depends on many aspects of software stack, from motherboard firmware to OS scheduler. If those things are written for retail SKU, tiny discrepancy can cause misutilization.

Years ago, one of such solutions was upgrading Visual Studio redistributable used for compiling. Just saying it isn't as easy as updating libraries often, though I do hope it is for this particular issue.
Hi Juraj, thanks for your input, we or I am a bit frustrated , spend a small fortune and now this shit, hope the devs solves this soon.

2024-04-18, 15:57:43
Reply #13

Juraj

  • Active Users
  • **
  • Posts: 4761
    • View Profile
    • studio website
Yeah I can understand, but now that devs have latest high-end dual-Epyc, let's see what happens :- ). There are lot of after-market Epycs on market so that would be quite good for studios if their performance becomes bulletproof in Corona.
Please follow my new Instagram for latest projects, tips&tricks, short video tutorials and free models
Behance  Probably best updated portfolio of my work
lysfaere.com Please check the new stuff!

2024-04-25, 11:27:22
Reply #14

webuilddreams

  • Active Users
  • **
  • Posts: 23
    • View Profile
Any news or findings from the dev team?