Nvidia

As we all know, a large number of GPUs were sent to the “gym” for exercise (crypto currency mining) some time ago. During this period, the price of graphics cards can be said to have risen. As the tide ebbs, the price of GPUs begins to plummet, but it seems that there is still a force that is strongly pushing the price to the bottom. YDJSIR will obviously also get some retired graphics cards from the gym to compare when the tide is low.

As the DAG size increases, batch after batch of cards are left without a workout and must step down. Of course, the cards that are actually disabled here do not just include those that have been disabled by workouts. AI computing is actually also very destructive to graphics cards, right? It’s just that the working environment of those graphics cards is slightly better and more stable after all. Therefore, professional computing cards (second-hand) are also covered in this article. Newly purchased graphics cards are not considered in this article.

Nvidia RTX 2080Ti (Huaqiang North powered water cooling, with 22G VRAM)

YDJSIR has been torn between wanting to buy a graphics card for a long time. After spending a substantive fortune on the AMD Ryzen R9-5950X and the Asus TUF X570-PLUS WIFI, the originally scheduled purchase of the 4060Ti 16G and 3080Ti 20G obviously fell through. However, there is no turning back once the decision has been made, so we must persevere to the end. During this period, YDJSIR also considered the RTX 5000M 16G magic modification plan (1600, which is said to be the official version of the core), but the seller even said that the price YDJSIR got was still second-hand goods, and YDJSIR was dissatisfied. In the end, YDJSIR chose the simplest and most straightforward solution: 2080Ti 22G. On Xianyu (闲鱼), there was just an old guy selling a water-cooled 2080Ti for a low price of 2400, so YDJSIR of course rushed to buy it immediately. The seller said that he had been refining the card for half a year, until he replaced it with a 3090. As for the one before that? Who knows where it came from. Regardless of whether it was previously used to mine artificial intelligence in a dust-free computer room or some kind of coin in an inexplicable rendering farm, as long as it works in my hands, it’s fine.

When YDJSIR got it, YDJSIR could really tell that it was full of the aura of alchemy. There was too much dust… The mounting was modified really well, with four screws on the diagonal, but it wasn’t unusable in the end. YDJSIR was slightly worried when the back panel screws were loose. The bezel was not provided by the seller, and YDJSIR had to search for a long time before finding the right screws.

image-20231026105119823

Obviously, screw holes are reserved for the latter half of the graphics card, but the seller felt that the memory and hot spots would only reach a maximum of 88°C anyway, so he didn’t bother installing it. YDJSIR really doesn’t understand this behavior, but there are indeed missing screws, and YDJSIR really doesn’t want to disassemble this set of fasteners, so let’s just leave it at that. The bottom fan is far away, so the problem should not be too big. During the installation of the graphics card, the water cooling tube was not long enough, so the fan could only be installed between the case and the cold row, which was very difficult to stretch. Because this motherboard happens to be the second main PCIe x16 slot… The power supply is the ROG-THOR-850P. There are indeed ROG parts in this case, and YDJSIR’s first installation is more in line with the label description. TUF can barely be considered a small ROG, right?

image-20231026110116829

A dual-grill (both CPU and GPU) set is used. The chassis is a JONSBO D41, with a Scythe 12015 fan added to the front, and the bottom is the same as the rear, using a 12025 fan removed from the EK AIO BASIC 240 water cooling. The CPU heatsink is a Thermalright PA120, rather than a PA120SE. The water cooling for the graphics card is from Coolermaster, model unknown. The room temperature is 16°C. The side panels of the case are closed and placed in the position of the dormitory computer, with about one or two spaces between the front, back, left and right of the case. The JONSBO D41 case is really nice to look at!

image-20231026110434023

The following data was obtained after two hours of baking. 160W CPU + 250W GPU, not bad at all. Water cooling suppresses the performance of the graphics card and makes it extremely stable. When the ambient temperature rises, it will be difficult for the fan to work hard no matter how hard it tries. If this continues, you can really bake your legs.

image-20231026105853587

Results from GPU-Z come as follows.

image-20231026110619476

However, it finally got broken within 1 year. YDJSIR sadly sold it.

Nvidia Tesla M4 (Founders Edition with modified fans, with 4GB VRAM)

You can’t lose if you spend 200 RMB! Only 200 RMB! Free shipping by SF Express!

YDJSIR modified the cooling. Yes, it’s a fixed-speed fan with only positive and negative poles, blowing in one direction.

image-20230326210626267

For the PCIE 3.0 x4 channel, the dual small fans at 6000 rpm can reach 65W under full load, pushing the temperature up to 67°C (the room temperature is about 18°C). FurMark runs normally, but the performance is still too low. Just think of it as pure CUDA. There is essentially no difference from the integrated graphics…at most, it is doubled, with independent video memory. It is also possible that the 8G memory graphics card simply cannot run at all?

image-20230326210826368

image-20230326210849935

Benchmark results can be retrieved here.

https://oss.ydjsir.com.cn/GitPages/X79andP106/tsresult-m4.3dmark-result

Follow-up: YDJSIR sold this card for 130 RMB with SF Express shipping included. The buyer seems to be a researcher at the Institute of Geographic Sciences and Natural Resources Research of the Chinese Academy of Sciences. He said he wanted to use the graphics card to edit videos of simulation software. YDJSIR couldn’t figure out what use this broken card could be. But since he said it would work, it must work.

Later, YDJSIR also sold him another Samsung PM953 960GB SSD at the price 168 RMB.

Nvidia Tesla K20 (Huaqiang North powered modification to Titan, with 5GB VRAM)

image-20230326211316656

At a price of 233.33 RMB, if you want performance, you can’t go wrong with the AMD RX580 8G. The AMD RX580 8G is probably the GPU with the biggest price difference between cards. After all, this thing is obviously ridiculous. But this is the Titan of modding! The price of the original Titan tripled. Thanks to the cocoon, YDJSIR’ve played with the public version of Titan leather anyway. Experience the performance of the 1065.This also belongs to YDJSIR’s cracky series of installations, which belongs to the “not unusable” series. After this thing is used with a laptop U, there is nothing serious about the whole machine.

In the end, this thing doesn’t light up. This stupid case messed up the PCIE slot a bit, and it’s a collector’s item +1.

Nvidia Tesla P40 (Founders Edition with turbo fans attached outside, with 24GB VRAM)

Except for the lack of half-precision support, the AI computing power is the same as the 3060 12G, but the video memory is doubled, and it’s cheap! Cheap! Cheap!

There are probably some problems with the QVYE’s UHD750 (Engineering Sample of Intel Core i9-11900), so YDJSIR had to use it together with the K620 GPU in Grid mode, referencing https://blog.csdn.net/weixin_44503976/article/details/127942918 . YDJSIR did not overclock the card but used the default settings and an output card like the K620, which does indeed have a lot of latency and is not friendly to high resolutions. If only YDJSIR had used a QV1K or a QVYE with better quality. YDJSIR is very quiet with its centrifugal turbo cooling, with a full load below 75℃ and an indoor temperature of around 18℃.

Nvidia Tesla M40 (Founders Edition with modified fans , with 12GB VRAM)

In the 50HX group, watching group members tossing and turning with the X99 and P40 24G scared YDJSIR so much that he didn’t dare to get the picky 24G version, but instead bought the improved M40 12G. It uses a 1080Ti cooler from XFX with dual fans, which is very quiet. It’s really not noisy even when it’s fully loaded with 200W. YDJSIR bought it from Xianyu (闲鱼), so let’s just use the seller’s picture as a placeholder.

image-20230307000320505

image-20230304095404360

To be honest, YDJSIR didn’t know that such a legacy GPU could even support Resizable BAR… But YDJSIR didn’t expect much from it anyway.

At this point, YDJSIR had to complain about the outrageous Gigabyte Z490UD. As a Z-board, it doesn’t have a Type-C port. There’s no 4K60 output (only one HDMI, up to 4K30). These are all minor issues that can be tolerated for the time being. YDJSIR updated to the latest version (F21) BIOS and found that the BIOS option for selecting which port to output video from has disappeared. As a result, even if YDJSIR wants to give up 4K60 and output to the M40 rendering core display, the result will be that the core display driver will immediately turn black as soon as it is turned on. YDJSIR have cleared the CMOS countless times, after all, this thing has been bricked more than once. It doesn’t work! It doesn’t work! It doesn’t work!

Below is a screenshot from the manual.

image-20230306223321225

Here’s an image after clearing CMOS.

image-20230306223346862

In fact, there are indeed some minor problems with QVYE’s UHD750. For example, inexplicable problems such as the North Bridge dropping the frequency of high-frequency memory when the UHD750 is active. But it’s not unusable. Since there are other solutions, it’s better to turn off the iGPU. Okay, the iGPU is disabled, and let’s use the AMD Catalyst card to output. That’s right, YDJSIR is talking about YDJSIR’s Fatcat 56 Although this method is indeed good if you only want to use the graphics card to run CUDA, it may indeed lack a bit of graphics rendering power and is not very suitable for playing games. YDJSIR has been here for so long. YDJSIR tried to imitate the following method, which can achieve coexistence of the AMD Catalyst and the special “professional” cards of the P10X series, to render games, but it seems to have not worked. For example, in the step where the black card is disabled in the device manager, YDJSR is directly trapped in a dark state and unable to extricate itself. This is a loss. Although it was unsuccessful, the tutorial is still included below for reference.

https://www.bilibili.com/video/BV1Nd4y1277n/

Undaunted, YDJSIR finally decided to spend a fortune and spent almost 200 RMB to buy an Nvidia Quadro K620 that is recognized as being useful in this scenario. The K420 is a lost cause anyway, and he has both DP and DVI, which should theoretically be just as useful, but for some reason no one recommended it. Why didn’t YDJSIR use a gaming card to modify the same drive solution?

Thanks to the superb compatibility left by Nvidia under the T4 driver, using the game driver may make the recording function work much better, and it also provides better support for some new games. It seems that the famous big guy in the “图拉丁吧” still needs to make a Tesla, a special “professional” card, a mobile magic modification card, and a normal game card, and even a magical driver that allows specific professional cards to coexist across architectures.

Gaming card solutions require at least an Nvidia gaming GPU better or equal to GTX1050 to work well (however, YDJSIR found out that he was being naïve, as it is not impossible to modify a card. In fact, the idea behind these solutions is often to use a large amount of video memory specifically for AI applications. Just the new features alone would justify using a new modified card. After all, new things are new things. Even if it is a 2060M, it is not weaker than a P40 1080Ti, and it also eliminates the latency caused by split rendering and output. But no matter what, the cost of these magic modification cards is not low, which goes against YDJSIR’s idea of entrusting the main force to the M40. So just buy the amazing K620. For a detailed tutorial, see the one below. YDJSIR used the solution of installing the Quadro driver and then modifying the driver. The rest of the modifications to the registry and high-performance settings are exactly the same as described in the video. After all, the version of the driver obtained in this way will be higher.

https://www.bilibili.com/video/BV1d3411a7x8/

https://www.bilibili.com/video/BV1c24y1m7eH/

In fact, the K600 costs almost half as much, even though it is only a year older, and the performance gap between the two cards is not that big except for the VRAM. But the price of the K620 has almost doubled. The reason? The K600 only has 1GB of video memory, which is probably not enough even for output. YDJSIR In actual use, even the K620’s video memory occupies more than 1GB on a daily basis. There are always some applications that definitely use the output card.

Let’s take a look at 3D Mark. Compare it to the 10700K+ almighty 1060Ti. YDJSIR’s solution uses DX12 to outperform the latter (the graphics card part), but it can’t beat it under DX11. What’s going on? Theoretically, the 980Ti should actually be able to compete with the 1660.

What YDJSIR didn’t expect was that the ancient M40 actually performed better in Time Spy, a DX12 test. What’s going on?

image-20230306234729153

image-20230306234111524

image-20230306234612046

image-20230306234914879

In terms of usage, it is no differencies between AI or gaming applications. Horizon 4, high-definition graphics locked at 60 frames is no problem. Horizon 5, medium-definition graphics locked at 60 is also fine. It’s not like you can’t play it anyway. The resolution is 1080p. YDJSIR gives M40 PCIE 3.0x16, but K600 gives PCIE 3.0x4, and as we all know, K600 only supports PCIE 2.0. After this wave, in fact, its bandwidth is very limited , leaving only PCIE2.0x4, which is a huge disadvantage for high resolution. Although the YDJSR has a 4K60 screen, it is not unusable when scaled to 1080P, right? A 27-inch 1080P is indeed a bit too big. Running the PCIE speed test in 3DMark, YDJSIR burst out laughing. One moment it was 40GB/s, the next it was 6GB/s. The speed difference between the two cards was too great, the ghosting was extremely bad, and the screen frame rate was also fluctuating, making it basically unusable.

However, when it comes to VR, this problem has been extremely well solved. YDJSIR can be used to push Quest1 without any pressure. After all, when it comes to VR, just let the M40 do the background calculation, and you don’t need to consider the feelings of the K620 at all. This is a win.

image-20230306235106743

As for AI applications? Just like that, after so many tests on Bilibili, YDJSIR doesn’t want to test it. The 12G version and the 24G version only differ in capacity, and in this regard, they are still more widely applicable than most game cards on the market. Slow, but not unusable. YDJSIR has not yet tried a model that cannot run 12G. This perception is not strong, but it increases power consumption. However, with the development of SD, 24G will probably also become a necessity.

So what’s the performance of the K620? Who cares about the K620? The K620 is still a good GPU card that can handle UHD content. The UHD 770 can’t even handle it, so what’s there to say?

image-20230306233826298

Nvidia CMP 50HX (MSI turbo fan card at dual PCIE slots, with 10GB VRAM)

Unless you are running integer arithmetic and double-precision floating-point, never select 50HX!!!

YDJSIR spent 700 RMB on this piece of junk just to be the first to review it, so he basically lost money. As of November 5, 2022, the price of a 50HX is basically around 550 RMB with shipping, mainly for cards from MSI and Gigabyte. YDJSIR bought the MSI version because it’s so beautiful! With same look as MSI 2080Ti Turbo version, it looks nice.

1 2 3
image-20221105193733507 image-20221105193810244 image-20221105193837492

Even the PCB is the same, however, some elements and components are missing.

image-20221105194135009

P106 Baidu Tieba) owner released version 526 with support for the entire series of mining cards. However, changing the 50HX is also useless. Below are two GPU-Z images. The 50HX cannot supplement the PCIE x16 channel like the 40HX can by supplementing the capacitor, but only to the x8 channel. It is said that there is a missing resistor. But in the face of its extremely weak single-precision floating point, what difference does it make whether or not the capacitor is supplemented?

Normal Driver Modified Driver
image-20221105194247112 image-20221105194227794

Let’s take a look at its miserable single-precision floating-point performance.

image-20221105194403326

Single precision floating-point and double precision floating-point are identical. Meanwhile, this card has a static power consumption of 68W, and the video encoding/decoding engine is always at full capacity. Some speculate that this is because the video encoding engine consumes single precision floating-point, which seems somewhat reasonable. What does 400GFLOPS of single precision floating-point mean? Take a look at the numbers for the GT720 and GT730 below. With such a trivial single precision floating-point, what value does modding it bring?

GT720 GT730
image-20221105194830077 image-20221105194946179

This card has a high static power consumption, and the fan noise while running CUDA (all the points mentioned below are about rendering) is actually only 120W, which is bearable at around 18°C room temperature. Of course, when performing integer operations, it can really take off like a turbo engine. YDJSIR usually has it disabled in the device manager and only runs it when needed.

You can just install CUDA 11.8 directly. The CUDA installation package can include the driver as well. Forget about Geforce Experience; it’s of no use in this scenario.

image-20221105195255462

image-20221105200716053

Running Stable Diffusion has its value; the speed is comparable to the 1650S. The most crucial aspect is that this thing has enough VRAM, allowing for larger images and bigger batch sizes. YDJSIR enjoys it quite a bit. Of course, YDJSIR is just in it for the fun, as learning the incantations for magic is still necessary.

Here are a few images as examples. It’s evident that they can also be processed with super-resolution tools.

image-20221105201508185

image-20221105233128900

image-20221105201910624

YDJSIR still feel that the details from this thing are quite absurd, but the general direction is pretty right. On the other hand, when it comes to drawing people without focusing on the hands, it genuinely feels quite good.

At present, it can only generate small images to be placed in corners or reduce the resolution to play a supporting role; it still can’t take on a leading role.

The bandwidth and single precision are quite poor, which limits the speed. It requires a lot of patience or to let it run continuously. It does add some fun, though.

Of course, YDJSIR has also tried running it on the XPS with the 1650. That one is even slower and less interesting, so YDJSIR won’t mention it.

Nvidia P106-090 (Zotac with single fan, with 3G VRAM)

This card was acquired during the last downturn. At that time, the P106-100 was still over 500, while the P106-90 had already dropped to 98.

From this experience, YDJSIR has drawn the following conclusions:

  1. Running power cables along the back is extremely elegant.
  2. M2 NVMe drives using PCIe may cause boot checks to fail.
  3. Your VGA cable might be faulty (a huge disappointment).
  4. The P106-90 is indeed a bit lacking… it’s not recommended to buy without an integrated graphics platform.
  5. Building a system with a half-white and a pure white component is very painful.
  6. It’s best not to attempt things that aren’t fully hands-on… (due to differences in perspective and the difficulties of solving problems from a distance).

The following configuration list has been modified according to YDJSIR’s preferences and is not the actual build version!

Purpose: Long-term stable home server + light gaming (focusing on RTS and Minecraft) + deep learning and media rendering.

The prices listed below are from around the summer of 2020.

Here’s the translated configuration list with prices:

Item Details Price
Power Supply Second-hand 700W dual-card power supply (preferably from Great Wall or XinGu, modular is best) ¥150 Market price on Xianyu
Case EATX case ¥150 Taobao
Motherboard Huainanzhi X79 Flame God ¥900 Link (Huainanzhi official store)
CPU E5 2650 v2 Included above
CPU Cooler Arctic A600 Included above
RAM Samsung RECC DDR3 8G 1600MHz ¥85 × 4 = ¥340 Link
Dummy GPU Brand-name 1050 2G ¥360 Market price on Xianyu
CUDA Card Relatively new orange P106-90 ¥100 Market price on Xianyu
Wi-Fi Card + Wi-Fi (7) Intel 3600AC-PCIE: Integrated Bluetooth Card ¥48 Link (Recommended by YDJSIR!)
SSD SM961-256G-MLC ¥319 Link
Monitor Not included in budget considerations ¥0
Keyboard & Mouse Not included in budget considerations ¥0
HDD Random quantity, up to 2 ¥120
Stickers for Aesthetics NY Fashion Line ¥4 For CPU, GPU, and system, etc.
Case Fans RGB fans 12cm ¥10 × 3 It’s recommended to confirm the cooling situation before adding fans.

image-20200725233911443

image-20200725234223457

The actual rendering performance of the P106 is similar to that of the GTX1050Ti. Its owner later got a proper GTX1050Ti as a dummy GPU. Considering the card isn’t too expensive, that’s how it ended up.

AMD

The AMD cards YDJSIR bought, whether second-hand or brand new, are just normal GPUs with no significant modifications. Perhaps when AMD’s compute cards become more powerful in the future, YDJSIR will consider getting one to play around with.