Has anyone tried benchmarking to see if processor optomisations like SSE2 make any difference to WinVNC?
I compile my own UltraVNC, and have played around with the compiler switches to optomise them for Pentium 4 SSE2, but I wondered if it makes any difference?? ( I also did ZLib and JPegLib, and some other critical stuff)
Is there a good way to benchmark this stuff?
Anyone want to test my VC++ 7.1 (.NET2003) compiled SSE2 Winvnc?
Update: UltraVNC 1.4.3.6 and UltraVNC SC 1.4.3.6: https://forum.uvnc.com/viewtopic.php?t=37885
Important: Please update to latest version before to create a reply, a topic or an issue: https://forum.uvnc.com/viewtopic.php?t=37864
Join us on social networks and share our announcements:
- Website: https://uvnc.com/
- GitHub: https://github.com/ultravnc
- Mastodon: https://mastodon.social/@ultravnc
- Facebook: https://www.facebook.com/ultravnc1
- X/Twitter: https://x.com/ultravnc1
- Reddit community: https://www.reddit.com/r/ultravnc
- OpenHub: https://openhub.net/p/ultravnc
Important: Please update to latest version before to create a reply, a topic or an issue: https://forum.uvnc.com/viewtopic.php?t=37864
Join us on social networks and share our announcements:
- Website: https://uvnc.com/
- GitHub: https://github.com/ultravnc
- Mastodon: https://mastodon.social/@ultravnc
- Facebook: https://www.facebook.com/ultravnc1
- X/Twitter: https://x.com/ultravnc1
- Reddit community: https://www.reddit.com/r/ultravnc
- OpenHub: https://openhub.net/p/ultravnc
Processor Optomisations...
-
- 8
- Posts: 17
- Joined: 2004-06-22 08:50
- Contact:
- Rudi De Vos
- Admin & Developer
- Posts: 6862
- Joined: 2004-04-23 10:21
- Contact:
Doesn't make a big difference (5%)...
Tested with intel compiler and cpu opts...
(jpeg is a difference, used in tight, but the we better use the
jpeg_mmx library instead of the normal.
mmx and sse acceleration require that certain functions
need to be rewritten ( 32 or 64 ) blocks.
Acceleration make a big difference when compression is rewritten in asm.
Did some test, and certain functions get a 1/8 "self time".
Tested with intel compiler and cpu opts...
(jpeg is a difference, used in tight, but the we better use the
jpeg_mmx library instead of the normal.
mmx and sse acceleration require that certain functions
need to be rewritten ( 32 or 64 ) blocks.
Acceleration make a big difference when compression is rewritten in asm.
Did some test, and certain functions get a 1/8 "self time".