The stedrr output is normal and showing reality time:
Sieve started: 58408512000000000 <= p < 58408518000000000
Thread 0 starting
Detected 32 multiprocessors (160 SPUs) on device 0.
Thread 0 completed
Sieve complete: 58408512000000000 <= p < 58408518000000000
Elapsed time: 32991.39 sec. (1.43 init + 32989.96 sieve) at 181880 p/sec.
Processor time: 1084.49 sec. (1.40 init + 1083.08 sieve) at 5539935 p/sec.
Average processor utilization: 0.99 (init), 0.03 (sieve)
04:43:30 (4944): called boinc_finish
It does however show a lower end 5xxx card ("Detected 32 multiprocessors (160 SPUs) on device 0."), and that together with the fact that the AMD application is on OpenCL, not any native ATI language means it will be markedly slower than the CUDA cards - fact of life with this application and current stage of development of OpenCL. Slower 5xxx + OpanCL is going to mean the timing is about right.
(Very much worth updating your 6.10.60 BOINC Client to the current 6.12.33 if multiple GPU on a card, and 6.12.34 if single GPU on a card. A lot of work was done between 6.10.XX and 6.12.XX on working with GPUs)
What is the actual card you are using ?