although I don´t quite understand why it is neccesary to wait for the rising flange; wouldn´t be enought to just wait for the falling flange?
No, because if the polling starts while RC2 is already 1, you would miss the edge and shift the strobe
I suppossed you were using some kind of interrupt, but I see it is just a polling/loop.
With interrupts you wouldn’t be able to achieve the strict timings.
Btw.: interrupts should be disabled while the polling loop is processed.
I cannot find information in the datasheet about this, but I will keep up searching.
Just study the pictures 
There is a D FF between the input pin and the data bus. I believe, that these are in fact two FFs (common design rule to prevent metastabilities), because I measured a delay of 3 clock cycles. The third FF stage is in the CPU pipeline
I have modified siddump to extract the SID registers states from PSID files, and my aim is to play this states using the PIC; the project is a PIC-PSID-SID player.
So, you’ve an huge external memory which contains the SID register traces. I guess that preloading this memory will take very long… so, why not emulating a 6502 with the PIC, and process the player code (which is embedded in the .sid file) directly? You would need an external RAM (64k), so that the whole address space of a C64 is available, and a CPU emulation routine which processes the SID player code. Since this routine is normaly clocked at 50 Hz, you’ve 20 mS to decode the CPU instructions - this should be possible.
I would also propose to connect the SID pins directly to the PIC, and not via serial interface, because your application doesn’t require so many different external components like a MBSID. This would give you more CPU time for the emulation.
Best Regards, Thorsten.