There is only Kirsch mask part, without adding human detection.
I implement the frame similarities formula in reference paper.
Using FCM method to decide the threshold.
However the elementery result is not so good.
There are three lines represent three FCM clustering centers.
We use the value of the median cluster center ( red line ) as the threshold.
X axis means the similarity of two successive frame, and Y axis means the frame index.
Since the lectAccording to observation, I guess that some noise at the end of the lecture video may cause the computation of threshold in FCM.
Therefore, I'll try remove removing the noise frames at the end of the video data.
The following is the result after removing the noise frames (start from 2192th frame).
沒有留言:
張貼留言