Text this: Attention-Driven CNN-LSTM Fusion for Robust Deepfake Detection in Digital Media