Text this: Video Super‐Resolution via Effective Spatio‐Temporal Alignment Network