Text this: Adaptive Multilevel Feature Fusion and Dynamic Model Updating for Robust Visual Object Tracking