Upload the main video first. Preview uses a quick FFmpeg render. Final render uses the same settings.
Use up to 3 text layers. A text layer is automatically shown when it contains text.
Keep banners available, but separate from the main text workflow.