Well, the logic is simple:
1. The modal purpose is to set a link.
2. You would love to write the URL in order to set the link.
3. You don’t have to click the URL input if it’s empty because it’s already autofocused.
Intuition supports eliminating user steps for executing the purpose of the modal.