#how do large language models predict next words