We should be able to get the task pointer faster by storing it on the current stack segment instead of looking it up in TLS.