Temporal-consistent Segmentation of Echocardiography with Co-learning from Appearance and Shape