Abstract
). One hundred and ninety-two full-length HCV genomes were amplified from plasma of incident infections and subjected to next generation sequencing to establish the largest cross-continental, full-length acute HCV genomic data set available to date. Genomes from the most common subtypes (1a: n=94, 2b: n=15 and 3a: n=68) were used in phylogenetic analysis. Using full genome trees, 78 sequences (44%) were found to lie within 29 phylogenetic clusters/pairs defined on the basis of molecular similarity of consensus sequences. Of these, 26 each had exclusively Australian or North American sequences indicating a strong geographical bias for molecular similarity. On further analysis of behavioural and demographic associations, binary logistic regression analysis showed that older age and non-Caucasian ethnicity were significantly associated with clustering. HCV probably evolves in micro-epidemics within geographically isolated communities.