Gene EcHS_A3110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3110 
Symbol 
ID5593907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3123018 
End bp3123998 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content55% 
IMG OID640922229 
Producttwitching motility family protein 
Protein accessionYP_001459729 
Protein GI157162411 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2805] Tfp pilus assembly protein, pilus retraction ATPase PilT 
TIGRFAM ID[TIGR01420] pilus retraction protein PilT 


Plasmid Coverage information

Num covering plasmid clones59 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATGG AAGAAATTGT GGCCCTTAGT GTAAAGCATA ACGTCTCGGA TCTACACCTG 
TGCAGCGCCT GGCCCGCACG ATGGCGCATT CGCGGGCTAA TGGAAGCTGC GCCGTTTGAT
GCGCCGGACG TCGAAGAGCT ACTGCGGGAG TGGCTGGATG ACGATCAGCG GGCAATATTG
CTGGAAAATG GCCAGCTGGA TTTTGCCGTG TCGCTGGCGG AAAACCAGCG GTTGCGTGGC
AGTGCGTTCG CGCAACGGCA AGGTATTTCT CTGGCATTAC GGTTGTTACC TTCGCACTGT
CCACAGCTCG AACAGCTTGG TGCGCCACCG GTATTGCCGG AATTACTCAA GAGCGAGAAT
GGCATGATTC TGGTGACGGG GGCGACGGGG AGTGGCAAAT CTACCACGCT GGCGGCGATG
GTTGGCTATC TTAATCAACA TGCCGATGCG CATATTCTGA CGCTGGAAGA TCCTGTGGAA
TATCTCTATG CCAGCCAGCG ATGTTTGATC CAGCAGCGGG AAATTGGTTT GCACTGTATG
ACGTTCGCAT CGGGGTTGCG GGCCGCATTG CGGGAAGATC CTGATGTGAT TTTGCTCGGA
GAGCTACGTG ACAGTGAGAC AATCCGTCTG GCGCTGACGG CGGCAGAAAC CGGGCATCTG
GTGCTGGCAA CCTTACATAC ACGTGGTGCC GCGCAGGCAG TTGAGCGACT GGTGGATTCA
TTTCCGGCGC AGGAAAAAGA CCCCGTGCGT AATCAACTGG CAGGTAGTTT ACGGGCAGTG
CTGTCACAAA AGCTGGAAGT GGATAAACAG GAAGGACGCG TGGCGCTATT TGAATTGCTG
ATTAACACAC CCGCGGTGGG GAATTTGATT CGTGAAGGGA AAACCCACCA GTTACCGCAT
GTTATTCAAA CCGGGCAGCA GGTGGGGATG ATAACGTTTC AGCAGAGTTA TCAGCACCGG
GTGGGGGAAG GGCGTTTGTG A
 
Protein sequence
MNMEEIVALS VKHNVSDLHL CSAWPARWRI RGLMEAAPFD APDVEELLRE WLDDDQRAIL 
LENGQLDFAV SLAENQRLRG SAFAQRQGIS LALRLLPSHC PQLEQLGAPP VLPELLKSEN
GMILVTGATG SGKSTTLAAM VGYLNQHADA HILTLEDPVE YLYASQRCLI QQREIGLHCM
TFASGLRAAL REDPDVILLG ELRDSETIRL ALTAAETGHL VLATLHTRGA AQAVERLVDS
FPAQEKDPVR NQLAGSLRAV LSQKLEVDKQ EGRVALFELL INTPAVGNLI REGKTHQLPH
VIQTGQQVGM ITFQQSYQHR VGEGRL