Gene ECH74115_5288 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5288 
SymboltrkH 
ID6971834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4932380 
End bp4933831 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content53% 
IMG OID643388952 
Productpotassium transporter 
Protein accessionYP_002273366 
Protein GI209397794 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0168] Trk-type K+ transport systems, membrane components 
TIGRFAM ID[TIGR00933] potassium uptake protein, TrkH family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000758223 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.0471596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTTC GCGCCATTAC CCGAATCGTT GGACTACTGG TCATCTTATT TTCAGGGACC 
ATGATTATCC CTGGGCTGGT AGCACTCATC TACCGGGATG GAGCGGGCCG CGCTTTTACC
CAGACCTTTT TTGTCGCCCT CGCCATTGGC TCTATGCTGT GGTGGCCGAA CCGCAAAGAG
AAGGGCGAAC TGAAATCCCG TGAGGGGTTT CTGATAGTGG TGCTGTTCTG GACCGTGCTG
GGTAGTGTCG GTGCGCTCCC TTTTATCTTC TCGGAAAGCC CGAACCTCAC GATTACCGAT
GCGTTTTTTG AATCTTTCTC TGGCCTGACC ACCACCGGGG CCACTACGCT GGTGGGGCTG
GATTCGCTCC CTCATGCCAT CCTCTTTTAT CGCCAGATGC TGCAATGGTT TGGCGGGATG
GGGATCATCG TGTTAGCGGT TGCGATACTG CCTATCCTCG GCGTGGGTGG GATGCAGCTC
TATCGCGCAG AAATGCCCGG CCCGCTGAAA GATAACAAAA TGCGCCCGCG AATTGCGGAA
ACGGCGAAAA CCCTGTGGTT GATTTATGTC TTGCTGACCG TCGCCTGTGC GCTGGCGTTG
TGGTTTGCCG GAATGGATGC CTTTGATGCC ATCGGCCATA GCTTTGCGAC TATCGCTATT
GGCGGCTTCT CGACACATGA TGCCAGTATC GGTTATTTCG ATAGCCCGAC TATTAACACT
ATCATTGCTA TCTTCCTGCT GATCTCCGGC TGTAACTACG GTCTGCACTT TTCACTGTTA
AGTGGGCGTA GTCTGAAGGT TTATTGGCGC GATCCGGAAT TTCGCATGTT TATCGGCGTA
CAGTTTACGC TGGTGGTTAT TTGTACCCTC GTACTGTGGT TTCATAATGT CTACAGTTCG
GCGCTGATGA CAATTAACCA GGCGTTTTTC CAGGTGGTGT CGATGGCGAC AACCGCCGGG
TTTACGACTG ACAGCATTGC CCGCTGGCCG CTCTTTTTGC CGGTACTGCT TTTATGTTCA
GCTTTTATCG GCGGTTGTGC CGGGTCAACG GGCGGTGGCC TGAAAGTGAT CCGCATCCTG
CTGCTGTTTA AGCAGGGGAA CCGTGAACTG AAACGACTGG TGCATCCGAA CGCCGTCTAT
AGCATTAAGC TGGGGAATCG CGCACTGCCG GAACGTATCC TCGAAGCCGT TTGGGGATTT
TTCTCCGCCT ATGCATTGGT GTTTATTGTC AGTATGCTGG CGATTATCGC CACGGGCGTG
GATGACTTTT CTGCCTTTGC CTCGGTTGTT GCGACATTGA ATAACCTGGG GCCAGGGCTT
GGCGTGGTTG CTGATAACTT TACCAGTATG AACCCGGTGG CTAAATGGAT CCTGATTGCC
AACATGCTGT TTGGTCGTCT CGAGGTCTTT ACATTGCTGG TGCTCTTTAC CCCGACTTTC
TGGCGTGAAT GA
 
Protein sequence
MHFRAITRIV GLLVILFSGT MIIPGLVALI YRDGAGRAFT QTFFVALAIG SMLWWPNRKE 
KGELKSREGF LIVVLFWTVL GSVGALPFIF SESPNLTITD AFFESFSGLT TTGATTLVGL
DSLPHAILFY RQMLQWFGGM GIIVLAVAIL PILGVGGMQL YRAEMPGPLK DNKMRPRIAE
TAKTLWLIYV LLTVACALAL WFAGMDAFDA IGHSFATIAI GGFSTHDASI GYFDSPTINT
IIAIFLLISG CNYGLHFSLL SGRSLKVYWR DPEFRMFIGV QFTLVVICTL VLWFHNVYSS
ALMTINQAFF QVVSMATTAG FTTDSIARWP LFLPVLLLCS AFIGGCAGST GGGLKVIRIL
LLFKQGNREL KRLVHPNAVY SIKLGNRALP ERILEAVWGF FSAYALVFIV SMLAIIATGV
DDFSAFASVV ATLNNLGPGL GVVADNFTSM NPVAKWILIA NMLFGRLEVF TLLVLFTPTF
WRE