Gene EcHS_A4072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4072 
SymboltrkH 
ID5594754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4063405 
End bp4064856 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content53% 
IMG OID640923175 
Productpotassium transporter 
Protein accessionYP_001460641 
Protein GI157163323 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0168] Trk-type K+ transport systems, membrane components 
TIGRFAM ID[TIGR00933] potassium uptake protein, TrkH family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000258715 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTTTC GCGCCATTAC CCGAATCGTT GGACTACTGG TCATCTTATT TTCAGGGACC 
ATGATTATCC CTGGGCTGGT AGCACTCATC TACCGGGATG GAGCGGGCCG CGCTTTTACC
CAGACCTTTT TTGTCGCCCT CGCCATTGGC TCTATGCTGT GGTGGCCGAA CCGCAAAGAG
AAGGGCGAAC TGAAATCCCG TGAGGGGTTT CTGATAGTGG TGCTGTTTTG GACCGTGCTG
GGTAGTGTCG GTGCGCTCCC TTTTATCTTC TCGGAAAGCC CGAACCTCAC GATTACCGAT
GCGTTTTTTG AATCTTTCTC TGGCCTGACC ACCACCGGGG CCACTACGCT GGTGGGGCTG
GATTCGCTCC CTCATGCCAT CCTCTTTTAT CGCCAGATGC TGCAATGGTT TGGCGGGATG
GGGATCATCG TGTTAGCGGT TGCGATACTG CCTATCCTCG GCGTGGGTGG GATGCAGCTC
TATCGCGCAG AAATGCCCGG CCCGCTGAAA GATAACAAAA TGCGCCCGCG AATTGCGGAA
ACGGCGAAAA CCCTGTGGTT GATTTATGTC TTGCTGACCG TCGCCTGTGC GCTGGCGTTG
TGGTTTGCCG GAATGGATGC CTTTGATGCC ATCGGCCATA GCTTTGCGAC TATCGCTATT
GGCGGCTTCT CGACACATGA TGCCAGTATC GGTTATTTCG ACAGCCCGAC TATTAACACT
ATCATTGCTA TCTTCCTGCT GATCTCCGGC TGTAACTACG GTCTGCACTT TTCACTGTTA
AGTGGGCGTA GTCTGAAGGT TTATTGGCGC GATCCGGAAT TTCGCATGTT TATCGGCGTA
CAGTTTACGC TGGTGGTTAT TTGTACACTC GTACTGTGGT TTCATAATGT CTACAGTTCG
GCGCTGATGA CAATTAACCA GGCGTTTTTC CAGGTGGTAT CGATGGCGAC AACCGCCGGG
TTTACGACTG ACAGCATTGC CCGCTGGCCG CTCTTTTTGC CGGTACTGCT TTTATGTTCA
GCTTTTATCG GCGGTTGTGC CGGGTCAACG GGCGGTGGCC TGAAAGTGAT CCGCATCCTG
CTGCTGTTTA AGCAGGGGAA CCGTGAGCTG AAACGACTGG TGCATCCGAA CGCCGTCTAT
AGCATTAAGC TGGGGAATCG CGCACTGCCG GAACGTATCC TCGAAGCCGT TTGGGGATTT
TTCTCCGCCT ATGCATTGGT GTTTATTGTC AGTATGCTGG CGATTATCGC CACGGGCGTG
GATGACTTTT CTGCTTTTGC CTCGGTTGTT GCGACATTGA ATAACCTGGG GCCGGGGCTT
GGCGTGGTTG CTGATAACTT TACCAGTATG AACCCGGTGG CTAAATGGAT CCTGATTGCC
AACATGCTGT TTGGTCGTCT CGAGGTCTTT ACATTGCTGG TGCTCTTTAC CCCGACTTTC
TGGCGTGAAT GA
 
Protein sequence
MHFRAITRIV GLLVILFSGT MIIPGLVALI YRDGAGRAFT QTFFVALAIG SMLWWPNRKE 
KGELKSREGF LIVVLFWTVL GSVGALPFIF SESPNLTITD AFFESFSGLT TTGATTLVGL
DSLPHAILFY RQMLQWFGGM GIIVLAVAIL PILGVGGMQL YRAEMPGPLK DNKMRPRIAE
TAKTLWLIYV LLTVACALAL WFAGMDAFDA IGHSFATIAI GGFSTHDASI GYFDSPTINT
IIAIFLLISG CNYGLHFSLL SGRSLKVYWR DPEFRMFIGV QFTLVVICTL VLWFHNVYSS
ALMTINQAFF QVVSMATTAG FTTDSIARWP LFLPVLLLCS AFIGGCAGST GGGLKVIRIL
LLFKQGNREL KRLVHPNAVY SIKLGNRALP ERILEAVWGF FSAYALVFIV SMLAIIATGV
DDFSAFASVV ATLNNLGPGL GVVADNFTSM NPVAKWILIA NMLFGRLEVF TLLVLFTPTF
WRE