Gene EcSMS35_4230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4230 
SymboltrkH 
ID6145261 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4328191 
End bp4329642 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content53% 
IMG OID641619053 
Productpotassium transporter 
Protein accessionYP_001746181 
Protein GI170679843 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0168] Trk-type K+ transport systems, membrane components 
TIGRFAM ID[TIGR00933] potassium uptake protein, TrkH family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000116067 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0142841 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTTC GCGCCATTAC CCGAATCGTT GGACTACTGG TCATCTTATT TTCAGGGACC 
ATGATTATCC CTGGGCTGGT AGCACTCATC TACCGGGATG GAGCGGGCCG CGCTTTTACC
CAGACCTTTT TTGTCGCCCT CGCCATTGGC TCTATGCTGT GGTGGCCGAA CCGCAAAGAG
AAGGGCGAAC TGAAATCCCG TGAGGGGTTT CTGATAGTGG TGCTGTTCTG GACCGTGCTG
GGTAGTGTCG GTGCGCTCCC TTTTATCTTC TCGGAAAGCC CGAACCTCAC GATTACCGAT
GCGTTTTTTG AATCTTTCTC TGGCCTGACC ACCACGGGAG CCACTACGCT GGTGGGGCTG
GATTCGCTCC CTCATGCCAT CCTCTTTTAT CGCCAGATGC TGCAATGGTT TGGCGGGATG
GGGATCATCG TGTTGGCGGT TGCGATACTG CCTATCCTCG GCGTGGGTGG GATGCAGCTC
TATCGCGCAG AAATGCCCGG CCCGCTGAAA GATAACAAAA TGCGCCCGCG AATTGCGGAA
ACGGCGAAAA CCCTGTGGTT GATTTATGTC TTGCTGACCG TCGCCTGTGC GCTGGCGTTG
TGGTTTGCTG GAATGGATGC CTTTGATGCC ATCGGCCATA GCTTTGCGAC TATCGCTATT
GGCGGCTTCT CGACACATGA TGCCAGTATC GGTTATTTCG ACAGCCCGAC TATTAACACT
ATCATTGCTA TCTTCCTGCT GATCTCTGGC TGTAACTACG GTCTGCACTT TTCACTGTTA
AGTGGGCGTA GTCTGAAGGT TTATTGGCGC GATCCGGAAT TTCGCATGTT TATCGGCGTA
CAGTTTTCAC TGGTGGTTAT TTGTACACTC GTACTGTGGT TTCATAATGT CTACAGTTCG
GCGCTGATGA CAATTAACCA GGCGTTTTTC CAGGTGGTGT CGATGGCGAC AACCGCCGGG
TTTACAACTG ACAGCATTGC CCGCTGGCCG CTCTTTTTGC CGGTACTGCT CTTATGTTCA
GCTTTTATCG GCGGTTGTGC CGGGTCAACG GGCGGTGGCC TGAAAGTGAT CCGCATCCTG
CTGCTGTTTA AGCAGGGGAA CCGTGAGCTG AAACGACTGG TGCATCCGAA CGCCGTCTAT
AGCATTAAGC TGGGGAATCG CGCACTGCCG GAACGTATCC TCGAAGCCGT GTGGGGATTT
TTCTCCGCCT ATGCATTGGT GTTTATTGTC AGTATGCTGG CGATTATCGC CACGGGCGTG
GATGACTTTT CTGCCTTTGC CTCGGTTGTT GCGACATTGA ATAACCTGGG GCCAGGGCTT
GGTGTGGTTG CTGATAACTT TACCAGTATG AACCCGGTGG CTAAATGGAT CCTGATTGCC
AACATGCTGT TTGGTCGTCT CGAGGTCTTT ACATTGCTGG TGCTCTTTAC CCCGACTTTC
TGGCGTGAAT GA
 
Protein sequence
MHFRAITRIV GLLVILFSGT MIIPGLVALI YRDGAGRAFT QTFFVALAIG SMLWWPNRKE 
KGELKSREGF LIVVLFWTVL GSVGALPFIF SESPNLTITD AFFESFSGLT TTGATTLVGL
DSLPHAILFY RQMLQWFGGM GIIVLAVAIL PILGVGGMQL YRAEMPGPLK DNKMRPRIAE
TAKTLWLIYV LLTVACALAL WFAGMDAFDA IGHSFATIAI GGFSTHDASI GYFDSPTINT
IIAIFLLISG CNYGLHFSLL SGRSLKVYWR DPEFRMFIGV QFSLVVICTL VLWFHNVYSS
ALMTINQAFF QVVSMATTAG FTTDSIARWP LFLPVLLLCS AFIGGCAGST GGGLKVIRIL
LLFKQGNREL KRLVHPNAVY SIKLGNRALP ERILEAVWGF FSAYALVFIV SMLAIIATGV
DDFSAFASVV ATLNNLGPGL GVVADNFTSM NPVAKWILIA NMLFGRLEVF TLLVLFTPTF
WRE