Gene B21_03689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03689 
SymboltrkH 
ID8114933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3938036 
End bp3939487 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content53% 
IMG OID644849850 
Producthypothetical protein 
Protein accessionYP_003001423 
Protein GI251787119 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0168] Trk-type K+ transport systems, membrane components 
TIGRFAM ID[TIGR00933] potassium uptake protein, TrkH family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.025775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTTTC GCGCCATTAC CCGAATCGTT GGACTACTGG TCATCTTATT TTCAGGGACC 
ATGATTATCC CTGGGCTGGT AGCACTCATC TACCGGGATG GAGCGGGCCG CGCTTTTACC
CAGACCTTTT TTGTCGCCCT CGCCATTGGC TCTATGCTGT GGTGGCCGAA CCGCAAAGAG
AAAGGCGAAC TTAAATCCCG TGAGGGGTTT CTGATAGTGG TGCTGTTCTG GACCGTGCTG
GGTAGCGTCG GTGCGCTCCC TTTTATCTTC TCGGAAAGCC CGAACCTCAC GATTACCGAT
GCGTTTTTTG AATCTTTCTC TGGCCTGACC ACTACGGGAG CCACTACGCT GGTGGGGCTG
GATTCGCTCC CTCACGCCAT CCTCTTTTAT CGCCAGATGC TGCAATGGTT TGGCGGGATG
GGGATCATCG TGTTGGCGGT TGCGATACTG CCTATCCTCG GCGTGGGTGG GATGCAGCTC
TATCGCGCAG AAATGCCCGG CCCGCTGAAA GATAACAAAA TGCGCCCGCG AATTGCGGAA
ACGGCGAAAA CCCTGTGGTT GATTTATGTC TTGCTGACCG TCGCCTGTGC GCTGGCGTTG
TGGTTTGCCG GAATGGATGC CTTTGATGCC ATCGGCCATA GCTTTGCGAC TATCGCTATT
GGCGGCTTCT CGACACATGA TGCCAGTATC GGTTATTTCG ATAGCCCGAC TATTAACACT
ATCATTGCTA TCTTCCTGCT GATCTCCGGC TGTAACTACG GTCTGCACTT TTCACTGTTA
AGTGGGCGTA GTCTGAAGGT TTATTGGCGC GATCCGGAAT TTCGCATGTT TATCGGCGTA
CAGTTTACGC TGGTGGTTAT TTGTACCCTC GTACTGTGGT TTCATAATGT CTACAGTTCG
GCGCTGATGA CAATTAACCA GGCGTTTTTC CAGGTGGTGT CGATGGCGAC AACCGCCGGG
TTTACAACTG ACAGCATTGC CCGCTGGCCG CTCTTTTTGC CGGTACTGCT TTTATGTTCA
GCTTTTATCG GCGGTTGTGC CGGGTCAACG GGCGGTGGCC TGAAAGTGAT CCGCATCCTG
CTGCTGTTTA AGCAGGGGAA CCGTGAACTG AAACGACTGG TGCATCCGAA CGCCGTCTAT
AGCATTAAGC TGGGGCATCG CGCACTGCCG GAACGTATCC TCGAAGCCGT TTGGGGATTT
TTCTCCGCCT ATGCATTGGT GTTTATTGTC AGTATGCTGG CGATTATCGC CACGGGCGTG
GATGACTTTT CTGCCTTTGC CTCGGTTGTT GCGACATTGA ATAACCTGGG GCCAGGGCTT
GGCGTGGTTG CTGATAACTT TACCAGTATG AACCCGGTGG CTAAATGGAT CCTGATTGCC
AACATGCTGT TTGGTCGTCT CGAGGTCTTT ACATTGCTGG TGCTCTTTAC CCCGACTTTC
TGGCGTGAAT GA
 
Protein sequence
MHFRAITRIV GLLVILFSGT MIIPGLVALI YRDGAGRAFT QTFFVALAIG SMLWWPNRKE 
KGELKSREGF LIVVLFWTVL GSVGALPFIF SESPNLTITD AFFESFSGLT TTGATTLVGL
DSLPHAILFY RQMLQWFGGM GIIVLAVAIL PILGVGGMQL YRAEMPGPLK DNKMRPRIAE
TAKTLWLIYV LLTVACALAL WFAGMDAFDA IGHSFATIAI GGFSTHDASI GYFDSPTINT
IIAIFLLISG CNYGLHFSLL SGRSLKVYWR DPEFRMFIGV QFTLVVICTL VLWFHNVYSS
ALMTINQAFF QVVSMATTAG FTTDSIARWP LFLPVLLLCS AFIGGCAGST GGGLKVIRIL
LLFKQGNREL KRLVHPNAVY SIKLGHRALP ERILEAVWGF FSAYALVFIV SMLAIIATGV
DDFSAFASVV ATLNNLGPGL GVVADNFTSM NPVAKWILIA NMLFGRLEVF TLLVLFTPTF
WRE