Gene Cthe_1658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1658 
Symbol 
ID4808908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1984293 
End bp1985645 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content38% 
IMG OID640107073 
ProductTrkH family potassium uptake protein 
Protein accessionYP_001038074 
Protein GI125974164 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0168] Trk-type K+ transport systems, membrane components 
TIGRFAM ID[TIGR00933] potassium uptake protein, TrkH family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.552595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAAAAT ATATAAGAAA ACTGAAATCA TTATCATACC CACAGATAAT AGCACTTGGC 
TACTTTCTGA TAATAATTGC CGGGACATTT TTGCTGTCAT TGCCGATAGC CAGCCGCAAT
AATTTATTTC CCGGATTAAT TAATGCACTT TTTACAGCAA CCTCTGCTAC TTGTGTAACA
GGGCTTGTTG TTTTTGACAC CTATACACAA TGGTCGCTAT TTGGACAAGT AGTTATTTTA
CTGCTGATTC AAATAGGTGG CTTAGGGTTT ATGACGGTTG TCACAATGTT TTCTATTTTT
TTAAAAAGAA AAATCGGGCT GAAAGAAAGG GGACTTTTAC GTGAAAGTAT AAATGCTATG
TATATCGGTG GGATTGTACG TCTGACCAAA AGAATATTGC TTGGCACTTT GCTGTTTGAA
GGTATTGGTG CAGTATTGCT ATCATTGAGG TTTATACCCA AAATGGGCTT GCTGGAAGGT
ATATATAATG GCATTTTTCA CTCGGTTTCA GCTTTTTGCA ATGCCGGGTT TGACATTATG
GGAAAGTACG GCAAGTATTC ATCGCTAACA AATTTCGCCG GGGATGCGGT TGTTAACTTA
ACAATTATTT CACTCATCAT TGTCGGAGGA ATAGGGTTCT TTGTATGGGA TGATATTGAG
AAAAATAAAT ATCATTTTAG AAAATATCAG CTACATACAA AAATAGTTTT AACGATGACC
GCAATTCTTA TAGTGTCAGG GACGATATGC TTTTATATTT TTGAAAGAAA TAACCTTCTT
TATGGGATGA CTACAGGAGA AAAAGTTTTA GTCTCTCTCT TTGGCGCAGT TACGCCCAGA
ACAGCAGGCT TCAATACTGT TGACGTTGCA TCATTAACCT CTGCAAGTAA ACTTCTAACT
ATTGTGTTGA TGTTTATAGG CGGCAGTCCC GGATCTACTG CCGGTGGGAT CAAGACTACT
ACTTTAGCGG TTATTATGAT TTCGCTGTGG TCAAGCTTGA AAAATAGGAA GGGTGATAAT
ATATTTGGCA GGAGACTGGA AGATAATGCA CTAAAGAGGT CGTCTGCCGT TGTGACAGTC
AATATACTTC TTATACTGAG TGCGGCTTTA CTTATTAGTG CTACAAATAA AGCTTTAGGA
CTTGACGCTG TTTTGTTTGA GGTTACATCT GCAATTGGTA CTGTTGGTCT TTCTACGGGA
ATCACAGGTG GTCTGAATAC CTTTGCAAAA ATAATTATCA TACTATTGAT GTATAGCGGC
AGAGTCGGAA GTCTTTCTTT TGCTTTGCTG TTTACAGAAC ATGGAGTGAC GTCATCTATA
CAGAATCCGG TGGAAAAAAT AAATATAGGA TAG
 
Protein sequence
MRKYIRKLKS LSYPQIIALG YFLIIIAGTF LLSLPIASRN NLFPGLINAL FTATSATCVT 
GLVVFDTYTQ WSLFGQVVIL LLIQIGGLGF MTVVTMFSIF LKRKIGLKER GLLRESINAM
YIGGIVRLTK RILLGTLLFE GIGAVLLSLR FIPKMGLLEG IYNGIFHSVS AFCNAGFDIM
GKYGKYSSLT NFAGDAVVNL TIISLIIVGG IGFFVWDDIE KNKYHFRKYQ LHTKIVLTMT
AILIVSGTIC FYIFERNNLL YGMTTGEKVL VSLFGAVTPR TAGFNTVDVA SLTSASKLLT
IVLMFIGGSP GSTAGGIKTT TLAVIMISLW SSLKNRKGDN IFGRRLEDNA LKRSSAVVTV
NILLILSAAL LISATNKALG LDAVLFEVTS AIGTVGLSTG ITGGLNTFAK IIIILLMYSG
RVGSLSFALL FTEHGVTSSI QNPVEKINIG