Gene Cthe_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1654 
Symbol 
ID4808904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1980282 
End bp1982201 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content37% 
IMG OID640107069 
Producthypothetical protein 
Protein accessionYP_001038070 
Protein GI125974160 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGCG TCAGCTCGTT CAGAGATGTT ATTGCAAATA TGTATTACAA TGATCTCTTT 
GACGAATTGT CTGAATATAT AGAGGACAAC CCGGATAAGC TTGAATCCAA CTCATACCAT
GTGCAATCAC CGGATGAAGC AGCATTATCT GATTTTGACA TCATAACGAT AGATATAACC
GACTCGCCAG GTAACAGTAT TTTATTTGAC GTAATTGTTT CTGCCGAAAT TGAAATTGCA
GAAACAGTAC GAAGGAATCG CGAGACTGAT GGTATAGAAC AATGGTTCCG TATCTCCTGC
AGAGCTGACC TTGATGACGG AATTCAGAAT TTTCAAATCA ACTCTGTTTC AATATACAAC
AAGTACAGAG AAAGCAAATT AGGCAGACTG TCTGAGTATT TAGTACCAAT TATAGAAAAG
GAACAGTTTG ACAATGTTGC TACTGAATTT CTAAATGAGT TTTGCCCAGA AGCATTAAGT
ACTCCTATGC CCATTCCAGT AGATGAAGTA GTGAAAAGAA TGGGGCTTAA GGTTAAGGAA
ATCCAGCTTA CAAAGCATTT CACTATATTT GGTCAAATAG TCTTTGGCGA TTGCACAATA
GAGTATTACG ACAGAAATGA AAGAACATAT AAGCCTTTGG AAGTTTCAAG AGGAACAATT
CTCGTGGATC CTAATGTGTA TTTCATGCGA AACATAGGGT GCATGAACAA TACCATTATT
CATGAGTGTG TCCACTGGTA TAAGCATAGA AAATACCATG AGTTAGTTAA GACGTATAAC
AGCGATGCTT TGCTCATAAG CTGCAGGGTA AACGAAACAA CTAAATACAA ACAGCAATGG
ACGCCAGAAG ACTGGATGGA ATGGCATGCT AACGGAATTG CACCACGAAT CCTTATGCCT
AGATCAATGA CCATTAAAAA GATTGAGGAG CTAATTAAAA AGAATGAGCT CCTTTTTGGT
ACTTACGACA GGCTAAATAT AATGGAAAAT GTCGTGTATG AATTAGCTGA CTTCTTCCAG
GTGTCAAGGA TAGCGGCCAA AATAAGGATG CTTGACCTTG GATATAAGGA AGTTGAAGGT
GTATATACCT ACGTAGATGA CCATTTTATC AGCAATTATT CATTTAAGGC AGACTCATTA
CATAAGAATC AAACATACAG TATTAGCCTA AGTGATTCTT TTTTTGAATA CTATGCAAAT
CCGGAATTCG CAAAGATTAT AGACAGCGGT AATTTTATTT ATGTTGATGG TCATTACGTT
ATTAACGACT CCAAATACAT TAAAAAGTTA GAAAATGGAA GCATTGATCT TACAGACTAT
GCAAAACTGC ATGTAGATGA ATGCTGCCTT CTGTTTGATT TAAAATTAAA TAAAGCCTCA
AAAATGGACA TTGTAGTATA CCTCGATTCT ATAATGTTCC GTAAAGCTAC ACCGGATTAT
AACAGAGTGC CGACATTTAA TCCGGACAAG CATAATATGG AAGTATTTAA TCGTTCAGAA
GAGCTAAAGA AGTTTCACGA AGAATTCGTC GAAGAAGGTC AGCATTTGAG CCGTACAACC
CAGACATTTT CCCAAGCGGT ATACGGACAT ATCAAAAGGA AAGGCTACAA TAAGGTTGTT
TTTATAGAAA AGACTTTGCT TTCAGGAAAA ACATATGACA GAATAAAAAA CAATGAACTT
AACAATCCAA CTTTAGAAAC CGTTGTTGCA ATCTGCATCG GATTGGAGCT AAGCCCTACA
TACAGTGAAG AAATATTAAG GCTTGCCGGA TATACTCTCA ATAACACTCC ACAGCAATTG
GCGTATAAAA AGCTAATCCA TTCGTATAGA GGGCATTCAA TATATGAATG CAATGAAGTT
TTGGAAGCCT TGGGACTTTC CCCTCTTTGT GCAAAGGCAT ATAAAGAAAT GATAAGTTAA
 
Protein sequence
MASVSSFRDV IANMYYNDLF DELSEYIEDN PDKLESNSYH VQSPDEAALS DFDIITIDIT 
DSPGNSILFD VIVSAEIEIA ETVRRNRETD GIEQWFRISC RADLDDGIQN FQINSVSIYN
KYRESKLGRL SEYLVPIIEK EQFDNVATEF LNEFCPEALS TPMPIPVDEV VKRMGLKVKE
IQLTKHFTIF GQIVFGDCTI EYYDRNERTY KPLEVSRGTI LVDPNVYFMR NIGCMNNTII
HECVHWYKHR KYHELVKTYN SDALLISCRV NETTKYKQQW TPEDWMEWHA NGIAPRILMP
RSMTIKKIEE LIKKNELLFG TYDRLNIMEN VVYELADFFQ VSRIAAKIRM LDLGYKEVEG
VYTYVDDHFI SNYSFKADSL HKNQTYSISL SDSFFEYYAN PEFAKIIDSG NFIYVDGHYV
INDSKYIKKL ENGSIDLTDY AKLHVDECCL LFDLKLNKAS KMDIVVYLDS IMFRKATPDY
NRVPTFNPDK HNMEVFNRSE ELKKFHEEFV EEGQHLSRTT QTFSQAVYGH IKRKGYNKVV
FIEKTLLSGK TYDRIKNNEL NNPTLETVVA ICIGLELSPT YSEEILRLAG YTLNNTPQQL
AYKKLIHSYR GHSIYECNEV LEALGLSPLC AKAYKEMIS