Gene Cthe_0291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0291 
Symbol 
ID4808509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp363973 
End bp366015 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content43% 
IMG OID640105703 
Producthypothetical protein 
Protein accessionYP_001036723 
Protein GI125972813 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGCTT ATAAACAAGC CAACAGACTA ATCCATGAGA AATCCCCTTA CCTGCTGCAG 
CATGCATATA ACCCTGTAGA CTGGTATCCC TGGTGTGATG AGGCATTTGA AAAAGCCAAA
CGGGAAAACA AGCCCATCTT CCTCTCCATA GGCTATTCCA CCTGCCACTG GTGCCATGTG
ATGGAGAGTG AATCCTTTGA AGATGAAGAA GTTGCCGAAA TTCTAAACAA AAACTTTGTT
TCCATCAAAG TGGACAGAGA AGAACGCCCG GATATAGACA GCATATACAT GACTGCCTGT
CAGGCACTGA CGGGACATGG GGGCTGGCCG CTGACAATCA TTATGACTCC CGACAAAAAG
CCTTTCTTTG CCGGAACATA CTTTCCCAAA AAAGACCGTA TGGGAATGCC CGGACTTATA
TCCATCCTCA AGAGCGTACA CAACACCTGG GTAAATGAAA AAGATTCACT TGCCAAATAC
AGCTCCAAGG TAGTCAGTGT AATCAGCGAA TCAATTGATG ATGACTATTA CTATTCTGTC
GATGAAATTA CAGAAGACAT ATTTGAAGAT GCCTTTTCGC AGTTCAAATA TGACTTTGAC
AACATTTACG GAGGATTTGG GAACGCACCC AAGTTCCCTA TGCCCCACAA CCTGTATTTT
CTTCTGAGAT ACTGGCACAA GGCCAAAGAG GAGTATGCCC TTGTCATGGT CGAAAAAACT
CTTGACTCCA TGTACAGCGG CGGAATATAT GACCACATAG GTTTTGGCTT TTGCCGTTAT
TCGACTGATG AAAAATGGCT GGTGCCTCAT TTCGAAAAAA TGCTGTATGA TAATGCATTG
CTGGCCATAG CATATCTTGA AACCTATCAG GCAACAAAAA ACAAAAAATA CGCCGATATT
GCAAAAGAAA TCTTTACTTA TGTGCTAAGA GACATGACAT CACCGGAAGG TGGATTCTAC
AGCGCAGAAG ACGCAGATTC CGAAGGCGAA GAAGGAAAAT TCTACATCTG GTCTCCAACT
GAAATAAAAG AAGTCTTGGG AGAAAGCGAC GGTGAAAAAT TCTGCAAATA TTACAACATC
ACCGAAGAAG GAAATTTTGA AGGTCTCAAC ATCCCAAACC TTATAAACAG CACAATACCT
GACGAAGATA AGGAGTTTGT CGAATTGTGC AGAAAAAAAC TCTTTGACCA CAGAGAAAAA
AGGGTGCATC CCCATAAGGA TGACAAAATC CTGACTGCCT GGAACGGTCT TATGATAGCC
GCCCTGGCAA TCGGTGGAAG AGTTCTGGGG ATTGAAAAAT ACACTCTCGC CGCTGAGAAA
GCCAGTGAAT TCATATTCTC AAAACTCGTA AGACCTGACG GAAGGCTTCT TGCGCGGTAC
AGGGACGGAG AAGCCGCATT TTTGGCATAT CTCGACGACT ATGCATTCTT AATCTGGGCT
CTTATTGAGC TTTATGAAAC AACTTACAAA CCTATGTATC TCAAAAAAGC CATGGAACTG
ACCAATGACA TGATTAAGTA TTTCTGGGAC AATAAAAAGG GTGGGCTTTT CATATATGGC
AGCGACAGTG AGCAACTCAT TACCAGACCA AAGGAAATAT ACGACGGAGC CATTCCGTCC
GGTAATTCGG TTGCAGCTTT GAATTTTCTA AGACTTTCAC GTTTGACAGG GCAGCAGGAG
TTGGAAGAAA AAGCCCATCA GATGTTTGCC CTTTTTGGAA GCAAAATTGA CAGCATGCCG
CAAGGATATG CTTTTTTCCT TACAGCTATG CTTTTTTCGA AATCCAAATC AAATGAAGTT
GTTCTGGTGG GCAGCAATGA GAAAGACACT CAAAACATGC TCAGTATTCT CAGTGAAGAT
TTCAGACCTT TCACAACTTC AATTTTATAT TCCGAAGAAC ACAAGGATTT AAAAGAACTG
ATACCGTTTA TCGACAATTA CACTACAATT GAAAATAAAC CTACTGCTTA CGTCTGCGAA
AACTTTGTCT GTCATGAACC GATTACTGAC GGCGCCCTGC TCCGCGAAAA GCTAAACCGT
TAG
 
Protein sequence
MSAYKQANRL IHEKSPYLLQ HAYNPVDWYP WCDEAFEKAK RENKPIFLSI GYSTCHWCHV 
MESESFEDEE VAEILNKNFV SIKVDREERP DIDSIYMTAC QALTGHGGWP LTIIMTPDKK
PFFAGTYFPK KDRMGMPGLI SILKSVHNTW VNEKDSLAKY SSKVVSVISE SIDDDYYYSV
DEITEDIFED AFSQFKYDFD NIYGGFGNAP KFPMPHNLYF LLRYWHKAKE EYALVMVEKT
LDSMYSGGIY DHIGFGFCRY STDEKWLVPH FEKMLYDNAL LAIAYLETYQ ATKNKKYADI
AKEIFTYVLR DMTSPEGGFY SAEDADSEGE EGKFYIWSPT EIKEVLGESD GEKFCKYYNI
TEEGNFEGLN IPNLINSTIP DEDKEFVELC RKKLFDHREK RVHPHKDDKI LTAWNGLMIA
ALAIGGRVLG IEKYTLAAEK ASEFIFSKLV RPDGRLLARY RDGEAAFLAY LDDYAFLIWA
LIELYETTYK PMYLKKAMEL TNDMIKYFWD NKKGGLFIYG SDSEQLITRP KEIYDGAIPS
GNSVAALNFL RLSRLTGQQE LEEKAHQMFA LFGSKIDSMP QGYAFFLTAM LFSKSKSNEV
VLVGSNEKDT QNMLSILSED FRPFTTSILY SEEHKDLKEL IPFIDNYTTI ENKPTAYVCE
NFVCHEPITD GALLREKLNR