Gene Cthe_2761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2761 
Symbol 
ID4810264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3258151 
End bp3260274 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content42% 
IMG OID640108181 
Productglycoside hydrolase family protein 
Protein accessionYP_001039153 
Protein GI125975243 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000261631 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGAAAA TAGGAATACT TGTCTTGATA ACCGCTCTTC TGGCGGGAAT AATTCCAAAA 
TCGGCTCTGG CCGAAGAGCC AAAATTTAAC TATGTAGATG CGTTTGCCAA ATCAATTCTG
TTTTACGAAG CCAACTGGTG CGGGCCTGAC GCAGGGAACA ACAGGATAAA ATGGCGTGGT
CCATGCCATA TTGAGGACGG CAAGGATGTG GGTCTTGATT TGACGGGAGG TTTCCATGAC
TGCGGAGACC ATGTCAAGTT TGGATTGCCT CAATGTGCTT CGGCTTCAAC ACTTGCCTGG
GCTTATCATG AATTCTCAGA CGTGTTTATA GAAAAAGGGC AGGATGAATA CATGCTGAAC
ATTTTAAAGC ATTTTTGTGA TTACTTCATG AAATGCTATC CTGAAAAGAA CAAGTTTTAC
TATCAAGTCG GTGACGGTGA TGTGGATCAC CAGTACTGGG GACCGCCTGA GCTTCAAAGT
TATGACAGAC CTACATATTA TGTAGCCACA CCGGAAAATC CGGGTTCGGA TGTTGCCGGG
GATACGGCAG CGGCATTGGC TCTTATGTAT TTGAATTATA AAGACAGGGA TTTGGAATAT
GCAGAGAAAT GCCTGGCTTA TGCAAAGGAT ATTTATGAGT TTGGTATGAC CTACAGAGGA
AACAGTAAAG GACAAAGTTA TTATCTTCCC AGAGATTATC TTGATGAACT TATGTGGGGA
TCTTTGTGGC TTTATGTCGC CACAGGAGAG CAAAAATACA TGGACAATTT GGAAAAACTG
ATGGTTGAAA AAAGGATTGG CGATGAAGCC GGCAATTCCT TTAATGATAA TTGGACCCAA
TGCTGGGATT ATGTTTTGAC CGGGGTGTTT ATCAAGCTTG CAACCCTTAC CGACAAGCCA
TTGTATAAAC AGATTGCCGA AGACCATTTG GATTACTGGC AGAACAGGAT AAAGTCCACT
CCTGGAGGTT TAAAATACCT GGATAGTTGG GGTGTTTGCA AATATCCTGC CGCAGAGAGT
ATGGTTCAGC TGGTGTACTA TAAGTATACC GGGGACAAGA GATGCCTTGA TTTTGCAAAG
AGCCAGATTG ACTACATACT TGGAGATAAT CCTAAAAAAA TGTCCTATGT GGTTGGTTTT
GGAGACAATT ATCCCAAATT CCCGCATCAC AGGGCTGCAA GCGGCAGACT TGAAGGACCG
CCGGCCGACG AAACAAAGAA TGATCCGCAA AGGCATATTT TATATGGCGC TTTGGTTGGT
GGAGCGGATA TAAATGATGA ATATTATGAT GATATAGATA AGTACGTTTA TTCCGAAACG
GGATTGGATT ATAATGCAGG GCTTGTCGGC GCATTGGCAG GTATGTCAAA ATATTTTGGA
CAAGGCCAAA TGCCCGAAGA TACTCCTGGT ATTGAAGGTG AGCCGCCTGT ATACTATGCA
GACGCAAAAA TATACGAAGA AAATGAAAGC GGTATTACAG TTGATCTTAA TATGTATAAT
ATTGTTACAT CGCCTCCGCA ATACGAGTCC GATTTATCCT GCAGATATTT TGTTGATTTG
TCGGAGTATG CAGGGGAAAA TATTGATATG TCAAAATTTG TAACAAAAGT GTATTACTCG
CCTGCAGGTG CTACAATATC CGAGCTTAAG CCTTATGATA AAGAGAAGAA TATTTACTAT
GTGGAAATAA GTTTCCCCAA TCCGGTATAT GCAAGAACTT ATGTGCAGTT CTGTATTTAC
TATTATGAAA ATAAACTGTG GGATTCTTCG AATGACTTTT CTTACCAGGG TATAGGGGAT
ACTTATAAAA CTTTGGAGAA TATTCCTATA TACAAGAATG GCGTTCTGGT GGCAGGAAAA
GAACCCTCCG GAGCAGGACC GGTTGAACCT ACACCTCCAC CGAAGAATTA TGTATACGGA
GATGTAAACG GTGATGGTAA GGTAAATTCA ACGGACTGTT CAATTGTCAA GAGATATTTG
CTCAAGAATA TAGAGGATTT CCCGTACGAG TATGGAAAAG AGGCCGGAGA TGTAAACGGT
GATGGCAAGG TGAATTCAAC GGATTATTCT CTGCTTAAGA GGTTTGTACT GCGCAATATA
GATAAGTTCC CCGTAGAGCA GTAA
 
Protein sequence
MKKIGILVLI TALLAGIIPK SALAEEPKFN YVDAFAKSIL FYEANWCGPD AGNNRIKWRG 
PCHIEDGKDV GLDLTGGFHD CGDHVKFGLP QCASASTLAW AYHEFSDVFI EKGQDEYMLN
ILKHFCDYFM KCYPEKNKFY YQVGDGDVDH QYWGPPELQS YDRPTYYVAT PENPGSDVAG
DTAAALALMY LNYKDRDLEY AEKCLAYAKD IYEFGMTYRG NSKGQSYYLP RDYLDELMWG
SLWLYVATGE QKYMDNLEKL MVEKRIGDEA GNSFNDNWTQ CWDYVLTGVF IKLATLTDKP
LYKQIAEDHL DYWQNRIKST PGGLKYLDSW GVCKYPAAES MVQLVYYKYT GDKRCLDFAK
SQIDYILGDN PKKMSYVVGF GDNYPKFPHH RAASGRLEGP PADETKNDPQ RHILYGALVG
GADINDEYYD DIDKYVYSET GLDYNAGLVG ALAGMSKYFG QGQMPEDTPG IEGEPPVYYA
DAKIYEENES GITVDLNMYN IVTSPPQYES DLSCRYFVDL SEYAGENIDM SKFVTKVYYS
PAGATISELK PYDKEKNIYY VEISFPNPVY ARTYVQFCIY YYENKLWDSS NDFSYQGIGD
TYKTLENIPI YKNGVLVAGK EPSGAGPVEP TPPPKNYVYG DVNGDGKVNS TDCSIVKRYL
LKNIEDFPYE YGKEAGDVNG DGKVNSTDYS LLKRFVLRNI DKFPVEQ