Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2761 |
Symbol | |
ID | 4810264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3258151 |
End bp | 3260274 |
Gene Length | 2124 bp |
Protein Length | 707 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640108181 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001039153 |
Protein GI | 125975243 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1331] Highly conserved protein containing a thioredoxin domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000261631 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAGAAAA TAGGAATACT TGTCTTGATA ACCGCTCTTC TGGCGGGAAT AATTCCAAAA TCGGCTCTGG CCGAAGAGCC AAAATTTAAC TATGTAGATG CGTTTGCCAA ATCAATTCTG TTTTACGAAG CCAACTGGTG CGGGCCTGAC GCAGGGAACA ACAGGATAAA ATGGCGTGGT CCATGCCATA TTGAGGACGG CAAGGATGTG GGTCTTGATT TGACGGGAGG TTTCCATGAC TGCGGAGACC ATGTCAAGTT TGGATTGCCT CAATGTGCTT CGGCTTCAAC ACTTGCCTGG GCTTATCATG AATTCTCAGA CGTGTTTATA GAAAAAGGGC AGGATGAATA CATGCTGAAC ATTTTAAAGC ATTTTTGTGA TTACTTCATG AAATGCTATC CTGAAAAGAA CAAGTTTTAC TATCAAGTCG GTGACGGTGA TGTGGATCAC CAGTACTGGG GACCGCCTGA GCTTCAAAGT TATGACAGAC CTACATATTA TGTAGCCACA CCGGAAAATC CGGGTTCGGA TGTTGCCGGG GATACGGCAG CGGCATTGGC TCTTATGTAT TTGAATTATA AAGACAGGGA TTTGGAATAT GCAGAGAAAT GCCTGGCTTA TGCAAAGGAT ATTTATGAGT TTGGTATGAC CTACAGAGGA AACAGTAAAG GACAAAGTTA TTATCTTCCC AGAGATTATC TTGATGAACT TATGTGGGGA TCTTTGTGGC TTTATGTCGC CACAGGAGAG CAAAAATACA TGGACAATTT GGAAAAACTG ATGGTTGAAA AAAGGATTGG CGATGAAGCC GGCAATTCCT TTAATGATAA TTGGACCCAA TGCTGGGATT ATGTTTTGAC CGGGGTGTTT ATCAAGCTTG CAACCCTTAC CGACAAGCCA TTGTATAAAC AGATTGCCGA AGACCATTTG GATTACTGGC AGAACAGGAT AAAGTCCACT CCTGGAGGTT TAAAATACCT GGATAGTTGG GGTGTTTGCA AATATCCTGC CGCAGAGAGT ATGGTTCAGC TGGTGTACTA TAAGTATACC GGGGACAAGA GATGCCTTGA TTTTGCAAAG AGCCAGATTG ACTACATACT TGGAGATAAT CCTAAAAAAA TGTCCTATGT GGTTGGTTTT GGAGACAATT ATCCCAAATT CCCGCATCAC AGGGCTGCAA GCGGCAGACT TGAAGGACCG CCGGCCGACG AAACAAAGAA TGATCCGCAA AGGCATATTT TATATGGCGC TTTGGTTGGT GGAGCGGATA TAAATGATGA ATATTATGAT GATATAGATA AGTACGTTTA TTCCGAAACG GGATTGGATT ATAATGCAGG GCTTGTCGGC GCATTGGCAG GTATGTCAAA ATATTTTGGA CAAGGCCAAA TGCCCGAAGA TACTCCTGGT ATTGAAGGTG AGCCGCCTGT ATACTATGCA GACGCAAAAA TATACGAAGA AAATGAAAGC GGTATTACAG TTGATCTTAA TATGTATAAT ATTGTTACAT CGCCTCCGCA ATACGAGTCC GATTTATCCT GCAGATATTT TGTTGATTTG TCGGAGTATG CAGGGGAAAA TATTGATATG TCAAAATTTG TAACAAAAGT GTATTACTCG CCTGCAGGTG CTACAATATC CGAGCTTAAG CCTTATGATA AAGAGAAGAA TATTTACTAT GTGGAAATAA GTTTCCCCAA TCCGGTATAT GCAAGAACTT ATGTGCAGTT CTGTATTTAC TATTATGAAA ATAAACTGTG GGATTCTTCG AATGACTTTT CTTACCAGGG TATAGGGGAT ACTTATAAAA CTTTGGAGAA TATTCCTATA TACAAGAATG GCGTTCTGGT GGCAGGAAAA GAACCCTCCG GAGCAGGACC GGTTGAACCT ACACCTCCAC CGAAGAATTA TGTATACGGA GATGTAAACG GTGATGGTAA GGTAAATTCA ACGGACTGTT CAATTGTCAA GAGATATTTG CTCAAGAATA TAGAGGATTT CCCGTACGAG TATGGAAAAG AGGCCGGAGA TGTAAACGGT GATGGCAAGG TGAATTCAAC GGATTATTCT CTGCTTAAGA GGTTTGTACT GCGCAATATA GATAAGTTCC CCGTAGAGCA GTAA
|
Protein sequence | MKKIGILVLI TALLAGIIPK SALAEEPKFN YVDAFAKSIL FYEANWCGPD AGNNRIKWRG PCHIEDGKDV GLDLTGGFHD CGDHVKFGLP QCASASTLAW AYHEFSDVFI EKGQDEYMLN ILKHFCDYFM KCYPEKNKFY YQVGDGDVDH QYWGPPELQS YDRPTYYVAT PENPGSDVAG DTAAALALMY LNYKDRDLEY AEKCLAYAKD IYEFGMTYRG NSKGQSYYLP RDYLDELMWG SLWLYVATGE QKYMDNLEKL MVEKRIGDEA GNSFNDNWTQ CWDYVLTGVF IKLATLTDKP LYKQIAEDHL DYWQNRIKST PGGLKYLDSW GVCKYPAAES MVQLVYYKYT GDKRCLDFAK SQIDYILGDN PKKMSYVVGF GDNYPKFPHH RAASGRLEGP PADETKNDPQ RHILYGALVG GADINDEYYD DIDKYVYSET GLDYNAGLVG ALAGMSKYFG QGQMPEDTPG IEGEPPVYYA DAKIYEENES GITVDLNMYN IVTSPPQYES DLSCRYFVDL SEYAGENIDM SKFVTKVYYS PAGATISELK PYDKEKNIYY VEISFPNPVY ARTYVQFCIY YYENKLWDSS NDFSYQGIGD TYKTLENIPI YKNGVLVAGK EPSGAGPVEP TPPPKNYVYG DVNGDGKVNS TDCSIVKRYL LKNIEDFPYE YGKEAGDVNG DGKVNSTDYS LLKRFVLRNI DKFPVEQ
|
| |