Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2193 |
Symbol | |
ID | 4811058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2613733 |
End bp | 2616579 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640107599 |
Product | carbohydrate-binding family 6 protein |
Protein accession | YP_001038588 |
Protein GI | 125974678 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.43679 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGCAA GTATTAAAAC AAGTATTAAA ATTAGAACAG TTGCATTTGT ATCAATTATT GCAATTGCTT TAAGTATTCT AAGTTTTATT CCAAACCGGG CATATGCAAG CCCGCAACGT GGCCGGCCGC GTCTTAATGC AGCCAGAACC ACGTTTGTAG GAGACAATGG GCAGCCGTTA CGCGGTCCAT ACACTTCCAC GGAATGGACG GCGGCAGCTC CTTATGACCA GATTGCGAGA GTTAAGGAAC TGGGATTTAA TGCAGTACAC CTCTACGCAG AATGCTTTGA CCCCAGATAT CCTGCTCCCG GAAGCAAAGC TCCGGGATAC GCGGTTAATG AAATTGACAA GATAGTGGAA AGGACTCGTG AACTTGGTCT TTATCTGGTA ATAACCATAG GCAACGGTGC CAATAACGGA AATCATAACG CGCAATGGGC AAGGGATTTC TGGAAGTTCT ATGCACCGCG TTATGCAAAA GAAACACATG TATTGTATGA AATACACAAT GAGCCTGTGG CATGGGGACC TCCATATTCT TCTTCAACGG CCAATCCTCC CGGTGCAGTT GATATGGAAA TTGATGTTTA CAGGATAATC CGTACTTATG CACCGGAAAC ACCGGTATTA CTTTTCTCCT ATGCAGTATT TGGAGGCAAA GGGGGAGCGG CCGAAGCCCT GAAAGACATA CGCGCTTTCA ACAAGGCGGT TTTTGGCAAT GAAAACGCAG TGTGGACTAA CGAAGCTGTG GCGTTTCACG GATATGCAGG TTGGCAGGAA ACCACCATTG CGGTGGAGGA ACTTTTAAAG GCCGGTTATC CCTGCTTTAT GACTGAGTAT GCCGGAGGTG CCTGGGGCAG CGGCATGGGA GGATTGGATG TTGAACTTAC CTACGAGCTG GAACGCCTGG GAGTCTCCTG GCTTACTTTC CAGTACATTC CGCCAACCGG TGTGTCTGAT GATGTTACAA AGCCGGAATA TTTCTCAGCA TTGGTGGAAA ATTCCGGTCT TTCTTGGACT CCCGATTACG GGAACTGGCC GGCGGCCCGC GGTGTATACG GCAACGGTGG TCTGGCAAGG GAAACTGCGA CGTGGATTAA CAACTTCTTA ACCGGTACAA CCCGTATCGA AGCAGAAGAC TTCGATTGGG GCGGAAACGG GGTTTCGTAT TATGACACGG ATTCGGTGAA TGTTGGAGGA CAATACCGCC CGGATGAAGG AGTGGATATT GAGAAAACTT CAGACACAGG CGGCGGTTAC AATGTCGGAT GGATTTCGGA AGGAGAATGG CTTGAATATA CCATAAGAGT TCGGAATCCC GGATACTATA ACTTGTCGCT CCGTGTGGCA GGCATCAGCG GCAGCAGAGT ACAGGTGAGT TTCGGAAACC AGGACAAGAC CGGAGTTTGG GAACTGCCTG CTACCGGAGG TTTTCAGACT TGGACTACAG CCACAAGGCA GGTGTTTCTT GGAGCCGGCC TGCAAAAATT ACGTATCAAT GCTTTGTCCG GAGGGTTCAA TTTGAATTGG ATTGAACTTT CTCCGATATC AACAGGAACC ATTCCCGACG GAACATATAA GTTTTTGAAC CGCGCAAATG GAAAGACATT GCAGGAAGTA ACCGGCAACA ACAGCATAAT AACCGCCGAT TACAAAGGAA TCACGGAACA GCACTGGAAG ATTCAGCACA TTGGCGGCGG CCAATACAGA ATTTCATCCG CAGGCAGAGG CTGGAACTGG AACTGGTGGA TGGGTTTTGG AACTGTCGGA TGGTGGGGAA CAGGCTCCAG TACGTGTTTT ATTATCAGTC CTACGGGTGA CGGTTACTAC AGAATCGTAC TTGTCGGTGA CGGTACAAAC CTGCAAATAT CCTCAGGTGA TCCGAGCAAG ATAGAGGGAA AGGCTTTTCA TGGTGGAGCC AATCAGCAGT GGGCAATACT TCCGGTTTCC GCTCCCGCGT TTCCGACAGG GCTAAGTGCG GTACTTGATT CTTCCGGCAA TACGGCCAAT TTGACATGGA ATGCCGCTCC GGGTGCGAAC TCTTACAATG TTAAACGTTC CACCAAAAGC GGTGGTCCGT ATACAACTAT TGCCACCAAT ATCACATCGA CAAACTATAC CGACACCGGT GTGGCAACGG GTACTAAATA CTATTATGTG GTAAGTGCGG TAAGCAATGG AGTGGAAACC CTCAACAGTG CGGAAGCGAT ACTGCAATAT CCTAAACTTA CGGGTACCGT TATTGGAACC CAAGGTTCGT GGAATAACAT TGGGAACACA ATTCACAAAG CTTTTGACGG TGACCTGAAC ACGTTTTTTG ACGGTCCTAC AGCAAACGGC TGCTGGCTGG GACTGGATTT TGGGGAAGGT GTGAGGAATG TCATTACACA AATTAAATTC TGCCCGCGTT CCGGCTATGA ACAGCGCATG ATAGGGGGAA TTTTTCAGGG GGCAAATAAA GAAGATTTCA GCGATGCAGT GACGCTGTTT ACCATTACCT CACTACCAGG CTCCGGTACG TTAACTTCGG TGGATGTAGA CAATCCAACC GGCTTCCGCT ATGTCCGCTA TTTGTCCCCG GACGGCAGTA ATGGAAATAT TGCAGAGCTG CAGTTTTTCG GTACACCGGC CGGTGAGGAG AATGATGATG TGCATTTGGG CGATATAAAC GATGACGGAA ATATAAACTC AACAGACCTT CAGATGCTAA AAAGGCATTT GCTCCGCAGT ATCCGGCTTA CGGAAAAACA GCTTTTAAAT GCGGATACAA ACAGAGACGG CAGAGTGGAT TCCACCGACC TTGCTTTATT AAAAAGATAT ATACTCCGTG TCATAACTAC TTTATAA
|
Protein sequence | MGASIKTSIK IRTVAFVSII AIALSILSFI PNRAYASPQR GRPRLNAART TFVGDNGQPL RGPYTSTEWT AAAPYDQIAR VKELGFNAVH LYAECFDPRY PAPGSKAPGY AVNEIDKIVE RTRELGLYLV ITIGNGANNG NHNAQWARDF WKFYAPRYAK ETHVLYEIHN EPVAWGPPYS SSTANPPGAV DMEIDVYRII RTYAPETPVL LFSYAVFGGK GGAAEALKDI RAFNKAVFGN ENAVWTNEAV AFHGYAGWQE TTIAVEELLK AGYPCFMTEY AGGAWGSGMG GLDVELTYEL ERLGVSWLTF QYIPPTGVSD DVTKPEYFSA LVENSGLSWT PDYGNWPAAR GVYGNGGLAR ETATWINNFL TGTTRIEAED FDWGGNGVSY YDTDSVNVGG QYRPDEGVDI EKTSDTGGGY NVGWISEGEW LEYTIRVRNP GYYNLSLRVA GISGSRVQVS FGNQDKTGVW ELPATGGFQT WTTATRQVFL GAGLQKLRIN ALSGGFNLNW IELSPISTGT IPDGTYKFLN RANGKTLQEV TGNNSIITAD YKGITEQHWK IQHIGGGQYR ISSAGRGWNW NWWMGFGTVG WWGTGSSTCF IISPTGDGYY RIVLVGDGTN LQISSGDPSK IEGKAFHGGA NQQWAILPVS APAFPTGLSA VLDSSGNTAN LTWNAAPGAN SYNVKRSTKS GGPYTTIATN ITSTNYTDTG VATGTKYYYV VSAVSNGVET LNSAEAILQY PKLTGTVIGT QGSWNNIGNT IHKAFDGDLN TFFDGPTANG CWLGLDFGEG VRNVITQIKF CPRSGYEQRM IGGIFQGANK EDFSDAVTLF TITSLPGSGT LTSVDVDNPT GFRYVRYLSP DGSNGNIAEL QFFGTPAGEE NDDVHLGDIN DDGNINSTDL QMLKRHLLRS IRLTEKQLLN ADTNRDGRVD STDLALLKRY ILRVITTL
|
| |