Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0275 |
Symbol | |
ID | 4808558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 341280 |
End bp | 343715 |
Gene Length | 2436 bp |
Protein Length | 811 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105687 |
Product | cellobiose phosphorylase |
Protein accession | YP_001036707 |
Protein GI | 125972797 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTCG GTTTTTTTGA TGATGCAAAC AAAGAGTACG TTATTACCGT GCCCAGGACA CCGTATCCGT GGATAAACTA CCTGGGTACA GAGAATTTCT TCTCACTCAT TTCGAATACC GCAGGCGGAT ATTGCTTTTA CAGGGATGCA AGGCTTAGAC GTATAACAAG ATACAGATAC AACAATGTTC CTATTGACAT GGGAGGACGT TATTTCTACA TATATGACAA CGGTGATTTC TGGTCGCCGG GATGGTCTCC GGTAAAAAGG GAGCTTGAAA GCTATGAATG CAGACATGGA CTGGGATATA CAAAAATTGC CGGTAAAAGA AACGGAATAA AAGCGGAGGT CACTTTCTTC GTTCCGTTAA ACTACAATGG TGAAGTCCAA AAGCTTATAT TGAAGAATGA AGGACAGGAC AAAAAGAAAA TAACTCTCTT CTCTTTTATT GAGTTCTGCT TGTGGAATGC TTATGATGAT ATGACCAACT TCCAGAGAAA CTTCAGCACC GGTGAAGTTG AGATTGAAGG CTCGGTTATC TATCACAAGA CAGAGTACAG AGAGCGCAGA AACCATTACG CATTCTATTC TGTAAATGCA AAAATCAGCG GATTTGACAG TGACAGAGAC AGCTTCATAG GACTTTACAA CGGTTTTGAC GCTCCTCAGG CTGTAGTGAA CGGCAAGTCA AACAATTCCG TTGCGGACGG ATGGGCACCG ATTGCGTCCC ACAGCATTGA AATTGAATTG AATCCCGGGG AGCAAAAGGA ATATGTATTT ATTATAGGTT ATGTGGAGAA CAAAGATGAA GAAAAATGGG AGTCAAAAGG TGTCATCAAC AAGAAAAAAG CTTATGAAAT GATAGAGCAG TTCAACACTG TTGAAAAGGT TGACAAAGCA TTTGAAGAAC TCAAGAGCTA TTGGAATGCT CTTCTTTCAA AATACTTTCT TGAAAGCCAC GATGAAAAAC TCAACCGTAT GGTTAATATA TGGAATCAGT ACCAGTGTAT GGTTACATTC AACATGTCAA GAAGCGCTTC ATACTTTGAA TCCGGTATCG GAAGAGGTAT GGGTTTCAGA GATTCAAACC AGGACTTGCT GGGATTTGTA CACCAGATAC CCGAAAGAGC AAGAGAAAGG CTTCTTGACC TGGCTGCAAC TCAGCTTGAA GATGGCGGTG CGTACCATCA GTATCAGCCT CTTACCAAAA AAGGTAACAA TGAAATCGGA AGCAACTTCA ACGATGACCC GTTGTGGCTG ATTCTTGCAA CTGCTGCATA TATTAAGGAA ACCGGTGATT ATTCAATACT GAAGGAGCAA GTTCCGTTCA ACAATGATCC GTCCAAAGCC GACACCATGT TTGAACATTT GACCCGTTCC TTCTACCATG TGGTAAACAA CCTTGGACCT CACGGATTGC CGCTTATAGG TAGGGCGGAC TGGAATGACT GCCTTAACTT AAACTGCTTC TCCACCGTTC CGGATGAGTC GTTCCAGACC ACAACAAGCA AAGACGGAAA AGTGGCAGAG TCAGTTATGA TTGCCGGAAT GTTTGTGTTC ATCGGAAAAG ACTATGTGAA GCTTTGCGAA TACATGGGCC TTGAAGAGGA AGCCAGGAAA GCTCAGCAGC ATATTGACGC AATGAAGGAA GCAATTCTCA AATACGGTTA TGACGGTGAG TGGTTCTTAA GAGCTTACGA CGACTTTGGA AGAAAAGTCG GAAGCAAAGA AAACGAAGAG GGTAAGATTT TCATTGAGTC TCAGGGATTC TGTGTAATGG CTGAAATCGG GCTTGAAGAC GGCAAGGCTT TGAAGGCTCT GGATTCTGTC AAGAAATATC TTGACACTCC ATATGGTCTT GTACTTCAAA ATCCCGCGTT TACAAGATAC TATATTGAGT ACGGAGAAAT TTCAACATAT CCACCGGGAT ACAAAGAAAA TGCCGGTATA TTCTGCCACA ACAATGCATG GATAATCTGT GCTGAAACGG TTGTCGGAAG AGGAGACATG GCGTTTGATT ACTATAGAAA AATAGCACCT GCTTATATTG AAGATGTAAG TGACATCCAC AAGCTTGAGC CTTATGTTTA TGCACAGATG GTTGCCGGAA AAGACGCAAA ACGCCATGGA GAAGCTAAGA ACTCATGGCT GACCGGTACT GCGGCGTGGA ACTTTGTGGC GATTTCACAG TGGATACTGG GTGTAAAACC TGACTATGAC GGATTGAAGA TTGATCCATG CATACCCAAG GCATGGGACG GATACAAAGT TACCAGATAT TTCAGAGGCT CAACTTATGA AATCACTGTG AAGAATCCGA ACCATGTATC AAAAGGTGTG GCTAAAATTA CTGTTGACGG CAATGAAATC AGCGGAAATA TTCTTCCGGT GTTCAATGAC GGAAAGACTC ACAAAGTTGA AGTAATTATG GGATAA
|
Protein sequence | MKFGFFDDAN KEYVITVPRT PYPWINYLGT ENFFSLISNT AGGYCFYRDA RLRRITRYRY NNVPIDMGGR YFYIYDNGDF WSPGWSPVKR ELESYECRHG LGYTKIAGKR NGIKAEVTFF VPLNYNGEVQ KLILKNEGQD KKKITLFSFI EFCLWNAYDD MTNFQRNFST GEVEIEGSVI YHKTEYRERR NHYAFYSVNA KISGFDSDRD SFIGLYNGFD APQAVVNGKS NNSVADGWAP IASHSIEIEL NPGEQKEYVF IIGYVENKDE EKWESKGVIN KKKAYEMIEQ FNTVEKVDKA FEELKSYWNA LLSKYFLESH DEKLNRMVNI WNQYQCMVTF NMSRSASYFE SGIGRGMGFR DSNQDLLGFV HQIPERARER LLDLAATQLE DGGAYHQYQP LTKKGNNEIG SNFNDDPLWL ILATAAYIKE TGDYSILKEQ VPFNNDPSKA DTMFEHLTRS FYHVVNNLGP HGLPLIGRAD WNDCLNLNCF STVPDESFQT TTSKDGKVAE SVMIAGMFVF IGKDYVKLCE YMGLEEEARK AQQHIDAMKE AILKYGYDGE WFLRAYDDFG RKVGSKENEE GKIFIESQGF CVMAEIGLED GKALKALDSV KKYLDTPYGL VLQNPAFTRY YIEYGEISTY PPGYKENAGI FCHNNAWIIC AETVVGRGDM AFDYYRKIAP AYIEDVSDIH KLEPYVYAQM VAGKDAKRHG EAKNSWLTGT AAWNFVAISQ WILGVKPDYD GLKIDPCIPK AWDGYKVTRY FRGSTYEITV KNPNHVSKGV AKITVDGNEI SGNILPVFND GKTHKVEVIM G
|
| |