Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0322 |
Symbol | |
ID | 4808540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 405713 |
End bp | 407047 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640105733 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001036753 |
Protein GI | 125972843 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00637507 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA GAGTTCATAT TCTAATAATG ACCTTAATGC TAATTGTGGC TTGTGTCGGA TGCAGCGGCA GTTCCGATAA TCCGCCGCCT GCCAACAGTT TGCATTCGGC ATCAACCATA CCGATATCAG CTATGCCTTC TTTTTCGACT CCTTCACCTA CTGTTGAAGA TACTGAAAAT CAACAATCGA AAACAGTGAA TTACTTGATA AATTCCATGA CCTTGGAGGA AAAGATCGGA CAGATTTTCA TTGTTGCCTT TAGAAAAGGC AAATCTTCAC GCCCGTTAAA AGTATTGGAC AATTCTACAA AACTGAAGAT TCAGAACTTT AATCCCGGTG GTATAATACT TTTCAGCGAA AACATAGACA CCATACCGCA GACACAAAAA CTCATTCGTG ATATGCAAGA GGCAAGCAAA ATTCCCATGT TCATCGCTGT TGACGAAGAA GGCGGGCGAA TAGCAAGAAT CGGAAACAAT CCGAAAATGC ATTCCACAAA AATACCCTCC GCCCAGACAA TAGGACTTGC CGATGACCCC GAACTTGCGT ATGAGGCAGG CAGGATACTG GGTGCGGAGC TGTCTGCCCT TGGCTTTAAC ATGAATTTCG CACCGGTGGC CGATGTAAAT ACAAATCCGG ACAATCCTGT TATCGGAGAC AGATCTTTTG GTTCCGACCC CTATAAAGTC GGCTTAATGG TACAGGCGAT GTCCAAAGGT ATGCAGGAGC AAAATGTCTG CACCGTGCTA AAGCACTTTC CCGGTCATGG CGACACCTCC TACGACTCGC ATCTTGGTCA GGTGGTAATA AATCACGACA TTGAAAGGCT TCGCCAAATA GAACTGACAC CTTTTAAAAT GGGCATCAAG GCCGGCGCCG ACGGCGTAAT GACGGCTCAT ATCATAATGC CCAACATTAC CGGCAGCAAT CTGCCCGCAA CTCTGTCTGA AGAAATCCTC AGCGGGCTTT TGAGAAATGA GTTAAAACAC GAAAAGCTCA TTATAACCGA TGCAATGGAA ATGAAGGCAA TCAGCAACTA CTGGTCTTCT TCCAAGGCTG CGGTCATGGC ATTTAAAGCA GGAGCAGACA TCATACTCAT GCCCGAATCA TTTGAAGAAG CCTATAACGG TATTCTCAAA GCCGTAAAAG ATGGTGAAAT AACGGAAGAA AGGCTAAATC AGTCCCTGCA AAGAATTCTC GCTCTAAAAT TTGAAAGAAA CATACTTGCA AATAAAGAAA GCTCCGTCGA CCCTGAAAAA GTATTGGGCA GACAAGAGCA TACTGACATA GTAGTGAAAA TCATGCAAAA GGCAGAAGAG CAAAATATCC CTTAA
|
Protein sequence | MKKRVHILIM TLMLIVACVG CSGSSDNPPP ANSLHSASTI PISAMPSFST PSPTVEDTEN QQSKTVNYLI NSMTLEEKIG QIFIVAFRKG KSSRPLKVLD NSTKLKIQNF NPGGIILFSE NIDTIPQTQK LIRDMQEASK IPMFIAVDEE GGRIARIGNN PKMHSTKIPS AQTIGLADDP ELAYEAGRIL GAELSALGFN MNFAPVADVN TNPDNPVIGD RSFGSDPYKV GLMVQAMSKG MQEQNVCTVL KHFPGHGDTS YDSHLGQVVI NHDIERLRQI ELTPFKMGIK AGADGVMTAH IIMPNITGSN LPATLSEEIL SGLLRNELKH EKLIITDAME MKAISNYWSS SKAAVMAFKA GADIILMPES FEEAYNGILK AVKDGEITEE RLNQSLQRIL ALKFERNILA NKESSVDPEK VLGRQEHTDI VVKIMQKAEE QNIP
|
| |