Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0211 |
Symbol | |
ID | 4808629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 256900 |
End bp | 257904 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105624 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001036645 |
Protein GI | 125972735 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2273] Beta-glucanase/Beta-glucan synthetase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00263756 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACA GGGTAATTTC ATTATTAATG GCTTCCTTGC TTTTGGTTTT GTCGGTAATT GTTGCTCCTT TTTACAAAGC GGAAGCCGCA ACTGTGGTAA ATACGCCTTT TGTTGCAGTG TTTTCGAACT TTGACTCCAG TCAGTGGGAA AAAGCGGATT GGGCGAACGG TTCGGTGTTC AACTGTGTTT GGAAGCCTTC ACAGGTGACA TTTTCGAACG GTAAAATGAT TTTGACCCTT GACAGGGAAT ATGGCGGTTC ATATCCGTAT AAAAGCGGTG AATATCGTAC AAAATCATTT TTCGGATACG GTTATTATGA AGTAAGAATG AAAGCTGCCA AAAACGTAGG AATTGTTTCA TCTTTCTTCA CTTATACAGG ACCTTCGGAC AACAATCCAT GGGACGAAAT CGATATCGAG TTTTTAGGAA AGGACACAAC TAAAGTTCAG TTCAACTGGT ACAAAAATGG AGTCGGTGGA AACGAGTATT TGCACAATCT TGGATTCGAT GCTTCCCAGG ATTTTCATAC ATATGGATTT GAATGGAGGC CGGATTATAT AGACTTCTAT GTTGACGGCA AAAAAGTTTA TCGTGGAACC AGGAACATAC CTGTTACTCC CGGCAAAATT ATGATGAATT TGTGGCCAGG AATAGGAGTG GATGAATGGT TGGGACGTTA CGACGGAAGA ACTCCTTTGC AGGCGGAGTA CGAATATGTA AAATACTATC CTAACGGTGT TCCGCAAGAT AATCCTACTC CTACTCCTAC GATTGCTCCT TCTACTCCGA CTAACCCTAA TTTACCTCTT AAGGGAGACG TAAACGGCGA CGGTCATGTT AACTCATCAG ACTATTCATT ATTTAAAAGA TATTTGCTCA GGGTTATTGA TAGATTCCCT GTTGGAGATC AGAGTGTTGC TGATGTAAAC AGGGACGGAA GGATTGACTC CACAGACCTT ACAATGTTAA AGAGATATCT GATACGGGCA ATTCCGTCAC TTTGA
|
Protein sequence | MKNRVISLLM ASLLLVLSVI VAPFYKAEAA TVVNTPFVAV FSNFDSSQWE KADWANGSVF NCVWKPSQVT FSNGKMILTL DREYGGSYPY KSGEYRTKSF FGYGYYEVRM KAAKNVGIVS SFFTYTGPSD NNPWDEIDIE FLGKDTTKVQ FNWYKNGVGG NEYLHNLGFD ASQDFHTYGF EWRPDYIDFY VDGKKVYRGT RNIPVTPGKI MMNLWPGIGV DEWLGRYDGR TPLQAEYEYV KYYPNGVPQD NPTPTPTIAP STPTNPNLPL KGDVNGDGHV NSSDYSLFKR YLLRVIDRFP VGDQSVADVN RDGRIDSTDL TMLKRYLIRA IPSL
|
| |