Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0175 |
Symbol | |
ID | 4808663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 211898 |
End bp | 213034 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640105586 |
Product | polysaccharide deacetylase |
Protein accession | YP_001036609 |
Protein GI | 125972699 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.122625 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCTT TCTTTAAGTC AAGAGTTGCA AGAATTCTTC CCTTAATAGC TTTGTCTCTT TCTCTGCTGT TTGCAGGATG TAACACCAGT CCCAAATTGA ATGGCGATTT GAAGTCACCG GTTCCCAATA CCGCCGGACA GGATGCGGGA CAGGAAGGTG GTTCTTTGGC AACACCGACT CCAACACCGA CACCTGAGCC AACGCCTACA ATTGATCTTC AAAGTGTCAA GCCTAATGAG GCTGGCAAAA TAATGGTTGT TATGTTCCAT AATTTTGTTG AGTCTTTTAC TCCTAAGAGT TATGATAAGG GTGAATACAC CACTACTTTC AGTGAGTTTG AAAAGCTGCT TCAAGACTTG TATGACAGAG GTTACAGGCT TATCAGCATG AGTGATTATT TAAATAACAA TATTTCAGTG CCTGCAGGAT GCATACCGAT AATATTTACA TTTGACGACG GAACATCGGG ACAGTTTAAT TTGGTTGAAG AAAACGGAAC TCTCAAAGTA AATAAAAAAT CTGCCGTAGG AATTATGGAA GAGTTTTATG AGAAGCATCC TGATTTCGGG CTTAAAGGCA CTTTCTATGT GAACCTTGGC AACAGCACTT TTGAGGGTGC GGGAACTTTG CAACAGAGGC TTCAATATTT GGTTGACAAA GGATTTGAAA TTGGGAACCA TACTTATACC CATATAAATC TGAAAGATAC CACAAGTGCT CAAAAAATAC AGGAAGAGAT TGCTAAAAAC CAAAGGACTA TTGCTGAGCT TATACCCGGA TATAAAATGA CTACTTTTTC TCTTCCGTAC GGACTTCCAT CTGTAGAAGG CCTGCATAGC TATGTTATAA AAGGTAGTTA TGAAGGGGTG GAATACGAAC ATGCGGCAAT TATGGAGGTA GGCTGGGATC CGGCCCATTC TCCGGTATCA AAAGATTTTA ACCCCCTCTC GACCCACAGG GTTAGAGCTT CGGGAATTAA TCCTGTAGAT TGCGACCTGG CATGGTGGCT TAAAAATTTA TCCAGGGAAG AGCAGTATAT AAGTGACGGA GATCCCAATA CCGTTACAGT TCCTAAGAAA AATGAAGACA AAGTAGACAA GGATAAGCTA AAGGACAAAA AGCTCGTTAT ATATTAA
|
Protein sequence | MASFFKSRVA RILPLIALSL SLLFAGCNTS PKLNGDLKSP VPNTAGQDAG QEGGSLATPT PTPTPEPTPT IDLQSVKPNE AGKIMVVMFH NFVESFTPKS YDKGEYTTTF SEFEKLLQDL YDRGYRLISM SDYLNNNISV PAGCIPIIFT FDDGTSGQFN LVEENGTLKV NKKSAVGIME EFYEKHPDFG LKGTFYVNLG NSTFEGAGTL QQRLQYLVDK GFEIGNHTYT HINLKDTTSA QKIQEEIAKN QRTIAELIPG YKMTTFSLPY GLPSVEGLHS YVIKGSYEGV EYEHAAIMEV GWDPAHSPVS KDFNPLSTHR VRASGINPVD CDLAWWLKNL SREEQYISDG DPNTVTVPKK NEDKVDKDKL KDKKLVIY
|
| |