Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1074 |
Symbol | |
ID | 4811372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1281519 |
End bp | 1282481 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106496 |
Product | polysaccharide deacetylase |
Protein accession | YP_001037499 |
Protein GI | 125973589 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | [TIGR02884] delta-lactam-biosynthetic de-N-acetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0121745 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA GAAGAGTAAT CAGAGGAGCG ATTGTTCTAA CAATAATGTT AGCTTTACTT TTGTCAATAA GATTTTTGGC GACAAGAGAG CAGGAAAATC TCAAGCTGCC GGACATGGGC GGAGTATTCT CGGTGCGCCT TGAGAATCTG GATTTTACAA CGGAGATCAC CGGAATGTTA AGTGACGGGG TGATAGACGA GGTGCCTTAT GAAGAATATG AGGATGTTTT TGCGGAAGTG GAAGCGATTC CTGTCAGCGG GGGCTACAGC AACAAGGTTC TGCGCTGGGG AATATTAAGA AAAGGTTACG GCGAAACACC GCAGGCAGAT CCCGGGGCAC CGGAGCTTTT GGCAAAATAC GGGGGCGTCT ATTTGGGTGA TACCTCCAAA AAGGTAATAT ACCTTACTTT TGACGAAGGA TATGAAAACG GCTACACTTC AAAGATTCTT GACGTTCTTC GGGACAACAA TGTGAAAGCT GTATTTTTCA TAACAGGGCC TTATTTAAAT GAGCATGAGG ATTTGGTGAG AAGAATGGTA GAAGAGGGTC ATGTGGTGGG AAATCACACG GTACATCATC CCAGCCTTCC GAGCCTTTCA GACAAAGAAC TTGAAGAGGA AATCCTGGGG CTTGACAGAG CCTTTCATGA AAAATTCGGT ATCAACATGA AATTTTTAAG ACCGCCGAAG GGCGAATACA GCGAAAGAAC CCTTGCCATA ACGCAGAAGT TGGGCTATAC CAATCTGTTT TGGAGTTTTG CCTATGATGA CTGGCACAGG GATAAAATAA GAGGTCCGCA ATATGCTTAC GATAAGGTTA TGAACAATCT TCACAACGGT GCGGTGCTGC TTTTGCATGC AGTGTCAAAG GACAATGCGG ATGCCCTTGA TATGATTATC AAGGGAGCCA GGGAAAGAGG ATTTGAGTTT GGAGACGTAA ATGACCTTAT ACTTGAGAAA TGA
|
Protein sequence | MIKRRVIRGA IVLTIMLALL LSIRFLATRE QENLKLPDMG GVFSVRLENL DFTTEITGML SDGVIDEVPY EEYEDVFAEV EAIPVSGGYS NKVLRWGILR KGYGETPQAD PGAPELLAKY GGVYLGDTSK KVIYLTFDEG YENGYTSKIL DVLRDNNVKA VFFITGPYLN EHEDLVRRMV EEGHVVGNHT VHHPSLPSLS DKELEEEILG LDRAFHEKFG INMKFLRPPK GEYSERTLAI TQKLGYTNLF WSFAYDDWHR DKIRGPQYAY DKVMNNLHNG AVLLLHAVSK DNADALDMII KGARERGFEF GDVNDLILEK
|
| |