Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1858 |
Symbol | |
ID | 3831489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1919681 |
End bp | 1920724 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637829790 |
Product | polysaccharide deacetylase |
Protein accession | YP_430701 |
Protein GI | 83590692 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0000052436 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCAAAA AAGCCGCTGC CGTTATTGCT TTAGTCCTGC TCCTGTCATT GTCATTTACA TATGTCCAAA TATCTTCCCG CCTGAACCGG AGCCCGGCTG CTACCACTGC CGCTAGCAGC GATAACGTGA ACCAGCCCGC CCCTGCCCAA CTGGCTACCG CCACTCCCCA TCAAAACGGG CAGGCCCAGC TGGCCGGGGT ACAGTCCAAT GAAGTTTACT ACCAGAATAA GATCGTCGTC CTTATCTATC ACCATATTGA TGTTAAAGAA GAACCGGGGT TAGTTATCTC CCCGGAACGT TTCGCCAGCG AACTGGATAT GCTCCTAGCC AGGGGTTACC ACGTCATCAG CCTGGACCAG TTGCGCGACT TCTTAAATGG CGGTTCAGTA CCCGATAATG CCGTCCTCAT CACCTTTGAC GACGGTTATG AGAGTGTCCA CCAGTATGCC TTGCCGGAAT TACAGAAAAG GCATATGCCT GCTGTAGCCT TTGCCATTGT GAAATACGTC GGGCAAAAGC GGGGCAACCT CCAGTACTAT AGCTGGGACG GGGCCAGGGA GATGGCCGCC GCTGGCTTTA CCACCCAGTC CCACACCTAT AATCTCCATG ACTTCGGCCC CCTGGCCAGC GGCAAAAACG GGCCTCTCCT CAACGGCCCC CTCAAGGGCC AGAGTTTGAG CGACTATAGA AATATGGTCT ACCAGGACCT GAAGCGTTCC CGGGAAGAAA TAGAAAGCCA TCTCCAGCAG CCGGTCTATG CCCTGGCCCT GCCCTTCGGT GCCGGCGGCC AGACGGCCAT CCAGGCCGCT GTTGATGCCG GTTTTAAAAT CGTCTTTACC ACCCATTATG GGGTCGTTAC CCGTCAGAGT AACCCCCTGG CCCTGCCCCG GGTCAACGCC GGCGGCCCGG CTATTACGCC GGCTAAACTA GATGCCCTCA TCCGGGCTAC TGCCGGGGCA AGCACCTCTC CCAAGGGGCA ACCCAAAAAG CCACCTACTC CCCAATCCAG AGTAGTGACC AGTAAAAAGG CGAACCGCAT TTAG
|
Protein sequence | MPKKAAAVIA LVLLLSLSFT YVQISSRLNR SPAATTAASS DNVNQPAPAQ LATATPHQNG QAQLAGVQSN EVYYQNKIVV LIYHHIDVKE EPGLVISPER FASELDMLLA RGYHVISLDQ LRDFLNGGSV PDNAVLITFD DGYESVHQYA LPELQKRHMP AVAFAIVKYV GQKRGNLQYY SWDGAREMAA AGFTTQSHTY NLHDFGPLAS GKNGPLLNGP LKGQSLSDYR NMVYQDLKRS REEIESHLQQ PVYALALPFG AGGQTAIQAA VDAGFKIVFT THYGVVTRQS NPLALPRVNA GGPAITPAKL DALIRATAGA STSPKGQPKK PPTPQSRVVT SKKANRI
|
| |