Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0739 |
Symbol | |
ID | 3831131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 773171 |
End bp | 774145 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637828670 |
Product | polysaccharide deacetylase |
Protein accession | YP_429600 |
Protein GI | 83589591 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0726] Predicted xylanase/chitin deacetylase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000000774609 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGG TAGTTAAATA TGTGGCGGAG TGCTGGTCCT GGGGAAACCG GGTGTTCAGG CAACCCTGGC GGCCGGATTG GCCGGTAATT ATCCTGCGCC TGGAGCAATT TGCAACCCTG GTGGGGCTGG TGGCAGTTAT CATGACCGCC GGCGGTGACA GCCTGGTCCG GTCTGCCGGG AAGCCGGGGG TTCATTTCCC GGCCGCGCCA ACACCACCCG GCCAGTACCT GGAGGCGGTA AAACAGCCGG TAATGACCAT TTCTCCGGTT GTCCCGGCGG TGGCAAAAGG GGAAACGGCT TTCACCCCGG GTGACCAGCC TGGGGGGTGG GGGAGCAGCC CGCCGGGTGC AGCCCCGCCC CGGCAGGACC TCCAGCCCTT ACGCCAGGTA GCTGGTGCCG GCCACCGGGT GGCCATTACC TTTGACGACG GGCCCTCGCC AGGCTGGACG GATCGTTACT TGAAGGTCCT GGCGGCCATG GGGACCAGGG CCACCTTTTT CATGGTCGGC AGCCAGGCGG TAGCCCACCC GGATCTGGTC AAGGCCGTTC TGGCCGGCAA TAACGAGGTG GCCAGCCACT CCTGGCGTCA TGCCAACCTG AGCATGGTTT CCCGGGAAGC GGCCCGGGAG GACTTAAGCC AGACCGCCAG CGCCCTGGCC GCCATTACGG GCCAGAAGGT CAAGTATTTC CGGCCTCCCT ATGGCGCTAT GGGGCCTAAC CTCCTGGCTG CGGCCGGGGA CGTGGGTGAG AAGACCGTCA CCTGGAGCGT CGATCCCCGG GACTGGTCCA ATCCCGGCCC CCAGGCGATC ATCCAGCGGG TTATGGCCAA TGTCCGCGAC GGCAGCATTA TCCTTCTCCA TGAGGCCCAC CCCGGTACCC TGGTCGCCCT GCCGATACTC ATTAAAGAAC TCCGTGACCG GGGCTATGAA ATAGTGACCG TATCGGAACT TATTGCCGCC GGTAAAATCC CCTAA
|
Protein sequence | MSQVVKYVAE CWSWGNRVFR QPWRPDWPVI ILRLEQFATL VGLVAVIMTA GGDSLVRSAG KPGVHFPAAP TPPGQYLEAV KQPVMTISPV VPAVAKGETA FTPGDQPGGW GSSPPGAAPP RQDLQPLRQV AGAGHRVAIT FDDGPSPGWT DRYLKVLAAM GTRATFFMVG SQAVAHPDLV KAVLAGNNEV ASHSWRHANL SMVSREAARE DLSQTASALA AITGQKVKYF RPPYGAMGPN LLAAAGDVGE KTVTWSVDPR DWSNPGPQAI IQRVMANVRD GSIILLHEAH PGTLVALPIL IKELRDRGYE IVTVSELIAA GKIP
|
| |