Gene Moth_1858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1858 
Symbol 
ID3831489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1919681 
End bp1920724 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content56% 
IMG OID637829790 
Productpolysaccharide deacetylase 
Protein accessionYP_430701 
Protein GI83590692 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000052436 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCCAAAA AAGCCGCTGC CGTTATTGCT TTAGTCCTGC TCCTGTCATT GTCATTTACA 
TATGTCCAAA TATCTTCCCG CCTGAACCGG AGCCCGGCTG CTACCACTGC CGCTAGCAGC
GATAACGTGA ACCAGCCCGC CCCTGCCCAA CTGGCTACCG CCACTCCCCA TCAAAACGGG
CAGGCCCAGC TGGCCGGGGT ACAGTCCAAT GAAGTTTACT ACCAGAATAA GATCGTCGTC
CTTATCTATC ACCATATTGA TGTTAAAGAA GAACCGGGGT TAGTTATCTC CCCGGAACGT
TTCGCCAGCG AACTGGATAT GCTCCTAGCC AGGGGTTACC ACGTCATCAG CCTGGACCAG
TTGCGCGACT TCTTAAATGG CGGTTCAGTA CCCGATAATG CCGTCCTCAT CACCTTTGAC
GACGGTTATG AGAGTGTCCA CCAGTATGCC TTGCCGGAAT TACAGAAAAG GCATATGCCT
GCTGTAGCCT TTGCCATTGT GAAATACGTC GGGCAAAAGC GGGGCAACCT CCAGTACTAT
AGCTGGGACG GGGCCAGGGA GATGGCCGCC GCTGGCTTTA CCACCCAGTC CCACACCTAT
AATCTCCATG ACTTCGGCCC CCTGGCCAGC GGCAAAAACG GGCCTCTCCT CAACGGCCCC
CTCAAGGGCC AGAGTTTGAG CGACTATAGA AATATGGTCT ACCAGGACCT GAAGCGTTCC
CGGGAAGAAA TAGAAAGCCA TCTCCAGCAG CCGGTCTATG CCCTGGCCCT GCCCTTCGGT
GCCGGCGGCC AGACGGCCAT CCAGGCCGCT GTTGATGCCG GTTTTAAAAT CGTCTTTACC
ACCCATTATG GGGTCGTTAC CCGTCAGAGT AACCCCCTGG CCCTGCCCCG GGTCAACGCC
GGCGGCCCGG CTATTACGCC GGCTAAACTA GATGCCCTCA TCCGGGCTAC TGCCGGGGCA
AGCACCTCTC CCAAGGGGCA ACCCAAAAAG CCACCTACTC CCCAATCCAG AGTAGTGACC
AGTAAAAAGG CGAACCGCAT TTAG
 
Protein sequence
MPKKAAAVIA LVLLLSLSFT YVQISSRLNR SPAATTAASS DNVNQPAPAQ LATATPHQNG 
QAQLAGVQSN EVYYQNKIVV LIYHHIDVKE EPGLVISPER FASELDMLLA RGYHVISLDQ
LRDFLNGGSV PDNAVLITFD DGYESVHQYA LPELQKRHMP AVAFAIVKYV GQKRGNLQYY
SWDGAREMAA AGFTTQSHTY NLHDFGPLAS GKNGPLLNGP LKGQSLSDYR NMVYQDLKRS
REEIESHLQQ PVYALALPFG AGGQTAIQAA VDAGFKIVFT THYGVVTRQS NPLALPRVNA
GGPAITPAKL DALIRATAGA STSPKGQPKK PPTPQSRVVT SKKANRI