Gene Moth_0739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0739 
Symbol 
ID3831131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp773171 
End bp774145 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content64% 
IMG OID637828670 
Productpolysaccharide deacetylase 
Protein accessionYP_429600 
Protein GI83589591 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0726] Predicted xylanase/chitin deacetylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000774609 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGG TAGTTAAATA TGTGGCGGAG TGCTGGTCCT GGGGAAACCG GGTGTTCAGG 
CAACCCTGGC GGCCGGATTG GCCGGTAATT ATCCTGCGCC TGGAGCAATT TGCAACCCTG
GTGGGGCTGG TGGCAGTTAT CATGACCGCC GGCGGTGACA GCCTGGTCCG GTCTGCCGGG
AAGCCGGGGG TTCATTTCCC GGCCGCGCCA ACACCACCCG GCCAGTACCT GGAGGCGGTA
AAACAGCCGG TAATGACCAT TTCTCCGGTT GTCCCGGCGG TGGCAAAAGG GGAAACGGCT
TTCACCCCGG GTGACCAGCC TGGGGGGTGG GGGAGCAGCC CGCCGGGTGC AGCCCCGCCC
CGGCAGGACC TCCAGCCCTT ACGCCAGGTA GCTGGTGCCG GCCACCGGGT GGCCATTACC
TTTGACGACG GGCCCTCGCC AGGCTGGACG GATCGTTACT TGAAGGTCCT GGCGGCCATG
GGGACCAGGG CCACCTTTTT CATGGTCGGC AGCCAGGCGG TAGCCCACCC GGATCTGGTC
AAGGCCGTTC TGGCCGGCAA TAACGAGGTG GCCAGCCACT CCTGGCGTCA TGCCAACCTG
AGCATGGTTT CCCGGGAAGC GGCCCGGGAG GACTTAAGCC AGACCGCCAG CGCCCTGGCC
GCCATTACGG GCCAGAAGGT CAAGTATTTC CGGCCTCCCT ATGGCGCTAT GGGGCCTAAC
CTCCTGGCTG CGGCCGGGGA CGTGGGTGAG AAGACCGTCA CCTGGAGCGT CGATCCCCGG
GACTGGTCCA ATCCCGGCCC CCAGGCGATC ATCCAGCGGG TTATGGCCAA TGTCCGCGAC
GGCAGCATTA TCCTTCTCCA TGAGGCCCAC CCCGGTACCC TGGTCGCCCT GCCGATACTC
ATTAAAGAAC TCCGTGACCG GGGCTATGAA ATAGTGACCG TATCGGAACT TATTGCCGCC
GGTAAAATCC CCTAA
 
Protein sequence
MSQVVKYVAE CWSWGNRVFR QPWRPDWPVI ILRLEQFATL VGLVAVIMTA GGDSLVRSAG 
KPGVHFPAAP TPPGQYLEAV KQPVMTISPV VPAVAKGETA FTPGDQPGGW GSSPPGAAPP
RQDLQPLRQV AGAGHRVAIT FDDGPSPGWT DRYLKVLAAM GTRATFFMVG SQAVAHPDLV
KAVLAGNNEV ASHSWRHANL SMVSREAARE DLSQTASALA AITGQKVKYF RPPYGAMGPN
LLAAAGDVGE KTVTWSVDPR DWSNPGPQAI IQRVMANVRD GSIILLHEAH PGTLVALPIL
IKELRDRGYE IVTVSELIAA GKIP