Gene Moth_0946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0946 
Symbol 
ID3832831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp978236 
End bp979177 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content63% 
IMG OID637828876 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_429805 
Protein GI83589796 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID[TIGR03151] putative enoyl-(acyl-carrier-protein) reductase II 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.00349563 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGGA CACCCCTCTG CGATCTCCTG GGGATTACTT ATCCCATTAT TCAGGGCGGT 
ATGGCCTGGG TAGCAACGGG AGAGCTGGCG GCCGCTGTTT CGGCTGCCGG GGGACTGGGA
ATTATCGGCG CCGGCAGTGC GCCGCCGGAT GTAGTTCGCC GGGAGATTCG CAAGGTACGG
GAACAAACGG ACCGGCCCTT CGGGGTTAAT ATTTACTATC TATCACCTTA TGTCGAAGAA
TTGGTTGATC TGGTATGCGA GGAAAGGGTG CCGGTGGTCA CCACCGGGGC CGGTAATCCG
GGCAAGCACC TGCCCCGTTT CAAGGAGGCA GGGGTGAAGG TAATTCCGGT GGTAGCCTCG
GTGGCCCTGG CGAAGAGGTT GGAGCGCCTG GGAGTGGACG CCCTGGTGGC CGAAGGCATG
GAATGCGGCG GCCATATTGG AGAGATTGCC ACCATGCCCC TGGTGCCCCA GATTGTCGAT
GCCGTACATA TCCCGGTGAT TGCTGCCGGC GGTATTGCCG ACGGACGCGG CCTGGCCGCC
GCCCTGGCCC TGGGGGCCGC AGGCATCCAG ATGGGGACCA GGTTTATCTG CGCCACCGAG
TGTACCGTCC ACGCCAACTA TAAAGAAGCG GTCCTCAAAG CCGGGGACCG GGACGCCGTC
GTTACCGGTA TGGCCGGGCA CTATGTCCGG GTACTAAAGA ACAAGCTGAC CAGGCAGTTT
GAGGAACTTT CCGCCCGGGG AGCGAGCTGG GAGGAGATGG ACCGCCTGGG AACCGGGAAG
CTGCGGGCGG CGGCAGTCGA TGGCGATGTG GAGTACGGTT CAGTAATGGC CGGCCAGAGC
GCGGCCATGG TGCGGGAAAT CAAGCCGGCA GCAGCCATCA TTGCGGAAAT CATGGCCGAG
GCTGCTGAGG TTATAGCCCG GCTGGGCGCC TTGACAGGGT AG
 
Protein sequence
MLRTPLCDLL GITYPIIQGG MAWVATGELA AAVSAAGGLG IIGAGSAPPD VVRREIRKVR 
EQTDRPFGVN IYYLSPYVEE LVDLVCEERV PVVTTGAGNP GKHLPRFKEA GVKVIPVVAS
VALAKRLERL GVDALVAEGM ECGGHIGEIA TMPLVPQIVD AVHIPVIAAG GIADGRGLAA
ALALGAAGIQ MGTRFICATE CTVHANYKEA VLKAGDRDAV VTGMAGHYVR VLKNKLTRQF
EELSARGASW EEMDRLGTGK LRAAAVDGDV EYGSVMAGQS AAMVREIKPA AAIIAEIMAE
AAEVIARLGA LTG