Gene Mnod_1193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_1193 
Symbol 
ID7308633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp1263034 
End bp1264395 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content69% 
IMG OID643598939 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_002496501 
Protein GI220921200 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.814651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCACC ATCCCTCGCT GGAGGCTGCG GCGAAAAACT CGTCGGCGGC CCTCCATTCC 
GGCTACATGT CGGGCTTCGG CAACGGCTTC GAGACCGAGG CGCTGCCCGG CGCGCTGCCG
ATCGGCCGCA ACTCGCCCCA GAAATGCCCC TATGGGCTCT ACGCCGAGCA GCTCTCGGGC
TCGCCCTTCA CGGCGCCGCG CACCACCAAC GAGCGCTCCT GGCTCTACCG CATCCGCCCG
ACCGTGATGC ATTGGGGCGC CTTCGCCAAG GCCGAGATCG GGCTGTGGCG CACCGCGCCG
GCTGAGGTGG TCGAGCTGCC GATCGCGCCC CTGCGCTGGG ACCCGATCCC GATCCCCTCC
GAGCCGCTCT CCTTCGTCGA GGGCATCCGC ACCATGACTA CGGCCGGGGA CGCCGGGTCC
CAGGCCGGCA TGGGCGCGCA TCTCTACTTC GCCACCCGCT CGATGCGGGA CGAGTACTTC
TACAACGCCG ACGGAGAGAT GCTGGTCGTG CCCCAGCAGG GGGCCTTGCG CTTCTGCACC
GAGTTCGGGG TGATCGACAT CGAGCCCGGC GAGATCGCGG TGATCCCGCG CGGGGTGAAG
ATCCGGGTCG AGCTCCCCGG CGGGCCGGCC CGCGGCTATC TCTGCGAGAA TTACGGCGGC
GCCTTCACGC TGCCCGAGCG CGGCCCGATC GGCGCCAATT GCCTCGCCAA CCAGCGCGAC
TTCCTCACCC CGGTCGCGGC CTACGAGGAC CGCGACGGCC CCGCCACCAT GCTGGTGAAG
TGGGGCGGGA GCCTGTGGGC GGCGACGATC GACCACTCGC CCCTCGACGT GGTCGCCTGG
CACGGCAACT ACGCGCCCTA CAAGTACGAC CTGCGCAAGT ACTCGCCGGT CGGGCCGATC
CTGTTCGACC ATGCCGACCC GTCGATCTTC ACGGTGCTGA CCTCGCCCTC GGAGACGCCC
GGCACCGCCA ACATCGATTT CGTGCTGTTC TCCGACCGCT GGCTGGTGGC CGAGAACACG
TTCCGGCCGC CCTGGTATCA CCTGAACGTG ATGAGCGAGT TCATGGGGCT GGTCTACGGG
GTCTACGACG CCAAGACCGG CGGCGGCTTC CAGCCCGGCG GGGTCTCGCT GCACAACACC
CTGCTGCCGC ACGGGCCGGA CGTGGACGCC TTCGAGCGCG CCTCGAACGC CGAGCTCAAG
CCGCACAAGC TCGAGGGCAC GCTCGCCTTC ATGTTCGAGA CCCGCTTCCC CCAGAAGGTC
AGCCGCTTCG CGGCCGAGCA TCCGGCCCTG CAGAAGGACT ACGCAGGCTA CGGGCGCAAG
CTCGCCAAGC ATTTCGATCC GCGCCGGCCA GAGGCTCGCT GA
 
Protein sequence
MTHHPSLEAA AKNSSAALHS GYMSGFGNGF ETEALPGALP IGRNSPQKCP YGLYAEQLSG 
SPFTAPRTTN ERSWLYRIRP TVMHWGAFAK AEIGLWRTAP AEVVELPIAP LRWDPIPIPS
EPLSFVEGIR TMTTAGDAGS QAGMGAHLYF ATRSMRDEYF YNADGEMLVV PQQGALRFCT
EFGVIDIEPG EIAVIPRGVK IRVELPGGPA RGYLCENYGG AFTLPERGPI GANCLANQRD
FLTPVAAYED RDGPATMLVK WGGSLWAATI DHSPLDVVAW HGNYAPYKYD LRKYSPVGPI
LFDHADPSIF TVLTSPSETP GTANIDFVLF SDRWLVAENT FRPPWYHLNV MSEFMGLVYG
VYDAKTGGGF QPGGVSLHNT LLPHGPDVDA FERASNAELK PHKLEGTLAF MFETRFPQKV
SRFAAEHPAL QKDYAGYGRK LAKHFDPRRP EAR