Gene M446_2532 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_2532 
Symbol 
ID6134688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp2806980 
End bp2808341 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content69% 
IMG OID641642744 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001769409 
Protein GI170740754 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGC ACGGCCGTCT CGATCCCGCG GGCACGATTG CCGCGGCCCA GATCCACCCC 
GGCTACATGT CAGGCTTCGG CAACGGCTTC GAGACCGAGG CACTGCCGGG CGCGCTGCCG
ATCGGACGCA ACTCGCCGCA GATTTGCCCC TACGGCCTCT ACGCCGAGCA GCTCTCGGGC
TCACCCTTCA CGGCGCCCCG CACCGCCAAC GAGCGCTCCT GGCTCTACCG CATCCGCCCC
ACGGTGATGC ACTGGGGCGA CTTCCGAAAA GTCGACGCTC GCCTCTGGCT TACGGCACCG
GCGGAGCTGG TCGACCTGCC GCCCGCGCCC CTGCGCTGGG ACCCGGTGGC CATCCCCTCG
GAACCCCTGT CCTTCGTGGA GGGCATCTGC ACCGTGACCA CGGCGGGCGA CGCCGGTGCC
CAGGCAGGTA TGGGCGCCCA CTTCTACTTC GCCACCCGCT CGATGCCGGA CGAGTACTTC
TCCAACGCGG ACGGTGAGAT GCTGGTCGTG CCGCAGGAGG GAGCGCTCCG CTGGCGCACC
GAGTTCGGGA TCATCGACGT CGAACCGGGC GAGGTCTGCG TGATCCCGCG CGGGGTGAAG
GTCGCGGTCG ACCTCCTCGG CGGGCCGGCG CGGGGCTACG TCTGCGAGAA TTACGGCGGC
GCCTTCACAC TGCCGGAGCG CGGCCCGATC GGCGCCAACT GCCTCGCCAA TCAGCGCGAC
TTCCTCACCC CGGTCGCCGC CTACGAGGAC CGCGACGCGC CCGGGACCAT GCTGGTGAAG
TGGGGCGGCG CGCTCTGGGC GGCGGAGATC GACCACTCGC CCCTCGACGT GGTCGCCTGG
CACGGCAACT ACGCGCCCTA CAAGTACGAC CTGCGCAAGT TCTCGCCGGT TGGGCCGATC
CTGTTCGACC ACGCCGACCC GTCGATCTTC ACGGTGCTGA CCTCGCCCTC CGAGACGCCC
GGCACCGCCA ACATCGACTT CGTGATCTTC TCCGACCGCT GGCTCGTGGC GGAGAACACC
TTCCGGCCGC CCTGGTACCA CCTCAACGTG ATGAGCGAGT TCATGGGGCT GGTCTACGGG
GTCTACGACG CTAAGACCGG CGGCGGCTTC CGGCCGGGCG GGGCCTCGCT GCACAACACG
TTGCTGCCGC ACGGGCCGGA CGTTGACGCC TTCGAGAGGG CGTCGAACGT CGATCTCAAG
CCGCACAAGT TGGAGGGCAC GCTCGCCTTC ATGTTCGAGA CGCGCTTTCC CCAGAAGGTC
AGCCGCTTCG CCGCTGAGAC GCCCGCCCGG CAGAAGGACT ACGCCGCTTA CGGGCGCAAG
CTCGCCAAGC ACTTCGACCC GAACCGTGCC GAGGCGCGGT GA
 
Protein sequence
MNQHGRLDPA GTIAAAQIHP GYMSGFGNGF ETEALPGALP IGRNSPQICP YGLYAEQLSG 
SPFTAPRTAN ERSWLYRIRP TVMHWGDFRK VDARLWLTAP AELVDLPPAP LRWDPVAIPS
EPLSFVEGIC TVTTAGDAGA QAGMGAHFYF ATRSMPDEYF SNADGEMLVV PQEGALRWRT
EFGIIDVEPG EVCVIPRGVK VAVDLLGGPA RGYVCENYGG AFTLPERGPI GANCLANQRD
FLTPVAAYED RDAPGTMLVK WGGALWAAEI DHSPLDVVAW HGNYAPYKYD LRKFSPVGPI
LFDHADPSIF TVLTSPSETP GTANIDFVIF SDRWLVAENT FRPPWYHLNV MSEFMGLVYG
VYDAKTGGGF RPGGASLHNT LLPHGPDVDA FERASNVDLK PHKLEGTLAF MFETRFPQKV
SRFAAETPAR QKDYAAYGRK LAKHFDPNRA EAR