Gene Mlut_20010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlut_20010 
Symbol 
ID7985211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMicrococcus luteus NCTC 2665 
KingdomBacteria 
Replicon accessionNC_012803 
Strand
Start bp2159053 
End bp2160219 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content69% 
IMG OID644806941 
Product3,4-dihydroxyphenylacetate 2,3-dioxygenase 
Protein accessionYP_002958029 
Protein GI239918471 
COG category 
COG ID 
TIGRFAM ID[TIGR02295] 3,4-dihydroxyphenylacetate 2,3-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAACC TCGAGAGCCG CCAGAAGACC TCCTCCGGCT TCTTCGTCAC GAAGGAAGCC 
CCCATCGACC CGGAGAACCC CCTCCCCACC CCGACGTCCG AGGCGCCGGA CATCCTGCGC
TGCGCCTCGA TGGAGCTCGT GGTCACGGAC CTCGCCGCCT CCCGCGAGTT CTACGTGGAC
GTGCTCGGCC TCGTGGTCAC CGAGGAGGAC GAGGAGACCG TCTACCTGCG CTCCATGGAG
GAGTTCATCC ACCACAACCT GGTGCTGCGC CAGGGCGAGA CCGCCGCCGT GGCCGCGTTC
TCCTACCGCG TGCGCACCCC CGAGGACCTG GACAAGGCGG TCGCCTTCTA CGAGGAGCTG
GGCTGCCGTG TCGAGCGCCG GAGGGACGGC TTCGTCAAGG GCGTCGGCGA CTCCGTGCGC
GTCGAGGACC CGCTGGGCTT CCCCTACGAG TTCTTCCACG ACGTCGAGCA CGTGGAGCGC
CTGGCCTGGC GCTATGACCT CTACACCCCG GGCGCCCTGG TCCGCCTGGA CCACTTCAAC
CAGGTGACCC CGGATGTGCC GCGCGCCGCC CGCTACATGC AGGACCTGGG CTTCCGCGTC
ACCGAGGACA TCCAGGACGA GAAGGGCACC GTCTACGCCG CGTGGATGCG CCGCAAGCCC
ACCGTGCACG ACACCGCCAT GACCGGCGGC GACGGCCCCC GGATGCACCA CGTGTGCTTC
GCCACGCATG AGAAGCACAA CATCCTGGCG ATCTGCGACA AGCTCGGCGC CCTGCGCATG
TCCGACCACA TCGAGCGCGG CCCGGGACGC CACGGCGTCT CCAACGCGTT CTACCTCTAC
CTGCGCGACC CGGACGGCCA CCGCGTGGAG GTCTACACCC AGGACTACTA CACCGGTGAC
CCGGACAACC CCGTGGTCAC CTGGGACGTG CACGACAACC AGCGTCGCGA CTGGTGGGGC
ACCCCGGTCG TGCCGTCCTG GTACACCGAC GCCTCCCGCG TGCTGGACCT CGACGGCAAC
CTGCAGCCGC TGGTCTCCCG CACCGACGAC TCGGAGATGG CCGTGACCAT CGGCGCCGAC
GGCTTCTCCT ACACCCGCGA GGAGCAGGGC GAGGAGTCCC TGCCGGAGTG GAAGCAGGGC
GAGTACAAGC TCGGCAACCA GCTCTGA
 
Protein sequence
MTNLESRQKT SSGFFVTKEA PIDPENPLPT PTSEAPDILR CASMELVVTD LAASREFYVD 
VLGLVVTEED EETVYLRSME EFIHHNLVLR QGETAAVAAF SYRVRTPEDL DKAVAFYEEL
GCRVERRRDG FVKGVGDSVR VEDPLGFPYE FFHDVEHVER LAWRYDLYTP GALVRLDHFN
QVTPDVPRAA RYMQDLGFRV TEDIQDEKGT VYAAWMRRKP TVHDTAMTGG DGPRMHHVCF
ATHEKHNILA ICDKLGALRM SDHIERGPGR HGVSNAFYLY LRDPDGHRVE VYTQDYYTGD
PDNPVVTWDV HDNQRRDWWG TPVVPSWYTD ASRVLDLDGN LQPLVSRTDD SEMAVTIGAD
GFSYTREEQG EESLPEWKQG EYKLGNQL