Gene Mmwyl1_2738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_2738 
Symbol 
ID5367012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp3099958 
End bp3101283 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content45% 
IMG OID640805113 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001341586 
Protein GI152996751 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTATC TCAGCGGCTT TAGAAACTAT CATTCGACCG AGGCGTTAGC GGGTGTTTTG 
CCTATTGGAC AGAACAGTCC TCAGCGCTGC GCATTTGGTT TATATGCAGA ACAGATAAGT
GGCTCTGCAT TTACGGCACC CCAAGGACAT AATCTACGTA GCTGGATGTA CCGCATTAGA
CCCAGTGTTG CGCATCGAGC TGACGGAGAC TTGGTTGAGT TCGTTAACTG GGTATCTCGA
TCAGACGATA GCAATACGCA AATTGCCTTT GATCCGCTTA GATGGAATCC ATTAGCGACC
GTAGAAGGGC ATTGGATTGA GTCTGTTCGC ACCTTAACTA TCGCAGGGTC TGCAAGCTCG
CAAATTGGTA TGTCAGCGAG TATTGCTACA TTGCATGAGC AGCAACAGCC TGTATTACAA
AACCATGATG CAGAAATGTT GGTCATGCCG ATCAGTTCGT CTATTAGCTT GCGCACAGAG
TTTGGCGTTT TGGAAATAGG TGTTGGCTCG CTTGGTGTTA TTCCACGTTC TGCATTTGTT
GAGTTAAGCA CTGAAAATAT CACCCAAATT TACTTGTTAG AAAATTATGG TTCGCCTTTT
GAATTGCCTA ATCGTGGCGC CATAGGTGCG AATGGTTTGG CGAATGAACG TGACTTTGAA
TATCCAACGG CGAGCTTTGT AGAGCAACAA ACAGCAACGT GGGTGATTGA AAAAAGAGAA
GGGCGCTTTC GTGAATTCAC CCTTGAACAT TCGCCATTTG ATGTGCTCGG TTGGCATGGA
AATCATGCTC CTTATCGATA TGATTTGAGA CGCTTTAATA CTCTTGGCTC GATAAGCTAT
GATCATCCTG ATCCTTCGAT CTTTACGGTT CTGACGTCAC CAAGCCAAAC TGCCGGTGTG
GCAAACATAG ATGTGGTGGT ATTTGGTGAT CGTTGGCTGG TGGCTGAAAA TACTTTTCGT
CCACCTTGGT ATCATCGTAA TGTTATGGCT GAGTTTATGG GATTAATTTA CGGTACGTAC
GACGCCAAAC CGAATGGCTT TTTGCCTGGT GGTGCTAGTC TGCATAATGC CGGTTCGCCA
CATGGTCCTG ATTATGATGC GTTTTCAAAA GCATCGGTGT CAGAATTGTC TCCCCATAAA
TTATCTGGAA CACTGGCGTT TATGTTTGAA ACCAGTGCAC CCCAGCGAAT TACCCACTTT
GCAGCGAATG CGACTGAACG TCAAAAAGAC TATGTGGACT GTTGGAAAGA CATTCAACCA
GCTAGCCAAA TTATTAAAGA GTTCAACGTA CCGGATAACA AAAGTACGGA CAGCCACAAT
GGGTAA
 
Protein sequence
MSYLSGFRNY HSTEALAGVL PIGQNSPQRC AFGLYAEQIS GSAFTAPQGH NLRSWMYRIR 
PSVAHRADGD LVEFVNWVSR SDDSNTQIAF DPLRWNPLAT VEGHWIESVR TLTIAGSASS
QIGMSASIAT LHEQQQPVLQ NHDAEMLVMP ISSSISLRTE FGVLEIGVGS LGVIPRSAFV
ELSTENITQI YLLENYGSPF ELPNRGAIGA NGLANERDFE YPTASFVEQQ TATWVIEKRE
GRFREFTLEH SPFDVLGWHG NHAPYRYDLR RFNTLGSISY DHPDPSIFTV LTSPSQTAGV
ANIDVVVFGD RWLVAENTFR PPWYHRNVMA EFMGLIYGTY DAKPNGFLPG GASLHNAGSP
HGPDYDAFSK ASVSELSPHK LSGTLAFMFE TSAPQRITHF AANATERQKD YVDCWKDIQP
ASQIIKEFNV PDNKSTDSHN G