Gene Dshi_3812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3812 
Symbol 
ID5714341 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp20139 
End bp21494 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content65% 
IMG OID641276727 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001542023 
Protein GI159046352 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.283332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.618849 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCC AAGCGCAAGT CAGCGGTCTG ACCCGGGCGG CGACGCCCAT GGGCACGACC 
GAAGGGTACA TGCCCGGTTT CGGCAATGAT TTCGAGACCG AGGCCCTGCC CGGTGCCCTG
CCGCAGGGCA TGAACAGCCC GCAGAAATGC AATTACGGGC TTTATGGCGA GCAGCTCTCG
GGCACCGCCT TCACCGATGT GCGCCCCGAG CGGACCTGGT GTTACCGGAT CCGGCCCTCG
GTCAAGCATT CCCACCGCTA CCGGCGGGTC GAGCTGCCCT ATTTCCGCTC CGCGCCGGAC
ATCCATCCGG AGGTCACGAG CCTCGGCCAG TACCGCTGGG ACCCGGTCCC GCACACGGAC
GCGCCGCTCA CATGGCTGAC GGGGATGCGC ACCATGACCA CGGCGGGGGA CGTGAACACA
CAAGTTGGCA TGGCCGCCCA CGTTTACCTC GTGACCGAGT CCATGCAGGA CGCCTATTTC
TACTCCGCCG ACAGCGAGAT GCTGGTGGTG CCACAGGAAG GTCGTTTGCG CTTTGCCACC
GAGCTGGGGA TCATCGATCT GGAACCGAAG GAGATCGCGA TCCTGCCGCG CGGCCTTCTC
TACCGGGTGG AACTGCTGGA CGGTCCCGCG CGGGGTTTCG TGTGCGAAAA CTACGGCCAG
AAGTTCGAGC TGCCCGGCCG CGGGCCGATC GGGGCCAACT GCATGGCCAA TCCGCGGGAT
TTCAAGACGC CCGTGGCGGC CTTCGAGGAC CGCGAGGTGC CCTCGACCGT GACGGTGAAA
TGGTGCGGCC AGTTCCACGA GACGCAGATC GGCCAGAGCC CGCTGGACGT GGTGGCCTGG
CACGGCAATT ACGCGCCCTG CAAATACGAT CTGCGCAATT ACTGCCCCGT GGGCGCGATC
CTGTTCGACC ATCCGGACCC GTCGATCTTC ACCGTGCTGA CCGCGCCCTC GGGCCAGCCG
GGCACCGCCA ATATCGACTT CGTGCTGTTC CGCGAGCGCT GGATGGTGGC CGAGGATACC
TTCCGCCCGC CCTGGTATCA CAAGAACATC ATGTCCGAGC TGATGGGCAA CATCTACGGC
CAGTACGATG CCAAGCCGCA GGGCTTCGTG CCCGGCGGGA TCAGCCTGCA CAACATGATG
CTGCCGCATG GCCCCGACCG CGACGCGTTC GAGAAGGCCT CCAACGCCAA TCTGGGCCCC
GACAAGCTCG ACAACACCAT GTCCTTCATG TTCGAGACCC GGTTTCCACA GCACCTGACC
CGCTTTGCCG GCACCGAGGC GCCGCTGCAG GACGACTATA TCGACTGCTG GAAGGACATC
GAAAAGAAGT TCGACGGCAC CCCGGGCAAG AAGTGA
 
Protein sequence
MNTQAQVSGL TRAATPMGTT EGYMPGFGND FETEALPGAL PQGMNSPQKC NYGLYGEQLS 
GTAFTDVRPE RTWCYRIRPS VKHSHRYRRV ELPYFRSAPD IHPEVTSLGQ YRWDPVPHTD
APLTWLTGMR TMTTAGDVNT QVGMAAHVYL VTESMQDAYF YSADSEMLVV PQEGRLRFAT
ELGIIDLEPK EIAILPRGLL YRVELLDGPA RGFVCENYGQ KFELPGRGPI GANCMANPRD
FKTPVAAFED REVPSTVTVK WCGQFHETQI GQSPLDVVAW HGNYAPCKYD LRNYCPVGAI
LFDHPDPSIF TVLTAPSGQP GTANIDFVLF RERWMVAEDT FRPPWYHKNI MSELMGNIYG
QYDAKPQGFV PGGISLHNMM LPHGPDRDAF EKASNANLGP DKLDNTMSFM FETRFPQHLT
RFAGTEAPLQ DDYIDCWKDI EKKFDGTPGK K