Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3812 |
Symbol | |
ID | 5714341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009956 |
Strand | - |
Start bp | 20139 |
End bp | 21494 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641276727 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_001542023 |
Protein GI | 159046352 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.283332 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.618849 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACCC AAGCGCAAGT CAGCGGTCTG ACCCGGGCGG CGACGCCCAT GGGCACGACC GAAGGGTACA TGCCCGGTTT CGGCAATGAT TTCGAGACCG AGGCCCTGCC CGGTGCCCTG CCGCAGGGCA TGAACAGCCC GCAGAAATGC AATTACGGGC TTTATGGCGA GCAGCTCTCG GGCACCGCCT TCACCGATGT GCGCCCCGAG CGGACCTGGT GTTACCGGAT CCGGCCCTCG GTCAAGCATT CCCACCGCTA CCGGCGGGTC GAGCTGCCCT ATTTCCGCTC CGCGCCGGAC ATCCATCCGG AGGTCACGAG CCTCGGCCAG TACCGCTGGG ACCCGGTCCC GCACACGGAC GCGCCGCTCA CATGGCTGAC GGGGATGCGC ACCATGACCA CGGCGGGGGA CGTGAACACA CAAGTTGGCA TGGCCGCCCA CGTTTACCTC GTGACCGAGT CCATGCAGGA CGCCTATTTC TACTCCGCCG ACAGCGAGAT GCTGGTGGTG CCACAGGAAG GTCGTTTGCG CTTTGCCACC GAGCTGGGGA TCATCGATCT GGAACCGAAG GAGATCGCGA TCCTGCCGCG CGGCCTTCTC TACCGGGTGG AACTGCTGGA CGGTCCCGCG CGGGGTTTCG TGTGCGAAAA CTACGGCCAG AAGTTCGAGC TGCCCGGCCG CGGGCCGATC GGGGCCAACT GCATGGCCAA TCCGCGGGAT TTCAAGACGC CCGTGGCGGC CTTCGAGGAC CGCGAGGTGC CCTCGACCGT GACGGTGAAA TGGTGCGGCC AGTTCCACGA GACGCAGATC GGCCAGAGCC CGCTGGACGT GGTGGCCTGG CACGGCAATT ACGCGCCCTG CAAATACGAT CTGCGCAATT ACTGCCCCGT GGGCGCGATC CTGTTCGACC ATCCGGACCC GTCGATCTTC ACCGTGCTGA CCGCGCCCTC GGGCCAGCCG GGCACCGCCA ATATCGACTT CGTGCTGTTC CGCGAGCGCT GGATGGTGGC CGAGGATACC TTCCGCCCGC CCTGGTATCA CAAGAACATC ATGTCCGAGC TGATGGGCAA CATCTACGGC CAGTACGATG CCAAGCCGCA GGGCTTCGTG CCCGGCGGGA TCAGCCTGCA CAACATGATG CTGCCGCATG GCCCCGACCG CGACGCGTTC GAGAAGGCCT CCAACGCCAA TCTGGGCCCC GACAAGCTCG ACAACACCAT GTCCTTCATG TTCGAGACCC GGTTTCCACA GCACCTGACC CGCTTTGCCG GCACCGAGGC GCCGCTGCAG GACGACTATA TCGACTGCTG GAAGGACATC GAAAAGAAGT TCGACGGCAC CCCGGGCAAG AAGTGA
|
Protein sequence | MNTQAQVSGL TRAATPMGTT EGYMPGFGND FETEALPGAL PQGMNSPQKC NYGLYGEQLS GTAFTDVRPE RTWCYRIRPS VKHSHRYRRV ELPYFRSAPD IHPEVTSLGQ YRWDPVPHTD APLTWLTGMR TMTTAGDVNT QVGMAAHVYL VTESMQDAYF YSADSEMLVV PQEGRLRFAT ELGIIDLEPK EIAILPRGLL YRVELLDGPA RGFVCENYGQ KFELPGRGPI GANCMANPRD FKTPVAAFED REVPSTVTVK WCGQFHETQI GQSPLDVVAW HGNYAPCKYD LRNYCPVGAI LFDHPDPSIF TVLTAPSGQP GTANIDFVLF RERWMVAEDT FRPPWYHKNI MSELMGNIYG QYDAKPQGFV PGGISLHNMM LPHGPDRDAF EKASNANLGP DKLDNTMSFM FETRFPQHLT RFAGTEAPLQ DDYIDCWKDI EKKFDGTPGK K
|
| |