Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1782 |
Symbol | |
ID | 5713350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1851151 |
End bp | 1852302 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641267701 |
Product | putative ring-hydroxylating dioxygenase subunit alpha |
Protein accession | YP_001533125 |
Protein GI | 159044331 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.218178 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGACA CCGCCGCAGA GACCATGCCA GAGCCTCCCG CCGCCCCGGC GCGCGGCCTG CCCAACGCTC ATTACACAGA CCCAGACATG CACAGGGCCG AGCAGCACGC GCTGCTCGCC GCGCAGTGGG CCGGGCTCGC AGTGGCTTCG GACGTGCCCG CCCCCGGCGA TGCCTTGCCC CTGACCTTCT GCGGCGAGCC GCTCTTGCTG CTGCGCGACC GCGCAGGCCA GGTGCGCGTG TTCTACAATG TCTGCCGCCA CCGGGGCATG ATCCTTGTCG ACGCCCCCCG CCGGATCGAG GGCGCGATCC GCTGCCCCTA TCATTCGTGG TGCTACGCCA CCGATGGACG CCTCGTCACC ACCCCCCATG TGGGCGGGCC GGGCCGCAAC ACCCATGCCC GGATCGACCG CGCCACCCTC GGGCTGATCG AGGTGCCCTC CCATGTCTGG CGCGATGTGG TCTTTGTGAA CCTGTCGGGC ACCGCCCCGC CCTTCGCCGA GGCCAATGCC GACCTCATCG CCCGCTGGGC CGCCCTGGAC CAGCCCCTGC ACCATGGCGG AGCAGACAGC CGCTTCACCC TGACCGTGCG ATGCAACTGG AAACTCGCGG TGGAAAACTA TTGCGAAAGC TATCATCTGC CCTGGGTTCA CCCCGGCCTG AACGAGATCT CGCGGCTCGA AGACCACTAC AACATCGAGA CCCCCGGCGC TTTCTCCGGC CAGGGGACCG AGGTCTACCA ACGGCCCCGC CGCCCCGACG GCACCCCGGC CTTCCCGGAT TTTACCGGCC TGCCTGCCAT GTGGGACACC GGGGCGGAAT ACGTGGCGCT CTATCCCAAC GTTCTGCTCG GCGCGCACCG GGATCACGCC TTCGCCATCC TGCTCCACCC CGATGGTCCG ACCCGCACCC GGGAGCATGT GCACCTCTAC TATGCCGCGC CCGACACCGA CCCGGCCGCC CGCGCCGCCA ATGCCGCCCA GTGGAAAACC GTGTTCGAAG AGGACGTCTT CGTGGTCGAG GGGATGCAGA GCGGCCGCGC TGCGCGCGGG TTCGACGGCG GCACCTTCTC CCCGGTCATG GACGGGCCGA CCCGGTGCTT CCACGCCTGG GCCGCCGCCG CGCTGTCCGA GACGTCCCGT GCGGCCGAGT GA
|
Protein sequence | MPDTAAETMP EPPAAPARGL PNAHYTDPDM HRAEQHALLA AQWAGLAVAS DVPAPGDALP LTFCGEPLLL LRDRAGQVRV FYNVCRHRGM ILVDAPRRIE GAIRCPYHSW CYATDGRLVT TPHVGGPGRN THARIDRATL GLIEVPSHVW RDVVFVNLSG TAPPFAEANA DLIARWAALD QPLHHGGADS RFTLTVRCNW KLAVENYCES YHLPWVHPGL NEISRLEDHY NIETPGAFSG QGTEVYQRPR RPDGTPAFPD FTGLPAMWDT GAEYVALYPN VLLGAHRDHA FAILLHPDGP TRTREHVHLY YAAPDTDPAA RAANAAQWKT VFEEDVFVVE GMQSGRAARG FDGGTFSPVM DGPTRCFHAW AAAALSETSR AAE
|
| |