Gene Dshi_0847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0847 
SymbolpyrD2 
ID5711473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp856488 
End bp857546 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content72% 
IMG OID641266756 
Productdihydroorotate dehydrogenase 2 
Protein accessionYP_001532193 
Protein GI159043399 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01036] dihydroorotate dehydrogenase, subfamily 2 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.211753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.196757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGC TCGAACGCCT CGGCCTCGCG GCCTTGCAGC GAGTCGATCC CGAAACCGCC 
CATGGCCTGG CGCTGCGCGC GCTGAACGCA GGGCTCGGGC CCCGTTCCGG CCCGGTCACG
AGCCCGCGGC TCAGCACCCA ACTGGCCGGG CTGCGCCTGC CCAACCCCGT GGGGCTCGCC
GCCGGGTTCG ACAAGAATGC CGAAACGCTC GGGGCGCTGG CGCAGACCGG CTTCGGGTTT
CTCGAGGTCG GCGCCGCCAC GCCCCTGCCC CAGCCCGGCA ACCCGCGTCC GCGCCTGTTT
CGCCTCTCCG AGGACCGCGC CGCGATCAAC CGGTTCGGCT TCAACAATGA CGGGGCCGAG
GCGATCGCCG CGCGGCTGGC CAGGCGTCCC GAAGGTCGGG TGGTCGGCCT GAACCTCGGC
GCCAACAAGA CCAGCGCGGA CCGGGCCGGG GATTTCGCCC GGGTGCTCGC CACCTGCGGC
GCCCATGTGG ATTTCGCGAC GGTCAACGTT TCGTCGCCCA ACACCGAAAA GCTGCGCGAC
CTGCAAGGCG CCGCCGCCCT GCGCGCGCTG CTGGAGGGGG TGATGGCCGC CCGTGCCGCC
CTCGTCCGCC CGATCCCGGT GTTCCTCAAG ATCGCCCCGG ACATGGACGA CGCCGCCCTG
GACGACATCG CCGGCGTAGT GACGGAGGCG GGCCTGCACG GCATCATCGC CACCAACACA
ACGCTGGCGC GCGACGGGCT CGTCTCGGCC CACAAGGGCG AGGCCGGAGG CCTGTCCGGC
GCACCGCTCT TCGAGGCGTC GACGCGGGTG CTGGCGCGAC TGTCGCAGGC CACCGAAGGC
ACTGTCCCGC TGATCGGCGT CGGCGGCGTG GACAGTGCGG GGGCGGCGTA TGCCAAGATC
CGCGCAGGCG CGTCGGCCGT TCAGCTCTAC ACCGCGCTGG TCTATGGCGG GATCAGCCTC
GCGGCCGAGA TCGCCACGGG GCTGGACACT CTGCTGGAAC GGGACGGGTT TTCCACCGTG
GCAGACGCGG TCGGCACGGG ACGAGGAGAC TGGCTATGA
 
Protein sequence
MSTLERLGLA ALQRVDPETA HGLALRALNA GLGPRSGPVT SPRLSTQLAG LRLPNPVGLA 
AGFDKNAETL GALAQTGFGF LEVGAATPLP QPGNPRPRLF RLSEDRAAIN RFGFNNDGAE
AIAARLARRP EGRVVGLNLG ANKTSADRAG DFARVLATCG AHVDFATVNV SSPNTEKLRD
LQGAAALRAL LEGVMAARAA LVRPIPVFLK IAPDMDDAAL DDIAGVVTEA GLHGIIATNT
TLARDGLVSA HKGEAGGLSG APLFEASTRV LARLSQATEG TVPLIGVGGV DSAGAAYAKI
RAGASAVQLY TALVYGGISL AAEIATGLDT LLERDGFSTV ADAVGTGRGD WL