Gene Rsph17029_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1820 
Symbol 
ID4896415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1919888 
End bp1921192 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content67% 
IMG OID640112414 
Productdihydropyrimidine dehydrogenase 
Protein accessionYP_001043699 
Protein GI126462585 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0167] Dihydroorotate dehydrogenase 
TIGRFAM ID[TIGR01037] dihydroorotate dehydrogenase (subfamily 1) family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.22065 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAACC TCCGTTCCGA CTTCATCGGC ATCAAGTCGC CGAATCCGTT CTGGCTCGCC 
TCGGCGCCGC CGACCGACAA GGAATACAAC GTCCGCCGCG CCTTCGAGGC CGGCTGGGGC
GGCGTCGTCT GGAAGACGCT GGGCTCCGAA GGCCCGCCGG TCGTCAACGT GAACGGCCCC
CGCTACGGCG CGATCTACGG CGCCGACCGG CGGCTCCTCG GGCTGAACAA CATCGAACTC
ATCACCGACC GGCCGCTCGA GGTGAACCTG CGCGAGATCA AGTCGGTCAA GCGCGACTAT
CCCGACCGCG CGCTGGTGGT CTCGCTGATG GTGCCCTGCG ACGAGGAAAG CTGGAAGGCG
ATCCTCGCCC ATGTCGAGGA TACCGGAGCC GATGGCGTCG AGCTGAACTT CGGCTGCCCG
CACGGCATGG CCGAGCGCGG CATGGGCTCG GCCGTGGGGC AGGTGCCCGA ATATATCGAG
ATGGTCACGC GCTGGGTGAA GCAGCACAGC CGGATGCCCT GCATCGTGAA GCTCACGCCC
AATGTGACCG ACATCCGCAA GCCGGCCGAA GCGGCCAGGC GCGGCGGCGC CGATGCGGTG
AGCCTCATCA ACACGATCAA TTCGATCACC GGCGTGGACA TCGACAGTTT CGCGCCGATG
CCCACCATCG ACGGCAAGGG CACCCATGGC GGCTATTGCG GTCCGGCGGT CAAGCCCATC
GCGCTGAACA TGGTGGCCGA GATTGCGCGC AACCCCGAGA CGCACGGGCT GCCGATCTCG
GGCATCGGCG GCGTCACCAC CTGGCGGGAT GCGGTCGAGT TCATGCTGCT CGGGGCGGGC
AATGTGCAGG TCTGCACCGC GGCCATGACC TACGGCTTCC GCGTCGTGCA GGAGATGATC
TCGGGCCTCT CCGACTACAT GGACGCCAAG GGCTTCGCCT CCACCGCCGA TCTCGTGGGG
CGCGCGGTTC CGAACGTGAC CGACTGGCAG TATCTGAACC TCAACTATGT CACCAAGGCG
CAGATCGACC AGGACCTCTG CATCAAGTGC GGCCGCTGCT ACGCCGCCTG CGAGGATACC
AGCCACCAGG CCATCGCCAT GTCCACCGAT CGCACCTTCA CGGTGAAGGA CGAGGAATGC
GTGGCCTGCA ACCTCTGCGT CGATGTCTGC CCGGTGGAGG ACTGCATCAC CATGCGCGAG
CTGCCGAAGG GCGCGCTCGA TCCGCGCACG GGCCGGACGG TGGGGGACTA TGCCAACTGG
CTGGGCCACC CGAACAACCC CTCGGTGCGC GAAGCCGCCG AGTGA
 
Protein sequence
MANLRSDFIG IKSPNPFWLA SAPPTDKEYN VRRAFEAGWG GVVWKTLGSE GPPVVNVNGP 
RYGAIYGADR RLLGLNNIEL ITDRPLEVNL REIKSVKRDY PDRALVVSLM VPCDEESWKA
ILAHVEDTGA DGVELNFGCP HGMAERGMGS AVGQVPEYIE MVTRWVKQHS RMPCIVKLTP
NVTDIRKPAE AARRGGADAV SLINTINSIT GVDIDSFAPM PTIDGKGTHG GYCGPAVKPI
ALNMVAEIAR NPETHGLPIS GIGGVTTWRD AVEFMLLGAG NVQVCTAAMT YGFRVVQEMI
SGLSDYMDAK GFASTADLVG RAVPNVTDWQ YLNLNYVTKA QIDQDLCIKC GRCYAACEDT
SHQAIAMSTD RTFTVKDEEC VACNLCVDVC PVEDCITMRE LPKGALDPRT GRTVGDYANW
LGHPNNPSVR EAAE