Gene Rsph17029_3107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3107 
Symbol 
ID4898141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp120995 
End bp122452 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content70% 
IMG OID640113709 
Productdihydropyrimidinase 
Protein accessionYP_001044979 
Protein GI126463866 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0154348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.387617 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTTCG ATACGGTCAT CCACGGCGGC ACCATCGTCA CGCCGACCGA AAGCTGGCAG 
GGCGATCTGG GCCTCGTGGG GGGCCGGATC GCGGCTCTGG CCGAGCGGCT GCCCGGCGGC
GCGCGCCGGA TCGACGCCAC CGGGCGGCTC GTCCTTCCCG GCGGCATCGA GGCGCACGCC
CATATCGCGC AGGAAAGCTC CTCGGGGCTG ATGAGCGCGG ACGACTATTA CACGGGCTCG
GTCTCGGCGG CCTTCGGCGG CAACTCGAGC TTCATCCCCT TCGCGGCCCA GCACCGCGGG
CAGTCGGTGG ATGCGGTGAT CGAGACCTAC GACAGCCGGG CGGCGCCGAA CTCGGTGCTC
GACTATTCCT ACCATCTCAT CATCTCGGAC CCGACCGAGA CCGTCCTGAC CGAAGAGCTG
CCGCGCGCCT TCGCCCGCGG CATCACCTCG TTCAAGGTCT TCATGACCTA CGATCTGATG
AACCTCGGCG ACCGCGGGAT GCTCGACATC CTGACCGTCG CCCGCCGTCA CGGCGCGCTC
ACCATGGTCC ATGCCGAGAA CAACGACATG GTGAAATGGA TGAACGCGCG CCTCGCCGCG
GCGGGGCTCA CGGCGCCGAA ATATCATGCG ATCTCGCGCC CGGCGCTGGC CGAGGCCGAG
GCGATCAACC GCGCGATCTC GCTCGCGCGG CTGGTGGGAG CGGGGCTCTT CATCGTCCAT
GTCTCGACGC CCGAGGGGGC GGATCTCGTG GCCCGCGCGC AGGCCTCGGG TCTGCCGATC
CATGCCGAGA CCTGCCCGCA GTATCTGGCC TTCACCCGTG CCGACCTCGA CCGGCCGGGG
ATGGAGGGGG CCAAATACAT CTGCTCGCCC CCCCTGCGCG ATGCGGCGAC GCAGGCCGCG
CTCTGGAACC ATGCCCGGCG CGGCACCTTC GAGAGCGTCT CCTCGGACCA TGCCCCCTAC
CGGTTCGACG CGAGCGGCAA GTTCGCCAAC GGCGCAGAGC CCGCCTATCC CGCCATCGCC
AACGGCCTGC CCGGCATCGC CATGCGCCTG CCCTATCTCT TCTCCGAAGG GGTCGCGGCC
GGGCGGATCA GCCTCCAGCA GTTCGCGGCC CTCTCTTCCT CGAACGCCGC CCGCCTCTTC
GGAATGGAGC GCAAGGGCGC GCTGCTGCCG GGCTATGACG CCGACATCGC GATCTGGAAC
CCCGAGGAAA CGCGCGAGGT CTCGCTCGCC GATCAGCACG ATGCCATGGA CTACACACCC
TTCGAGGGGA TGCGCCTCAC CGGCTGGCCC GAACATGTGC TGAGCCGCGG CGAGACGGTG
GTCGAGGCGG GCGAGCTGAA GGCCGCCCGC GGGCGCGGCC GCTTCGTGGC GCGTGCCCCC
TACCGCCCCG ATCCCAACGC GCCGGTCGAG CCCGAGCTCA CCCCCGCGCT CAACTTCGGC
GCGGAGATCC GGCCGTGA
 
Protein sequence
MEFDTVIHGG TIVTPTESWQ GDLGLVGGRI AALAERLPGG ARRIDATGRL VLPGGIEAHA 
HIAQESSSGL MSADDYYTGS VSAAFGGNSS FIPFAAQHRG QSVDAVIETY DSRAAPNSVL
DYSYHLIISD PTETVLTEEL PRAFARGITS FKVFMTYDLM NLGDRGMLDI LTVARRHGAL
TMVHAENNDM VKWMNARLAA AGLTAPKYHA ISRPALAEAE AINRAISLAR LVGAGLFIVH
VSTPEGADLV ARAQASGLPI HAETCPQYLA FTRADLDRPG MEGAKYICSP PLRDAATQAA
LWNHARRGTF ESVSSDHAPY RFDASGKFAN GAEPAYPAIA NGLPGIAMRL PYLFSEGVAA
GRISLQQFAA LSSSNAARLF GMERKGALLP GYDADIAIWN PEETREVSLA DQHDAMDYTP
FEGMRLTGWP EHVLSRGETV VEAGELKAAR GRGRFVARAP YRPDPNAPVE PELTPALNFG
AEIRP