Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3107 |
Symbol | |
ID | 4898141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | + |
Start bp | 120995 |
End bp | 122452 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640113709 |
Product | dihydropyrimidinase |
Protein accession | YP_001044979 |
Protein GI | 126463866 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR02033] D-hydantoinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0154348 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.387617 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTCG ATACGGTCAT CCACGGCGGC ACCATCGTCA CGCCGACCGA AAGCTGGCAG GGCGATCTGG GCCTCGTGGG GGGCCGGATC GCGGCTCTGG CCGAGCGGCT GCCCGGCGGC GCGCGCCGGA TCGACGCCAC CGGGCGGCTC GTCCTTCCCG GCGGCATCGA GGCGCACGCC CATATCGCGC AGGAAAGCTC CTCGGGGCTG ATGAGCGCGG ACGACTATTA CACGGGCTCG GTCTCGGCGG CCTTCGGCGG CAACTCGAGC TTCATCCCCT TCGCGGCCCA GCACCGCGGG CAGTCGGTGG ATGCGGTGAT CGAGACCTAC GACAGCCGGG CGGCGCCGAA CTCGGTGCTC GACTATTCCT ACCATCTCAT CATCTCGGAC CCGACCGAGA CCGTCCTGAC CGAAGAGCTG CCGCGCGCCT TCGCCCGCGG CATCACCTCG TTCAAGGTCT TCATGACCTA CGATCTGATG AACCTCGGCG ACCGCGGGAT GCTCGACATC CTGACCGTCG CCCGCCGTCA CGGCGCGCTC ACCATGGTCC ATGCCGAGAA CAACGACATG GTGAAATGGA TGAACGCGCG CCTCGCCGCG GCGGGGCTCA CGGCGCCGAA ATATCATGCG ATCTCGCGCC CGGCGCTGGC CGAGGCCGAG GCGATCAACC GCGCGATCTC GCTCGCGCGG CTGGTGGGAG CGGGGCTCTT CATCGTCCAT GTCTCGACGC CCGAGGGGGC GGATCTCGTG GCCCGCGCGC AGGCCTCGGG TCTGCCGATC CATGCCGAGA CCTGCCCGCA GTATCTGGCC TTCACCCGTG CCGACCTCGA CCGGCCGGGG ATGGAGGGGG CCAAATACAT CTGCTCGCCC CCCCTGCGCG ATGCGGCGAC GCAGGCCGCG CTCTGGAACC ATGCCCGGCG CGGCACCTTC GAGAGCGTCT CCTCGGACCA TGCCCCCTAC CGGTTCGACG CGAGCGGCAA GTTCGCCAAC GGCGCAGAGC CCGCCTATCC CGCCATCGCC AACGGCCTGC CCGGCATCGC CATGCGCCTG CCCTATCTCT TCTCCGAAGG GGTCGCGGCC GGGCGGATCA GCCTCCAGCA GTTCGCGGCC CTCTCTTCCT CGAACGCCGC CCGCCTCTTC GGAATGGAGC GCAAGGGCGC GCTGCTGCCG GGCTATGACG CCGACATCGC GATCTGGAAC CCCGAGGAAA CGCGCGAGGT CTCGCTCGCC GATCAGCACG ATGCCATGGA CTACACACCC TTCGAGGGGA TGCGCCTCAC CGGCTGGCCC GAACATGTGC TGAGCCGCGG CGAGACGGTG GTCGAGGCGG GCGAGCTGAA GGCCGCCCGC GGGCGCGGCC GCTTCGTGGC GCGTGCCCCC TACCGCCCCG ATCCCAACGC GCCGGTCGAG CCCGAGCTCA CCCCCGCGCT CAACTTCGGC GCGGAGATCC GGCCGTGA
|
Protein sequence | MEFDTVIHGG TIVTPTESWQ GDLGLVGGRI AALAERLPGG ARRIDATGRL VLPGGIEAHA HIAQESSSGL MSADDYYTGS VSAAFGGNSS FIPFAAQHRG QSVDAVIETY DSRAAPNSVL DYSYHLIISD PTETVLTEEL PRAFARGITS FKVFMTYDLM NLGDRGMLDI LTVARRHGAL TMVHAENNDM VKWMNARLAA AGLTAPKYHA ISRPALAEAE AINRAISLAR LVGAGLFIVH VSTPEGADLV ARAQASGLPI HAETCPQYLA FTRADLDRPG MEGAKYICSP PLRDAATQAA LWNHARRGTF ESVSSDHAPY RFDASGKFAN GAEPAYPAIA NGLPGIAMRL PYLFSEGVAA GRISLQQFAA LSSSNAARLF GMERKGALLP GYDADIAIWN PEETREVSLA DQHDAMDYTP FEGMRLTGWP EHVLSRGETV VEAGELKAAR GRGRFVARAP YRPDPNAPVE PELTPALNFG AEIRP
|
| |