Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17025_2067 |
Symbol | |
ID | 5082772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17025 |
Kingdom | Bacteria |
Replicon accession | NC_009428 |
Strand | + |
Start bp | 2108928 |
End bp | 2110385 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640483629 |
Product | dihydropyrimidinase |
Protein accession | YP_001168263 |
Protein GI | 146278104 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0044] Dihydroorotase and related cyclic amidohydrolases |
TIGRFAM ID | [TIGR02033] D-hydantoinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.265667 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTCG ATACGGTCAT CCACGGCGGC ACCATCGTCA CGCCGACCGA AAGCTGGCAG GGCGATCTCG GCCTCGTGGG GGGCCGGATC GCGGCTCTGG CCGAGCGGCT GCCCGGCGGC GCACGCCGGA TCGACGCCAC CGGGCGGCTC GTCCTGCCCG GCGGCATCGA GGCGCACGCC CATATTGCGC AGGAAAGCTC CTCGGGGCTG ATGAGCGCGG ACGACTATTA CACGGGCTCG GTCTCGGCGG CCTTCGGCGG CAACTCGAGC TTCATCCCCT TCGCGGCCCA GCATCGCGGG CAGTCGGTGG ATGCGGTGAT CGAGACCTAC GACAGCCGCG CGGCGCCGAA CTCGGTGCTC GACTATTCCT ACCATCTCAT CATCTCGGAC CCGACCGAAA AGGTCCTGAC CGAGGAGCTG CCGCGCGCCT TCGCCCGCGG GATCACCTCG TTCAAGGTCT TCATGACCTA CGACCTGATG AACCTCGGCG ACCGCGGGAT GCTCGACATC CTGACCGTCG CCCGCCGTCA CGGCGCGCTC ACCATGGTCC ATGCCGAGAA CAACGACATG GTGAAATGGA TGAACGCGCG GCTGGCCGCG GCGGGGCTGA CGGCGCCGAA ATATCACGCG ATCTCGCGCC CGGCGCTCGC CGAGGCCGAG GCGATCAACC GCGCGATCGC GCTTGCGCGG CTGGTGGGCG CGGGGCTCTT CATCGTCCAT GTCTCGACGC CCGAGGGGGC GGACCTCGTG GCCCGCGCGC AGGCCTGCGG GCTGCCGATC CACGCCGAGA CCTGCCCGCA GTATCTGGCC TTCACCCGCG CCGACCTCGA CCGGCCGGGG ATGGAGGGGG CCAAATACAT CTGCTCGCCT CCCTTGCGGG ATACGGCGAC GCAGGCCGCG CTCTGGAGCC ATGCCCGGCG CGGCACCTTC GAGAGCGTCT CGTCGGACCA TGCCCCCTAC CGGTTCGACG CGAGCGGCAA GTTCGCGAAC GGGGCAGAGC CCGCCTACCC CGCCATCGCC AACGGCCTGC CCGGCATCGC CATGCGCCTG CCCTATCTCT TCTCCGAAGG GGTCGCGGCG GGGCGGATCA GCCTCCAGCA GTTCGCGGCC CTCTCCTCCT CGAACGCCGC GCGCCTCTTC GGGATGGAGC GCAAGGGCGC GCTGCTGCCG GGCTACGACG CCGACATTGC GATCTGGAAC CCCGAGGAAA CGCGCGAGGT CACGCTGGCC GACCAGCACG ATGCGATGGA CTATACGCCC TTCGAGGGGA TGCGCCTCAC CGGCTGGCCC GAGCATGTGC TGAGCCGCGG CGAGACGGTG GTCGAGGCGG GCGAACTGAA GGCCGCCCGC GGGCGCGGCC GTTTCGTGGC GCGTGCCCCC TACCGCCCCG ATCCCAACGC GCCGGTCGAG CCCGAACTTG ACCCCGCGCT CAACTTCGGC GCGGAGATCC GGCCGTGA
|
Protein sequence | MEFDTVIHGG TIVTPTESWQ GDLGLVGGRI AALAERLPGG ARRIDATGRL VLPGGIEAHA HIAQESSSGL MSADDYYTGS VSAAFGGNSS FIPFAAQHRG QSVDAVIETY DSRAAPNSVL DYSYHLIISD PTEKVLTEEL PRAFARGITS FKVFMTYDLM NLGDRGMLDI LTVARRHGAL TMVHAENNDM VKWMNARLAA AGLTAPKYHA ISRPALAEAE AINRAIALAR LVGAGLFIVH VSTPEGADLV ARAQACGLPI HAETCPQYLA FTRADLDRPG MEGAKYICSP PLRDTATQAA LWSHARRGTF ESVSSDHAPY RFDASGKFAN GAEPAYPAIA NGLPGIAMRL PYLFSEGVAA GRISLQQFAA LSSSNAARLF GMERKGALLP GYDADIAIWN PEETREVTLA DQHDAMDYTP FEGMRLTGWP EHVLSRGETV VEAGELKAAR GRGRFVARAP YRPDPNAPVE PELDPALNFG AEIRP
|
| |