Gene Rsph17029_1816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1816 
Symbol 
ID4895492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1915437 
End bp1916888 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content66% 
IMG OID640112410 
Productphenylhydantoinase 
Protein accessionYP_001043695 
Protein GI126462581 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.168058 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.23209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAG TCATCAGGAA CGGAACGATC GTCACGGCCG ATCTCACCTA CAGGGCCGAT 
GTGCGGATCG AGGGCGGTAT CATCACCGAG ATCGGACCCG ATCTCGCAGG GGACGAGGTG
CTGGACGCGA CCGGCTGCTA TGTCATGCCG GGCGGCATCG ATCCGCATAC CCATCTCGAG
ATGCCCTTCA TGGGCACCTA TTCCTCGGAC GATTTCGAGA GCGGCACCCG CGCGGCGCTG
GCGGGCGGGA CGACGATGGT GGTCGATTTC GCGCTGCCCG CCCCCGGGCA GGGGCTGATG
GATGCGCTCG CCATGTGGCA CAACAAGTCG GGGCGCGCCA ACTGCGACTA TTCCTACCAT
ATGGCGATCA CCTGGTGGGG CGAGCAGGTC TTCGACGAGA TGCAGGCGGT GGTGGATCAG
GGGATCACCT CGTTCAAGCA TTTCATGGCC TACAAGGGCG CGCTGATGGT GAACGACGAC
GAGCTCTATG CGAGCTTCCG TCGCTGCGCC GATCTGGGGG CGCTCGCCAT GGTCCATGCC
GAGAACGGCG ATGTGGTGGC CGAGCTGTCG GCGCGCCTGC TGGCTGAGGG CAACCGCGGC
CCCGAGGCCC ATGCCTATTC GCGCCCCCCG CAGGTCGAGG GCGAGGCCAC CAACCGCGCG
ATCATGATCG CGGACATGGC GGGCGTGCCG CTCTATGTCG TCCATACCTC CTGCGAGGAG
GCGCACGAGG CGATCCGGCG CGCGCGGATG CAGGGCAAGA GGGTCTGGGG CGAGCCGCTG
ATCCAGCATC TGACGCTGGA CGAGAGCGAA TATTTCCACC CCGACTGGGA CCATGCTGCC
CGCCGGGTGA TGAGCCCGCC TTTCCGGAAC AGGCAGCATC AGGACAGTCT CTGGGCCGGG
CTGCAGTCGG GATCCCTGTC GGTCGTGGCG ACGGACCATT GCGCCTTCAC CACCGAGCAG
AAGCGCTTCG GGGTGGGCGA TTTCACCAAG ATCCCGAACG GCACAGGCGG GCTCGAGGAC
CGGATGCCGA TGCTCTGGAC CCAAGGGGTG AATACGGGGC GCCTGACGCC GAACGAATTC
GTGGCGGTGA CCTCGACCAA CATCGCGAAG ATCCTGAACT GCTACCCGAA GAAGGGCGCG
GTTCTGGTGG GGGCGGATGC CGATCTCGTG GTCTGGGATC CGGAGAAGAC CAAGGTGATC
TCGGCGGGCG CGCAGCAGTC GGCTATCGAT TACAACGTGT TCGAGGGCAA GGAGGTGAAG
GGCCTGCCGC GCTTCACGCT CAGCCGCGGC AAGGTCGCGG TGGCGGATGG CGAGATCCGC
ACCGAGGAAG GCCACGGGCA GTTCGTGGCG CGCCAGCCGC GTCCGGCTGT CAACCGCGCC
CTCTCGGCCT GGAAGGAGCT GACGGCGCCG CGCGCGGTCG AACGGTCCGG CATTCCGGCC
ACGGGGGTGT GA
 
Protein sequence
MSTVIRNGTI VTADLTYRAD VRIEGGIITE IGPDLAGDEV LDATGCYVMP GGIDPHTHLE 
MPFMGTYSSD DFESGTRAAL AGGTTMVVDF ALPAPGQGLM DALAMWHNKS GRANCDYSYH
MAITWWGEQV FDEMQAVVDQ GITSFKHFMA YKGALMVNDD ELYASFRRCA DLGALAMVHA
ENGDVVAELS ARLLAEGNRG PEAHAYSRPP QVEGEATNRA IMIADMAGVP LYVVHTSCEE
AHEAIRRARM QGKRVWGEPL IQHLTLDESE YFHPDWDHAA RRVMSPPFRN RQHQDSLWAG
LQSGSLSVVA TDHCAFTTEQ KRFGVGDFTK IPNGTGGLED RMPMLWTQGV NTGRLTPNEF
VAVTSTNIAK ILNCYPKKGA VLVGADADLV VWDPEKTKVI SAGAQQSAID YNVFEGKEVK
GLPRFTLSRG KVAVADGEIR TEEGHGQFVA RQPRPAVNRA LSAWKELTAP RAVERSGIPA
TGV