Gene Rsph17025_0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0222 
Symbol 
ID5083770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp215374 
End bp216648 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content71% 
IMG OID640481777 
Productdihydroorotase, multifunctional complex type 
Protein accessionYP_001166437 
Protein GI146276278 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.357972 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGT TCCTGACCAA CGCCCGCCTG ATCGACCCCG AGGCCGGGAC CGAGACCGAG 
GGCGACCTGC TGATCGAGGG CGGGCTCATC GCCGCCGTGG GCGCGCTCGA GCCGCCACCC
GGCACCGAGG TGATCGACTG CGGCGGCAAG TGCCTCGCGC CCGGCATTGT CGATCTGGGC
GTGAAGGTGG GCGAGCCCGG CGAGCGCCAC CGCGAGAGCT TCCGCTCGGC GGGCCTGGCC
GCGGCCGCGG GCGGCGTCAC CACGATCATC GCCCGCCCCG ACACGATGCC CGCCATCGAT
ACGCCCGAGG TGCTGGAATT CGTCACCCGC CGCGCCGCCG AGGCGAGCCC GGTCCGCATC
CGCCACATGG CGGCGCTGAC GAAGGGCCGC GAGGGGCGCG AGATGGTCGA GCTGGGCTTC
CTGCTCGACA CGGGCGCCAT CGCCTTCACC GACTGCGACC ATGTGATCGA GACCACCAAG
GTCGCCGCGC GCTGCATGAC CTATGCCCGC AGCCTCGGCG CGCTGGTGAT CGGCCATCCG
CAGGATCCGG GCCTCTCGGC GGGCGCCTCG GCCACCAACG GCAAGTTCGC CAGCCTGCGC
GGCATCCCCG CCGTTCATCC AATGGCCGAG CGCATGGGCT TCGACCGCGA CATGGCGCTG
GTCGAGATGT CGGGCGTGCG CTACCACGCC GACCAGGTCA CCACCGCCCG CACCCTGCCC
GCGCTCGAGC GGGCCAAGCG CAACGGCCTC GATGTGACGG CCGGCATCGG CATCCACCAC
CTGACGCTGA ACGAGTTCGA CGTCGGCGAC TACCGCACCT TCTTCAAGCT GAAGCCGCCG
CTGCGCTCGG AGGAGGATCG GCTGGCCATG GTCGAGGCGG TGGGATCGGG TCTGATCGAC
ATCATCTCCT CGATGCACAC GCCGCAGGAC GAGGAATCGA AGCGCCTGCC CTTCGAAGAG
GCCGCCTGGG GCGCGGTGGC GCTCGAAACC TTCCTGCCCG CGGCGCTCCG GCTCCATCAC
GCGGGCCTCC TGAGCCTGCC GCAGCTCTTC CGGGCGATGG CGCTCAACCC GGCGAAGCGG
CTCGGCCTGC CGCAGGGGCG GCTCTCCGAG GGGGCACCCG CCGACCTCGT GCTCTTCGAC
CCCGATGCCC CCTTCGTCCT CGACCGCTTC ACCCTGCGGT CGAAATCGAA GAACACGCCC
TTTGACGGGC AGCGGATGGA GGGGCGCGTG CTGGCGACCT TCGTCGGCGG CCGGCAGGTC
TTTGCGGTCG AGTGA
 
Protein sequence
MSLFLTNARL IDPEAGTETE GDLLIEGGLI AAVGALEPPP GTEVIDCGGK CLAPGIVDLG 
VKVGEPGERH RESFRSAGLA AAAGGVTTII ARPDTMPAID TPEVLEFVTR RAAEASPVRI
RHMAALTKGR EGREMVELGF LLDTGAIAFT DCDHVIETTK VAARCMTYAR SLGALVIGHP
QDPGLSAGAS ATNGKFASLR GIPAVHPMAE RMGFDRDMAL VEMSGVRYHA DQVTTARTLP
ALERAKRNGL DVTAGIGIHH LTLNEFDVGD YRTFFKLKPP LRSEEDRLAM VEAVGSGLID
IISSMHTPQD EESKRLPFEE AAWGAVALET FLPAALRLHH AGLLSLPQLF RAMALNPAKR
LGLPQGRLSE GAPADLVLFD PDAPFVLDRF TLRSKSKNTP FDGQRMEGRV LATFVGGRQV
FAVE