Gene RPC_2246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2246 
Symbol 
ID3973263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2450238 
End bp2451551 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content68% 
IMG OID637925354 
Productdihydroorotase 
Protein accessionYP_532119 
Protein GI90423749 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.607356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.303532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGTA ACATGCTGAC AGACCGCCGC CCGATCCTGC TCGCCAACGC CCGCCTGATC 
GATCCCTCGC GCGACGTCGA CGGCATCGGC GACGTGCTGA TCGCCGACGG CACCATTCGC
GAAGCGCGCC GCGGCATCGG CGCCGCCGGC GTGCCGGAAG GCACCGACAT CATCAACTGC
GCCGGCAAGA TCGTGGCGCC CGGCCTGATC GACATGCGCG CCTTCGTCGG CGAGCCCGGC
GCCGGTCACC GCGAGACCTT CGCCTCGGCG AGCCAGGCCG CCGCCGCCGG CGGCATCACC
ACCATTATCT GCCAGCCCGA CACCTCGCCC ACTATCGACA ATTCGGCGAC CGTCGACTTC
GTGTTGCGCC GCGCCCGCGA TACCGCGATC GTCAATATCC ATCCGATGGC GGCGCTGACC
AAGGATCTTG CCGGCCACGA GATGACCGAG ATCGGGCTAC TGAAAGCCGC CGGCGCGGTC
GCCTTCACCG ATGGCGTCAG AAGCGTTATG GACGCCCAAG TGATGCGCCG CGCGCTGACC
TATGCGCGCG ACTTCGACGC CTTGATCGTG CATCACACCG AAGATCCCAA TCTGGTCGGC
GAAGGCGTGA TGAACGAGGG CGAACTCGCT TCACGGCTCG GGTTGATCGG GGTGCCGAAC
ATCGCCGAAG CTGTGGTGCT GGAGCGCGAC ATGCGGCTCG TCGCGCTGAC CGGCGGCCGT
TACCACGCCG CCTCGATCAC CTGTGTGGAG TCGATCGAGA TCCTGCGCCG GGCCCGCGAG
GCCGGGCTCA AGGTCACCGC CTCCGCCTCG ATCAACCATC TGACGCTGAA CGAGAACGAC
ATCGGCCCCT ACCGCTCGTT CCTGAAGCTG TCGCCGCCGC TGCGCACCGA GGACGACCGC
CAGGCGCTGG TCGCCGCGGT CGCCGCGGGT CTTATCGACG TCATCATGTC CGACCACAAT
CCACAGGACG TCGAGGTGAA GCGGCTGCCG TTCGCCGAGG CCGCCGCCGG CGCCATCGGC
CTTGAGACCA TGCTGCCGGC CGGCCTGCGG CTGATCCACT CCGGCGAGTT GGATTTCCTG
ACGCTGATCC GGGCGATGTC GACCAAGCCT GCCGAATTGC TCGGCCTGCC CGGCGGCACG
CTGCGTGCCG GCTCCCCCGC CGACCTGATT GTGATCGACG CCGACGTGCC CTGGGTGGTC
GACCCCAACG AACTAAAGTC GAAGTGCAAG AACACGCCGT TCGACGAGGC CAAATTCGCC
GGACGCGTCG CCCGCACCAT CGTCGCAGGC CGCACCGTGT ACGAACACGT CTGA
 
Protein sequence
MRSNMLTDRR PILLANARLI DPSRDVDGIG DVLIADGTIR EARRGIGAAG VPEGTDIINC 
AGKIVAPGLI DMRAFVGEPG AGHRETFASA SQAAAAGGIT TIICQPDTSP TIDNSATVDF
VLRRARDTAI VNIHPMAALT KDLAGHEMTE IGLLKAAGAV AFTDGVRSVM DAQVMRRALT
YARDFDALIV HHTEDPNLVG EGVMNEGELA SRLGLIGVPN IAEAVVLERD MRLVALTGGR
YHAASITCVE SIEILRRARE AGLKVTASAS INHLTLNEND IGPYRSFLKL SPPLRTEDDR
QALVAAVAAG LIDVIMSDHN PQDVEVKRLP FAEAAAGAIG LETMLPAGLR LIHSGELDFL
TLIRAMSTKP AELLGLPGGT LRAGSPADLI VIDADVPWVV DPNELKSKCK NTPFDEAKFA
GRVARTIVAG RTVYEHV