Gene RPB_2423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2423 
Symbol 
ID3909557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2779053 
End bp2780363 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content68% 
IMG OID637884322 
Productdihydroorotase 
Protein accessionYP_486039 
Protein GI86749543 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.148654 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATCG ACCGCCGCCC GATCCTGCTC GCCAACGCCC GCCTGATCGA TCCGGCGCGC 
GACTTCGACG GCCTCGGCGA CGTGCTGATC GCCGACGGCG TGATCCGCGA CGCGCGCCGC
GGCATCGGCG CCGCCGGCGT GCCCGAGGGC ACCGACATTA TTAATTGCGC CGGCATGGTG
GTCGCCCCCG GGCTGGTCGA TATGCGCGCC TTCGTCGGCG AGCCCGGCGC CAGCCACCGC
GAGACCTTCG CCTCGGCCAG CCAGGCCGCC GCCGCCGGCG GCATCACCAC CATCATCTGC
CAGCCCAACA CCTCGCCGGT GATCGACAAT TCGGCGACGG TCGATTTCGT GATGCGTCGC
GCCCGCGACA CCGCGATCGT CAACATCCAT CCGATGGCCG CGCTGACCAA GGGTCTGAAC
GGAATCGAGA TGACCGAGAT CGGGCTGTTG AAGGCCGCCG GCGCCGTGGC GTTCAGCGAC
GGCGACCGCA GTGTGATGAA TGCGCGGGTG ATGCGCAGCG CGCTGACCTA TGCGCGCGAT
TTCGATGCGC TGATCGTCCA TCACACCGAA GACCCCGATC TGGTTGCCGA AGGCGTGATG
AACGAAAGCG AGTTCGCCAC CCGCCTCGGC CTGTCCGGCG TGCCGAGTGC CGCAGAAGCC
GTGGTGCTGG AGCGCGACGT CCGCCTCGCC GCATTGACCG GCGGGCGCTA TCACGCCGCC
TCGCTGACCT GCATCGAGTC GCTGGAGATC CTGCAGCGCG CTCGCGACGC CGGGATCAAC
GTCTCGGCGT CGGTGTCGAT CAATCACGTC ACGCTCAACG AGAACGACAT CGGCCCGTAT
CGCACCTTCC TCAAGCTGTC GCCGCCGCTG CGCAGCGAGG ACGACCGCAA GGCGCTGATC
GCCGCGGTCT CGTCCGGTCT GATCGACGTC ATCATGTCGG ACCACAATCC GCAGGATGTC
GAGGTCAAGC GGCTGCCCTT CGCCGAGGCC GCCGCCGGCG CGATCGGGCT GGAGACGATG
CTGCCGGCCG GCCTGCGATT GCTGCACGCC GGCGAACTCG ATCTCTTGAG TCTGATCCGC
GCGATGTCGA CCCGCCCGGC CGAACTGCTC GGCCTGCCCG GCGGCACGCT GCGCGCAGGC
AGCCCGGCCG ACCTGATCGT GATCGACCTC GACACGCCGT GGATCGTCGA TCCGAACGAA
CTGAAATCGA AGTGCAAGAA CACCCCGTTC GACGAGGCGC GGTTCTCCGG ACGGGTGGTC
CGCACCATCG TCGGCGGACG CACGGTGTAC GAGCACGTCA GCGCACATTG A
 
Protein sequence
MLIDRRPILL ANARLIDPAR DFDGLGDVLI ADGVIRDARR GIGAAGVPEG TDIINCAGMV 
VAPGLVDMRA FVGEPGASHR ETFASASQAA AAGGITTIIC QPNTSPVIDN SATVDFVMRR
ARDTAIVNIH PMAALTKGLN GIEMTEIGLL KAAGAVAFSD GDRSVMNARV MRSALTYARD
FDALIVHHTE DPDLVAEGVM NESEFATRLG LSGVPSAAEA VVLERDVRLA ALTGGRYHAA
SLTCIESLEI LQRARDAGIN VSASVSINHV TLNENDIGPY RTFLKLSPPL RSEDDRKALI
AAVSSGLIDV IMSDHNPQDV EVKRLPFAEA AAGAIGLETM LPAGLRLLHA GELDLLSLIR
AMSTRPAELL GLPGGTLRAG SPADLIVIDL DTPWIVDPNE LKSKCKNTPF DEARFSGRVV
RTIVGGRTVY EHVSAH