Gene RPD_3029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3029 
Symbol 
ID4023532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3374537 
End bp3375838 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content68% 
IMG OID637963228 
Productdihydroorotase 
Protein accessionYP_570156 
Protein GI91977497 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.678009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.507033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAACG ACCGCCGCCC GATCCTGCTC GCCAATGCTC GCCTGATCGA TCCGGCGCGC 
GACTTCGACG GCGTCGGCGA CGTGCTGATC GCCGACGGCG TGATCCGCGA CGCCCGCCGC
GGCATCGGCG CCGCCGGCGT GCCGGAAGGC ACCGACATCA TCAATTGCGG CGGCATGATC
GTGGCCCCCG GCCTGATCGA CATGCGCGCC TTCGTCGGCG AACCCGGCGC CAGCCATCGC
GAGACCTTCG CCTCGGCCAG CCAGGCCGCA GCCGCCGGCG GCATCACCAC CATCATCTGC
CAGCCGAACA CTTCGCCTGT GATCGACAAT TCGGCGACGG TGGACTTCGT GATGCGCCGC
GCCCGCGACA CCGCGATCGT CAACATTCAC CCGATGGCGG CGCTGACCAA GGGTCTCGCC
GGCGCGGAGA TGACCGAGAT CGGCCTGTTG AAGGCCGCCG GCGCCGTGGC CTTCAGCGAC
GGCGATCGCA GCGTCATGAA CGCGCGGGTG ATGCGCAGCG CGCTGACCTA CGCCCGCGAT
TTCGACGCCC TGATCGTTCA CCACACCGAA GACCCCGATC TGGTCGGCGA AGGCGTGATG
AACGAGGGTG AATTCGCCAC CCGCCTCGGG CTCTCCGGTA TGCCGAACGC CGCCGAGGCC
GTGGTGCTGG AGCGCGACGT CCGCCTCGCC GCACTGACCG GCGGCCGCTA TCACGCCGCG
TCGCTGACCT GCATCGAGTC GCTGGAGATT TTGCAGCGCG CGCGAGACAC CGGCATCAAC
GTCTCGGCCT CGGTATCGAT CAATCATGTC TCGCTGAATG AGAACGACAT CGGGCCGTAC
CGCACGTTCC TCAAGCTGTC GCCGCCGCTG CGCACCGAGA ACGACCGCAA GGCTCTGATC
GCCGCCGTCG CTTCGGGTCT CGTCGACGTC ATCATGTCGG ACCACAATCC GCAGGACGTC
GAGGTCAAGC GGCTGCCGTT CGCCGAGGCC GCCGCCGGCG CGATCGGCCT GGAGACGATG
CTGCCGGCCG GCTTGCGGCT GGTGCACAAT GGCGAGCTGG ACCTGCTGAC CCTGATCCGT
GCGATGTCGA CCCGCCCGGC CGAATTGCTC GGCCTGCCCG GCGGCACGCT GCGCGCAGGC
TCGCCAGCCG ATCTGATCAT GATCGACATC GACACCCCGT GGGTGGTCGA TCCGAACGAA
CTGAAATCGA AGTGCAAGAA TACCCCGTTC GACGAAGCTC GGTTCTCGGG ACGGGTGACG
CGGACCATCG TCGGCGGACG CACCGTCTAC GAACATGTGT GA
 
Protein sequence
MLNDRRPILL ANARLIDPAR DFDGVGDVLI ADGVIRDARR GIGAAGVPEG TDIINCGGMI 
VAPGLIDMRA FVGEPGASHR ETFASASQAA AAGGITTIIC QPNTSPVIDN SATVDFVMRR
ARDTAIVNIH PMAALTKGLA GAEMTEIGLL KAAGAVAFSD GDRSVMNARV MRSALTYARD
FDALIVHHTE DPDLVGEGVM NEGEFATRLG LSGMPNAAEA VVLERDVRLA ALTGGRYHAA
SLTCIESLEI LQRARDTGIN VSASVSINHV SLNENDIGPY RTFLKLSPPL RTENDRKALI
AAVASGLVDV IMSDHNPQDV EVKRLPFAEA AAGAIGLETM LPAGLRLVHN GELDLLTLIR
AMSTRPAELL GLPGGTLRAG SPADLIMIDI DTPWVVDPNE LKSKCKNTPF DEARFSGRVT
RTIVGGRTVY EHV