Gene RPD_1903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1903 
SymboldnaE2 
ID4022385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2137950 
End bp2141282 
Gene Length3333 bp 
Protein Length1110 aa 
Translation table11 
GC content66% 
IMG OID637962096 
Producterror-prone DNA polymerase 
Protein accessionYP_569039 
Protein GI91976380 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.989909 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTCC CGGCCTATGC CGAGATCGGC GTCACCACCA ATTTCTCGTT CCTCGAAGGC 
GGCTCGCATC CGCAGGACTA TGTTCACGAG GCGAGCCGGC TCGGGCTGGA GGCGATCGGC
ATCGCCGACC GCAACACGCT GGCCGGCGTG GTGCGGGCCT ATAGCGAACT CGGCAATGAG
GACCTCATCC ATAAGCCGAG GCTGCTGATC GGCGCGCGGC TGGTGTTCGC CGACGGCACG
CCGGACGTGC TCGCTTATCC GGTCGATCGG GCCGCCTATG GGCGGCTGTG CCGTTTGCTC
AGTGTCGGCA AGCTGCGCGG GGCGAAAGGC GAATGCCATC TCGCCGTCGC CGATCTCGAA
GCGTTCAGTC AGGGCCTGTC GCTGGTGCTG ATGCCGCCTT ATCGTTTCCA GGCCAGCGCC
ATCGCCGCCG CGTTGCAGCG TCTGACCGCG CTCGAATCCG GCGGAGTCTG GCTCGGGCTG
ACGCCGTATT ATCGCGGCGA CGACAAGCGG CGGCTGGCGC GGCTGAAGCG CGTGGCGTGG
GCCGCCCGGG TGCCGGGCAT CGCCACCAAC GACGTGCTGT ATCACCACCC CGAGCGCCGC
GCGCTGCAGG ATGTACTGAG CTGCGTGCGC GAGAAGACCA CGATCGACAA GATCGGGCGG
CGGCTGGAGG CGAATGCCGA GCGGCATCTG AAACCCGCAG CCGAAATGGC CCGGCTATTC
CGCGCGGACC CCGACGCCAT CGCCGAGACG TTGCGCTTTG CCGCCGGCAT CTCTTTCACC
CTGGACGAAT TGAAATATCA CTATCCCGAC GAGCCGGTGC CGCCCGGCAA GACCGCGCAG
CAGCATCTCG AGGATCTGAC CTGGGAAGGC GTCGCCGAAT ATTTTCCGGC TGGCATCAGC
GATAAGCTAC ACGCCACCAT CGACAAGGAA CTGGGGATCA TCGCGCATCG CGGCTATGCG
CAATACTTCC TCACCGTGCA CGACATCGTT CGCTACGCGC GGTCGCAGGA CATCCTGTGC
CAGGGGCGCG GCTCGGCCGC TAATTCGGCG GTGTGCTACG TCCTCGGCAT CACCTGCGTC
GATCCGACAG AGATCGACTT GCTGTTCGAA CGTTTCGTCT CCGAGGAGCG CGACGAGCCG
CCGGACATCG ACGTCGATTT CGAGCATTCG CGCCGCGAAG AGGTGATGCA ATACATCTAT
CGCCGCTATG GTCGCCACCG CGCTGCGATC GTCGCCACCG TGATTCACTA TCGGCCGCGC
TCGGCGATCC GCGACGTCGG CAAGGCGCTG GGCCTCAGCG AGGACGTCAC CGCCGCGCTG
GCCGATACGG TGTGGGGAAG CTGGGGCAAG GGCCTGAACG AGATGCAGGT GAAGCAGGCC
GGGCTCGATC CGCACAACCC GCTGATCGGG CGCGCGGTCG AACTCGCCAC CGAGCTGATC
GGCTTTCCGC GGCATCTGTC GCAGCATGTC GGCGGTTATG TGCTGACCCA GGACCGGCTC
GACAGCTACG TACCGATCGG CAACGCGGCG ATGGAGGGGC GTACCTTCAT CGAATGGGAC
AAGGACGACA TCGACGCCAT CAAGATGATG AAGGTCGACG TGCTGGCGCT CGGCATGCTG
ACCTGCATCC GCAAAGGGTT CGACCTGATC GCGCAGCACA AGGGCGTACG CTACGCGCTG
TCCGACATCA AGTCGGAGGA CGACAACGCC GTCTACCAGA TGCTCCAGCG CGGCGAGTCG
ATCGGCGTGT TTCAGGTCGA GAGCCGGGCG CAGATGAACA TGCTGCCGCG TTTGAAGCCG
AAATGTTTCT ACGACCTCGT CATCGAGGTT GCGATCGTGC GGCCGGGCCC GATTCAGGGC
GACATGGTGC ATCCCTATCT GCGCCGTCGC AACAAGCAGG AGCCGGTGGT GTATCCCGCG
CCGGCTGGTC ACGCAGGCGA CGCCAACGAA CTCGAAGTCA TTCTCGGCAA GACGCTCGGC
GTGCCGTTGT TCCAGGAGCA GGCGATGCGG ATCGCGATCG AGGCCGCGCA TTTCACGCCG
GACGAAGCCA ATCAGTTGCG CCGGGCGATG GCGACGTTCC GCAATGTCGG CACCATCGGA
AAGTTCGAGA GCAAGATGGT CGGCAATCTG GTGGCGCGCG GCTATGATCC GGTGTTCGCC
AAGAACTGCT TCGAGCAGAT CAAGGGGTTC GGCTCCTACG GCTTTCCCGA GAGCCATGCT
GCGAGCTTCG CCAAGCTGGT CTATGTCTCG GCCTGGATGA AGTGCGAGCA TCCCGACGCG
TTCTGCTGCG CGCTGTTGAA TTCGCAGCCG ATGGGATTCT ATGCGCCGGC GCAGATCGTC
GGCGACGCGC GAGCCAACGA GGTCGAGGTG CGGCCGGTCG ACGTGTCGTT CAGCGACGGC
CAGTGCACGC TGGAGGAACG TTGCGGCAAG CATCACGCGG TGCGGCTCGG CTTTCGCCAG
ATCGACGGTT TCGTCTGGGC CGACCCGGAT GAGGAGCGGG TGCGGCGGGA GGCAGGACTT
CTGCCCAGCG AGGATTGGGC GGCGCGGATT GTCGCGGCGC GTGCGCGGGG GCCGTTCAAT
TCGCTTGAGC GGTTCGCGCG CCTGACCGCA TTGCCGAAGC GGGCGCTGAT CCTGCTTGCG
GATGCCGATG CGTTCCGCTC GCTCGGGCTC GATCGCCGCG CGGCGTTGTG GGCGGTGCGG
CGACTGCCCG ACGACGTGCC GCTGCCGCTG TTCGAAGCCG CCAGCGCATC CGAGCAGTTG
GATGAGAATG CAGCGCCGCT GCCGCAAATG CCGACGGCCG AGCACGTCGT CGCCGATTAT
CAAACCGTGC GGCTGTCGCT GAAGGGGCAC CCGATGGAGT TTTTGCGCCC GCTGTTTGCG
GCCGAGCGCG TCGTGACCTG CCGCTCGATA TCGGAGTCCC GCGTCAGCGG CCAGCGGATG
CGCTGCGCCG GCGTGGTGCT GGTGCGGCAG CGGCCGGGCA GCGCCAAAGG CGTGGTGTTC
ATCACGCTCG AGGACGAAAC CGGAATCGCC AACCTCGTGG TGTGGCCGGC GGTGATGGAG
ACGTTTCGCA AGGAGGTGAT GGGTGCGCGG CTGCTGTGGG TCGAGGGCCG AATCCAGGCG
AGCCCGGAAG GCGTGGTGCA TCTGGTCGCC GAGCGCCTGA GCGACCGCAG CTTCGAGATG
ACGCGGCTGT CCGACAATCT CGCGGCGCCG CGGCTCGGCG AATTGCACGA ACCGCTCAAC
GACGACCGCC GCGAGCATCC CGACAACCCC GCCCAGCGCA TCCGCCACCC CCGAGACGTC
CGCATCCTGC CGCCGTCGCG GGATTTTCAT TGA
 
Protein sequence
MKVPAYAEIG VTTNFSFLEG GSHPQDYVHE ASRLGLEAIG IADRNTLAGV VRAYSELGNE 
DLIHKPRLLI GARLVFADGT PDVLAYPVDR AAYGRLCRLL SVGKLRGAKG ECHLAVADLE
AFSQGLSLVL MPPYRFQASA IAAALQRLTA LESGGVWLGL TPYYRGDDKR RLARLKRVAW
AARVPGIATN DVLYHHPERR ALQDVLSCVR EKTTIDKIGR RLEANAERHL KPAAEMARLF
RADPDAIAET LRFAAGISFT LDELKYHYPD EPVPPGKTAQ QHLEDLTWEG VAEYFPAGIS
DKLHATIDKE LGIIAHRGYA QYFLTVHDIV RYARSQDILC QGRGSAANSA VCYVLGITCV
DPTEIDLLFE RFVSEERDEP PDIDVDFEHS RREEVMQYIY RRYGRHRAAI VATVIHYRPR
SAIRDVGKAL GLSEDVTAAL ADTVWGSWGK GLNEMQVKQA GLDPHNPLIG RAVELATELI
GFPRHLSQHV GGYVLTQDRL DSYVPIGNAA MEGRTFIEWD KDDIDAIKMM KVDVLALGML
TCIRKGFDLI AQHKGVRYAL SDIKSEDDNA VYQMLQRGES IGVFQVESRA QMNMLPRLKP
KCFYDLVIEV AIVRPGPIQG DMVHPYLRRR NKQEPVVYPA PAGHAGDANE LEVILGKTLG
VPLFQEQAMR IAIEAAHFTP DEANQLRRAM ATFRNVGTIG KFESKMVGNL VARGYDPVFA
KNCFEQIKGF GSYGFPESHA ASFAKLVYVS AWMKCEHPDA FCCALLNSQP MGFYAPAQIV
GDARANEVEV RPVDVSFSDG QCTLEERCGK HHAVRLGFRQ IDGFVWADPD EERVRREAGL
LPSEDWAARI VAARARGPFN SLERFARLTA LPKRALILLA DADAFRSLGL DRRAALWAVR
RLPDDVPLPL FEAASASEQL DENAAPLPQM PTAEHVVADY QTVRLSLKGH PMEFLRPLFA
AERVVTCRSI SESRVSGQRM RCAGVVLVRQ RPGSAKGVVF ITLEDETGIA NLVVWPAVME
TFRKEVMGAR LLWVEGRIQA SPEGVVHLVA ERLSDRSFEM TRLSDNLAAP RLGELHEPLN
DDRREHPDNP AQRIRHPRDV RILPPSRDFH