Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_1903 |
Symbol | dnaE2 |
ID | 4022385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2137950 |
End bp | 2141282 |
Gene Length | 3333 bp |
Protein Length | 1110 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637962096 |
Product | error-prone DNA polymerase |
Protein accession | YP_569039 |
Protein GI | 91976380 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0587] DNA polymerase III, alpha subunit |
TIGRFAM ID | [TIGR00594] DNA-directed DNA polymerase III (polc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.989909 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTCC CGGCCTATGC CGAGATCGGC GTCACCACCA ATTTCTCGTT CCTCGAAGGC GGCTCGCATC CGCAGGACTA TGTTCACGAG GCGAGCCGGC TCGGGCTGGA GGCGATCGGC ATCGCCGACC GCAACACGCT GGCCGGCGTG GTGCGGGCCT ATAGCGAACT CGGCAATGAG GACCTCATCC ATAAGCCGAG GCTGCTGATC GGCGCGCGGC TGGTGTTCGC CGACGGCACG CCGGACGTGC TCGCTTATCC GGTCGATCGG GCCGCCTATG GGCGGCTGTG CCGTTTGCTC AGTGTCGGCA AGCTGCGCGG GGCGAAAGGC GAATGCCATC TCGCCGTCGC CGATCTCGAA GCGTTCAGTC AGGGCCTGTC GCTGGTGCTG ATGCCGCCTT ATCGTTTCCA GGCCAGCGCC ATCGCCGCCG CGTTGCAGCG TCTGACCGCG CTCGAATCCG GCGGAGTCTG GCTCGGGCTG ACGCCGTATT ATCGCGGCGA CGACAAGCGG CGGCTGGCGC GGCTGAAGCG CGTGGCGTGG GCCGCCCGGG TGCCGGGCAT CGCCACCAAC GACGTGCTGT ATCACCACCC CGAGCGCCGC GCGCTGCAGG ATGTACTGAG CTGCGTGCGC GAGAAGACCA CGATCGACAA GATCGGGCGG CGGCTGGAGG CGAATGCCGA GCGGCATCTG AAACCCGCAG CCGAAATGGC CCGGCTATTC CGCGCGGACC CCGACGCCAT CGCCGAGACG TTGCGCTTTG CCGCCGGCAT CTCTTTCACC CTGGACGAAT TGAAATATCA CTATCCCGAC GAGCCGGTGC CGCCCGGCAA GACCGCGCAG CAGCATCTCG AGGATCTGAC CTGGGAAGGC GTCGCCGAAT ATTTTCCGGC TGGCATCAGC GATAAGCTAC ACGCCACCAT CGACAAGGAA CTGGGGATCA TCGCGCATCG CGGCTATGCG CAATACTTCC TCACCGTGCA CGACATCGTT CGCTACGCGC GGTCGCAGGA CATCCTGTGC CAGGGGCGCG GCTCGGCCGC TAATTCGGCG GTGTGCTACG TCCTCGGCAT CACCTGCGTC GATCCGACAG AGATCGACTT GCTGTTCGAA CGTTTCGTCT CCGAGGAGCG CGACGAGCCG CCGGACATCG ACGTCGATTT CGAGCATTCG CGCCGCGAAG AGGTGATGCA ATACATCTAT CGCCGCTATG GTCGCCACCG CGCTGCGATC GTCGCCACCG TGATTCACTA TCGGCCGCGC TCGGCGATCC GCGACGTCGG CAAGGCGCTG GGCCTCAGCG AGGACGTCAC CGCCGCGCTG GCCGATACGG TGTGGGGAAG CTGGGGCAAG GGCCTGAACG AGATGCAGGT GAAGCAGGCC GGGCTCGATC CGCACAACCC GCTGATCGGG CGCGCGGTCG AACTCGCCAC CGAGCTGATC GGCTTTCCGC GGCATCTGTC GCAGCATGTC GGCGGTTATG TGCTGACCCA GGACCGGCTC GACAGCTACG TACCGATCGG CAACGCGGCG ATGGAGGGGC GTACCTTCAT CGAATGGGAC AAGGACGACA TCGACGCCAT CAAGATGATG AAGGTCGACG TGCTGGCGCT CGGCATGCTG ACCTGCATCC GCAAAGGGTT CGACCTGATC GCGCAGCACA AGGGCGTACG CTACGCGCTG TCCGACATCA AGTCGGAGGA CGACAACGCC GTCTACCAGA TGCTCCAGCG CGGCGAGTCG ATCGGCGTGT TTCAGGTCGA GAGCCGGGCG CAGATGAACA TGCTGCCGCG TTTGAAGCCG AAATGTTTCT ACGACCTCGT CATCGAGGTT GCGATCGTGC GGCCGGGCCC GATTCAGGGC GACATGGTGC ATCCCTATCT GCGCCGTCGC AACAAGCAGG AGCCGGTGGT GTATCCCGCG CCGGCTGGTC ACGCAGGCGA CGCCAACGAA CTCGAAGTCA TTCTCGGCAA GACGCTCGGC GTGCCGTTGT TCCAGGAGCA GGCGATGCGG ATCGCGATCG AGGCCGCGCA TTTCACGCCG GACGAAGCCA ATCAGTTGCG CCGGGCGATG GCGACGTTCC GCAATGTCGG CACCATCGGA AAGTTCGAGA GCAAGATGGT CGGCAATCTG GTGGCGCGCG GCTATGATCC GGTGTTCGCC AAGAACTGCT TCGAGCAGAT CAAGGGGTTC GGCTCCTACG GCTTTCCCGA GAGCCATGCT GCGAGCTTCG CCAAGCTGGT CTATGTCTCG GCCTGGATGA AGTGCGAGCA TCCCGACGCG TTCTGCTGCG CGCTGTTGAA TTCGCAGCCG ATGGGATTCT ATGCGCCGGC GCAGATCGTC GGCGACGCGC GAGCCAACGA GGTCGAGGTG CGGCCGGTCG ACGTGTCGTT CAGCGACGGC CAGTGCACGC TGGAGGAACG TTGCGGCAAG CATCACGCGG TGCGGCTCGG CTTTCGCCAG ATCGACGGTT TCGTCTGGGC CGACCCGGAT GAGGAGCGGG TGCGGCGGGA GGCAGGACTT CTGCCCAGCG AGGATTGGGC GGCGCGGATT GTCGCGGCGC GTGCGCGGGG GCCGTTCAAT TCGCTTGAGC GGTTCGCGCG CCTGACCGCA TTGCCGAAGC GGGCGCTGAT CCTGCTTGCG GATGCCGATG CGTTCCGCTC GCTCGGGCTC GATCGCCGCG CGGCGTTGTG GGCGGTGCGG CGACTGCCCG ACGACGTGCC GCTGCCGCTG TTCGAAGCCG CCAGCGCATC CGAGCAGTTG GATGAGAATG CAGCGCCGCT GCCGCAAATG CCGACGGCCG AGCACGTCGT CGCCGATTAT CAAACCGTGC GGCTGTCGCT GAAGGGGCAC CCGATGGAGT TTTTGCGCCC GCTGTTTGCG GCCGAGCGCG TCGTGACCTG CCGCTCGATA TCGGAGTCCC GCGTCAGCGG CCAGCGGATG CGCTGCGCCG GCGTGGTGCT GGTGCGGCAG CGGCCGGGCA GCGCCAAAGG CGTGGTGTTC ATCACGCTCG AGGACGAAAC CGGAATCGCC AACCTCGTGG TGTGGCCGGC GGTGATGGAG ACGTTTCGCA AGGAGGTGAT GGGTGCGCGG CTGCTGTGGG TCGAGGGCCG AATCCAGGCG AGCCCGGAAG GCGTGGTGCA TCTGGTCGCC GAGCGCCTGA GCGACCGCAG CTTCGAGATG ACGCGGCTGT CCGACAATCT CGCGGCGCCG CGGCTCGGCG AATTGCACGA ACCGCTCAAC GACGACCGCC GCGAGCATCC CGACAACCCC GCCCAGCGCA TCCGCCACCC CCGAGACGTC CGCATCCTGC CGCCGTCGCG GGATTTTCAT TGA
|
Protein sequence | MKVPAYAEIG VTTNFSFLEG GSHPQDYVHE ASRLGLEAIG IADRNTLAGV VRAYSELGNE DLIHKPRLLI GARLVFADGT PDVLAYPVDR AAYGRLCRLL SVGKLRGAKG ECHLAVADLE AFSQGLSLVL MPPYRFQASA IAAALQRLTA LESGGVWLGL TPYYRGDDKR RLARLKRVAW AARVPGIATN DVLYHHPERR ALQDVLSCVR EKTTIDKIGR RLEANAERHL KPAAEMARLF RADPDAIAET LRFAAGISFT LDELKYHYPD EPVPPGKTAQ QHLEDLTWEG VAEYFPAGIS DKLHATIDKE LGIIAHRGYA QYFLTVHDIV RYARSQDILC QGRGSAANSA VCYVLGITCV DPTEIDLLFE RFVSEERDEP PDIDVDFEHS RREEVMQYIY RRYGRHRAAI VATVIHYRPR SAIRDVGKAL GLSEDVTAAL ADTVWGSWGK GLNEMQVKQA GLDPHNPLIG RAVELATELI GFPRHLSQHV GGYVLTQDRL DSYVPIGNAA MEGRTFIEWD KDDIDAIKMM KVDVLALGML TCIRKGFDLI AQHKGVRYAL SDIKSEDDNA VYQMLQRGES IGVFQVESRA QMNMLPRLKP KCFYDLVIEV AIVRPGPIQG DMVHPYLRRR NKQEPVVYPA PAGHAGDANE LEVILGKTLG VPLFQEQAMR IAIEAAHFTP DEANQLRRAM ATFRNVGTIG KFESKMVGNL VARGYDPVFA KNCFEQIKGF GSYGFPESHA ASFAKLVYVS AWMKCEHPDA FCCALLNSQP MGFYAPAQIV GDARANEVEV RPVDVSFSDG QCTLEERCGK HHAVRLGFRQ IDGFVWADPD EERVRREAGL LPSEDWAARI VAARARGPFN SLERFARLTA LPKRALILLA DADAFRSLGL DRRAALWAVR RLPDDVPLPL FEAASASEQL DENAAPLPQM PTAEHVVADY QTVRLSLKGH PMEFLRPLFA AERVVTCRSI SESRVSGQRM RCAGVVLVRQ RPGSAKGVVF ITLEDETGIA NLVVWPAVME TFRKEVMGAR LLWVEGRIQA SPEGVVHLVA ERLSDRSFEM TRLSDNLAAP RLGELHEPLN DDRREHPDNP AQRIRHPRDV RILPPSRDFH
|
| |