Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0848 |
Symbol | |
ID | 3909106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 965253 |
End bp | 968351 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637882741 |
Product | DNA polymerase I |
Protein accession | YP_484470 |
Protein GI | 86747974 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAAT CCCCCTCGAA AGCCGCCGCG GCGCCCGCCG CAAACTCCCC TGCCCCGGCC GCCGCCAAGG CGCCGGGCAC GGGCGATCAC ATCTTCCTGG TCGACGGATC GTCCTACATC TTCCGCGCCT ATCACGCGCT GCCGCCGCTG ACCCGCAAGT CGGACGGGCT GCAGGTCAAC GCCGTGCTCG GCTTCTGCAA CATGCTGTGG AAGCTGCTCC GCGAGATGCC GCCGGACAAC CGGCCGACGC ATCTGGCGAT CATCTTCGAC AAATCCGAGC ACACCTTCCG CAACCAGCTC TACCCCGACT ACAAGGCGCA CCGGCCGCCG GCGCCGGACG ATCTGATCCC GCAATTCGCC CTGATCCGCG AGGCGGTGCG GGCGTTCGAC CTGCCGTGCC TCGAACAATC CGGCTTCGAG GCCGACGATC TGATCGCCAC CTATGTGCGC GAGGCCTGCG AGCGCGGCGC CACGGCTACA ATTGTTTCGT CCGACAAGGA TTTGATGCAG CTCGTGACCG ATTGCGTCAC GATGTACGAC ACCATGAAGG ACCGCCGCAT CGGCATTGCC GAGGTGATCG AGAAATTCGG CGTGCCGCCC GAGAAGGTGG TCGAAGTGCA GGCGCTGGCC GGCGACAGCG TCGACAACGT GCCGGGCGTG CCCGGCATCG GCATCAAGAC CGCGGCCCAA CTGATCAATG AATATGGCGA CCTCGACACG CTCTTGGCGC GCGCCGGCGA AATCAAGCAG CCGAAGCGGC GCGAGGCGCT GATCGAGAAC GCCGAGAAGG CGCGGATCTC GCGGCAACTG GTGCTGCTCG ACGACAAGGT GAAGCTCGAC GTCCCGCTCG ACGAACTCGC CGTGCACGAG CCCGACGCGC GAAAGCTGAT CTCGTTTCTC AAGGCGATGG AATTCACCAC GCTGACGCGG CGGGTCGCGG ACTACTCCCA GATCGATCCG TCCGACGTCG AGGCCGAAGC CGCGCTGAAG TCTTCACCTC TCCCGCTTGC GGGGGAGGTC GGCGCGCGGA GCGCGACGGG TGGGGGCACG TCCACAGGCG GAGACCTTTT TTCGGGGCAA GTGCCCTCAC CCCAACCCTC TCCCGCAGGC GGGAGAGGGA GCGCGCCGCA TTCGGCGGAA GGTGGTCCTC TGAATGCCGG CCGCGGCCGC GACGGCCAGC CGGGCGAGGT GCTGTCGCCA CAGATCCTCG CCGCCAAGCG CGCCGAAGCC GCGCGAAAAA TTCCGGTCGA TCGCACCGCA TACAAGACCG TGCGCACGCG CGACGAGCTG CAGGGCTGGA TCGCGCGCAT CCACGACGCC GGCGCCTTCG CGGTCGATGC GATCGCGACC TCGATCGATC CGATGCAGGC GGAGCTGTGC GGCATCGCCT TGTCGCTCGG GCCGAACGAC GCCTGCTACA TCCCGCTCGG CCATCGTCAG ACCGGCGACG GAAGCGGTCT GTTCGCCGCA GGATTGGCGC CCGACCAGCT CGGCGCGCGC GATGTGCTCG ATGCGCTGCG GCCGCTGCTG GACTCCGCAG GCCTCGCCAA GATCGGCTTC AACATCAAAT TCACCGCGGT GCTGCTGGCG CAGCACGGCG TCACCTTGCG TAACATCGAC GATGTGCAAC TGATCTCCTA TGTGCTCGAT GCCGGCCGCG GCAGCCACGG TCTGGATGCG CTGTCCGAAA GCAATCTCGG CCACACCCTG CACGTGCTCG GCGCATTGAC CGGCAGCGGC AAGGCGAAGA TCGCGTTCGA TCAGGTGCCG ATCGACCGCG CCACCGAATA TGGCGGCGAG CGCTCCGACG TCGCACTCCG GCTGTGGCGC GTGCTGAAGC CGCGGCTGGT CGCCGAGCGG ATGATGGCAG TGTACGAGAC GCTGGAGCGG CCGCTGGTCG GCGTGCTGGC GCGGATGGAG CGGCGCGGCA TCTCGATCGA TCGCCAAGTG CTGTCGCGAT TGTCCGCCGA CTTCGCCCAG ACCGCCGCAA GGATCGAGGC AGAGATCCGC GAACTCGCCG GCGAGGACAT CAACATCGGC AGTCCGAAGC AGCTCGGCGA CATCCTGTTC GGCAAGATGG GCCTGCCCGG TGGCAGCAAG ACCAAGACCG GCGCGTGGTC GACCTCGGCG CAGGTGCTCG ACGAGCTCGC CGAACAAGGG CACGAATTCC CGCGCAAAAT TCTCGACTGG CGCCAGGTCT CGAAGCTGCG CTCGACCTAC ACGGACGCGC TGCCGACCTA TGTGCATCCG CAGACCCAGC GCGTCCACAC CACCTACGCG CTCGCCGCCA CCACCACCGG GCGGCTGTCG TCGAACGAAC CCAATCTGCA GAACATCCCG GTGCGCACCG AGGACGGCCG TAAAATCCGC CGCGCCTTCG TGGCGACGCC CGGCCACAGA CTGGTCTCGG CCGACTACTC GCAGATCGAA TTGCGCCTGC TGTCCGAAGT CGCCGATGTG CCGGCGCTGC GGAAAGCGTT TCAGGACGGC ATCGACATTC ATGCGATGAC GGCGTCCGAA ATGTTCGGCG TGCCGGTCGA GGGCATGCCG TCGGACATCC GCCGCCGTGC GAAAGCGATC AATTTCGGCA TCATCTACGG CATCTCGGCG TTCGGCCTCG CCAATCAGCT CGGCATCCCG CGCGAGGAGG CCGGCGCCTA TATCAAGCGC TATTTCGAGC GCTTCCCGGG CATCCGCGCC TATATGGACG AGACCCGCGA TTTCTGCCGG ACGCACGGCT ATGTCGAGAC GCTGTTCGGC CGCAAATGCC ACTACCCGGA CATCAAGGCC TCGAACCCGT CGATCCGCGC CTTCAACGAA CGCGCCGCCA TCAATGCGAG GCTGCAAGGC TCCGCCGCCG ACATCATCCG CCGCGCCATG GTGCGGATGG AGGACGCGCT GGCCGAGAAG AAGCTGGCGG CGCAGATGCT GCTGCAGGTG CACGACGAAC TGATCTTCGA AGTGCCAGAG GATGAAGTGA CGGCGACGCT GCCGGTGGTG AGCCACGTCA TGCAGGACGC GCCGTTCCCG GCCGTGATCC TCAACGTGCC GCTGCAGGTC GACGCAAGAG CCGCGGACAA TTGGGACGAG GCGCATTGA
|
Protein sequence | MPKSPSKAAA APAANSPAPA AAKAPGTGDH IFLVDGSSYI FRAYHALPPL TRKSDGLQVN AVLGFCNMLW KLLREMPPDN RPTHLAIIFD KSEHTFRNQL YPDYKAHRPP APDDLIPQFA LIREAVRAFD LPCLEQSGFE ADDLIATYVR EACERGATAT IVSSDKDLMQ LVTDCVTMYD TMKDRRIGIA EVIEKFGVPP EKVVEVQALA GDSVDNVPGV PGIGIKTAAQ LINEYGDLDT LLARAGEIKQ PKRREALIEN AEKARISRQL VLLDDKVKLD VPLDELAVHE PDARKLISFL KAMEFTTLTR RVADYSQIDP SDVEAEAALK SSPLPLAGEV GARSATGGGT STGGDLFSGQ VPSPQPSPAG GRGSAPHSAE GGPLNAGRGR DGQPGEVLSP QILAAKRAEA ARKIPVDRTA YKTVRTRDEL QGWIARIHDA GAFAVDAIAT SIDPMQAELC GIALSLGPND ACYIPLGHRQ TGDGSGLFAA GLAPDQLGAR DVLDALRPLL DSAGLAKIGF NIKFTAVLLA QHGVTLRNID DVQLISYVLD AGRGSHGLDA LSESNLGHTL HVLGALTGSG KAKIAFDQVP IDRATEYGGE RSDVALRLWR VLKPRLVAER MMAVYETLER PLVGVLARME RRGISIDRQV LSRLSADFAQ TAARIEAEIR ELAGEDINIG SPKQLGDILF GKMGLPGGSK TKTGAWSTSA QVLDELAEQG HEFPRKILDW RQVSKLRSTY TDALPTYVHP QTQRVHTTYA LAATTTGRLS SNEPNLQNIP VRTEDGRKIR RAFVATPGHR LVSADYSQIE LRLLSEVADV PALRKAFQDG IDIHAMTASE MFGVPVEGMP SDIRRRAKAI NFGIIYGISA FGLANQLGIP REEAGAYIKR YFERFPGIRA YMDETRDFCR THGYVETLFG RKCHYPDIKA SNPSIRAFNE RAAINARLQG SAADIIRRAM VRMEDALAEK KLAAQMLLQV HDELIFEVPE DEVTATLPVV SHVMQDAPFP AVILNVPLQV DARAADNWDE AH
|
| |