Gene RPB_0848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0848 
Symbol 
ID3909106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp965253 
End bp968351 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content67% 
IMG OID637882741 
ProductDNA polymerase I 
Protein accessionYP_484470 
Protein GI86747974 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAAT CCCCCTCGAA AGCCGCCGCG GCGCCCGCCG CAAACTCCCC TGCCCCGGCC 
GCCGCCAAGG CGCCGGGCAC GGGCGATCAC ATCTTCCTGG TCGACGGATC GTCCTACATC
TTCCGCGCCT ATCACGCGCT GCCGCCGCTG ACCCGCAAGT CGGACGGGCT GCAGGTCAAC
GCCGTGCTCG GCTTCTGCAA CATGCTGTGG AAGCTGCTCC GCGAGATGCC GCCGGACAAC
CGGCCGACGC ATCTGGCGAT CATCTTCGAC AAATCCGAGC ACACCTTCCG CAACCAGCTC
TACCCCGACT ACAAGGCGCA CCGGCCGCCG GCGCCGGACG ATCTGATCCC GCAATTCGCC
CTGATCCGCG AGGCGGTGCG GGCGTTCGAC CTGCCGTGCC TCGAACAATC CGGCTTCGAG
GCCGACGATC TGATCGCCAC CTATGTGCGC GAGGCCTGCG AGCGCGGCGC CACGGCTACA
ATTGTTTCGT CCGACAAGGA TTTGATGCAG CTCGTGACCG ATTGCGTCAC GATGTACGAC
ACCATGAAGG ACCGCCGCAT CGGCATTGCC GAGGTGATCG AGAAATTCGG CGTGCCGCCC
GAGAAGGTGG TCGAAGTGCA GGCGCTGGCC GGCGACAGCG TCGACAACGT GCCGGGCGTG
CCCGGCATCG GCATCAAGAC CGCGGCCCAA CTGATCAATG AATATGGCGA CCTCGACACG
CTCTTGGCGC GCGCCGGCGA AATCAAGCAG CCGAAGCGGC GCGAGGCGCT GATCGAGAAC
GCCGAGAAGG CGCGGATCTC GCGGCAACTG GTGCTGCTCG ACGACAAGGT GAAGCTCGAC
GTCCCGCTCG ACGAACTCGC CGTGCACGAG CCCGACGCGC GAAAGCTGAT CTCGTTTCTC
AAGGCGATGG AATTCACCAC GCTGACGCGG CGGGTCGCGG ACTACTCCCA GATCGATCCG
TCCGACGTCG AGGCCGAAGC CGCGCTGAAG TCTTCACCTC TCCCGCTTGC GGGGGAGGTC
GGCGCGCGGA GCGCGACGGG TGGGGGCACG TCCACAGGCG GAGACCTTTT TTCGGGGCAA
GTGCCCTCAC CCCAACCCTC TCCCGCAGGC GGGAGAGGGA GCGCGCCGCA TTCGGCGGAA
GGTGGTCCTC TGAATGCCGG CCGCGGCCGC GACGGCCAGC CGGGCGAGGT GCTGTCGCCA
CAGATCCTCG CCGCCAAGCG CGCCGAAGCC GCGCGAAAAA TTCCGGTCGA TCGCACCGCA
TACAAGACCG TGCGCACGCG CGACGAGCTG CAGGGCTGGA TCGCGCGCAT CCACGACGCC
GGCGCCTTCG CGGTCGATGC GATCGCGACC TCGATCGATC CGATGCAGGC GGAGCTGTGC
GGCATCGCCT TGTCGCTCGG GCCGAACGAC GCCTGCTACA TCCCGCTCGG CCATCGTCAG
ACCGGCGACG GAAGCGGTCT GTTCGCCGCA GGATTGGCGC CCGACCAGCT CGGCGCGCGC
GATGTGCTCG ATGCGCTGCG GCCGCTGCTG GACTCCGCAG GCCTCGCCAA GATCGGCTTC
AACATCAAAT TCACCGCGGT GCTGCTGGCG CAGCACGGCG TCACCTTGCG TAACATCGAC
GATGTGCAAC TGATCTCCTA TGTGCTCGAT GCCGGCCGCG GCAGCCACGG TCTGGATGCG
CTGTCCGAAA GCAATCTCGG CCACACCCTG CACGTGCTCG GCGCATTGAC CGGCAGCGGC
AAGGCGAAGA TCGCGTTCGA TCAGGTGCCG ATCGACCGCG CCACCGAATA TGGCGGCGAG
CGCTCCGACG TCGCACTCCG GCTGTGGCGC GTGCTGAAGC CGCGGCTGGT CGCCGAGCGG
ATGATGGCAG TGTACGAGAC GCTGGAGCGG CCGCTGGTCG GCGTGCTGGC GCGGATGGAG
CGGCGCGGCA TCTCGATCGA TCGCCAAGTG CTGTCGCGAT TGTCCGCCGA CTTCGCCCAG
ACCGCCGCAA GGATCGAGGC AGAGATCCGC GAACTCGCCG GCGAGGACAT CAACATCGGC
AGTCCGAAGC AGCTCGGCGA CATCCTGTTC GGCAAGATGG GCCTGCCCGG TGGCAGCAAG
ACCAAGACCG GCGCGTGGTC GACCTCGGCG CAGGTGCTCG ACGAGCTCGC CGAACAAGGG
CACGAATTCC CGCGCAAAAT TCTCGACTGG CGCCAGGTCT CGAAGCTGCG CTCGACCTAC
ACGGACGCGC TGCCGACCTA TGTGCATCCG CAGACCCAGC GCGTCCACAC CACCTACGCG
CTCGCCGCCA CCACCACCGG GCGGCTGTCG TCGAACGAAC CCAATCTGCA GAACATCCCG
GTGCGCACCG AGGACGGCCG TAAAATCCGC CGCGCCTTCG TGGCGACGCC CGGCCACAGA
CTGGTCTCGG CCGACTACTC GCAGATCGAA TTGCGCCTGC TGTCCGAAGT CGCCGATGTG
CCGGCGCTGC GGAAAGCGTT TCAGGACGGC ATCGACATTC ATGCGATGAC GGCGTCCGAA
ATGTTCGGCG TGCCGGTCGA GGGCATGCCG TCGGACATCC GCCGCCGTGC GAAAGCGATC
AATTTCGGCA TCATCTACGG CATCTCGGCG TTCGGCCTCG CCAATCAGCT CGGCATCCCG
CGCGAGGAGG CCGGCGCCTA TATCAAGCGC TATTTCGAGC GCTTCCCGGG CATCCGCGCC
TATATGGACG AGACCCGCGA TTTCTGCCGG ACGCACGGCT ATGTCGAGAC GCTGTTCGGC
CGCAAATGCC ACTACCCGGA CATCAAGGCC TCGAACCCGT CGATCCGCGC CTTCAACGAA
CGCGCCGCCA TCAATGCGAG GCTGCAAGGC TCCGCCGCCG ACATCATCCG CCGCGCCATG
GTGCGGATGG AGGACGCGCT GGCCGAGAAG AAGCTGGCGG CGCAGATGCT GCTGCAGGTG
CACGACGAAC TGATCTTCGA AGTGCCAGAG GATGAAGTGA CGGCGACGCT GCCGGTGGTG
AGCCACGTCA TGCAGGACGC GCCGTTCCCG GCCGTGATCC TCAACGTGCC GCTGCAGGTC
GACGCAAGAG CCGCGGACAA TTGGGACGAG GCGCATTGA
 
Protein sequence
MPKSPSKAAA APAANSPAPA AAKAPGTGDH IFLVDGSSYI FRAYHALPPL TRKSDGLQVN 
AVLGFCNMLW KLLREMPPDN RPTHLAIIFD KSEHTFRNQL YPDYKAHRPP APDDLIPQFA
LIREAVRAFD LPCLEQSGFE ADDLIATYVR EACERGATAT IVSSDKDLMQ LVTDCVTMYD
TMKDRRIGIA EVIEKFGVPP EKVVEVQALA GDSVDNVPGV PGIGIKTAAQ LINEYGDLDT
LLARAGEIKQ PKRREALIEN AEKARISRQL VLLDDKVKLD VPLDELAVHE PDARKLISFL
KAMEFTTLTR RVADYSQIDP SDVEAEAALK SSPLPLAGEV GARSATGGGT STGGDLFSGQ
VPSPQPSPAG GRGSAPHSAE GGPLNAGRGR DGQPGEVLSP QILAAKRAEA ARKIPVDRTA
YKTVRTRDEL QGWIARIHDA GAFAVDAIAT SIDPMQAELC GIALSLGPND ACYIPLGHRQ
TGDGSGLFAA GLAPDQLGAR DVLDALRPLL DSAGLAKIGF NIKFTAVLLA QHGVTLRNID
DVQLISYVLD AGRGSHGLDA LSESNLGHTL HVLGALTGSG KAKIAFDQVP IDRATEYGGE
RSDVALRLWR VLKPRLVAER MMAVYETLER PLVGVLARME RRGISIDRQV LSRLSADFAQ
TAARIEAEIR ELAGEDINIG SPKQLGDILF GKMGLPGGSK TKTGAWSTSA QVLDELAEQG
HEFPRKILDW RQVSKLRSTY TDALPTYVHP QTQRVHTTYA LAATTTGRLS SNEPNLQNIP
VRTEDGRKIR RAFVATPGHR LVSADYSQIE LRLLSEVADV PALRKAFQDG IDIHAMTASE
MFGVPVEGMP SDIRRRAKAI NFGIIYGISA FGLANQLGIP REEAGAYIKR YFERFPGIRA
YMDETRDFCR THGYVETLFG RKCHYPDIKA SNPSIRAFNE RAAINARLQG SAADIIRRAM
VRMEDALAEK KLAAQMLLQV HDELIFEVPE DEVTATLPVV SHVMQDAPFP AVILNVPLQV
DARAADNWDE AH