Gene RPD_0956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0956 
Symbol 
ID4021431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1075040 
End bp1078141 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content66% 
IMG OID637961147 
ProductDNA polymerase I 
Protein accessionYP_568095 
Protein GI91975436 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAAT CCCCTTCGAA AGCCGCCGCG ACGCCAGCCG CCGCCACCTC CCCTGCCCCG 
ATCGCCGGCA AGGCGCCGGG CAAGGGGGAT CACATCTTCC TGGTCGACGG ATCGTCCTAC
ATCTTCCGCG CCTATCACGC GCTGCCGCCG CTGAGCCGCA AATCCGACGG GTTGCAGGTC
AACGCCGTGC TCGGCTTCTG CAACATGCTG TGGAAGCTGT TGCGCGACAT GCCGCCGGAC
AACCGGCCGA CGCATCTGGC GATCATCTTC GACAAGTCGG AAGCGACGTT CCGCAACGAA
ATGTATCCCG CCTACAAGGC GCATCGGCCG CCGGCGCCCG ACGACCTGAT CCCGCAATTC
GCGCTGATCC GCGAGGCGGT GCGCGCGTTC GATCTGCCCT GTCTCGAACA GTCCGGCTTC
GAAGCCGATG ATTTGATCGC CACCTATGTG CGCGAGGCCT GCGAGGCGGG CGCGACAGCC
ACCATCGTGT CGTCCGACAA GGACCTGATG CAGCTCGTGA CCGACTGCGT CACGATGTAC
GACACCATGA AGGACCGCCG CATCGGCGTT GCCGAAGTGA TCGAGAAATT CGGCGTGCCG
CCGGATAAAG TCGTCGAGGT CCAGGCGCTG GCCGGCGACA GCGTCGACAA CGTGCCGGGC
GTGCCCGGCA TCGGCATCAA GACCGCGGCG CAATTGATTA CCGAATACGG CGATCTCGAA
ACCTTGCTGG CGCGCGCCGG AGAGATCAAG CAGCCGAAGC GGCGCGAGGC GCTCATCGAG
AACGCCGAGA AGGCGCGGAT CTCGCGCAAG CTCGTCTTGC TCGATGATCA CGTCAAACTC
GACGTGCCGC TCGACGAGCT CGCGGTGCAC GAGCCCGATG CGCGAAAGCT GATCTCGTTT
CTCAAGGCGA TGGAATTCAC CACGCTGACG CGGCGGGTCG CCGACTACGC GCAGATCGAT
CCGTCGGATG TCGAGGCGGA AGCCGCGCTG AAGTCTTCAC CTCTCCCGCT TGCGGGGGAA
GTCGGCGCGC GTAGCGCGAC GGGTGGGGGC ACTGCCGCAA GCGGGGACCT GTTTTCGGAG
CAAGTGCCCT CAACCCAGCC CTCTCCCGCA AGCGGGAGAG GGAGCGCGCA GCAATCGGGG
GAAGGTGGTG CGCTGAACGC CGGGCGTGGG CGCGACGGCA GGCCGGGCGA GGTTCTGTCG
CCGCAGATCC TCGCCGCCAG GCGCGCCGAG ACCGCGCGAA AAATTCCGGT CGATCGCACC
GCATACAAGA CGATCCGGAC GCTCGACGAA TTGCACGGCT GGATCGCACG CATCCACGAC
GCAGGCTTCG TCGCGGTCGA CGCCATCGCG ACCTCGATCG ATCCGATGCA GGCGGAGCTT
TGCGGCATTG CCCTGGCGCT GGCGCCGAAT GATGCGGCCT ACATCCCGCT CGGCCATCGT
CAGACCGGCG ACGGAAGCGG CCTGTTCGCC GCGGGACTCG CGCCCGACCA GCTCGGCGCG
CGCGAGGCGC TGGATGCCTT GAAGCCGCTG CTGGAATCTG CCGGCCTCGC CAAGATCGGC
TTCAACATCA AGTTCACCGC AGTGTTGCTG GCGCAGCACG GCATCACGTT GCGCAACATC
GACGATCTGC AGCTGATGTC CTACGCGCTC GACGCCGGCC GCGGCAGCCA CGGACTCGAT
GCGCTGAGCG AGAGCAATCT CGGCCACACC CTGCACGCGC TCGGCGAACT CACCGGCAGC
GGCAAGGCAA AGATCGGTTT CGATCAGGTG CCGATCGAGC GCGCCACCGA ATATGCGGGC
GAGCGCGCCG ACGTGGCGCT GCGGCTATGG CGGGTGCTGA AGCCGCGGCT GGTCGCCGAG
CGGATGATGG CCGTGTACGA GACGCTGGAG CGGCCATTGG TCGGCGTGCT GGCGCGGATG
GAACGGCGCG GCATCTCGAT CGATCGAAGC GTGCTATCGC GGCTGTCGGC CGATTTCGCC
CAGACCGCGG CGCGGATCGA AGCGGAGATT CGCGAACTCG CCGGCGAAGA CATCAACATC
GGCAGTCCGA AGCAGCTCGG CGACATCCTG TTCGGCAAGA TGGGTCTGCC GGGCGGCAGC
AAGACCAAGA CCGGCGCGTG GTCGACCTCG GCGCAGGTGC TCGACGAACT CGCCGAACAG
GGCCATGAAT TTCCAAGGAA GATTCTCGAC TGGCGACAGG TGAGCAAGCT GCGCTCGACC
TACACCGACG CGCTGCCGAA CTACGTGCAT CCGCAGACGC AACGGGTCCA CACCACTTAC
GCGCTCGCCG CCACCACCAC CGGCCGGCTG TCGTCGAACG AGCCGAATTT GCAGAACATC
CCGGTGCGCA CCGAGGACGG CCGCAAGATT CGCCGCGCCT TCATCGCATC GCCGGGCTAC
AAACTGGTGT CGGCGGACTA CTCCCAGATC GAGCTGCGAT TGCTGTCCGA GGTCGCCGAG
GTGCCGGCGC TGCGCAAGGC GTTCCAAGAC GGCATCGACA TCCACGCCAT GACCGCGTCG
GAAATGTTCG GCGTGCCGGT CGAGGGCATG CCGTCGGAAA TCCGCCGCCG CGCCAAGGCG
ATCAATTTCG GCATCATCTA CGGCATCTCG GCGTTCGGCC TCGCCAACCA GCTCGGCATC
CCGCGCGAGG AGGCCGGCGC CTATATCAAG CGCTACTTCG AGCGCTTCCC CGGCATCCGG
GCCTATATGG ACGAGACCCG CGATTTCTGC CGGGCTCACG GCTATGTCGA GACGCTGTTC
GGCCGGAAAT GTCACTACCC GGACATCAAG GCGTCGAATC CGTCGATCCG CGCCTTCAAC
GAACGCGCCG CGATCAACGC CCGGCTGCAG GGCTCCGCCG CCGACATCAT CCGCCGCGCC
ATGGGGCGGA TGGAGGACGC GCTGGCGGAG AAGAAGCTCA GCGCGCAGAT GCTGCTGCAG
GTCCACGACG AGCTGATCTT CGAAGTGCCC GACGACGAGG TCGCCGCGAC GCTGCCGGTG
GTCCGGCACG TGATGCAGGA TGCGCCGTTC CCGGCGGTGC TGCTGAACGT GCCGCTACAG
GTCGACGCGC GGGCCGCGGA CAACTGGGAC GAGGCGCATT GA
 
Protein sequence
MPKSPSKAAA TPAAATSPAP IAGKAPGKGD HIFLVDGSSY IFRAYHALPP LSRKSDGLQV 
NAVLGFCNML WKLLRDMPPD NRPTHLAIIF DKSEATFRNE MYPAYKAHRP PAPDDLIPQF
ALIREAVRAF DLPCLEQSGF EADDLIATYV REACEAGATA TIVSSDKDLM QLVTDCVTMY
DTMKDRRIGV AEVIEKFGVP PDKVVEVQAL AGDSVDNVPG VPGIGIKTAA QLITEYGDLE
TLLARAGEIK QPKRREALIE NAEKARISRK LVLLDDHVKL DVPLDELAVH EPDARKLISF
LKAMEFTTLT RRVADYAQID PSDVEAEAAL KSSPLPLAGE VGARSATGGG TAASGDLFSE
QVPSTQPSPA SGRGSAQQSG EGGALNAGRG RDGRPGEVLS PQILAARRAE TARKIPVDRT
AYKTIRTLDE LHGWIARIHD AGFVAVDAIA TSIDPMQAEL CGIALALAPN DAAYIPLGHR
QTGDGSGLFA AGLAPDQLGA REALDALKPL LESAGLAKIG FNIKFTAVLL AQHGITLRNI
DDLQLMSYAL DAGRGSHGLD ALSESNLGHT LHALGELTGS GKAKIGFDQV PIERATEYAG
ERADVALRLW RVLKPRLVAE RMMAVYETLE RPLVGVLARM ERRGISIDRS VLSRLSADFA
QTAARIEAEI RELAGEDINI GSPKQLGDIL FGKMGLPGGS KTKTGAWSTS AQVLDELAEQ
GHEFPRKILD WRQVSKLRST YTDALPNYVH PQTQRVHTTY ALAATTTGRL SSNEPNLQNI
PVRTEDGRKI RRAFIASPGY KLVSADYSQI ELRLLSEVAE VPALRKAFQD GIDIHAMTAS
EMFGVPVEGM PSEIRRRAKA INFGIIYGIS AFGLANQLGI PREEAGAYIK RYFERFPGIR
AYMDETRDFC RAHGYVETLF GRKCHYPDIK ASNPSIRAFN ERAAINARLQ GSAADIIRRA
MGRMEDALAE KKLSAQMLLQ VHDELIFEVP DDEVAATLPV VRHVMQDAPF PAVLLNVPLQ
VDARAADNWD EAH