Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0956 |
Symbol | |
ID | 4021431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 1075040 |
End bp | 1078141 |
Gene Length | 3102 bp |
Protein Length | 1033 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637961147 |
Product | DNA polymerase I |
Protein accession | YP_568095 |
Protein GI | 91975436 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAAT CCCCTTCGAA AGCCGCCGCG ACGCCAGCCG CCGCCACCTC CCCTGCCCCG ATCGCCGGCA AGGCGCCGGG CAAGGGGGAT CACATCTTCC TGGTCGACGG ATCGTCCTAC ATCTTCCGCG CCTATCACGC GCTGCCGCCG CTGAGCCGCA AATCCGACGG GTTGCAGGTC AACGCCGTGC TCGGCTTCTG CAACATGCTG TGGAAGCTGT TGCGCGACAT GCCGCCGGAC AACCGGCCGA CGCATCTGGC GATCATCTTC GACAAGTCGG AAGCGACGTT CCGCAACGAA ATGTATCCCG CCTACAAGGC GCATCGGCCG CCGGCGCCCG ACGACCTGAT CCCGCAATTC GCGCTGATCC GCGAGGCGGT GCGCGCGTTC GATCTGCCCT GTCTCGAACA GTCCGGCTTC GAAGCCGATG ATTTGATCGC CACCTATGTG CGCGAGGCCT GCGAGGCGGG CGCGACAGCC ACCATCGTGT CGTCCGACAA GGACCTGATG CAGCTCGTGA CCGACTGCGT CACGATGTAC GACACCATGA AGGACCGCCG CATCGGCGTT GCCGAAGTGA TCGAGAAATT CGGCGTGCCG CCGGATAAAG TCGTCGAGGT CCAGGCGCTG GCCGGCGACA GCGTCGACAA CGTGCCGGGC GTGCCCGGCA TCGGCATCAA GACCGCGGCG CAATTGATTA CCGAATACGG CGATCTCGAA ACCTTGCTGG CGCGCGCCGG AGAGATCAAG CAGCCGAAGC GGCGCGAGGC GCTCATCGAG AACGCCGAGA AGGCGCGGAT CTCGCGCAAG CTCGTCTTGC TCGATGATCA CGTCAAACTC GACGTGCCGC TCGACGAGCT CGCGGTGCAC GAGCCCGATG CGCGAAAGCT GATCTCGTTT CTCAAGGCGA TGGAATTCAC CACGCTGACG CGGCGGGTCG CCGACTACGC GCAGATCGAT CCGTCGGATG TCGAGGCGGA AGCCGCGCTG AAGTCTTCAC CTCTCCCGCT TGCGGGGGAA GTCGGCGCGC GTAGCGCGAC GGGTGGGGGC ACTGCCGCAA GCGGGGACCT GTTTTCGGAG CAAGTGCCCT CAACCCAGCC CTCTCCCGCA AGCGGGAGAG GGAGCGCGCA GCAATCGGGG GAAGGTGGTG CGCTGAACGC CGGGCGTGGG CGCGACGGCA GGCCGGGCGA GGTTCTGTCG CCGCAGATCC TCGCCGCCAG GCGCGCCGAG ACCGCGCGAA AAATTCCGGT CGATCGCACC GCATACAAGA CGATCCGGAC GCTCGACGAA TTGCACGGCT GGATCGCACG CATCCACGAC GCAGGCTTCG TCGCGGTCGA CGCCATCGCG ACCTCGATCG ATCCGATGCA GGCGGAGCTT TGCGGCATTG CCCTGGCGCT GGCGCCGAAT GATGCGGCCT ACATCCCGCT CGGCCATCGT CAGACCGGCG ACGGAAGCGG CCTGTTCGCC GCGGGACTCG CGCCCGACCA GCTCGGCGCG CGCGAGGCGC TGGATGCCTT GAAGCCGCTG CTGGAATCTG CCGGCCTCGC CAAGATCGGC TTCAACATCA AGTTCACCGC AGTGTTGCTG GCGCAGCACG GCATCACGTT GCGCAACATC GACGATCTGC AGCTGATGTC CTACGCGCTC GACGCCGGCC GCGGCAGCCA CGGACTCGAT GCGCTGAGCG AGAGCAATCT CGGCCACACC CTGCACGCGC TCGGCGAACT CACCGGCAGC GGCAAGGCAA AGATCGGTTT CGATCAGGTG CCGATCGAGC GCGCCACCGA ATATGCGGGC GAGCGCGCCG ACGTGGCGCT GCGGCTATGG CGGGTGCTGA AGCCGCGGCT GGTCGCCGAG CGGATGATGG CCGTGTACGA GACGCTGGAG CGGCCATTGG TCGGCGTGCT GGCGCGGATG GAACGGCGCG GCATCTCGAT CGATCGAAGC GTGCTATCGC GGCTGTCGGC CGATTTCGCC CAGACCGCGG CGCGGATCGA AGCGGAGATT CGCGAACTCG CCGGCGAAGA CATCAACATC GGCAGTCCGA AGCAGCTCGG CGACATCCTG TTCGGCAAGA TGGGTCTGCC GGGCGGCAGC AAGACCAAGA CCGGCGCGTG GTCGACCTCG GCGCAGGTGC TCGACGAACT CGCCGAACAG GGCCATGAAT TTCCAAGGAA GATTCTCGAC TGGCGACAGG TGAGCAAGCT GCGCTCGACC TACACCGACG CGCTGCCGAA CTACGTGCAT CCGCAGACGC AACGGGTCCA CACCACTTAC GCGCTCGCCG CCACCACCAC CGGCCGGCTG TCGTCGAACG AGCCGAATTT GCAGAACATC CCGGTGCGCA CCGAGGACGG CCGCAAGATT CGCCGCGCCT TCATCGCATC GCCGGGCTAC AAACTGGTGT CGGCGGACTA CTCCCAGATC GAGCTGCGAT TGCTGTCCGA GGTCGCCGAG GTGCCGGCGC TGCGCAAGGC GTTCCAAGAC GGCATCGACA TCCACGCCAT GACCGCGTCG GAAATGTTCG GCGTGCCGGT CGAGGGCATG CCGTCGGAAA TCCGCCGCCG CGCCAAGGCG ATCAATTTCG GCATCATCTA CGGCATCTCG GCGTTCGGCC TCGCCAACCA GCTCGGCATC CCGCGCGAGG AGGCCGGCGC CTATATCAAG CGCTACTTCG AGCGCTTCCC CGGCATCCGG GCCTATATGG ACGAGACCCG CGATTTCTGC CGGGCTCACG GCTATGTCGA GACGCTGTTC GGCCGGAAAT GTCACTACCC GGACATCAAG GCGTCGAATC CGTCGATCCG CGCCTTCAAC GAACGCGCCG CGATCAACGC CCGGCTGCAG GGCTCCGCCG CCGACATCAT CCGCCGCGCC ATGGGGCGGA TGGAGGACGC GCTGGCGGAG AAGAAGCTCA GCGCGCAGAT GCTGCTGCAG GTCCACGACG AGCTGATCTT CGAAGTGCCC GACGACGAGG TCGCCGCGAC GCTGCCGGTG GTCCGGCACG TGATGCAGGA TGCGCCGTTC CCGGCGGTGC TGCTGAACGT GCCGCTACAG GTCGACGCGC GGGCCGCGGA CAACTGGGAC GAGGCGCATT GA
|
Protein sequence | MPKSPSKAAA TPAAATSPAP IAGKAPGKGD HIFLVDGSSY IFRAYHALPP LSRKSDGLQV NAVLGFCNML WKLLRDMPPD NRPTHLAIIF DKSEATFRNE MYPAYKAHRP PAPDDLIPQF ALIREAVRAF DLPCLEQSGF EADDLIATYV REACEAGATA TIVSSDKDLM QLVTDCVTMY DTMKDRRIGV AEVIEKFGVP PDKVVEVQAL AGDSVDNVPG VPGIGIKTAA QLITEYGDLE TLLARAGEIK QPKRREALIE NAEKARISRK LVLLDDHVKL DVPLDELAVH EPDARKLISF LKAMEFTTLT RRVADYAQID PSDVEAEAAL KSSPLPLAGE VGARSATGGG TAASGDLFSE QVPSTQPSPA SGRGSAQQSG EGGALNAGRG RDGRPGEVLS PQILAARRAE TARKIPVDRT AYKTIRTLDE LHGWIARIHD AGFVAVDAIA TSIDPMQAEL CGIALALAPN DAAYIPLGHR QTGDGSGLFA AGLAPDQLGA REALDALKPL LESAGLAKIG FNIKFTAVLL AQHGITLRNI DDLQLMSYAL DAGRGSHGLD ALSESNLGHT LHALGELTGS GKAKIGFDQV PIERATEYAG ERADVALRLW RVLKPRLVAE RMMAVYETLE RPLVGVLARM ERRGISIDRS VLSRLSADFA QTAARIEAEI RELAGEDINI GSPKQLGDIL FGKMGLPGGS KTKTGAWSTS AQVLDELAEQ GHEFPRKILD WRQVSKLRST YTDALPNYVH PQTQRVHTTY ALAATTTGRL SSNEPNLQNI PVRTEDGRKI RRAFIASPGY KLVSADYSQI ELRLLSEVAE VPALRKAFQD GIDIHAMTAS EMFGVPVEGM PSEIRRRAKA INFGIIYGIS AFGLANQLGI PREEAGAYIK RYFERFPGIR AYMDETRDFC RAHGYVETLF GRKCHYPDIK ASNPSIRAFN ERAAINARLQ GSAADIIRRA MGRMEDALAE KKLSAQMLLQ VHDELIFEVP DDEVAATLPV VRHVMQDAPF PAVLLNVPLQ VDARAADNWD EAH
|
| |