Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_4857 |
Symbol | |
ID | 3973600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 5419471 |
End bp | 5422536 |
Gene Length | 3066 bp |
Protein Length | 1021 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637927969 |
Product | DNA polymerase I |
Protein accession | YP_534698 |
Protein GI | 90426328 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.450809 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAAA CCTCTGAACA CGCCGCGTCC TCCGTCGCCG TCAAAGCCCC GAGCAAGGGC GACCACGTCT TTCTGGTCGA CGGTTCGTCG TATATTTTCC GCGCCTACCA CGCGCTGCCG CCGCTCAACC GCAAGTCCGA CGGGCTGCAG GTCAACGCCG TGCTGGGTTT CTGCAACATG CTGTGGAAGC TGCTGCGCGA AATGCCGAAC GACGAGCGGC CGACCCATCT GGCCATCATC TTCGACAAGG CGGAAAAGAC CTTCCGCAAC GAACTCTATC CTCTCTACAA GGCGCAGCGG CCGCCGGCGC CGGACGATCT GATCCCGCAA TTCGCGCTGA TCCGCGAGGC GGTGAGGGCG TTCGATCTGC CCTGCCTGGA ACAGATCGGC TTCGAGGCCG ACGATCTGAT CGCGACCTAT GTGCGGCAGG CCTGCGAACG CGGCGCCACC GCCACCATCG TCTCCTCCGA CAAGGACCTG ATGCAGCTCG TTACCGACTG CGTCACCATG TTCGACACCA TGAAGGACCG CCGCATCGGC ATCCCGCAAG TGATCGAGAA GTTCGGCGTG CCGCCCGACA AGGTGGTCGA GGTGCAGGCG CTGGCCGGCG ACAGCGTCGA CAACGTTCCG GGGGTGCCGG GCATCGGCGT CAAGACCGCG GCGCAATTGA TCAACGAGTA TGGCGATCTC GACACGCTGC TCGCCCGCGC CGCCGAGATC AAGCAGCCGA AGCGCCGCGA GGCGCTGATC GCCAACGCCG AGAAGGCGCG GATCTCGCGG CAGCTGGTGT TGCTCGATGA CAAGGTCGCG CTCGACGTGC CGCTCGACGA ACTCGCGGTG CACGAGCCGG ATGCGCGAAA ACTGATCGCG TTTCTGAAGG CGATGGAATT CTCCACGCTG ACGCGGCGGG TCGCGGAGTA TTCGCAGATC GATCCGGCCG ACGTCGAGGC CGATGCTGCG GTGAAGAGCG GGACGTCTTC TCCCTCCCCC CTTGCGGGGG AGGGTCGGGG TGGGGGGGCC ACGGACGCCG GCGATCTTTT TGGGGCACCC CCCTCCCCAA CCCTCCCCCG CAAGGGGGGA GGGAGCGCGC CGCAAACGGG GCAAGGCCTT GCCTCATCGG CGCCGCAATC GAGCGATGCC CTCACCCCGC AACTCCTCGC CGCCGCGCGG GCCGAAGAGG CGCGGAAGAT TCCGGTCAAC CGCGACGGCT ACGCCACGCT GCGCACGCGC GACGAACTGC ACGGCTGGAT CGCGAAAATC CACGACCTCG GCCATGTCGC GCTGGAGGCC AAGGCCACCT TCGACATCAA GAGCGCGGTG GTCGATCCGA TGCAGGCGGA GTTGACCGGG CTGGCGCTGG CGCTGGGCCC CAATGAGGCC TGCTACGTTC CGCTCGGCCA TCGGCAATCC GGCGACGCCG CGGGACTGTT CGCTTCCGGG CTCGAACCCG ATCAGATCGC CGCCCGCGAC GCGCTGGAAG CGCTGCGGCC GATCCTGGAA TCCCCCGGCG TGCTGAAGAT CGGCTTCAAC ATCAAATTCA CCGCGGTGCT GCTGGCGCAG CACGGCATCA CGCTGCAGAA CGCCGACGAC GTGCAGCTGA TGTCCTATGC GTTGGACGCC GGCCGCCACG CCCACGGGCT CGACGCCCTC GCCGAGACCT GGCTCGGCCA CAAGACGCTG AGCTATGGCG AGGTGATCGG CAGCGGCAAG GCCAAGCTGT CGTTCGACCA GGTCGCGATC GACCGCGCCA CCTGCTACGC CGCCGAGGAT GCCGACGTCG CGCTGCGGCT GTGGCGGGTG TTGAAGCCGC GGCTGGTCGC CGAACGCATG ACCACGGTTT ACGAGACACT GGAGCGGCCG CTGATCGGCG TGCTGGCGCG GATGGAGCGG CGCGGCATCT CGATCGACCG CGCAGTGCTG GCGCGGCTGT CCGGCGATTT CGCCCAGACC GCGGCGCGCA TCGAGGCGGA GATCCGCGAG ATCGCCGGCG AAGAGATCAA TATCGGCAGC CCGAAGCAGC TCGGCGACAT TTTGTTCGGC AAGATGCAGC TGCCCGGCGG CTCCAAGACC AAGACCGGGG CGTGGTCGAC CTCGGCGCAG GTCCTGGAAG AGCTCGCCGA GCAGGGCCAC GAATTCCCGC GAAAAATCCT GGATTGGCGG CAGGTGTCGA AGCTGCGCTC GACCTATACG GACGCGCTGC CGACCTACGT GCATCCGCAG ACCCACCGGG TGCACACCAC CTACGCGCTG GCCGCGACCA CCACCGGACG GCTGTCGTCG AACGAACCCA ACCTGCAGAA CATCCCGGTG CGCACCGAGG ACGGCCGCAA GATCCGCCGC GCCTTCATCG CCGCCCCCGG CCACAAGCTG GTGTCGGCGG ATTATTCCCA GATCGAGCTG CGGCTGCTCG CCGAAATCGC CGACATCCCG GTGTTGAAAC AGGCGTTCCA GGACGGTCTC GACATCCACG CCATGACCGC CTCGGAAATG TTCGGGGTGC CGATCAAGGA CATGCCGAGC GAGGTGCGGC GCCGCGCCAA AGCGATCAAT TTCGGCATCA TCTACGGCAT CTCGGCGTTT GGGCTGGCCA ACCAGCTCGG CATTCCGCGC GAGGAGGCCG GCGCCTATAT CAAGAAGTAC TTCGAGCGCT TTCCCGGCAT CCGCGCCTAT ATGGACGCCA CCCGCGACTT CTGCCGCGCC CATGGCTTTG TCGAAACGCT GTTCGGCCGA AAATGCCATT ACCCCGACAT CAAGGCCTCG AACGCCTCGG TGCGCGCCTT CAACGAGCGC GCCGCGATCA ACGCCAGGCT GCAGGGCACC GCCGCCGACA TCATCCGCCG CGCCATGGTG CGGATGGAAG ACGCGCTGGC CGAAAAGAAA TTATCGGCGC AGATGTTGTT GCAGGTGCAC GACGAATTGA TCTTCGAGGT GGCCGACGAC GAGGTCGCGG CAACGCTGCC GGTGGTGCAG CAGGTGATGC AGGACGCCCC GTTCCCGGCG GTGCTGCTGT CGGTGCCGCT GCAGGTCGAC GCGCGCGCGG CGAACAACTG GGACGAGGCG CATTGA
|
Protein sequence | MPKTSEHAAS SVAVKAPSKG DHVFLVDGSS YIFRAYHALP PLNRKSDGLQ VNAVLGFCNM LWKLLREMPN DERPTHLAII FDKAEKTFRN ELYPLYKAQR PPAPDDLIPQ FALIREAVRA FDLPCLEQIG FEADDLIATY VRQACERGAT ATIVSSDKDL MQLVTDCVTM FDTMKDRRIG IPQVIEKFGV PPDKVVEVQA LAGDSVDNVP GVPGIGVKTA AQLINEYGDL DTLLARAAEI KQPKRREALI ANAEKARISR QLVLLDDKVA LDVPLDELAV HEPDARKLIA FLKAMEFSTL TRRVAEYSQI DPADVEADAA VKSGTSSPSP LAGEGRGGGA TDAGDLFGAP PSPTLPRKGG GSAPQTGQGL ASSAPQSSDA LTPQLLAAAR AEEARKIPVN RDGYATLRTR DELHGWIAKI HDLGHVALEA KATFDIKSAV VDPMQAELTG LALALGPNEA CYVPLGHRQS GDAAGLFASG LEPDQIAARD ALEALRPILE SPGVLKIGFN IKFTAVLLAQ HGITLQNADD VQLMSYALDA GRHAHGLDAL AETWLGHKTL SYGEVIGSGK AKLSFDQVAI DRATCYAAED ADVALRLWRV LKPRLVAERM TTVYETLERP LIGVLARMER RGISIDRAVL ARLSGDFAQT AARIEAEIRE IAGEEINIGS PKQLGDILFG KMQLPGGSKT KTGAWSTSAQ VLEELAEQGH EFPRKILDWR QVSKLRSTYT DALPTYVHPQ THRVHTTYAL AATTTGRLSS NEPNLQNIPV RTEDGRKIRR AFIAAPGHKL VSADYSQIEL RLLAEIADIP VLKQAFQDGL DIHAMTASEM FGVPIKDMPS EVRRRAKAIN FGIIYGISAF GLANQLGIPR EEAGAYIKKY FERFPGIRAY MDATRDFCRA HGFVETLFGR KCHYPDIKAS NASVRAFNER AAINARLQGT AADIIRRAMV RMEDALAEKK LSAQMLLQVH DELIFEVADD EVAATLPVVQ QVMQDAPFPA VLLSVPLQVD ARAANNWDEA H
|
| |