Gene RPC_4857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4857 
Symbol 
ID3973600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5419471 
End bp5422536 
Gene Length3066 bp 
Protein Length1021 aa 
Translation table11 
GC content67% 
IMG OID637927969 
ProductDNA polymerase I 
Protein accessionYP_534698 
Protein GI90426328 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.450809 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAAA CCTCTGAACA CGCCGCGTCC TCCGTCGCCG TCAAAGCCCC GAGCAAGGGC 
GACCACGTCT TTCTGGTCGA CGGTTCGTCG TATATTTTCC GCGCCTACCA CGCGCTGCCG
CCGCTCAACC GCAAGTCCGA CGGGCTGCAG GTCAACGCCG TGCTGGGTTT CTGCAACATG
CTGTGGAAGC TGCTGCGCGA AATGCCGAAC GACGAGCGGC CGACCCATCT GGCCATCATC
TTCGACAAGG CGGAAAAGAC CTTCCGCAAC GAACTCTATC CTCTCTACAA GGCGCAGCGG
CCGCCGGCGC CGGACGATCT GATCCCGCAA TTCGCGCTGA TCCGCGAGGC GGTGAGGGCG
TTCGATCTGC CCTGCCTGGA ACAGATCGGC TTCGAGGCCG ACGATCTGAT CGCGACCTAT
GTGCGGCAGG CCTGCGAACG CGGCGCCACC GCCACCATCG TCTCCTCCGA CAAGGACCTG
ATGCAGCTCG TTACCGACTG CGTCACCATG TTCGACACCA TGAAGGACCG CCGCATCGGC
ATCCCGCAAG TGATCGAGAA GTTCGGCGTG CCGCCCGACA AGGTGGTCGA GGTGCAGGCG
CTGGCCGGCG ACAGCGTCGA CAACGTTCCG GGGGTGCCGG GCATCGGCGT CAAGACCGCG
GCGCAATTGA TCAACGAGTA TGGCGATCTC GACACGCTGC TCGCCCGCGC CGCCGAGATC
AAGCAGCCGA AGCGCCGCGA GGCGCTGATC GCCAACGCCG AGAAGGCGCG GATCTCGCGG
CAGCTGGTGT TGCTCGATGA CAAGGTCGCG CTCGACGTGC CGCTCGACGA ACTCGCGGTG
CACGAGCCGG ATGCGCGAAA ACTGATCGCG TTTCTGAAGG CGATGGAATT CTCCACGCTG
ACGCGGCGGG TCGCGGAGTA TTCGCAGATC GATCCGGCCG ACGTCGAGGC CGATGCTGCG
GTGAAGAGCG GGACGTCTTC TCCCTCCCCC CTTGCGGGGG AGGGTCGGGG TGGGGGGGCC
ACGGACGCCG GCGATCTTTT TGGGGCACCC CCCTCCCCAA CCCTCCCCCG CAAGGGGGGA
GGGAGCGCGC CGCAAACGGG GCAAGGCCTT GCCTCATCGG CGCCGCAATC GAGCGATGCC
CTCACCCCGC AACTCCTCGC CGCCGCGCGG GCCGAAGAGG CGCGGAAGAT TCCGGTCAAC
CGCGACGGCT ACGCCACGCT GCGCACGCGC GACGAACTGC ACGGCTGGAT CGCGAAAATC
CACGACCTCG GCCATGTCGC GCTGGAGGCC AAGGCCACCT TCGACATCAA GAGCGCGGTG
GTCGATCCGA TGCAGGCGGA GTTGACCGGG CTGGCGCTGG CGCTGGGCCC CAATGAGGCC
TGCTACGTTC CGCTCGGCCA TCGGCAATCC GGCGACGCCG CGGGACTGTT CGCTTCCGGG
CTCGAACCCG ATCAGATCGC CGCCCGCGAC GCGCTGGAAG CGCTGCGGCC GATCCTGGAA
TCCCCCGGCG TGCTGAAGAT CGGCTTCAAC ATCAAATTCA CCGCGGTGCT GCTGGCGCAG
CACGGCATCA CGCTGCAGAA CGCCGACGAC GTGCAGCTGA TGTCCTATGC GTTGGACGCC
GGCCGCCACG CCCACGGGCT CGACGCCCTC GCCGAGACCT GGCTCGGCCA CAAGACGCTG
AGCTATGGCG AGGTGATCGG CAGCGGCAAG GCCAAGCTGT CGTTCGACCA GGTCGCGATC
GACCGCGCCA CCTGCTACGC CGCCGAGGAT GCCGACGTCG CGCTGCGGCT GTGGCGGGTG
TTGAAGCCGC GGCTGGTCGC CGAACGCATG ACCACGGTTT ACGAGACACT GGAGCGGCCG
CTGATCGGCG TGCTGGCGCG GATGGAGCGG CGCGGCATCT CGATCGACCG CGCAGTGCTG
GCGCGGCTGT CCGGCGATTT CGCCCAGACC GCGGCGCGCA TCGAGGCGGA GATCCGCGAG
ATCGCCGGCG AAGAGATCAA TATCGGCAGC CCGAAGCAGC TCGGCGACAT TTTGTTCGGC
AAGATGCAGC TGCCCGGCGG CTCCAAGACC AAGACCGGGG CGTGGTCGAC CTCGGCGCAG
GTCCTGGAAG AGCTCGCCGA GCAGGGCCAC GAATTCCCGC GAAAAATCCT GGATTGGCGG
CAGGTGTCGA AGCTGCGCTC GACCTATACG GACGCGCTGC CGACCTACGT GCATCCGCAG
ACCCACCGGG TGCACACCAC CTACGCGCTG GCCGCGACCA CCACCGGACG GCTGTCGTCG
AACGAACCCA ACCTGCAGAA CATCCCGGTG CGCACCGAGG ACGGCCGCAA GATCCGCCGC
GCCTTCATCG CCGCCCCCGG CCACAAGCTG GTGTCGGCGG ATTATTCCCA GATCGAGCTG
CGGCTGCTCG CCGAAATCGC CGACATCCCG GTGTTGAAAC AGGCGTTCCA GGACGGTCTC
GACATCCACG CCATGACCGC CTCGGAAATG TTCGGGGTGC CGATCAAGGA CATGCCGAGC
GAGGTGCGGC GCCGCGCCAA AGCGATCAAT TTCGGCATCA TCTACGGCAT CTCGGCGTTT
GGGCTGGCCA ACCAGCTCGG CATTCCGCGC GAGGAGGCCG GCGCCTATAT CAAGAAGTAC
TTCGAGCGCT TTCCCGGCAT CCGCGCCTAT ATGGACGCCA CCCGCGACTT CTGCCGCGCC
CATGGCTTTG TCGAAACGCT GTTCGGCCGA AAATGCCATT ACCCCGACAT CAAGGCCTCG
AACGCCTCGG TGCGCGCCTT CAACGAGCGC GCCGCGATCA ACGCCAGGCT GCAGGGCACC
GCCGCCGACA TCATCCGCCG CGCCATGGTG CGGATGGAAG ACGCGCTGGC CGAAAAGAAA
TTATCGGCGC AGATGTTGTT GCAGGTGCAC GACGAATTGA TCTTCGAGGT GGCCGACGAC
GAGGTCGCGG CAACGCTGCC GGTGGTGCAG CAGGTGATGC AGGACGCCCC GTTCCCGGCG
GTGCTGCTGT CGGTGCCGCT GCAGGTCGAC GCGCGCGCGG CGAACAACTG GGACGAGGCG
CATTGA
 
Protein sequence
MPKTSEHAAS SVAVKAPSKG DHVFLVDGSS YIFRAYHALP PLNRKSDGLQ VNAVLGFCNM 
LWKLLREMPN DERPTHLAII FDKAEKTFRN ELYPLYKAQR PPAPDDLIPQ FALIREAVRA
FDLPCLEQIG FEADDLIATY VRQACERGAT ATIVSSDKDL MQLVTDCVTM FDTMKDRRIG
IPQVIEKFGV PPDKVVEVQA LAGDSVDNVP GVPGIGVKTA AQLINEYGDL DTLLARAAEI
KQPKRREALI ANAEKARISR QLVLLDDKVA LDVPLDELAV HEPDARKLIA FLKAMEFSTL
TRRVAEYSQI DPADVEADAA VKSGTSSPSP LAGEGRGGGA TDAGDLFGAP PSPTLPRKGG
GSAPQTGQGL ASSAPQSSDA LTPQLLAAAR AEEARKIPVN RDGYATLRTR DELHGWIAKI
HDLGHVALEA KATFDIKSAV VDPMQAELTG LALALGPNEA CYVPLGHRQS GDAAGLFASG
LEPDQIAARD ALEALRPILE SPGVLKIGFN IKFTAVLLAQ HGITLQNADD VQLMSYALDA
GRHAHGLDAL AETWLGHKTL SYGEVIGSGK AKLSFDQVAI DRATCYAAED ADVALRLWRV
LKPRLVAERM TTVYETLERP LIGVLARMER RGISIDRAVL ARLSGDFAQT AARIEAEIRE
IAGEEINIGS PKQLGDILFG KMQLPGGSKT KTGAWSTSAQ VLEELAEQGH EFPRKILDWR
QVSKLRSTYT DALPTYVHPQ THRVHTTYAL AATTTGRLSS NEPNLQNIPV RTEDGRKIRR
AFIAAPGHKL VSADYSQIEL RLLAEIADIP VLKQAFQDGL DIHAMTASEM FGVPIKDMPS
EVRRRAKAIN FGIIYGISAF GLANQLGIPR EEAGAYIKKY FERFPGIRAY MDATRDFCRA
HGFVETLFGR KCHYPDIKAS NASVRAFNER AAINARLQGT AADIIRRAMV RMEDALAEKK
LSAQMLLQVH DELIFEVADD EVAATLPVVQ QVMQDAPFPA VLLSVPLQVD ARAANNWDEA
H