Gene Rpal_5204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5204 
Symbol 
ID6412904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5611612 
End bp5614689 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content67% 
IMG OID642715094 
ProductDNA polymerase I 
Protein accessionYP_001994167 
Protein GI192293562 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAAAA CGACAAAGTC CGCCGCTGCG ACGCCTGCCG CTTCCTCCGT CACGCCCCCT 
GCCCCCGTCG GCGTCAAGCC GCCCGGCAAG GGCGACCACA TCTTCCTGGT CGACGGCTCG
TCGTACATCT TCCGCGCCTA TCACGCGCTG CCGCCGCTGA ACCGCAAGTC GGACGGGCTG
CAGGTCAATG CCGTGCTCGG CTTCTGCAAC ATGCTGTGGA AGCTGCTCCG CGACATGCCG
CCGGAGAACC GGCCGACGCA TCTCGCCATC ATCTTCGACA AGTCGGAAGT CACCTTCCGC
AACCATCTCT ACCCCGAATA CAAGGCGCAC CGGCCGCCGG CACCGGAGGA TCTGATCCCG
CAATTTGCCC TGATCCGCGA GGCGGTGCGG GCGTTCGACC TGCCGTGCCT GGAACAGTCC
GGTTTCGAGG CCGACGATCT GATCGCCACT TACGTGCGCG AGGCGTGCGA GGCCGGGGCC
ACGGCGACGA TCGTGTCGTC CGACAAGGAC CTGATGCAGC TCGTGACCGA CTGCGTGGTG
ATGTACGACA CCATGAAGGA CCGCCGCATC GGCGTGCCCG AGGTGATCGA GAAATTCGGC
GTGCCGCCCA ATAAGGTGGT CGAAGTCCAG GCGCTCGCCG GCGACAGCGT CGACAACGTG
CCGGGCGTGC CGGGCATCGG CATCAAGACC GCCGCGCAGC TGATCAACGA ATATGGCGAC
CTCGAAACCC TGCTGGCCCG CGCACCTGAG ATCAAGCAGC CGAAGCGGCG CGAGGCGCTG
ATCGAGAACG CCGAGAAGGC GCGGATCTCG CGCAAGCTGG TGCTGCTCGA CGACCATGTG
CAACTAGAGG TCCCGCTGGA AGAGCTCGCA GTGCACGAGC CCGACGCGCG AAAACTGGTG
GCATTCCTCA AGGCGATGGA ATTCACCACG CTGACGCGGC GCGTCGCCGA CTATGCGCAG
ATCGATCCGT CGGATGTCGA GGCGGAGACG GCGCTGAAGT CGGCCGTGCC GCCGCTTGCG
AAGAGCGCGC CCCCGAGCGG CGACCTGTTT GCGGGCCAAG AGCCCTCACC CCAGCCCTCT
CCCGCAGGCG GGGGAGGGAG CGCGCCGGCC TCGGGCCAAG GTGGTCCGCT GAATGCCGGC
CGCGGCCGCG ACGGCAGGCC GGGCGTCGTG CTGTCGCCGG AAACGCTGGT GGCGGCGCGG
GCCGAGGCCG CCCGCAAGAT TCCGGTCGAC CGCACTGCCT ACAAGACGCT GCACAGCCTC
GACGAACTGC AGGGCTTCAT CGCCCGCATC CACGACACCG GAATCGTCGC GCTTGAAGCG
GCCGCGACCT CGATCGATCC GATGCAGGCG GAGCTGATCG GCCTTGCCCT GGCGCTCGGC
CCGAACGACG CCTGCTACAT TCCGCTGCCG CATAAGCAGA GCGGCGACGG TGACGGCCTG
TTCGCCGCCG GCCTCGCGCC CGACCAGATC GGCCCCCGCG ACGCGCTCGA TGCGCTGAAG
CCGATCCTGG AATCCGCCGG CGTCGCCAAG GTCGGCTTCG CGATCAAATT CGCCGCGGTG
CTGCTGGCGC AGCACGGCCT GACCTTGCGC AACATCGACG ATCCGCAGCT GATGTCCTAC
GCGCTCGACG CCGGCCGCGG CAGCCACGCG CTCGATGCGC TGAGCGAAAG CAATCTCGGC
CACACCCTGC ACACGCTCAC CGAGCTGACC GGCACCGGCA AAAACAAGAT CGGCTTCGAT
CAGGTGCCGG TCGACCGTGC CACCGCCTAT GCGGCCGAAC GCGCCGACGT GAGCTTGCGG
CTGGCCCGCG TGCTGAAGCC GCGCCTCGTC GCAGGCAGCA TGACCGCGGT GTACGAGACG
CTGGAGCGCC CTCTTGTCGG CGTGCTGGCG CGGATGGAGC GGCGCGGCAT CTCGATCGAC
CGGCAGGTGC TGTCGCGGCT GTCGGCGGAT TTCGCCCAGA CCGCGGCGCG GATCGAGGCC
GAGATCCGGG AGCTCGCCGG CGAGGACATC AATATCGGCA GTCCCAAGCA GCTCGGCGAC
ATCCTGTTCG GCAAGATGGG CCTGCCGGGC GGCAGCAAGA CCAAGACCGG CGCGTGGTCG
ACCTCTGCGC AGGTGCTCGA CGAGCTCGCC GAACAGGGCC ACGAATTCCC GCGCAAGATT
TTGGACTGGC GGCAGGTCAG CAAGCTGCGC TCGACCTACA CCGACGCGCT GCCGAATTAC
GTGCATCCGC AGACCCACCG CGTCCACACC ACCTATGCAC TCGCCGCCAC CACCACCGGT
CGGCTGTCGT CGAACGAGCC GAACCTGCAG AACATCCCGG TGCGCACCGA GGATGGCCGC
AAGATCCGCC GCGCTTTCGT GGCGACGCCC GGCAACAAGC TGGTGTCGGC GGACTATTCG
CAGATCGAAC TGCGCCTCTT GTCCGAAGTC GCCGACGTGC CGGCGCTGCG CAAGGCGTTC
CAGGACGGCA TCGACATTCA CGCGATGACG GCGTCCGAGA TGTTCGGCGT GCCGGTCGAG
GGCATGCCGT CGGAAATTCG CCGCCGGGCG AAAGCGATCA ATTTCGGCAT CATCTACGGC
ATCTCGGCAT TCGGCCTCGC CAACCAGCTG AGCATCCCAC GCGAGGAAGC CGGCGCCTAC
ATCAAGCGCT ACTTCGAGCG CTTCCCAGGC ATCCGCGCCT ATATGGACGA GACCCGCGAT
TTCTGCCGGA CGCACGGCTA TGTCACCACG CTGTTCGGCC GCAAATGCCA CTACCCGGAC
ATCAAGGCGT CCAATCCGTC GATCCGCGCC TTCAACGAGC GCGCCGCCAT CAACGCAAGG
CTGCAAGGCT CCGCCGCCGA CATCATCCGC CGCGCCATGG TGCGGATGGA GGACGCACTC
GCCGAGAAGA AGCTCTCGGC GCAGATGCTG CTGCAGGTCC ACGACGAACT GATCTTCGAA
GTGCCGGAGG CCGAGGTGGA AGCGACGCTG CCGGTGGTCC GCTCGGTGAT GCAGGACGCG
CCGTTCCCGG CGGTGATCCT CAACGTCCCG CTGCAGGTGG ATGCGCGGGC GGCGGATAAC
TGGGACGAAG CGCATTAG
 
Protein sequence
MPKTTKSAAA TPAASSVTPP APVGVKPPGK GDHIFLVDGS SYIFRAYHAL PPLNRKSDGL 
QVNAVLGFCN MLWKLLRDMP PENRPTHLAI IFDKSEVTFR NHLYPEYKAH RPPAPEDLIP
QFALIREAVR AFDLPCLEQS GFEADDLIAT YVREACEAGA TATIVSSDKD LMQLVTDCVV
MYDTMKDRRI GVPEVIEKFG VPPNKVVEVQ ALAGDSVDNV PGVPGIGIKT AAQLINEYGD
LETLLARAPE IKQPKRREAL IENAEKARIS RKLVLLDDHV QLEVPLEELA VHEPDARKLV
AFLKAMEFTT LTRRVADYAQ IDPSDVEAET ALKSAVPPLA KSAPPSGDLF AGQEPSPQPS
PAGGGGSAPA SGQGGPLNAG RGRDGRPGVV LSPETLVAAR AEAARKIPVD RTAYKTLHSL
DELQGFIARI HDTGIVALEA AATSIDPMQA ELIGLALALG PNDACYIPLP HKQSGDGDGL
FAAGLAPDQI GPRDALDALK PILESAGVAK VGFAIKFAAV LLAQHGLTLR NIDDPQLMSY
ALDAGRGSHA LDALSESNLG HTLHTLTELT GTGKNKIGFD QVPVDRATAY AAERADVSLR
LARVLKPRLV AGSMTAVYET LERPLVGVLA RMERRGISID RQVLSRLSAD FAQTAARIEA
EIRELAGEDI NIGSPKQLGD ILFGKMGLPG GSKTKTGAWS TSAQVLDELA EQGHEFPRKI
LDWRQVSKLR STYTDALPNY VHPQTHRVHT TYALAATTTG RLSSNEPNLQ NIPVRTEDGR
KIRRAFVATP GNKLVSADYS QIELRLLSEV ADVPALRKAF QDGIDIHAMT ASEMFGVPVE
GMPSEIRRRA KAINFGIIYG ISAFGLANQL SIPREEAGAY IKRYFERFPG IRAYMDETRD
FCRTHGYVTT LFGRKCHYPD IKASNPSIRA FNERAAINAR LQGSAADIIR RAMVRMEDAL
AEKKLSAQML LQVHDELIFE VPEAEVEATL PVVRSVMQDA PFPAVILNVP LQVDARAADN
WDEAH