Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_5204 |
Symbol | |
ID | 6412904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 5611612 |
End bp | 5614689 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642715094 |
Product | DNA polymerase I |
Protein accession | YP_001994167 |
Protein GI | 192293562 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAAAA CGACAAAGTC CGCCGCTGCG ACGCCTGCCG CTTCCTCCGT CACGCCCCCT GCCCCCGTCG GCGTCAAGCC GCCCGGCAAG GGCGACCACA TCTTCCTGGT CGACGGCTCG TCGTACATCT TCCGCGCCTA TCACGCGCTG CCGCCGCTGA ACCGCAAGTC GGACGGGCTG CAGGTCAATG CCGTGCTCGG CTTCTGCAAC ATGCTGTGGA AGCTGCTCCG CGACATGCCG CCGGAGAACC GGCCGACGCA TCTCGCCATC ATCTTCGACA AGTCGGAAGT CACCTTCCGC AACCATCTCT ACCCCGAATA CAAGGCGCAC CGGCCGCCGG CACCGGAGGA TCTGATCCCG CAATTTGCCC TGATCCGCGA GGCGGTGCGG GCGTTCGACC TGCCGTGCCT GGAACAGTCC GGTTTCGAGG CCGACGATCT GATCGCCACT TACGTGCGCG AGGCGTGCGA GGCCGGGGCC ACGGCGACGA TCGTGTCGTC CGACAAGGAC CTGATGCAGC TCGTGACCGA CTGCGTGGTG ATGTACGACA CCATGAAGGA CCGCCGCATC GGCGTGCCCG AGGTGATCGA GAAATTCGGC GTGCCGCCCA ATAAGGTGGT CGAAGTCCAG GCGCTCGCCG GCGACAGCGT CGACAACGTG CCGGGCGTGC CGGGCATCGG CATCAAGACC GCCGCGCAGC TGATCAACGA ATATGGCGAC CTCGAAACCC TGCTGGCCCG CGCACCTGAG ATCAAGCAGC CGAAGCGGCG CGAGGCGCTG ATCGAGAACG CCGAGAAGGC GCGGATCTCG CGCAAGCTGG TGCTGCTCGA CGACCATGTG CAACTAGAGG TCCCGCTGGA AGAGCTCGCA GTGCACGAGC CCGACGCGCG AAAACTGGTG GCATTCCTCA AGGCGATGGA ATTCACCACG CTGACGCGGC GCGTCGCCGA CTATGCGCAG ATCGATCCGT CGGATGTCGA GGCGGAGACG GCGCTGAAGT CGGCCGTGCC GCCGCTTGCG AAGAGCGCGC CCCCGAGCGG CGACCTGTTT GCGGGCCAAG AGCCCTCACC CCAGCCCTCT CCCGCAGGCG GGGGAGGGAG CGCGCCGGCC TCGGGCCAAG GTGGTCCGCT GAATGCCGGC CGCGGCCGCG ACGGCAGGCC GGGCGTCGTG CTGTCGCCGG AAACGCTGGT GGCGGCGCGG GCCGAGGCCG CCCGCAAGAT TCCGGTCGAC CGCACTGCCT ACAAGACGCT GCACAGCCTC GACGAACTGC AGGGCTTCAT CGCCCGCATC CACGACACCG GAATCGTCGC GCTTGAAGCG GCCGCGACCT CGATCGATCC GATGCAGGCG GAGCTGATCG GCCTTGCCCT GGCGCTCGGC CCGAACGACG CCTGCTACAT TCCGCTGCCG CATAAGCAGA GCGGCGACGG TGACGGCCTG TTCGCCGCCG GCCTCGCGCC CGACCAGATC GGCCCCCGCG ACGCGCTCGA TGCGCTGAAG CCGATCCTGG AATCCGCCGG CGTCGCCAAG GTCGGCTTCG CGATCAAATT CGCCGCGGTG CTGCTGGCGC AGCACGGCCT GACCTTGCGC AACATCGACG ATCCGCAGCT GATGTCCTAC GCGCTCGACG CCGGCCGCGG CAGCCACGCG CTCGATGCGC TGAGCGAAAG CAATCTCGGC CACACCCTGC ACACGCTCAC CGAGCTGACC GGCACCGGCA AAAACAAGAT CGGCTTCGAT CAGGTGCCGG TCGACCGTGC CACCGCCTAT GCGGCCGAAC GCGCCGACGT GAGCTTGCGG CTGGCCCGCG TGCTGAAGCC GCGCCTCGTC GCAGGCAGCA TGACCGCGGT GTACGAGACG CTGGAGCGCC CTCTTGTCGG CGTGCTGGCG CGGATGGAGC GGCGCGGCAT CTCGATCGAC CGGCAGGTGC TGTCGCGGCT GTCGGCGGAT TTCGCCCAGA CCGCGGCGCG GATCGAGGCC GAGATCCGGG AGCTCGCCGG CGAGGACATC AATATCGGCA GTCCCAAGCA GCTCGGCGAC ATCCTGTTCG GCAAGATGGG CCTGCCGGGC GGCAGCAAGA CCAAGACCGG CGCGTGGTCG ACCTCTGCGC AGGTGCTCGA CGAGCTCGCC GAACAGGGCC ACGAATTCCC GCGCAAGATT TTGGACTGGC GGCAGGTCAG CAAGCTGCGC TCGACCTACA CCGACGCGCT GCCGAATTAC GTGCATCCGC AGACCCACCG CGTCCACACC ACCTATGCAC TCGCCGCCAC CACCACCGGT CGGCTGTCGT CGAACGAGCC GAACCTGCAG AACATCCCGG TGCGCACCGA GGATGGCCGC AAGATCCGCC GCGCTTTCGT GGCGACGCCC GGCAACAAGC TGGTGTCGGC GGACTATTCG CAGATCGAAC TGCGCCTCTT GTCCGAAGTC GCCGACGTGC CGGCGCTGCG CAAGGCGTTC CAGGACGGCA TCGACATTCA CGCGATGACG GCGTCCGAGA TGTTCGGCGT GCCGGTCGAG GGCATGCCGT CGGAAATTCG CCGCCGGGCG AAAGCGATCA ATTTCGGCAT CATCTACGGC ATCTCGGCAT TCGGCCTCGC CAACCAGCTG AGCATCCCAC GCGAGGAAGC CGGCGCCTAC ATCAAGCGCT ACTTCGAGCG CTTCCCAGGC ATCCGCGCCT ATATGGACGA GACCCGCGAT TTCTGCCGGA CGCACGGCTA TGTCACCACG CTGTTCGGCC GCAAATGCCA CTACCCGGAC ATCAAGGCGT CCAATCCGTC GATCCGCGCC TTCAACGAGC GCGCCGCCAT CAACGCAAGG CTGCAAGGCT CCGCCGCCGA CATCATCCGC CGCGCCATGG TGCGGATGGA GGACGCACTC GCCGAGAAGA AGCTCTCGGC GCAGATGCTG CTGCAGGTCC ACGACGAACT GATCTTCGAA GTGCCGGAGG CCGAGGTGGA AGCGACGCTG CCGGTGGTCC GCTCGGTGAT GCAGGACGCG CCGTTCCCGG CGGTGATCCT CAACGTCCCG CTGCAGGTGG ATGCGCGGGC GGCGGATAAC TGGGACGAAG CGCATTAG
|
Protein sequence | MPKTTKSAAA TPAASSVTPP APVGVKPPGK GDHIFLVDGS SYIFRAYHAL PPLNRKSDGL QVNAVLGFCN MLWKLLRDMP PENRPTHLAI IFDKSEVTFR NHLYPEYKAH RPPAPEDLIP QFALIREAVR AFDLPCLEQS GFEADDLIAT YVREACEAGA TATIVSSDKD LMQLVTDCVV MYDTMKDRRI GVPEVIEKFG VPPNKVVEVQ ALAGDSVDNV PGVPGIGIKT AAQLINEYGD LETLLARAPE IKQPKRREAL IENAEKARIS RKLVLLDDHV QLEVPLEELA VHEPDARKLV AFLKAMEFTT LTRRVADYAQ IDPSDVEAET ALKSAVPPLA KSAPPSGDLF AGQEPSPQPS PAGGGGSAPA SGQGGPLNAG RGRDGRPGVV LSPETLVAAR AEAARKIPVD RTAYKTLHSL DELQGFIARI HDTGIVALEA AATSIDPMQA ELIGLALALG PNDACYIPLP HKQSGDGDGL FAAGLAPDQI GPRDALDALK PILESAGVAK VGFAIKFAAV LLAQHGLTLR NIDDPQLMSY ALDAGRGSHA LDALSESNLG HTLHTLTELT GTGKNKIGFD QVPVDRATAY AAERADVSLR LARVLKPRLV AGSMTAVYET LERPLVGVLA RMERRGISID RQVLSRLSAD FAQTAARIEA EIRELAGEDI NIGSPKQLGD ILFGKMGLPG GSKTKTGAWS TSAQVLDELA EQGHEFPRKI LDWRQVSKLR STYTDALPNY VHPQTHRVHT TYALAATTTG RLSSNEPNLQ NIPVRTEDGR KIRRAFVATP GNKLVSADYS QIELRLLSEV ADVPALRKAF QDGIDIHAMT ASEMFGVPVE GMPSEIRRRA KAINFGIIYG ISAFGLANQL SIPREEAGAY IKRYFERFPG IRAYMDETRD FCRTHGYVTT LFGRKCHYPD IKASNPSIRA FNERAAINAR LQGSAADIIR RAMVRMEDAL AEKKLSAQML LQVHDELIFE VPEAEVEATL PVVRSVMQDA PFPAVILNVP LQVDARAADN WDEAH
|
| |