Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ppro_2835 |
Symbol | |
ID | 4573025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pelobacter propionicus DSM 2379 |
Kingdom | Bacteria |
Replicon accession | NC_008609 |
Strand | + |
Start bp | 3100114 |
End bp | 3102795 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639756888 |
Product | DNA polymerase I |
Protein accession | YP_902491 |
Protein GI | 118581241 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.333743 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCACG ACGATACGCT CTACCTGGTC GACGGCTCGT CCTATATCTA CCGGGCCTAC TACGCCATCC GCCACCTCTC CTCCCCCACG GGGCACCCCA CCAACACCAT CTATGGATTC ACCCAGATGC TGCTGAAGCT CCTCAAGGAG CACCAGCCGA GACATGTGGC AGTGGTTTTC GATGTGGGGA GAGAAACATT CCGCACCGAA CTGTACCCCG ACTACAAGGC CAACCGCGCC GCATTGCCCG ATGACCTGCG GGTTCAGATG GGGCCGATCC GCGACCTGGT GCGCGCCTTC AACATCCCGG CCCTGGAACT GGCCGGCTTC GAGGCCGACG ACATCATCGG CTGCCTGGCC GCCCGCTTCA GCGCCACCGG CGGCAGGGTG GTGATAGTCA CCGGCGACAA GGATCTGATG CAGATCGTCA CCGAAAGGGT GACCCTGCTG GACACCATGA AGGAGAAGCG CTCCGCCATC ACCGACGTTA TCGAGCGCTT CGGCGTGGAA CCGCGCCTGG TAACGGACAT CCTTGGCCTG GCCGGCGACG CGTCGGACAA CATTCCCGGC GTGCCGGGCA TCGGCGAGAA GACCGCCATG AAACTGATCA GAGAGTATGG TTCCCTGGAC CAACTGCTTG AACGCGCCTC CGAGGTCAAA GGAAAGAACG GCGAGAAGCT ACGCGAGTTC CGCGACCAGG CCCTGCTGTC GCGCCGACTG GCCACCATCA GATGCGACGT ACCGATCCAG GTGGAACTGG AGGAGTTAGC CAGCCAGGAG CCGGACCGGG AGGCGCTGAA CACCCTCTTC CGCCTGTACG GGTTCACCAC CCTGATCAAG GAACTGACCG GTCGTTCCAC CCTCTCCAGC GAAGGTTACT GCACGGTCAC AGCCCAGGAG GAGTTGGAGG AACTCGTCCA TGCCCTGGAG AGGTCCGAAG AGTTCGCCAT CGACCTGGAG ACCACCAGCC TGGATCCCCG GGACGCGGAG ATCGTCGGCC TCTCCTTCTC CTTCAGGGAC CACCAGGCCT GGTATATCCC GGTGGGGCAC ACAAGCGATG CCGGCCTCCA GCCGGGCCAA CTCCCCCGCG ACCTGGTGCT GGAGCGACTG CGCCCGCAGC TGGAGAGCCC ATCTCCCACC AAGATCGGCC AGAACATAAA GTTCGACATG CAGGTGCTGG CCACCAACGG CGTCTCACTG AGCGGCATCC GCTTCGACAC CATGCTGGCC TCCTATGTGC TCAACCCCTC CCGCCAGGGG CACGGACTTG ACGCCCTGGC CCTGGAGCAC CTGGGCCACC GCATGATCAG CTACGCGGAA GTAACCGGCA GCGGCAAGAC GCAGAAAAAC TTCTCCCAGG TGGAGATCGA AACAGCCTCG CGCTACGCCT GCGAGGATGC CGACGCAACC TGGCTCCTGC ATCGAAAGTT CTCCCCGCTG CTGGCCGAGA ACGGGGTTGA GGAGTTGTTC CATCGCATCG AGATGCCGCT GGTACCCATC CTGGCCGGGA TGGAGAACCA TGGCGTGCTG CTGGACATTC GACTGCTGGC GGATCTTTCA CGCGATTTTT CCAGCCGCAT GGCAACACTG GAGGGGCGCA TATTCCAGCT GGCGGGCACA ACCTTCAACC TTAACTCCCC CAAGCAGCTG GGCGAAGTAC TCTTCGAGCG CCTGCAGCTG AAGACCGGCA AGAAGACCAA GGGCAAGACC GGCTGGTCCA CGGACAACGA GGTGCTCAGT TCCCTGGCCG AGGAGCACGA GATCGCCCGC CTGATCCTGG ACTACCGCGG CCTGTCCAAA CTGAAATCAA CCTACAGCGA CGCCCTGCCG CGCCTGGTGC ATCCCCGCAC CGGACGGGTA CACACCTCCT ACAACCAGAC CGTCACCAGC ACCGGCCGCC TCTCCTCATC AGACCCCAAC CTGCAGAACA TCCCCATCCG CAGCGACGAG GGGCGCATGA TCCGGCACGC CTTCATCGCC CCGCCCGGGC ATGTGATCAT CTCTGCTGAC TACTCCCAGA TCGAGCTGCG CGTACTGGCC CACCTGTCCG GCGATCCGGT CTTCTGTCAG GCCTTCGAGC ACGACGAGGA CATCCACACC CGCACCGCCA GCGAGGTTTT CGGCCTGTTC CCCGAGATGG TCACAGCCGA GATGCGCCGC CAGGCCAAGA CCATCAACTT CGGCATCATC TACGGCCAGG GTTCCTTCAG CCTGGCCAGG CAGCTGGGCA TCGCCCGCAA AACCGCCGAA GAGTTCATCA GCGCCTACAA GGAGCGCCAT GCAAAGGCGG TTGGCTTCCT GGACGACTGC ATCCGCCAGG CGGAGGAGCA GGGGTTCGTC ACCACCATCC TGGGGCGGCG ACTGCCCATT GCGGACATCA CCAGCACGAA CGGCAACATC CGCGCCTTTG CCCAGCGCAA CGCCATCAAC TACCCGATCC AAGGTTCGGC GGCCGACATC ATCAAGAGCG CCATGATCCG GGTGGACGGC CGCATCCGCA GCGAGGGGCT CAAGAGCCGC CTGATCATGC AGGTTCACGA CGAACTGGTC TTCGAGGTTC GGGAAGATGA GCTGCTGGCC ATGGAGCTCT TGGTGGAGGA GGAAATGTCA CGGGCGGTGG AATTGCGCAT CCCGCTCAAG GTTGACATCA GCCACGGCGT TAACTGGAGC GAGGCGCACT GA
|
Protein sequence | MSHDDTLYLV DGSSYIYRAY YAIRHLSSPT GHPTNTIYGF TQMLLKLLKE HQPRHVAVVF DVGRETFRTE LYPDYKANRA ALPDDLRVQM GPIRDLVRAF NIPALELAGF EADDIIGCLA ARFSATGGRV VIVTGDKDLM QIVTERVTLL DTMKEKRSAI TDVIERFGVE PRLVTDILGL AGDASDNIPG VPGIGEKTAM KLIREYGSLD QLLERASEVK GKNGEKLREF RDQALLSRRL ATIRCDVPIQ VELEELASQE PDREALNTLF RLYGFTTLIK ELTGRSTLSS EGYCTVTAQE ELEELVHALE RSEEFAIDLE TTSLDPRDAE IVGLSFSFRD HQAWYIPVGH TSDAGLQPGQ LPRDLVLERL RPQLESPSPT KIGQNIKFDM QVLATNGVSL SGIRFDTMLA SYVLNPSRQG HGLDALALEH LGHRMISYAE VTGSGKTQKN FSQVEIETAS RYACEDADAT WLLHRKFSPL LAENGVEELF HRIEMPLVPI LAGMENHGVL LDIRLLADLS RDFSSRMATL EGRIFQLAGT TFNLNSPKQL GEVLFERLQL KTGKKTKGKT GWSTDNEVLS SLAEEHEIAR LILDYRGLSK LKSTYSDALP RLVHPRTGRV HTSYNQTVTS TGRLSSSDPN LQNIPIRSDE GRMIRHAFIA PPGHVIISAD YSQIELRVLA HLSGDPVFCQ AFEHDEDIHT RTASEVFGLF PEMVTAEMRR QAKTINFGII YGQGSFSLAR QLGIARKTAE EFISAYKERH AKAVGFLDDC IRQAEEQGFV TTILGRRLPI ADITSTNGNI RAFAQRNAIN YPIQGSAADI IKSAMIRVDG RIRSEGLKSR LIMQVHDELV FEVREDELLA MELLVEEEMS RAVELRIPLK VDISHGVNWS EAH
|
| |