Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4479 |
Symbol | |
ID | 4070962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5314821 |
End bp | 5317610 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637986518 |
Product | DNA polymerase I |
Protein accession | YP_593553 |
Protein GI | 94971505 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.578142 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCGCGA AGAAGAAATC TGCCAGCAGC CCCGCCGACT CCACTGCGCC CATAACCGAA TACAAGGCGC AGGTTCCAGC CGACGATAAA AAGCTCGGCC GGGTTTTCCT GATCGATACC TTCGGCTTCG TCTTCCGCGC TTATCACGCC ATGGCGCGGC AGCGTCCCAT GTCCACGAAA ACTGGCATCC CAACGAGCGC GACCTATGTG TTCGTCAACA TGCTCAACAA GCTGCGCCAG GACTTCGCGC CCGAGCACAT TGCGGCGATC ATGGAGGGCG GCAAGACGTT TCGTGATGAA GAGGCCGCAG CGGTCGCGAC GATCAATAAG TTCGACATCA AAACCCAGAC CTTCCAGGAG ATCGCATACG GCGGTTACAA GGCGAACCGC ACCGAGATGC CAGAAGATCT CACCCAGCAG ATGCCATACA TCGAACGTGC GCTGAATGCC TATCGCATCC CGATGATCTC GGCCGAAGGC TTCGAGGCTG ACGATGTGAT CGGCACACTG GCGAAGAAGG CTGCGGATGG AGGCTACCCG GTTTATATCG TCTCCAGCGA CAAGGACATG ATGCAGCTCG TGACCGAACG CGTCTGCATT CTGAATCCGC CGAAAGACAA CCTGATCTGC GATCCGAAGA AAGTGGAAGA GATCCTTGGC GTACCGCCCG AACGCGTGGT GGACGTGATG GCGCTGCGCG GCGACTCCAT CGACAACGTC CCAGGCGCAC CCGGAATCGG CGACAAAGGC TCAGTGCAAT TGATCCAGCG CTTCGGCACC GTCGAGGCCG CACTCGATCA TGCTGGCGAA GTCGAAAGCA AGCGCCAGCG CGAATCGTTG CAGCAGAACC GCGACGCCGT GCTCTTCAGC AAGCGCATGG TGACCATTCG TACCGACATC GATATGCCGT TCGAGCCTCA GGCAATGCGC GCGCAAGACC CGGATTACGA GGCTTGCAAG GCGCTCTTCG CGGAGCTCGA ATTCAACAAC CTGCTCAAGC AATTTCTGAC CGAAGGCTCC GAAGTTGGCG AGACCGATTA CGCCGATGCG AAGTCGACTG ACGAAATCAA AGCGCTGCTG AGAGATGTAA ACGCCGATCA CCCGCTCGCG GTGGCAATCG CGCACCTCGA TGCTCCATCA CTGGCGGCAG AAGAAGCCGA GCCGGAGGAG GACGCACCGC AACTCGCGCT CGCGATGGCC GAACCTGTTG CTACACCGCA GGTCACGAGT GTCGCGCTAT CCAGCAGAGA AGGTGCCGCC CGCGCCGTGG AATTGAAAGG CGAGTCCGGT GAAGTGGTTC GCCGCGCGCT CGCCGATCCA ACGGTCCCCA AGGCCGTGCA TGATGCGAAA GCCGCGATGC ATGCGGGCCT GCCGCTCGAG TCGGTTGAGC ACGACACCAT GCTCTACGAA TACCTGCTCG ATCCGACGTA CACAACCTAC CGACTTCCCG ATGTTGTGCT CCGCCGGTTG AACCTGAAGC TTGCTGGAAC TCTGCCCGAA GCGGCGGACA TGACCCATCG CCTCACGACC AAGCTGCACA AGCTGGTCGA AGATGGCGGG CTGATGAAGG TGTATGAGGA CATCGATCGC CCGCTGGTTA CGGTGCTCTA TGCGATGGAA GCCGCCGGCG TGAAGCTTGA CTGCGATGTG CTCGCCGAGA TGTCGACGCG CCTGCAGAGA GATGCGGATG CGCTCGCTCG CAAGATCTAC GGCCTCTCCG GACAGGAGTT CAACATCAAC TCGCCGAAGC AACTCGGAGA CGTGTTGTTC AATAAACTCA ATCTGCCTAA GCCGGTGAAG TACGGGAAGG GCAAGACCAT ATCGACCGCC GTCGATGTAT TAGAAGGGCT CGCCGGGGAA CACGAAGTTC CGAAGCTGGT GCTCGAATAT CGCCAGTTCA CGAAGCTCAA ATCAACCTAC GTGGACGCGC TGCCGAATCT CTGCCACGCC GGCACCGGGC GCTTGCACAC GACCTTCGCA CAAGCGGCGA CTTCCACTGG GCGTCTCTCT TCCGTCAATC CGAATTTGCA GAACATTCCC ATCCGCACCG AGCTAGGCCG CGAGATTCGC GCTGCGTTCG TCGCCGAAAA AGGCAACGTG CTGCTTGCCG CGGATTATTC GCAAATCGAA CTCCGACTCC TCGCACACTT CTCGCAAGAC CGCCTGCTGG TGGACGCCTA CAACAACGAC CGCGACATTC ATGCCCTAAC CGCCAGCGAA GTTTTCGGTG TGCCGCCGAT GATGATTGAT GCCGAACATC GCCGCCGGGC GAAGGCCGTG AACTTCGGCA TCGTCTATGG CATCTCGCCG TTCGGACTCT CGCAGCAACT CGGCATAGAC ACGAAAGAAT CGAAGCGCTA TATCGAGAGT TACTTCGAAC GTTACAGCGG CGTTCGCGAG TGGCTCAACA GCGTGCTGGA GCAAGTTCGC AAGGACGAAA AAGTCAGCAC ACTCTTCGGT CGCATCCGGC CAATCCCCGA CATCCACAGC CGCAATCCAA ATCTCCGCGG TTTTGCCGAA CGCACGGCGA CGAACACACC GCTGCAGGGC ACGGCAGCCG ATCTCATCAA GCTGGCGATG ATCCGCATTC ACCGCGATCT CATCGAGCGC AAGTTGAAAA CGCGCATGCT GCTCCAGGTG CATGACGAGC TCGTCTTCGA AGTTCCGCAG GCGGAAGTCG AAGAAGTGCG TGCGCTGGTT CAGGACCGAA TGGAAAACGT GCATCCCGAG CTGACGGTGC CGCTGAAAGT GGATGTCGGC GTAGGAAAAA ACTGGCGCGA TATGGATTAA
|
Protein sequence | MPAKKKSASS PADSTAPITE YKAQVPADDK KLGRVFLIDT FGFVFRAYHA MARQRPMSTK TGIPTSATYV FVNMLNKLRQ DFAPEHIAAI MEGGKTFRDE EAAAVATINK FDIKTQTFQE IAYGGYKANR TEMPEDLTQQ MPYIERALNA YRIPMISAEG FEADDVIGTL AKKAADGGYP VYIVSSDKDM MQLVTERVCI LNPPKDNLIC DPKKVEEILG VPPERVVDVM ALRGDSIDNV PGAPGIGDKG SVQLIQRFGT VEAALDHAGE VESKRQRESL QQNRDAVLFS KRMVTIRTDI DMPFEPQAMR AQDPDYEACK ALFAELEFNN LLKQFLTEGS EVGETDYADA KSTDEIKALL RDVNADHPLA VAIAHLDAPS LAAEEAEPEE DAPQLALAMA EPVATPQVTS VALSSREGAA RAVELKGESG EVVRRALADP TVPKAVHDAK AAMHAGLPLE SVEHDTMLYE YLLDPTYTTY RLPDVVLRRL NLKLAGTLPE AADMTHRLTT KLHKLVEDGG LMKVYEDIDR PLVTVLYAME AAGVKLDCDV LAEMSTRLQR DADALARKIY GLSGQEFNIN SPKQLGDVLF NKLNLPKPVK YGKGKTISTA VDVLEGLAGE HEVPKLVLEY RQFTKLKSTY VDALPNLCHA GTGRLHTTFA QAATSTGRLS SVNPNLQNIP IRTELGREIR AAFVAEKGNV LLAADYSQIE LRLLAHFSQD RLLVDAYNND RDIHALTASE VFGVPPMMID AEHRRRAKAV NFGIVYGISP FGLSQQLGID TKESKRYIES YFERYSGVRE WLNSVLEQVR KDEKVSTLFG RIRPIPDIHS RNPNLRGFAE RTATNTPLQG TAADLIKLAM IRIHRDLIER KLKTRMLLQV HDELVFEVPQ AEVEEVRALV QDRMENVHPE LTVPLKVDVG VGKNWRDMD
|
| |