Gene Acid345_4479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4479 
Symbol 
ID4070962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5314821 
End bp5317610 
Gene Length2790 bp 
Protein Length929 aa 
Translation table11 
GC content59% 
IMG OID637986518 
ProductDNA polymerase I 
Protein accessionYP_593553 
Protein GI94971505 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.578142 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCGA AGAAGAAATC TGCCAGCAGC CCCGCCGACT CCACTGCGCC CATAACCGAA 
TACAAGGCGC AGGTTCCAGC CGACGATAAA AAGCTCGGCC GGGTTTTCCT GATCGATACC
TTCGGCTTCG TCTTCCGCGC TTATCACGCC ATGGCGCGGC AGCGTCCCAT GTCCACGAAA
ACTGGCATCC CAACGAGCGC GACCTATGTG TTCGTCAACA TGCTCAACAA GCTGCGCCAG
GACTTCGCGC CCGAGCACAT TGCGGCGATC ATGGAGGGCG GCAAGACGTT TCGTGATGAA
GAGGCCGCAG CGGTCGCGAC GATCAATAAG TTCGACATCA AAACCCAGAC CTTCCAGGAG
ATCGCATACG GCGGTTACAA GGCGAACCGC ACCGAGATGC CAGAAGATCT CACCCAGCAG
ATGCCATACA TCGAACGTGC GCTGAATGCC TATCGCATCC CGATGATCTC GGCCGAAGGC
TTCGAGGCTG ACGATGTGAT CGGCACACTG GCGAAGAAGG CTGCGGATGG AGGCTACCCG
GTTTATATCG TCTCCAGCGA CAAGGACATG ATGCAGCTCG TGACCGAACG CGTCTGCATT
CTGAATCCGC CGAAAGACAA CCTGATCTGC GATCCGAAGA AAGTGGAAGA GATCCTTGGC
GTACCGCCCG AACGCGTGGT GGACGTGATG GCGCTGCGCG GCGACTCCAT CGACAACGTC
CCAGGCGCAC CCGGAATCGG CGACAAAGGC TCAGTGCAAT TGATCCAGCG CTTCGGCACC
GTCGAGGCCG CACTCGATCA TGCTGGCGAA GTCGAAAGCA AGCGCCAGCG CGAATCGTTG
CAGCAGAACC GCGACGCCGT GCTCTTCAGC AAGCGCATGG TGACCATTCG TACCGACATC
GATATGCCGT TCGAGCCTCA GGCAATGCGC GCGCAAGACC CGGATTACGA GGCTTGCAAG
GCGCTCTTCG CGGAGCTCGA ATTCAACAAC CTGCTCAAGC AATTTCTGAC CGAAGGCTCC
GAAGTTGGCG AGACCGATTA CGCCGATGCG AAGTCGACTG ACGAAATCAA AGCGCTGCTG
AGAGATGTAA ACGCCGATCA CCCGCTCGCG GTGGCAATCG CGCACCTCGA TGCTCCATCA
CTGGCGGCAG AAGAAGCCGA GCCGGAGGAG GACGCACCGC AACTCGCGCT CGCGATGGCC
GAACCTGTTG CTACACCGCA GGTCACGAGT GTCGCGCTAT CCAGCAGAGA AGGTGCCGCC
CGCGCCGTGG AATTGAAAGG CGAGTCCGGT GAAGTGGTTC GCCGCGCGCT CGCCGATCCA
ACGGTCCCCA AGGCCGTGCA TGATGCGAAA GCCGCGATGC ATGCGGGCCT GCCGCTCGAG
TCGGTTGAGC ACGACACCAT GCTCTACGAA TACCTGCTCG ATCCGACGTA CACAACCTAC
CGACTTCCCG ATGTTGTGCT CCGCCGGTTG AACCTGAAGC TTGCTGGAAC TCTGCCCGAA
GCGGCGGACA TGACCCATCG CCTCACGACC AAGCTGCACA AGCTGGTCGA AGATGGCGGG
CTGATGAAGG TGTATGAGGA CATCGATCGC CCGCTGGTTA CGGTGCTCTA TGCGATGGAA
GCCGCCGGCG TGAAGCTTGA CTGCGATGTG CTCGCCGAGA TGTCGACGCG CCTGCAGAGA
GATGCGGATG CGCTCGCTCG CAAGATCTAC GGCCTCTCCG GACAGGAGTT CAACATCAAC
TCGCCGAAGC AACTCGGAGA CGTGTTGTTC AATAAACTCA ATCTGCCTAA GCCGGTGAAG
TACGGGAAGG GCAAGACCAT ATCGACCGCC GTCGATGTAT TAGAAGGGCT CGCCGGGGAA
CACGAAGTTC CGAAGCTGGT GCTCGAATAT CGCCAGTTCA CGAAGCTCAA ATCAACCTAC
GTGGACGCGC TGCCGAATCT CTGCCACGCC GGCACCGGGC GCTTGCACAC GACCTTCGCA
CAAGCGGCGA CTTCCACTGG GCGTCTCTCT TCCGTCAATC CGAATTTGCA GAACATTCCC
ATCCGCACCG AGCTAGGCCG CGAGATTCGC GCTGCGTTCG TCGCCGAAAA AGGCAACGTG
CTGCTTGCCG CGGATTATTC GCAAATCGAA CTCCGACTCC TCGCACACTT CTCGCAAGAC
CGCCTGCTGG TGGACGCCTA CAACAACGAC CGCGACATTC ATGCCCTAAC CGCCAGCGAA
GTTTTCGGTG TGCCGCCGAT GATGATTGAT GCCGAACATC GCCGCCGGGC GAAGGCCGTG
AACTTCGGCA TCGTCTATGG CATCTCGCCG TTCGGACTCT CGCAGCAACT CGGCATAGAC
ACGAAAGAAT CGAAGCGCTA TATCGAGAGT TACTTCGAAC GTTACAGCGG CGTTCGCGAG
TGGCTCAACA GCGTGCTGGA GCAAGTTCGC AAGGACGAAA AAGTCAGCAC ACTCTTCGGT
CGCATCCGGC CAATCCCCGA CATCCACAGC CGCAATCCAA ATCTCCGCGG TTTTGCCGAA
CGCACGGCGA CGAACACACC GCTGCAGGGC ACGGCAGCCG ATCTCATCAA GCTGGCGATG
ATCCGCATTC ACCGCGATCT CATCGAGCGC AAGTTGAAAA CGCGCATGCT GCTCCAGGTG
CATGACGAGC TCGTCTTCGA AGTTCCGCAG GCGGAAGTCG AAGAAGTGCG TGCGCTGGTT
CAGGACCGAA TGGAAAACGT GCATCCCGAG CTGACGGTGC CGCTGAAAGT GGATGTCGGC
GTAGGAAAAA ACTGGCGCGA TATGGATTAA
 
Protein sequence
MPAKKKSASS PADSTAPITE YKAQVPADDK KLGRVFLIDT FGFVFRAYHA MARQRPMSTK 
TGIPTSATYV FVNMLNKLRQ DFAPEHIAAI MEGGKTFRDE EAAAVATINK FDIKTQTFQE
IAYGGYKANR TEMPEDLTQQ MPYIERALNA YRIPMISAEG FEADDVIGTL AKKAADGGYP
VYIVSSDKDM MQLVTERVCI LNPPKDNLIC DPKKVEEILG VPPERVVDVM ALRGDSIDNV
PGAPGIGDKG SVQLIQRFGT VEAALDHAGE VESKRQRESL QQNRDAVLFS KRMVTIRTDI
DMPFEPQAMR AQDPDYEACK ALFAELEFNN LLKQFLTEGS EVGETDYADA KSTDEIKALL
RDVNADHPLA VAIAHLDAPS LAAEEAEPEE DAPQLALAMA EPVATPQVTS VALSSREGAA
RAVELKGESG EVVRRALADP TVPKAVHDAK AAMHAGLPLE SVEHDTMLYE YLLDPTYTTY
RLPDVVLRRL NLKLAGTLPE AADMTHRLTT KLHKLVEDGG LMKVYEDIDR PLVTVLYAME
AAGVKLDCDV LAEMSTRLQR DADALARKIY GLSGQEFNIN SPKQLGDVLF NKLNLPKPVK
YGKGKTISTA VDVLEGLAGE HEVPKLVLEY RQFTKLKSTY VDALPNLCHA GTGRLHTTFA
QAATSTGRLS SVNPNLQNIP IRTELGREIR AAFVAEKGNV LLAADYSQIE LRLLAHFSQD
RLLVDAYNND RDIHALTASE VFGVPPMMID AEHRRRAKAV NFGIVYGISP FGLSQQLGID
TKESKRYIES YFERYSGVRE WLNSVLEQVR KDEKVSTLFG RIRPIPDIHS RNPNLRGFAE
RTATNTPLQG TAADLIKLAM IRIHRDLIER KLKTRMLLQV HDELVFEVPQ AEVEEVRALV
QDRMENVHPE LTVPLKVDVG VGKNWRDMD