Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_0105 |
Symbol | |
ID | 5454288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 114688 |
End bp | 117627 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640875665 |
Product | DNA polymerase I |
Protein accession | YP_001411385 |
Protein GI | 154250561 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.1277 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAAGCG GCAAGGCGGA AACGGCAATG GCGGAAAAGA CCGGAAGAGC ACTGAAAAAA GGCGACCATC TCTTCCTGGT GGATGGTTCG GGCTATATTT TCCGCGCCTA TCATGCCCTC CCGCCGCTGA CGCGGAAATC GGACGGCATG CCGGTCGGCG CCGTCGCCGG CTTCTGCAAC ATGCTCTACA AGCTCATCGA GGACACCAAG GACGAGTTCG AGCCGACGCA TCTCGCCGTC ATTTTCGACG CCGCCGCCAA GACCTTCCGC AACGACATCT ACCCCGAATA CAAGGCCAAT CGTGTCGAGC CGTCGGAAGA CCTGCGCCCG CAATTCGCGC TTGTGCGCGA CGCGACCCGC GCCTTCGGTG TCCCCTGCAT CGAGAAGAAG GGCTACGAGG CCGACGACAT CATCGCCACC TATGCGCGCC TCGCGCATGA AGCGGGCGCC CGCGTCACCA TCGTCTCCTC CGACAAGGAT CTGATGCAGC TCGTGAACGA CAATGTCGAC ATGCTCGATA CCATGAAGCT GAAGACAATC GCGCGCGAAC AGGTAATCGA GAAATTCGGC GTGCCGCCGG AAAAGGTCGT CGATGTGCAG GCACTGGCGG GCGACTCCAC CGACAACGTT CCCGGCGTCC CCGGCATCGG TATCAAGACC GCGGCCCAGC TCATCGGCGA ATATGGCGAT CTCGAAACGC TGCTCGCCCG CGCCGGAGAG ATCAAGCAGC AGAAGCGCCG CGAAAACCTT ATCGAATTTG CCGAGCAGGC CCGCATCTCC CGCCGTCTTG TCGAGCTCGA CAATAACGTC CCCCTCGAAG AGCCGCTTGA GGGCATGGGC GTCCGCGAGC CCGATCCTGA AACGCTGATC GGCTTCTTCA AGGACATGGA ATTCAACACG CTGACGCGCC GCGTCGGCGA GCGTTTCAAC ATCGACGTCG ACGCCATTCC CGCTGCCGGA AAGCATGCCC TCATTACCGA TGGCGCGCTC GCTGCCCCCG CGGGCGAGGA GCCGAAAAAG GAAAAGCAGA CGGTCGCGCG CACCGCACGG GGCGGCACGC CGGGTGCAGT TCCGAAGGGC ATCGACGCGG ATTTCAACGA TGCGAATTAT GTTGCGGTAA CGGCGCTTGC CGATCTCGAC GAGTGGATCG CGCGCGCCCG CGAGCAGGGC TTCCTCGCCG TCGATACGGA GACGGACAGC CTCTTCCCGA TGCAGGCGCG CCTTGTTGGC GTCTCCCTTT CGCTGCTGCC CGGCGAGGCC TGTTACATCC CGCTGCAGCA TGGCGCTGGC GGCGGCCTCG ACTTCGCGGA TGCCGGTGGC CAGCCGCAAA TTCCGCTTAA GGAGGCTATC GCCCGCCTGA AACCGCTGCT TGAGGATCCT TCCATCCTGA AGATCGGTCA GAACCTGAAA TTCGACATGA CGGTCCTGCG TCAGCATGGC ATCCAATTGA AAGGTCTCGA CGACACGATG CTCATGTCCT ACGCGCTCGA CGCAGGCGTG CATGGCCACG GCATGGACGA ATTGTCGGAA CTGCATCTCG GCCACAAGCC GATTTCCTTC GCGGAAGTCG CGGGCAAGGG CAAGGCGCAG ATCACCTTCG ACCAGGTGCC GGTGGACCGC GCCACCGCCT ATGCCGCCGA AGATGCCGAC GTCACACTCC GCCTCTGGCA TATCCTGAAG CCGCGCCTCG TCGCGGAGCG TCGCGTTACT GTTTATGAAA CGCTGGAGCG TCCGCTCGTT TCCGTTCTCG CGGAAATGGA GCGAGCCGGC GTCAAGGTCG ACAAGGCGGT GCTCGCGCGC CTCTCCGGCG ATTTTTCGCA GAAGATGGCG CAATATGAGG ATGAGATCTA CGAGCTTGCC GGCGAACGCT TCAATATCGG CTCGCCGAAA CAGCTCGGCG AAATCCTCTT CGACAAGCAA AGCCTCGAAG GCGGCCGCAA AACCAAGACC GGCGCCTGGT CGACCGACGC CGACACGCTT GAGGCGCTGG CCGCGAAAGG CCATGAGCTG CCGCAGCGCG TGCTCGACTG GCGCGGGCTT TCCAAGCTGA AAAGCACCTA TACGGATGCA CTCCCTGAAT ATATCAACCC CGAGACCGGC CGCATCCACA CCTGCTACTC GCTCGCCTCG ACATCGACCG GCCGCCTCGC GTCAACCGAG CCGAACCTGC AGAACATTCC CGTGCGCACG GAAGACGGCC GGAAAATCAG AACGGCCTTT GTCGCCGAGA AGGGAAATCT TCTCATCTCC GCCGACTACA GCCAGATCGA GTTGCGCCTC CTCGCCCATA TCGCGGATAT CGAGGCGCTG AAGAAGGCCT TTGCCGAAGG TCTCGATATT CATGCGATGA CGGCATCGGA AATGTTCGGC GTGCCCATCG AGGGCATGGA GTCTTCCGTT CGCCGCCGCG CCAAGGCCAT CAATTTCGGC ATCATCTACG GCATATCCGC CTTCGGCCTG GCCAACCAGC TCGGCATCCC GCGGCAGGAG GCGGGAGAAT ATATCGATCG CTACTTCAAG CGTTTCCCCG GCATCCGCGC CTATATGGAC GACACCCGGG ATTTCGCTCA CAAGAACGGT TATGTCGAAA CGATCTTCGG CCGCCGCATT CACCTCCCCG CGATCAATTC TAAGAATCCC GCGGAGAAAA GCTTCATGGA GCGCGCCGCC ATCAACGCGC CGATTCAGGG CTCGGCCGCC GACATCATCC GCCGCGCCAT GATCCGCATG CCGCAGGCAT TGGCGGATGC GAAGCTCGCC GCGCGGATGC TGCTGCAGGT TCATGACGAA TTGATTTTCG AAGTGCCGGA AAAGGAAGCC GAGAAGACGA GCAAGGTGGT GTCGCGCATC ATGTCGGATG CCGCCGCGCC CGCCGTGGCG CTGACTGTGC CGCTCGATGT CGATGCCCGT GCCGCGAAAA ACTGGGACGA GGCGCATTAG
|
Protein sequence | MASGKAETAM AEKTGRALKK GDHLFLVDGS GYIFRAYHAL PPLTRKSDGM PVGAVAGFCN MLYKLIEDTK DEFEPTHLAV IFDAAAKTFR NDIYPEYKAN RVEPSEDLRP QFALVRDATR AFGVPCIEKK GYEADDIIAT YARLAHEAGA RVTIVSSDKD LMQLVNDNVD MLDTMKLKTI AREQVIEKFG VPPEKVVDVQ ALAGDSTDNV PGVPGIGIKT AAQLIGEYGD LETLLARAGE IKQQKRRENL IEFAEQARIS RRLVELDNNV PLEEPLEGMG VREPDPETLI GFFKDMEFNT LTRRVGERFN IDVDAIPAAG KHALITDGAL AAPAGEEPKK EKQTVARTAR GGTPGAVPKG IDADFNDANY VAVTALADLD EWIARAREQG FLAVDTETDS LFPMQARLVG VSLSLLPGEA CYIPLQHGAG GGLDFADAGG QPQIPLKEAI ARLKPLLEDP SILKIGQNLK FDMTVLRQHG IQLKGLDDTM LMSYALDAGV HGHGMDELSE LHLGHKPISF AEVAGKGKAQ ITFDQVPVDR ATAYAAEDAD VTLRLWHILK PRLVAERRVT VYETLERPLV SVLAEMERAG VKVDKAVLAR LSGDFSQKMA QYEDEIYELA GERFNIGSPK QLGEILFDKQ SLEGGRKTKT GAWSTDADTL EALAAKGHEL PQRVLDWRGL SKLKSTYTDA LPEYINPETG RIHTCYSLAS TSTGRLASTE PNLQNIPVRT EDGRKIRTAF VAEKGNLLIS ADYSQIELRL LAHIADIEAL KKAFAEGLDI HAMTASEMFG VPIEGMESSV RRRAKAINFG IIYGISAFGL ANQLGIPRQE AGEYIDRYFK RFPGIRAYMD DTRDFAHKNG YVETIFGRRI HLPAINSKNP AEKSFMERAA INAPIQGSAA DIIRRAMIRM PQALADAKLA ARMLLQVHDE LIFEVPEKEA EKTSKVVSRI MSDAAAPAVA LTVPLDVDAR AAKNWDEAH
|
| |