Gene Plav_0105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0105 
Symbol 
ID5454288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp114688 
End bp117627 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content62% 
IMG OID640875665 
ProductDNA polymerase I 
Protein accessionYP_001411385 
Protein GI154250561 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI)
[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.1277 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAAGCG GCAAGGCGGA AACGGCAATG GCGGAAAAGA CCGGAAGAGC ACTGAAAAAA 
GGCGACCATC TCTTCCTGGT GGATGGTTCG GGCTATATTT TCCGCGCCTA TCATGCCCTC
CCGCCGCTGA CGCGGAAATC GGACGGCATG CCGGTCGGCG CCGTCGCCGG CTTCTGCAAC
ATGCTCTACA AGCTCATCGA GGACACCAAG GACGAGTTCG AGCCGACGCA TCTCGCCGTC
ATTTTCGACG CCGCCGCCAA GACCTTCCGC AACGACATCT ACCCCGAATA CAAGGCCAAT
CGTGTCGAGC CGTCGGAAGA CCTGCGCCCG CAATTCGCGC TTGTGCGCGA CGCGACCCGC
GCCTTCGGTG TCCCCTGCAT CGAGAAGAAG GGCTACGAGG CCGACGACAT CATCGCCACC
TATGCGCGCC TCGCGCATGA AGCGGGCGCC CGCGTCACCA TCGTCTCCTC CGACAAGGAT
CTGATGCAGC TCGTGAACGA CAATGTCGAC ATGCTCGATA CCATGAAGCT GAAGACAATC
GCGCGCGAAC AGGTAATCGA GAAATTCGGC GTGCCGCCGG AAAAGGTCGT CGATGTGCAG
GCACTGGCGG GCGACTCCAC CGACAACGTT CCCGGCGTCC CCGGCATCGG TATCAAGACC
GCGGCCCAGC TCATCGGCGA ATATGGCGAT CTCGAAACGC TGCTCGCCCG CGCCGGAGAG
ATCAAGCAGC AGAAGCGCCG CGAAAACCTT ATCGAATTTG CCGAGCAGGC CCGCATCTCC
CGCCGTCTTG TCGAGCTCGA CAATAACGTC CCCCTCGAAG AGCCGCTTGA GGGCATGGGC
GTCCGCGAGC CCGATCCTGA AACGCTGATC GGCTTCTTCA AGGACATGGA ATTCAACACG
CTGACGCGCC GCGTCGGCGA GCGTTTCAAC ATCGACGTCG ACGCCATTCC CGCTGCCGGA
AAGCATGCCC TCATTACCGA TGGCGCGCTC GCTGCCCCCG CGGGCGAGGA GCCGAAAAAG
GAAAAGCAGA CGGTCGCGCG CACCGCACGG GGCGGCACGC CGGGTGCAGT TCCGAAGGGC
ATCGACGCGG ATTTCAACGA TGCGAATTAT GTTGCGGTAA CGGCGCTTGC CGATCTCGAC
GAGTGGATCG CGCGCGCCCG CGAGCAGGGC TTCCTCGCCG TCGATACGGA GACGGACAGC
CTCTTCCCGA TGCAGGCGCG CCTTGTTGGC GTCTCCCTTT CGCTGCTGCC CGGCGAGGCC
TGTTACATCC CGCTGCAGCA TGGCGCTGGC GGCGGCCTCG ACTTCGCGGA TGCCGGTGGC
CAGCCGCAAA TTCCGCTTAA GGAGGCTATC GCCCGCCTGA AACCGCTGCT TGAGGATCCT
TCCATCCTGA AGATCGGTCA GAACCTGAAA TTCGACATGA CGGTCCTGCG TCAGCATGGC
ATCCAATTGA AAGGTCTCGA CGACACGATG CTCATGTCCT ACGCGCTCGA CGCAGGCGTG
CATGGCCACG GCATGGACGA ATTGTCGGAA CTGCATCTCG GCCACAAGCC GATTTCCTTC
GCGGAAGTCG CGGGCAAGGG CAAGGCGCAG ATCACCTTCG ACCAGGTGCC GGTGGACCGC
GCCACCGCCT ATGCCGCCGA AGATGCCGAC GTCACACTCC GCCTCTGGCA TATCCTGAAG
CCGCGCCTCG TCGCGGAGCG TCGCGTTACT GTTTATGAAA CGCTGGAGCG TCCGCTCGTT
TCCGTTCTCG CGGAAATGGA GCGAGCCGGC GTCAAGGTCG ACAAGGCGGT GCTCGCGCGC
CTCTCCGGCG ATTTTTCGCA GAAGATGGCG CAATATGAGG ATGAGATCTA CGAGCTTGCC
GGCGAACGCT TCAATATCGG CTCGCCGAAA CAGCTCGGCG AAATCCTCTT CGACAAGCAA
AGCCTCGAAG GCGGCCGCAA AACCAAGACC GGCGCCTGGT CGACCGACGC CGACACGCTT
GAGGCGCTGG CCGCGAAAGG CCATGAGCTG CCGCAGCGCG TGCTCGACTG GCGCGGGCTT
TCCAAGCTGA AAAGCACCTA TACGGATGCA CTCCCTGAAT ATATCAACCC CGAGACCGGC
CGCATCCACA CCTGCTACTC GCTCGCCTCG ACATCGACCG GCCGCCTCGC GTCAACCGAG
CCGAACCTGC AGAACATTCC CGTGCGCACG GAAGACGGCC GGAAAATCAG AACGGCCTTT
GTCGCCGAGA AGGGAAATCT TCTCATCTCC GCCGACTACA GCCAGATCGA GTTGCGCCTC
CTCGCCCATA TCGCGGATAT CGAGGCGCTG AAGAAGGCCT TTGCCGAAGG TCTCGATATT
CATGCGATGA CGGCATCGGA AATGTTCGGC GTGCCCATCG AGGGCATGGA GTCTTCCGTT
CGCCGCCGCG CCAAGGCCAT CAATTTCGGC ATCATCTACG GCATATCCGC CTTCGGCCTG
GCCAACCAGC TCGGCATCCC GCGGCAGGAG GCGGGAGAAT ATATCGATCG CTACTTCAAG
CGTTTCCCCG GCATCCGCGC CTATATGGAC GACACCCGGG ATTTCGCTCA CAAGAACGGT
TATGTCGAAA CGATCTTCGG CCGCCGCATT CACCTCCCCG CGATCAATTC TAAGAATCCC
GCGGAGAAAA GCTTCATGGA GCGCGCCGCC ATCAACGCGC CGATTCAGGG CTCGGCCGCC
GACATCATCC GCCGCGCCAT GATCCGCATG CCGCAGGCAT TGGCGGATGC GAAGCTCGCC
GCGCGGATGC TGCTGCAGGT TCATGACGAA TTGATTTTCG AAGTGCCGGA AAAGGAAGCC
GAGAAGACGA GCAAGGTGGT GTCGCGCATC ATGTCGGATG CCGCCGCGCC CGCCGTGGCG
CTGACTGTGC CGCTCGATGT CGATGCCCGT GCCGCGAAAA ACTGGGACGA GGCGCATTAG
 
Protein sequence
MASGKAETAM AEKTGRALKK GDHLFLVDGS GYIFRAYHAL PPLTRKSDGM PVGAVAGFCN 
MLYKLIEDTK DEFEPTHLAV IFDAAAKTFR NDIYPEYKAN RVEPSEDLRP QFALVRDATR
AFGVPCIEKK GYEADDIIAT YARLAHEAGA RVTIVSSDKD LMQLVNDNVD MLDTMKLKTI
AREQVIEKFG VPPEKVVDVQ ALAGDSTDNV PGVPGIGIKT AAQLIGEYGD LETLLARAGE
IKQQKRRENL IEFAEQARIS RRLVELDNNV PLEEPLEGMG VREPDPETLI GFFKDMEFNT
LTRRVGERFN IDVDAIPAAG KHALITDGAL AAPAGEEPKK EKQTVARTAR GGTPGAVPKG
IDADFNDANY VAVTALADLD EWIARAREQG FLAVDTETDS LFPMQARLVG VSLSLLPGEA
CYIPLQHGAG GGLDFADAGG QPQIPLKEAI ARLKPLLEDP SILKIGQNLK FDMTVLRQHG
IQLKGLDDTM LMSYALDAGV HGHGMDELSE LHLGHKPISF AEVAGKGKAQ ITFDQVPVDR
ATAYAAEDAD VTLRLWHILK PRLVAERRVT VYETLERPLV SVLAEMERAG VKVDKAVLAR
LSGDFSQKMA QYEDEIYELA GERFNIGSPK QLGEILFDKQ SLEGGRKTKT GAWSTDADTL
EALAAKGHEL PQRVLDWRGL SKLKSTYTDA LPEYINPETG RIHTCYSLAS TSTGRLASTE
PNLQNIPVRT EDGRKIRTAF VAEKGNLLIS ADYSQIELRL LAHIADIEAL KKAFAEGLDI
HAMTASEMFG VPIEGMESSV RRRAKAINFG IIYGISAFGL ANQLGIPRQE AGEYIDRYFK
RFPGIRAYMD DTRDFAHKNG YVETIFGRRI HLPAINSKNP AEKSFMERAA INAPIQGSAA
DIIRRAMIRM PQALADAKLA ARMLLQVHDE LIFEVPEKEA EKTSKVVSRI MSDAAAPAVA
LTVPLDVDAR AAKNWDEAH