Gene PG2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG2072 
Symbol 
ID2552024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp2167509 
End bp2170817 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content51% 
IMG OID637150646 
ProductUvrD/REP helicase domain-containing protein 
Protein accessionNP_906134 
Protein GI34541655 
COG category[L] Replication, recombination and repair 
COG ID[COG1074] ATP-dependent exoDNAse (exonuclease V) beta subunit (contains helicase and exonuclease domains) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGCT CCAAGCAACT CCGCATTTAT ACAGCTTCGG CGGGTTCCGG CAAGACTCAT 
ACACTTACCG GCGAATATCT TCGTTTAGCC CTGCGTACGC GCGGGGCTTT CCGTTATATA
CAGGCTGTCA CGTTCACGAA CAAAGCCACT GCCGAAATGA AGGAGCGTAT TCTCGAGGAG
CTGTACAGTC TGGCTGTGGG CGGATCGTCC CCTTTCGCCG AGGAGCTGAT GCAGGAGTTG
GCTCTGACTA CCGAGCAGCT ACAAGTCAGA GCACAGGAAG TCCTGACCGA AATACTAAAC
GACTATTCTT CTTTGCGAGT CAAGACCATT GACTCCTTCT TTCAGGAAGT AATGCGTGCC
TTCTCTCATG AATTGGGACT GCCGGGTGGT TTTCGGATCG AGATGGAGCA GAAGGCCGTG
CTCGAACAGG CTGTCGTCCG TCTGCTGCAC AGCTTGGGAG AAAAAGATAC CTCCGACGTC
GAGAATTGGA TCAGGCGTTT GGCTGAAGAC CTGATCGAAG AGGGGCGTGG ACATAACATC
CGCAGGGAGA TAGTCAGCTT GGGTGATGAG CTTTTCAAAG AACAGCTACT CCTGCTATCC
GAGGAAGGCA AACTACCGAC CAAAGCTGCC ATTCATCGCT ATCAAACCGA AATGAACAAG
CTGATGGAAG GCTTCGAACA GCGACGTCTG TCTATTGCAC GTCGGGCGGA AGAAATCGTC
GCCACAGCCG GAATCAGCTT CTACGATTTC AAAGGGGGTA CCAAAGGAGG TATCTTAGAG
TTTGCCAAGG TGCTCAAAGG AGGAGAAGTC AAGCCTCCCA CAAAGACCTT TATGGCGATG
GCAGAAGGCG ATCCGGAGAC CACACTCTAC GCCAAAACCA CTCCTGCCAC CACCCAGGCA
GCCATCCTCT CGGCTTACCA ATCCGGTCTC AAAGAGTGTC TGACCGAAAT GGCAACACTC
TATCTCGGCA GGGAATGGCA AGAGTATTCC ACGGCCAAGC AGTCGCTTCC CTTTCTGAAC
CGGCTTGGTA TCATCTCCGA CCTTTGGCGA CAAATCGAAG GAATCAGACA AGAAGAGAAC
AAAATGCTCA TTTCCGATGC TCCGTCTCTA TTGCACAGGA TCATCGACGG GAGCGAGACC
CCTTTCGTCT ATGACAAAAT CGGGGTACGC ATCGAACACG AAATGATCGA TGAATTTCAA
GATACCAGCC GCTTACAGTA CGAAAACTTC AAGCCCCTGC TGTCCGAGAG TCTGGCTCAC
GGCAAGTACA ACCTCCTCGT CGGCGATGCC AAACAGAGTA TATACCGCTT CCGCAATGCC
GACCGGCGTC TGCTCACAGA GGTCGTCAGT CGGGATTTTG CCGAAACATC GGAGAGGGTC
AATCTACCAT ATAATTGGCG AAGCACTCCC GAAATAATCG AGTTCAACAA CTCCCTCTAC
AAGCATCTGC CACAAATCCT CTGTGAGGCT ATGACTCGCG AAGCCGAGAC GATGGCAATG
CCCGATCCTA AGTTGCCGGA GGAGATCAAC CGTACATTCA TGCAGACCTA TGCCGATGTG
GAGCAGCTTG TGCCACCGGC TAAAGTAGAC CGGCATGGAA GCGTCTGCAT TTATCTTCCG
TCGCCGGCAT CATCAGAAGA AGCGGAAGCC AATCTTTCGT GGGAGGAGCA GATCTTGCAG
GATCTTCCCC GTTTTATCAT CGGTCTGCAA AAAAGAGGGT ATGCTCCTTC GGATATTGCT
ATTCTCGTGC GCAAAACTTA TCAGGCTCGC GAGATAGCTC GTGCCATGCT CTCTTATCAA
CCCGAACCGG ATGAAGAGGA CTATCCCCTT ATCCCGATGT CGGACGAATC TTTGTCGCTG
AGTGGAGCAG CATCAATTCG TTTTCTATCG AACCTACTCA AGTTTATATC CCGCCCCCAA
TCCGATGCTT TGCGACAGAT CGCCTACCTG TCGTACGAAG AACTCCGCAA GGAAAAAGGC
TTAGCTCCTA CAGAGGAGGG CAACTTCAGT GCCGCAGAGC TTGCAGAATT TGCCAACCTT
CGTCGTCGCT CCTTATACGA ATTGGCAGAA GGATTGGTCT CTTTTTTCCA TTCATACCTA
CCTGAAAGAG AGATGCCCTA TCTAATTGCT TGGCTGGATT TAATCAATGA TTTCGGTCAC
GAACGATCTG CCGATCTGCA CTCTTTCCTG CAATGGTGGG ACGAAACGGG GCATGCCAAA
AGCCGTATTT CCTCCGCTCC CAACAGTCAG GCCGTCACGC TAATGACCAT TCACAAGGCG
AAAGGTCTAG GCTTCCGCGT TGTTCTTATT CCTTTCTTGG ACTGGAATCT GGACGACGAA
GCAGCACATC GACACATCCT CTGGTGCAAG ATCGATCCTG CCCGCAGTCC TTTCAATATC
CTGCCCGTAG TGCCGATCAG ATATAAGAAG GAGATGGCAC AAACAATGTT TGCCACAGAT
TATTTCCGAG AAAGAGCCGA TATTCTCCTT GACAACCTCA ATCTTCTTTA TGTGGCTACG
ACACGAGCGA AGGACGAGAT GCATCTGTGG TTACATCCTT CCCAAAAGCC CGAATCGCTC
TCTACCGTGG GAGATCTTAT ACATTTAGCT CTTGCTTCCT TGGACGAGAA TAAGACAGAG
TCCGATATGT ACTGCTGGGG AAGTCCGGTC ATATCCGCGA GACAAAAAAC GACAGATCAC
CCCTCTTCTG GCTTACCTTT TACCCTGCCC AAAGGAAGTC TCTCGGCCGT AGCCGACCGC
TTGGCTATCC GGCCTGAAGG CAGTGAATTT TATCGCAGGC ACAAACCGCT TTACCACGGA
CATGTAATGC ACAGAATCCT TGCCGACATC GTTCTCGCCA AGGACATCGA GCCTGCTCTT
GAGCGTTATG TGTCGGGAGG AATCGTCACG AGAGATGAGG CGATCGAATT GGTCGATCGC
CTATCTGTTG TCACTTCGGA CAGTCGATTG TCTCGTTGGT TCGATGGTAG CGGACGAGTT
CTTAATGAAC AGGATATTCT CCTCCCGGAA GGGGAACAGC GTCGTCCCGA TCGGATCATT
CTCTACGACG ATCACACAGA CATCGTAGAC TATAAATTCG GAGCAGTTCG CAAAGTTCAT
CATGCACAGA TGGAGAATTA TATCCGGCTG CTCGTCTCGA TGGGCTATCC ATCCGTCCGC
GGCTACCTAT GGTACCTACC GAACAATGAG ATAGTTGGCG TACGATCTGT TAGAGGAATA
ACGCGCAGCA AGGGGGAGGA AAGGAAGGAA GGAAAAGGAG AAAGAGGAAT AGAGAGAAAA
GCAAAATGA
 
Protein sequence
MKRSKQLRIY TASAGSGKTH TLTGEYLRLA LRTRGAFRYI QAVTFTNKAT AEMKERILEE 
LYSLAVGGSS PFAEELMQEL ALTTEQLQVR AQEVLTEILN DYSSLRVKTI DSFFQEVMRA
FSHELGLPGG FRIEMEQKAV LEQAVVRLLH SLGEKDTSDV ENWIRRLAED LIEEGRGHNI
RREIVSLGDE LFKEQLLLLS EEGKLPTKAA IHRYQTEMNK LMEGFEQRRL SIARRAEEIV
ATAGISFYDF KGGTKGGILE FAKVLKGGEV KPPTKTFMAM AEGDPETTLY AKTTPATTQA
AILSAYQSGL KECLTEMATL YLGREWQEYS TAKQSLPFLN RLGIISDLWR QIEGIRQEEN
KMLISDAPSL LHRIIDGSET PFVYDKIGVR IEHEMIDEFQ DTSRLQYENF KPLLSESLAH
GKYNLLVGDA KQSIYRFRNA DRRLLTEVVS RDFAETSERV NLPYNWRSTP EIIEFNNSLY
KHLPQILCEA MTREAETMAM PDPKLPEEIN RTFMQTYADV EQLVPPAKVD RHGSVCIYLP
SPASSEEAEA NLSWEEQILQ DLPRFIIGLQ KRGYAPSDIA ILVRKTYQAR EIARAMLSYQ
PEPDEEDYPL IPMSDESLSL SGAASIRFLS NLLKFISRPQ SDALRQIAYL SYEELRKEKG
LAPTEEGNFS AAELAEFANL RRRSLYELAE GLVSFFHSYL PEREMPYLIA WLDLINDFGH
ERSADLHSFL QWWDETGHAK SRISSAPNSQ AVTLMTIHKA KGLGFRVVLI PFLDWNLDDE
AAHRHILWCK IDPARSPFNI LPVVPIRYKK EMAQTMFATD YFRERADILL DNLNLLYVAT
TRAKDEMHLW LHPSQKPESL STVGDLIHLA LASLDENKTE SDMYCWGSPV ISARQKTTDH
PSSGLPFTLP KGSLSAVADR LAIRPEGSEF YRRHKPLYHG HVMHRILADI VLAKDIEPAL
ERYVSGGIVT RDEAIELVDR LSVVTSDSRL SRWFDGSGRV LNEQDILLPE GEQRRPDRII
LYDDHTDIVD YKFGAVRKVH HAQMENYIRL LVSMGYPSVR GYLWYLPNNE IVGVRSVRGI
TRSKGEERKE GKGERGIERK AK