Gene PG1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG1036 
SymboluvrA-1 
ID2553083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp1098576 
End bp1101479 
Gene Length2904 bp 
Protein Length967 aa 
Translation table11 
GC content47% 
IMG OID637149741 
Productexcinuclease ABC, A subunit 
Protein accessionNP_905256 
Protein GI34540777 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000712855 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCACGATA CAGTTATAAA CGTAAAAGAC AAGATAGCTC GACACGATGC TGTCGAAGTC 
TATGGAGCAC GGGTGCACAA TCTGAAGAAT ATCGATGTGT GCATTCCGCG TCATAGCCTT
ACAGTCATCA CTGGAATGAG TGGTAGCGGA AAGTCGTCAT TGGCTTTCGA TACCCTTTTT
GCCGAAGGAC AGCGACGGTA CATAGAAACT TTTTCGGCTT ATGCGCGGAA TTTCCTCGGT
GGTGGTATGG AGCGCCCCGA CGTGGACGAG ATAAAAGGGC TGAGTCCGGT CATTTCCATA
GAACAGAAGA CGACCAATCG GAATCCGAGA TCGACTGTCG GTACCGTTAC AGAGGTGTAC
GACTTCTTTC GTTTGCTCTA TGCTCGTGTT GCTGAGGCTT ATTCTTATGC CGGAGGAAAG
AAGATGGTGA AATACACCGA GGAGCAAATC TTCCACCTCA TTTTGGAAAA GTATGATGGC
CGTAAGATAG CTTTGTTAGC CCCGGTGGTT CGCTCGCGTA AGGGGCATTA TAAGGAGCTA
TTCGAAAGGC TTCGTAAGAA AGGCTATCTG CACGTACGCG TGGATGGTGA GATTCGCGAG
ATTTTGCATG GGATGAAGCT TGATCGCTAT AAAATGCATG ACGTCGAAGT GCAGATCGAC
AAGCTGTGCG TAGCATCGAA TCAAAGCAAA CGTATAGCGG AGAGCTTGGC ACTGGCTATG
CGTGAAGGCG AAGGGCTTGT GATGGTGTTG GATTCGGAAA AAAACGAAGT GGGACATTTT
AGTCGGATGA TGATGTGTCC TGATACGGGA ATTTCGTATA GCGATCCGGC TCCGCACAAT
TTCAGTTTCA ATTCCCCTCA TGGCTATTGT CCGCGCTGTA AGGGCTTGGG CGAAGTGAAT
CTGCCGGATA TGGACAAGAT CATTCCATCT CGTGAGAAGA GTATATACGA AGGTGGCATT
GAGCCATTGG GCAAATACAA AAACAATCTT TTCTTTTGGC AAATAGAGGC ATTGTGCGAG
AAGTACGATG TGACGATCAA AACGCCTTTG CGCGATTTGC CGGAAGAGTT GATCGAGGAT
ATCCTATATG GTACAGACGA ACTACTGACC ATAAACAACA AAGCTTTGGG GCAGTCGAGG
TATGCTCTCA GCTTCGATGG CGTGGCTAAG TATATATTGA TGCAGGCAGA AGAATCGGAT
GCTTCGGCTA CTGCGCAGAA ATGGGCAGAT CAGTTTCTCA AGACTACAAC TTGCCCCGAC
TGTGCCGGCA AGCGTCTGAA CAAAGAAGCT CTGTCCTATC GTTTGGCCGG CAAGGACATT
GCAGAAGTGA ATGCAATGGA CATCAAGACT CTTATTGAAT GGGTAGATTC TCTGGATGAA
CATCTGTTGG ATACGCAACG GGCTATTTCT ATAGAAATAC TGAAAGAGAT ACGGACACGC
TTGGGGTTTC TTAAAGATGT GGGGTTGGAG TACTTGACGA TGAACCGAGC TGCAGCATCC
TTATCCGGAG GAGAAAGCCA GCGTATTCGT TTGGCTACGC AGATCGGGAG CAAACTGGTT
GAGGTCTTGT ACATATTGGA TGAACCGAGT ATCGGTCTTC ACCAAAGGGA CAATCTTCGG
TTGATTCATT CGCTTCAGGA TTTGCGTGAT ATAGGCAATA CAGTGGTAGT GGTGGAGCAT
GATCAGGATA TGATGCTGCA CGCCGACTAT GTAATAGACT TAGGACCACG AGCCGGCAGA
CATGGTGGTG AAGTGGTGTT TGCAGGTAGT CCGGAAGAGA TGGTGCAGGC TAATACCCTG
ACTGCCGATT ATATAAGTGG ACGAAAGCGT ATAGAAAAGT CGATCGGCAG GAGAGATGGT
AGTGGGAAAA CGATAAAATT ATTTGGAGCC AAAGGCAATA ACCTGCAGAA TATAGACGTT
ATCTTTCCGC TGGGGGTATT TATATGTGTT ACCGGGGTCT CCGGTAGTGG GAAAAGTACA
TTGATCAATA AGACTCTATT CCCTGCTATC AGCCAAAAGC TGTACCGCTC TTTGCAGGAT
CCGATGCCCT ACGACCGCAT AGAGGGGTTA AAGCATATAG ATAAGATAAT AGCCGTAGAC
CAAAGTCCTA TCGGGCGGAC GCTTAGAAGC AATGCAGCCA CATATACGGG ATTGTTTACC
GATATTCGCG CTCTTTTTGT CGGTCTGCCC GAGAGTAAAG CTCGTGGATA TAAGCCGGGG
CGTTTTTCTT TCAACGTCAA AGGCGGACGC TGTGAGGTTT GTAAGGGTAA TGGCTACAAG
ACGATCGAAA TGAATTTTTT GCCCGATGTA TTTGCTCCGT GTGAGGGATG CCGGGGAAAG
CGATACAACA GGGAGACACT CGAAGTCCGT TACAAAGGAA AATCCATTGC GGATGTTTTG
GATATGACTA TCAATAAGGC TGTCGAATTT TTCGAACATG CACCTCATAT TTTATCCAAG
CTTTCCGTTT TACAGGAGGT CGGTTTGGGC TATATCAAAC TGGGACAACC ATCCTCGACT
TTGTCGGGAG GAGAGTGTCA ACGTGTTAAA TTAGCCACCG AGCTGAGTAA ACGTGACACG
GGAAATACAC TCTATGTATT AGATGAACCG ACTACGGGCT TACATTTCGA AGATGTACGC
GTTTTGTTGG GGATCCTTAA TCGACTGATA GAGAGAGGGA ATACGGTAAT AGTGATCGAA
CATAATTTGG ATGTGATCCG CTGTGCGGAT TATTTGATTG ATATAGGACC GGAGGGAGGA
GCAGGTGGCG GACAGTTGCT TTACCAAGGG AAAATGGAGG ATATAATAGA ATGCAAGAAT
AGTTATACAG CTCAATTTGT GAAAGCCGAA CTCGAAAAAG GCCGGATTGA TACTGTACAC
ATGAATTCTG CAGACAATAT ATAA
 
Protein sequence
MHDTVINVKD KIARHDAVEV YGARVHNLKN IDVCIPRHSL TVITGMSGSG KSSLAFDTLF 
AEGQRRYIET FSAYARNFLG GGMERPDVDE IKGLSPVISI EQKTTNRNPR STVGTVTEVY
DFFRLLYARV AEAYSYAGGK KMVKYTEEQI FHLILEKYDG RKIALLAPVV RSRKGHYKEL
FERLRKKGYL HVRVDGEIRE ILHGMKLDRY KMHDVEVQID KLCVASNQSK RIAESLALAM
REGEGLVMVL DSEKNEVGHF SRMMMCPDTG ISYSDPAPHN FSFNSPHGYC PRCKGLGEVN
LPDMDKIIPS REKSIYEGGI EPLGKYKNNL FFWQIEALCE KYDVTIKTPL RDLPEELIED
ILYGTDELLT INNKALGQSR YALSFDGVAK YILMQAEESD ASATAQKWAD QFLKTTTCPD
CAGKRLNKEA LSYRLAGKDI AEVNAMDIKT LIEWVDSLDE HLLDTQRAIS IEILKEIRTR
LGFLKDVGLE YLTMNRAAAS LSGGESQRIR LATQIGSKLV EVLYILDEPS IGLHQRDNLR
LIHSLQDLRD IGNTVVVVEH DQDMMLHADY VIDLGPRAGR HGGEVVFAGS PEEMVQANTL
TADYISGRKR IEKSIGRRDG SGKTIKLFGA KGNNLQNIDV IFPLGVFICV TGVSGSGKST
LINKTLFPAI SQKLYRSLQD PMPYDRIEGL KHIDKIIAVD QSPIGRTLRS NAATYTGLFT
DIRALFVGLP ESKARGYKPG RFSFNVKGGR CEVCKGNGYK TIEMNFLPDV FAPCEGCRGK
RYNRETLEVR YKGKSIADVL DMTINKAVEF FEHAPHILSK LSVLQEVGLG YIKLGQPSST
LSGGECQRVK LATELSKRDT GNTLYVLDEP TTGLHFEDVR VLLGILNRLI ERGNTVIVIE
HNLDVIRCAD YLIDIGPEGG AGGGQLLYQG KMEDIIECKN SYTAQFVKAE LEKGRIDTVH
MNSADNI