Gene PG1982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG1982 
Symbol 
ID2552124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp2069615 
End bp2072710 
Gene Length3096 bp 
Protein Length1031 aa 
Translation table11 
GC content51% 
IMG OID637150566 
ProductCRISPR-associated Cas1 family protein 
Protein accessionNP_906056 
Protein GI34541577 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.675546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACTCT CCACAGATTC GTTGCCGATC TTTTTATCGG ACTTCACCGC CTACCATTTT 
AGCGTTCGTT TCAGAGCCGA ACGTGCTATC GCTTTCGAAC GCAAATGGTA TTTCATGCCT
CGGTTTGCAT TGGGCAATGC CCTCAAAAAT AGTGAGCAAT ACGCCTACCT CTATGGGCAG
ATATTCAAGC CACAAGAGGA AGACACGGAC GAAAGCAAGG GACCGGGCAA CACCTCGCGC
CTGATCATTC GAGCCGACAA ACCCTCTCGG AAGTCTCTGG AGGCAGGCGA AGCCATGGAT
CTGTATATTA CCGTCGTAAC GAGAGATCCT CTGCTGGTCG GAGACTTCCT CTCTTTCCTT
CCGGAATGGC AAGCGTACAA CTTCTTTCGG GAAAATGATC TCACATACGA CTCCTACCGC
TTGTACAATC CCACGACACA GAAATACGAG TCCGGGCTAA GGGTAGAGGA TGCGGCACTT
ACGGTAGACT TCTTTTCTCG GCAAGCAATC CGTTGGGGTG AAATCCTCTC TGTACGCTTT
CTGTCTCCGG CCAGTATCAA GGTAGATCAA ATACTGTCCG CCGAAATACC CTACTCGCGT
CTCATGAATA GACTCTCACG GCGTCTGTAT GAGCTATATA CGCAATACCT GAGCAGGGGA
GAGACATCTG TGGAGCGATA TATTTTCCCC GACCATGATG GGCTGATTTA CTCCCAGATA
AGTATGCCAC GCAAGGCGAC TATCAAGGAA AACAGACAGT ACGACATGTC CGGCATATTG
GGTCAGCTCT TCTATCGTGT TCCCTACGAT CCCGTAGCAG CCCTTATGCT CTCAATGGCG
CATTGGGTGC ATATAGGCAA TCATACAATC GTGGGGAATG GTCAGATAGA GAGCACTCCG
GGCAACGATA CCTTGTATCG CAAGTGGCTC TCTTCGCTGG CTGCGGATCG TGACCTTCCC
ATGGCAGAAG CGGAGCGACA AGATTTGTTG GAAGCCCTAC GTATTTGCAG CTATATACCG
CAGCCCTATC ACTCTGTGAA CATACCCAAG GGGGATGGCT CTTACCGACA GCTACATATC
CCTTCGGCAG TGGATCTCCA CTTGCAGAGA AGCCTTGCCG GCATACTCTA TCCGATCACC
GAGTCTTTAT CCATTGCACA GAGCTATGCC TATCGCAAGG GGAAGGGAGC TGTGGCTGCC
GTTCGTAGAG TGCAGCATCT GCTGGATTCT CTGGATGAGA ACCATACAGT GGTGCGCTGC
GATATTGACA ACTTCTTCGA TTCTATCCCT GTCCCCTCTC TATTGCAGAA AGTCCAAAGA
ACAACCGAAG ATCCCTTCCT TACCCGTATG TTGAGCCTCT GGATGAAGTC CGGAGTCGTG
GACCGGAAGC AGCAATATGC TCGTGCATCC TCCGGCATCC CACAGGGAAG CCCTCTGGCC
CCTCTGCTTT CCAACCTCTA CTTGGAAGAC ACGGATCGCT ACATCGCCGG GCATATAACA
ACCGAATTTA TCCGCTATGC CGACGACCTG CTACTCTTCT TGCCGGAGAA AGTAGATCCG
CTAAATGCTC TCCAAGACCT GAGCGAACAT TTGAAATACC GCAAAGGGCT GAAGCTGAAT
AGGGACTTTG TGGTATCGAG CATAAAAAGC TCTTTCAGTT TCCTCGGTAT TACGTTCTGT
GCAGATGGTT CAAGGAGCAT GAGCCGGGAC AAGAAAGAGG GACTCAAACG CAAGATCACA
CTCGCGCTCC ATCGCGACAC CGAAAACTTC TCAGCTCTCT CGGAGACAAT CCATGGCATG
GAGCAGTACT ACCGCAAACT GCTCGAAAAG GTGGACATAG AGGCTATTGA CGAAGTTGCC
GCCACTGTCT ATGCCACTCA CATCGCATCA TTACCGACCT CCGAGGCTCG GAAAAGCGCC
AAGGACAATC TTCTACGGCT GGGCTTTCTC TCTTCGGAAA CGGCCAAACA GACTCTTCGA
GAGGCTATGC GGCAGACAGT GGTCTCAAGT GCAGACAATT TCCCGATAAA GAAAGAGAGC
GAAATCCTGC GTGAGCAGCA AAAAAGACAA TTGCAAGAGA GGGGAGAGAT ATTCGATTTG
GTAGTTACCG AGCCAGGAGC TTTCATCGGT ATCAGTCGCA ACCATGTGCT CGTAAGGAAG
TACGGGAAGA CAATCTGCAA ACAACCTGCT GCACAGATAG AGCAGATCAG CATCATCAGT
GACGGTGTCT CTCTCAGCAG TAATGTCACC AAATACTGTA GGAAGAAGAA TATCAGAGTC
ATATTCTACA ATGCAACGGG CCAGGCCTAT GCCTCTCTCA ACGGCATGAA TACCATCTTG
CCTTCCGTGA TGGAGGCCCA GATGCGCCTG AGCGAAGAAA AAAAGCGAGA GTTCATCCTT
ACTCTTATCA AGAACAAGGT TCGCAATCAG GGCAAACTCC TGCGCTACTA CCATAAGTAT
TATCGCCATG ACAAAGAGCT GAAAGAGCCT CTCTCCAACG CTATCGCCGA GCTAAAGCAG
CTGGAAGGTA TACCCATAGC GGAGGGCAGC TCCCTTGCGG ACTTCCGACA GCATGCCATG
CTTCATGAAG CCCGCTGCGC ACAAGTCTAT TGGAGGGCAT TTGCCCTGCT GGTGCATCGC
TCCGGGCATG AATTTGAAGG GAGAGAGCAT AAGGGAGCCG AGGGATTGGT CAATCAGATG
CTAAACTATG GCTACGCTAT CCTCCGGAGC TATGTGATGA AGACAATAGT CCTTTGGCAG
CTCAACCCGA ATATCGGTAT CCTGCACAGC ACGCAAGACA ATAAACCGGC CCTGTGTTTC
GATCTCATGG AGCAATACCG GGCCTTTGTC GTCGATAGGA GCATCCTGGC CCTACTGGCC
AAGGGGGAAG ATGTGGGGCA GAATAGCAAA GGGCTGTTGG ATATGCCTAC ACGCAGTCGC
ATTATATCGA AGATAAACGA ACGCTGGTTT GCTACCGAAT ACTATCGATC CGGGGAAAAG
CTTTTCTCCG ATATTATGAA GTTGCAAACC AAAGATGTGA GTGCTTTTTG CTGTGGCAAG
GTGAAGCGCA TCAAATTCTA TACCCCTAAA TGGTAG
 
Protein sequence
MPLSTDSLPI FLSDFTAYHF SVRFRAERAI AFERKWYFMP RFALGNALKN SEQYAYLYGQ 
IFKPQEEDTD ESKGPGNTSR LIIRADKPSR KSLEAGEAMD LYITVVTRDP LLVGDFLSFL
PEWQAYNFFR ENDLTYDSYR LYNPTTQKYE SGLRVEDAAL TVDFFSRQAI RWGEILSVRF
LSPASIKVDQ ILSAEIPYSR LMNRLSRRLY ELYTQYLSRG ETSVERYIFP DHDGLIYSQI
SMPRKATIKE NRQYDMSGIL GQLFYRVPYD PVAALMLSMA HWVHIGNHTI VGNGQIESTP
GNDTLYRKWL SSLAADRDLP MAEAERQDLL EALRICSYIP QPYHSVNIPK GDGSYRQLHI
PSAVDLHLQR SLAGILYPIT ESLSIAQSYA YRKGKGAVAA VRRVQHLLDS LDENHTVVRC
DIDNFFDSIP VPSLLQKVQR TTEDPFLTRM LSLWMKSGVV DRKQQYARAS SGIPQGSPLA
PLLSNLYLED TDRYIAGHIT TEFIRYADDL LLFLPEKVDP LNALQDLSEH LKYRKGLKLN
RDFVVSSIKS SFSFLGITFC ADGSRSMSRD KKEGLKRKIT LALHRDTENF SALSETIHGM
EQYYRKLLEK VDIEAIDEVA ATVYATHIAS LPTSEARKSA KDNLLRLGFL SSETAKQTLR
EAMRQTVVSS ADNFPIKKES EILREQQKRQ LQERGEIFDL VVTEPGAFIG ISRNHVLVRK
YGKTICKQPA AQIEQISIIS DGVSLSSNVT KYCRKKNIRV IFYNATGQAY ASLNGMNTIL
PSVMEAQMRL SEEKKREFIL TLIKNKVRNQ GKLLRYYHKY YRHDKELKEP LSNAIAELKQ
LEGIPIAEGS SLADFRQHAM LHEARCAQVY WRAFALLVHR SGHEFEGREH KGAEGLVNQM
LNYGYAILRS YVMKTIVLWQ LNPNIGILHS TQDNKPALCF DLMEQYRAFV VDRSILALLA
KGEDVGQNSK GLLDMPTRSR IISKINERWF ATEYYRSGEK LFSDIMKLQT KDVSAFCCGK
VKRIKFYTPK W