Gene PG2016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPG2016 
Symbolcas3 
ID2553085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePorphyromonas gingivalis W83 
KingdomBacteria 
Replicon accessionNC_002950 
Strand
Start bp2106124 
End bp2108460 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content38% 
IMG OID637150594 
ProductCRISPR-associated helicase Cas3 
Protein accessionNP_906084 
Protein GI34541605 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3
[TIGR01596] CRISPR-associated endonuclease Cas3-HD 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.580429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAGAACA ACAAGATGAT CAATAAAGAA TCGCCGATTT TAGCAAAGCC CTCCGGGATT 
ACGTTGGACG TTCATGCCGA AAACGTAATT CAAGAGGGCG CATTGCTACT AAAAGGTCTT
GGTTGCACAA AAGAAAAATA CTTCCAAACC ACTGCCAAAC AGATTGTGCA GCGAGTGGAG
TTAGCTTGTA AATACCATGA CATAGGAAAA AGAAATAAGA TATGGCAAGA TGCTTGTAAA
AAAGACTATA AGGCATATCT TATTTGGAAG AAAATTCATC CAAATAAATC TTTTGAAGAT
TACACTATAG AATGTAGAGA AGAGGCCGGT AAATTTTTAC GACAAACACA TGTGCGACAT
GAATTTTATT CAATAAAAGG GCTTCTCAAT AACAACCCTC CACCCAATAA CAACCCTCCA
TACTGGCTGA AGATTGCGAT CGCTGCACAT CATCGTAAGC TAAGCATGAA GCATGAAGAA
CGATGGCTAA GCAGTGATGA AGATATTTGC TCTGTGTGGA AGGATATCAG AAGATTGTCA
AATTATTTCT TTGATGATCG GAATAGCGTT CTGTCCGATA TTGCAATCAA GCAATATGAA
TTTGCAGGAC CACGAAGCTT ACTTCAATTA GCGGATCACA GAGCGAGTGC AAAGGAAGAG
TCAAAATGCA TCCCGGATCC GATTGCTTTT TCTTATACCT TTCCTGAAAA GTGGGAAAAA
AGACCTGTTC AAAAGCTGGT AGAAAAGCAT TGGAGAGACG AACTTCTTTT AGTACGAGCA
CCCACTGGTG CTGGAAAAAC GGATGCAGCA CTCTTGTGGG CTTCTCATCA AATCAAACAC
AGAAAAGCAG ACCGCTTGAT TATTGCTATG CCCACTCGTT TTACAACAAA TGCACTAGCT
CTAAGCGTAT CTTCGACCCT TTCTTCTACA GGCCTGTATC ATTCCAGTGC ATGGACACAG
AATTTCAGTA GCAAAATAGA CAACGGGGAA ATAGCTTTAG AGCAGGCAAG ATACTATCAT
AACCAAGCTC GCTTACTACA GACCCCTATA ACGGTATGCA CCATAGATCA TCTCCTCTCT
TCATTGACTC TTTGTGACGA GGAACATCAG ACCATTACTT TTGCACTGGC AAATGCATGT
CTCGTTATTG ATGAGGCAGA TTTCTATGAT CAATTTACAC AAGCTAATAT ATTAGTATTA
CTAGAGGTTC TAAAATATTG GAAAGTGCCG ATATTGCTGA TGAGTGCCTC ATTGCCAGAC
AATATAGTAC AAGAATATCG AAAGATTGGT TATGATATTA AACACATACT TGAAGACACG
TCAGACATAG AGAGAGCACG TTTTTCTTTA ATAGAGAAAC GTGATGTTTC AGAAATAAGT
CAGATGCAAG ACTTATTACA ATTATGCCTT GATAGAGGTA GTGCGATTAT TTTCGCAAAT
ACTGTCGATA GAGCACAAGC ATACTATCAA TGGTTTTTGG AGAACGGTCA TGAAGAAGAA
GTAATACTTT ATCATAGTAG GTATACTGAA ACAGATAAAA TTGAAAAGGA AGCACAATTG
CTTCTTAGAT TAGGGAGAAG CGCTTGGGAT AATGATCGAG CAAATGGGGT TGCCATATTG
ACACAGATTG GAGAAATGAG TGTGAATATC AGTGCAGATA TAATGATTTC TGATCTTTGT
CCTATTGATC GATTGACCCA AAGATGCGGT CGTTTGTGTC GGTTTTCGAG AAAGATTGGA
GACTTATATA TAATCAGGCC TATGCGCGAG GGAAATTTAT TTCCAGCCCC TTATTATCAA
AATGTCGGCC CATCACAAAA ATTAGAGCCA AATGATGCAT TAATCCAAAC AGATGAACTA
TTGGAAAAAG GGATGTATTC TGCAGCTAAT TTAGTTCAAT TGCTAAACAA AGTATATGCT
AATCAGACAG TATTCTCCAC AGAAGCGAGC CGTAATGCGG ATCGCTTGAG AGAGATGTTC
AAAGCGAATT GGTTGATAAA TCCAATAGAG TACCTTGATG AAGAAATAGG TTCTACAAAT
TTTTGGCGAG CTCGAAACAT AATGCCACGA TGTAGAATAT ACATAAAAAG GCCATTACAA
TCTTTTCGGA ATTACTCGGA TTTTAGAAGA TGGGAAACAG ATGTAGCGAT AGATATCCCT
ATTTATGTCA TTAAAAAAGC TATGCGTATG AGCGATACCG TTTGCTCTCA AGTAAAAGTG
TGTATTGGCA AAAATGATTA TACATTGATT GTAATAAATG CCAAATGCTA CAATAATGTT
AGAGGACTGT TTTTCTCTGA ATCAGTCCTC GAAGAACCTA ACTTTGAGTT CTTGTAA
 
Protein sequence
MQNNKMINKE SPILAKPSGI TLDVHAENVI QEGALLLKGL GCTKEKYFQT TAKQIVQRVE 
LACKYHDIGK RNKIWQDACK KDYKAYLIWK KIHPNKSFED YTIECREEAG KFLRQTHVRH
EFYSIKGLLN NNPPPNNNPP YWLKIAIAAH HRKLSMKHEE RWLSSDEDIC SVWKDIRRLS
NYFFDDRNSV LSDIAIKQYE FAGPRSLLQL ADHRASAKEE SKCIPDPIAF SYTFPEKWEK
RPVQKLVEKH WRDELLLVRA PTGAGKTDAA LLWASHQIKH RKADRLIIAM PTRFTTNALA
LSVSSTLSST GLYHSSAWTQ NFSSKIDNGE IALEQARYYH NQARLLQTPI TVCTIDHLLS
SLTLCDEEHQ TITFALANAC LVIDEADFYD QFTQANILVL LEVLKYWKVP ILLMSASLPD
NIVQEYRKIG YDIKHILEDT SDIERARFSL IEKRDVSEIS QMQDLLQLCL DRGSAIIFAN
TVDRAQAYYQ WFLENGHEEE VILYHSRYTE TDKIEKEAQL LLRLGRSAWD NDRANGVAIL
TQIGEMSVNI SADIMISDLC PIDRLTQRCG RLCRFSRKIG DLYIIRPMRE GNLFPAPYYQ
NVGPSQKLEP NDALIQTDEL LEKGMYSAAN LVQLLNKVYA NQTVFSTEAS RNADRLREMF
KANWLINPIE YLDEEIGSTN FWRARNIMPR CRIYIKRPLQ SFRNYSDFRR WETDVAIDIP
IYVIKKAMRM SDTVCSQVKV CIGKNDYTLI VINAKCYNNV RGLFFSESVL EEPNFEFL