Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG1982 |
Symbol | |
ID | 2552124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | - |
Start bp | 2069615 |
End bp | 2072710 |
Gene Length | 3096 bp |
Protein Length | 1031 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637150566 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | NP_906056 |
Protein GI | 34541577 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.675546 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACTCT CCACAGATTC GTTGCCGATC TTTTTATCGG ACTTCACCGC CTACCATTTT AGCGTTCGTT TCAGAGCCGA ACGTGCTATC GCTTTCGAAC GCAAATGGTA TTTCATGCCT CGGTTTGCAT TGGGCAATGC CCTCAAAAAT AGTGAGCAAT ACGCCTACCT CTATGGGCAG ATATTCAAGC CACAAGAGGA AGACACGGAC GAAAGCAAGG GACCGGGCAA CACCTCGCGC CTGATCATTC GAGCCGACAA ACCCTCTCGG AAGTCTCTGG AGGCAGGCGA AGCCATGGAT CTGTATATTA CCGTCGTAAC GAGAGATCCT CTGCTGGTCG GAGACTTCCT CTCTTTCCTT CCGGAATGGC AAGCGTACAA CTTCTTTCGG GAAAATGATC TCACATACGA CTCCTACCGC TTGTACAATC CCACGACACA GAAATACGAG TCCGGGCTAA GGGTAGAGGA TGCGGCACTT ACGGTAGACT TCTTTTCTCG GCAAGCAATC CGTTGGGGTG AAATCCTCTC TGTACGCTTT CTGTCTCCGG CCAGTATCAA GGTAGATCAA ATACTGTCCG CCGAAATACC CTACTCGCGT CTCATGAATA GACTCTCACG GCGTCTGTAT GAGCTATATA CGCAATACCT GAGCAGGGGA GAGACATCTG TGGAGCGATA TATTTTCCCC GACCATGATG GGCTGATTTA CTCCCAGATA AGTATGCCAC GCAAGGCGAC TATCAAGGAA AACAGACAGT ACGACATGTC CGGCATATTG GGTCAGCTCT TCTATCGTGT TCCCTACGAT CCCGTAGCAG CCCTTATGCT CTCAATGGCG CATTGGGTGC ATATAGGCAA TCATACAATC GTGGGGAATG GTCAGATAGA GAGCACTCCG GGCAACGATA CCTTGTATCG CAAGTGGCTC TCTTCGCTGG CTGCGGATCG TGACCTTCCC ATGGCAGAAG CGGAGCGACA AGATTTGTTG GAAGCCCTAC GTATTTGCAG CTATATACCG CAGCCCTATC ACTCTGTGAA CATACCCAAG GGGGATGGCT CTTACCGACA GCTACATATC CCTTCGGCAG TGGATCTCCA CTTGCAGAGA AGCCTTGCCG GCATACTCTA TCCGATCACC GAGTCTTTAT CCATTGCACA GAGCTATGCC TATCGCAAGG GGAAGGGAGC TGTGGCTGCC GTTCGTAGAG TGCAGCATCT GCTGGATTCT CTGGATGAGA ACCATACAGT GGTGCGCTGC GATATTGACA ACTTCTTCGA TTCTATCCCT GTCCCCTCTC TATTGCAGAA AGTCCAAAGA ACAACCGAAG ATCCCTTCCT TACCCGTATG TTGAGCCTCT GGATGAAGTC CGGAGTCGTG GACCGGAAGC AGCAATATGC TCGTGCATCC TCCGGCATCC CACAGGGAAG CCCTCTGGCC CCTCTGCTTT CCAACCTCTA CTTGGAAGAC ACGGATCGCT ACATCGCCGG GCATATAACA ACCGAATTTA TCCGCTATGC CGACGACCTG CTACTCTTCT TGCCGGAGAA AGTAGATCCG CTAAATGCTC TCCAAGACCT GAGCGAACAT TTGAAATACC GCAAAGGGCT GAAGCTGAAT AGGGACTTTG TGGTATCGAG CATAAAAAGC TCTTTCAGTT TCCTCGGTAT TACGTTCTGT GCAGATGGTT CAAGGAGCAT GAGCCGGGAC AAGAAAGAGG GACTCAAACG CAAGATCACA CTCGCGCTCC ATCGCGACAC CGAAAACTTC TCAGCTCTCT CGGAGACAAT CCATGGCATG GAGCAGTACT ACCGCAAACT GCTCGAAAAG GTGGACATAG AGGCTATTGA CGAAGTTGCC GCCACTGTCT ATGCCACTCA CATCGCATCA TTACCGACCT CCGAGGCTCG GAAAAGCGCC AAGGACAATC TTCTACGGCT GGGCTTTCTC TCTTCGGAAA CGGCCAAACA GACTCTTCGA GAGGCTATGC GGCAGACAGT GGTCTCAAGT GCAGACAATT TCCCGATAAA GAAAGAGAGC GAAATCCTGC GTGAGCAGCA AAAAAGACAA TTGCAAGAGA GGGGAGAGAT ATTCGATTTG GTAGTTACCG AGCCAGGAGC TTTCATCGGT ATCAGTCGCA ACCATGTGCT CGTAAGGAAG TACGGGAAGA CAATCTGCAA ACAACCTGCT GCACAGATAG AGCAGATCAG CATCATCAGT GACGGTGTCT CTCTCAGCAG TAATGTCACC AAATACTGTA GGAAGAAGAA TATCAGAGTC ATATTCTACA ATGCAACGGG CCAGGCCTAT GCCTCTCTCA ACGGCATGAA TACCATCTTG CCTTCCGTGA TGGAGGCCCA GATGCGCCTG AGCGAAGAAA AAAAGCGAGA GTTCATCCTT ACTCTTATCA AGAACAAGGT TCGCAATCAG GGCAAACTCC TGCGCTACTA CCATAAGTAT TATCGCCATG ACAAAGAGCT GAAAGAGCCT CTCTCCAACG CTATCGCCGA GCTAAAGCAG CTGGAAGGTA TACCCATAGC GGAGGGCAGC TCCCTTGCGG ACTTCCGACA GCATGCCATG CTTCATGAAG CCCGCTGCGC ACAAGTCTAT TGGAGGGCAT TTGCCCTGCT GGTGCATCGC TCCGGGCATG AATTTGAAGG GAGAGAGCAT AAGGGAGCCG AGGGATTGGT CAATCAGATG CTAAACTATG GCTACGCTAT CCTCCGGAGC TATGTGATGA AGACAATAGT CCTTTGGCAG CTCAACCCGA ATATCGGTAT CCTGCACAGC ACGCAAGACA ATAAACCGGC CCTGTGTTTC GATCTCATGG AGCAATACCG GGCCTTTGTC GTCGATAGGA GCATCCTGGC CCTACTGGCC AAGGGGGAAG ATGTGGGGCA GAATAGCAAA GGGCTGTTGG ATATGCCTAC ACGCAGTCGC ATTATATCGA AGATAAACGA ACGCTGGTTT GCTACCGAAT ACTATCGATC CGGGGAAAAG CTTTTCTCCG ATATTATGAA GTTGCAAACC AAAGATGTGA GTGCTTTTTG CTGTGGCAAG GTGAAGCGCA TCAAATTCTA TACCCCTAAA TGGTAG
|
Protein sequence | MPLSTDSLPI FLSDFTAYHF SVRFRAERAI AFERKWYFMP RFALGNALKN SEQYAYLYGQ IFKPQEEDTD ESKGPGNTSR LIIRADKPSR KSLEAGEAMD LYITVVTRDP LLVGDFLSFL PEWQAYNFFR ENDLTYDSYR LYNPTTQKYE SGLRVEDAAL TVDFFSRQAI RWGEILSVRF LSPASIKVDQ ILSAEIPYSR LMNRLSRRLY ELYTQYLSRG ETSVERYIFP DHDGLIYSQI SMPRKATIKE NRQYDMSGIL GQLFYRVPYD PVAALMLSMA HWVHIGNHTI VGNGQIESTP GNDTLYRKWL SSLAADRDLP MAEAERQDLL EALRICSYIP QPYHSVNIPK GDGSYRQLHI PSAVDLHLQR SLAGILYPIT ESLSIAQSYA YRKGKGAVAA VRRVQHLLDS LDENHTVVRC DIDNFFDSIP VPSLLQKVQR TTEDPFLTRM LSLWMKSGVV DRKQQYARAS SGIPQGSPLA PLLSNLYLED TDRYIAGHIT TEFIRYADDL LLFLPEKVDP LNALQDLSEH LKYRKGLKLN RDFVVSSIKS SFSFLGITFC ADGSRSMSRD KKEGLKRKIT LALHRDTENF SALSETIHGM EQYYRKLLEK VDIEAIDEVA ATVYATHIAS LPTSEARKSA KDNLLRLGFL SSETAKQTLR EAMRQTVVSS ADNFPIKKES EILREQQKRQ LQERGEIFDL VVTEPGAFIG ISRNHVLVRK YGKTICKQPA AQIEQISIIS DGVSLSSNVT KYCRKKNIRV IFYNATGQAY ASLNGMNTIL PSVMEAQMRL SEEKKREFIL TLIKNKVRNQ GKLLRYYHKY YRHDKELKEP LSNAIAELKQ LEGIPIAEGS SLADFRQHAM LHEARCAQVY WRAFALLVHR SGHEFEGREH KGAEGLVNQM LNYGYAILRS YVMKTIVLWQ LNPNIGILHS TQDNKPALCF DLMEQYRAFV VDRSILALLA KGEDVGQNSK GLLDMPTRSR IISKINERWF ATEYYRSGEK LFSDIMKLQT KDVSAFCCGK VKRIKFYTPK W
|
| |