Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG2014 |
Symbol | cas1 |
ID | 2553099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | - |
Start bp | 2104579 |
End bp | 2105595 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 637150592 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | NP_906082 |
Protein GI | 34541603 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGA CATATTATCT ATTTAACCCC GGCGAATTGT CCCGAAAAGA CAATACCATC CGCTTTGTCC CGATCCAAGA GGGAGAAAAC GGACAGGAAC AAGCAGGACA AGCCCGCTAC ATTCCCGTTG AAGGAATTAG CGATTTCTAT GTGTTTGGCT CTTTGAGGGC CAATAGCAGC TTATACAATT TCCTTGGGAG CAATGACATC GCAGTCCATT TTTTCGATTA TTACGAAAAT TACACCGGTT CTTTTATGCC TCGCGACTTC TTGCTATCAG GAAAAATGCT CCTGGCACAG GCTTCGGCAT ATAAGAATAA GAAGAAACGA TTGTTTCTGG CCAGAAAGTT TATTGAAGGT GCAGCCTCTA ATATGCAGAA AAACTTAGCC TACTACAACA ATCGGGGAAA AGATATGCAG CCGATGATGG AGCTAATCGA CAAGTATTCA TTGCGCCTGA AGGAAACGAC AACGATAGAG GCTCTTATGG GTATCGAAGG TAATATCCGG CAAGCATACT ATGATGCTTT TAACCTTATT ATCGATCCTT TCGAAATGGG GGTGCGGAGC AAACAGCCTC CTCAAAATGA GGTAAATGCA CTCATCTCTT TTGGGAATAT GATGTGCTAC ACCTTATGCC TGAAGGCAAT TCATCAAAGC CAACTGAATC CCACGATCAG CTTCCTACAT ACGCCCGGGG AGAGGCGATA CTCTCTATGC TTGGATATAT CGGAAGTGTT TAAGCCCATA TTGGTAGATC GCACAATCTT CAAAGTCATG AACAAGAGAA TAATACAGGC CAAACACTTC GACAAGCAAC TCAATAAATG TATCCTAAAT CCATCGGGAA AGAAGCTTTT CGTACAAGCC TTCGAAGAAA GACTTTCTGA AACGATTCGA CATCGCAAGC TCAATCGTTC CGTCAGTTTC AGGCACCTGG TCAAGCTCGA ATGCTACAAA ATAGCCAAAG ACATTCTTGG GATCGAAGAG TATCAACCCT TTAAGATGTA TTGGTAA
|
Protein sequence | MKKTYYLFNP GELSRKDNTI RFVPIQEGEN GQEQAGQARY IPVEGISDFY VFGSLRANSS LYNFLGSNDI AVHFFDYYEN YTGSFMPRDF LLSGKMLLAQ ASAYKNKKKR LFLARKFIEG AASNMQKNLA YYNNRGKDMQ PMMELIDKYS LRLKETTTIE ALMGIEGNIR QAYYDAFNLI IDPFEMGVRS KQPPQNEVNA LISFGNMMCY TLCLKAIHQS QLNPTISFLH TPGERRYSLC LDISEVFKPI LVDRTIFKVM NKRIIQAKHF DKQLNKCILN PSGKKLFVQA FEERLSETIR HRKLNRSVSF RHLVKLECYK IAKDILGIEE YQPFKMYW
|
| |