Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TDE0328 |
Symbol | cas1 |
ID | 2741544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Treponema denticola ATCC 35405 |
Kingdom | Bacteria |
Replicon accession | NC_002967 |
Strand | + |
Start bp | 365205 |
End bp | 366077 |
Gene Length | 873 bp |
Protein Length | 290 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637159200 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | NP_970942 |
Protein GI | 42525844 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTGGA GAACCGTTGT TATTTCAAAT AGGGCAAAGC TGGATTTACA CTTAAATCAT CTAGTAGTTC GGGGTGAAAA AACTCAAAAA GTTTTTATCG AAGAAATTTC AGTTTTAATC ATTGAAACGA CAGCGGTATC TATAACCGCA GCCTTATTAA ATGAGCTTAT AAAACAAAAG GTTAAGGTAA TTTTTTGTGA CGAAAAACGA AATCCGGCTT CTGAATTGAT AGGATATTAC GGCAGCCACG ATACCAGCGA AAAAATCAGG CTTCAAATAA AATGGGATAA GAATATCAAA CAGCTTGTAT GGACTGAGGT CGTAACAGAA AAAATAAGAC AGCAAAAATA CCTCTTAGAA AAACTGAATC TTCCGCAAGC AAGTTTACTG GCTGAGTATA TTACGGATAT AGATATTAAC GATAAAACAA ATAAGGAAGC CCATGCTGCA AAAGCATATT TTGCAGCGTT ATTTGGAGCA GGTTTTTCCA GAAGTTTGGA TATTCCTATA AATGCCGCTC TAAATTACGG ATATAGCATC TTACTTTCTG CATTTAATCG TGAAATAATA GCAAACGGTT ATATTACACA ACTGGGAATT TTTCATGATA ATATGTTTAA TCCCTTTAAT CTCGGATCAG ACCTTATGGA GCCTTTTAGG CCTCTCGTTG ATGCAGAAGT TTTTAAACTT AATCCTCAAA AATTTGAACA TGAAGAAAAA CTAAAAATAG TTAGTGTTAT AAATAAAAAG GTGCTCATAA ACAATAAAGA GCATTATCTA AATAAAGCTA TCGAAATTTT TGTACACAGT ATTTTTGATG CTCTAAATGA AAAGGATATT TCACAAATTA ACTTTTACCG AAATGAGTTA TAG
|
Protein sequence | MSWRTVVISN RAKLDLHLNH LVVRGEKTQK VFIEEISVLI IETTAVSITA ALLNELIKQK VKVIFCDEKR NPASELIGYY GSHDTSEKIR LQIKWDKNIK QLVWTEVVTE KIRQQKYLLE KLNLPQASLL AEYITDIDIN DKTNKEAHAA KAYFAALFGA GFSRSLDIPI NAALNYGYSI LLSAFNREII ANGYITQLGI FHDNMFNPFN LGSDLMEPFR PLVDAEVFKL NPQKFEHEEK LKIVSVINKK VLINNKEHYL NKAIEIFVHS IFDALNEKDI SQINFYRNEL
|
| |