Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_0234 |
Symbol | |
ID | 4059142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | - |
Start bp | 219242 |
End bp | 220270 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641229234 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_603706 |
Protein GI | 94984342 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGACAAC TTCTCAACAC CCTCTACATC CAGACCCAGG GCACCTACCT GCACCTTGAC ACCGACAACA TCCGGGTCGA GGTGGAGCGA ACAAAGAAGG CGATGCTGCC CCTGCACCAC ATCGAGGGCG TGGTGGTGTT CGGCAACGTG CTGCTCTCGC CCTTTCTGAT TCACCGCCTC GCCCGCGAGC ACAAGCCGGT CACCTGGCTG AGCGAACACG GGCGCTTCAT GGCCCGCACC GAAACGCCGA TGAGCGGAAA CGTCCTCCTG CGAACGGCCC AGCACGCCTG CGCAGGGAAT GCCGCAAGAA CGCTGGCCAT CGCCCGCCTG ATTGCCGCTG GGAAGCTCCA GAATCAGAAA GTCACCCTGC TGCGCGCGGC TCGCGAAGCA GAAGCCGACG ACGCCGCGCT GCTGCGCCAA GCCGCCCGTG ACATCAACGT CCAGATCGCC TGCCTCCCCC TGACCGAGAC GGTGGACGAG GTCCGCGGCA CCGAGGGCAC CGCCGCTCGC CTCTACTGGG AGGTCTTCCC GCTCATGCTG CGGCAAAACC GCGATTTCTT CTGGCTCTCG GAGCGCCACC GCCGCCCGGC CCGCGACCCC ATCAATGCCC TGCTGAACTT CGTGTACACC GTGCTGGCCA ATGACTGTGC CTCAGCGTGT CAGGCGGTGG GTCTTGACCC GCAGCTCGGC TTCCTGCACG CCCTGCGTCC GGGCAGAAGC AGCCTGGCCC TCGACCTCAT GGAAGAACTG CGCCCCGTCA TCGCCGACCG TGCCATCCTC ACCCTGATTA ACCGCCAGCA GCTCACCCCT CGCGACTTTG TGCTGCATGA GGGCGGCACT GTCAGCATCA CCGAAGAGGG ACGCAAAACC ATTCTGGCGC ATCTGGCTGA ACGCCGCCGG GAGGAAGTCA TGCACCCCCT CACCGCCCGC AAAACTCCGC TGGGGCTGCT GTCACACGTT CAGGCTCGTC TGCTTGCCCA GCACCTCCGC GGTGACCGCC CCCATTACCC CCCCTACCTG CACCGATGA
|
Protein sequence | MRQLLNTLYI QTQGTYLHLD TDNIRVEVER TKKAMLPLHH IEGVVVFGNV LLSPFLIHRL AREHKPVTWL SEHGRFMART ETPMSGNVLL RTAQHACAGN AARTLAIARL IAAGKLQNQK VTLLRAAREA EADDAALLRQ AARDINVQIA CLPLTETVDE VRGTEGTAAR LYWEVFPLML RQNRDFFWLS ERHRRPARDP INALLNFVYT VLANDCASAC QAVGLDPQLG FLHALRPGRS SLALDLMEEL RPVIADRAIL TLINRQQLTP RDFVLHEGGT VSITEEGRKT ILAHLAERRR EEVMHPLTAR KTPLGLLSHV QARLLAQHLR GDRPHYPPYL HR
|
| |