Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_2636 |
Symbol | |
ID | 4073867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008010 |
Strand | - |
Start bp | 407219 |
End bp | 408169 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641228839 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_594144 |
Protein GI | 94972104 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.34834 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAC CTGGCATGTC CAACGCCATC ATCTGGCAGC GGCAAAACCT GCGCGAGCTG CCCAAGTTCC GCGACGGCAC GACCTACCTC TATCTCGAAC ACACACGGCT GGAACAGGAC GGGCGCGGGG TACGTGCCTA CCACCCTGAA GGGATGGTGA CCCTCCCGGC GGCGAGCCTC AGTGTGCTGC TGTTGGGGCC GGGCTGCTCG GTCAGTCACG AGGCAGTCAA GGCCCTCTCC GACACGGGCT GCTCGCTGCT GTGGGTGGGG GAGGGAGGCG TACGGCTTTA TGCCAGTGGC CTGGGTGAGA CGCGCAGCGC CGCCCGTCTC CAGCGTCAGG CGCTGCTGTG GGCCAACCCC CAGAGCCGCC TGCGGGTGGT CCGGCAGATG TACGCGATGC GTTTCCCCGA GGGACTGCCC CCGGACCTCA CCCTCGAGCA GATTCGTGGG CGCGAGGGAG CACGGGTGCG CGACGCTTAT GCCCGCTCGA GCCAGGCGTA CGGCGTCAGG TGGGACACCC GCCAGTACAA GCAGCAGGAC TGGCACCGCG CCACGCCCGT GAACAAGGCC ATCAGCGCTG GAAACGCCTG CCTGTATGGC CTCGCCCACG CCGCCATCTT GAGCATGGGC TACAGCCCCG CGCTGGGCTT CATTCACACC GGCAAGATGC TGAGCTTCGT GTACGACGTG GCCGACCTCT ACAAGCTGGA GGTGGTCCTG CCGGTCGCCT TTCGCGAGGC GGCCACTCCC GGCGACGACC TGGAACGGCG GGTCCGCACC GGCCTGCGCG ACCACATGAC CAAGTTGCGT TTGCTGGAGC GCATGGCCGC CGATCTCCTG CGCCTGCTGG GCGGGGACGA CACCGACCCG AACTCCACCG CCCCCGGCGA CCTGTGGGAC CCGGAGGGCC ACGCGGCGGG AGGGGTGAAC CATGCTGGTC ATGACTCTTG A
|
Protein sequence | MTEPGMSNAI IWQRQNLREL PKFRDGTTYL YLEHTRLEQD GRGVRAYHPE GMVTLPAASL SVLLLGPGCS VSHEAVKALS DTGCSLLWVG EGGVRLYASG LGETRSAARL QRQALLWANP QSRLRVVRQM YAMRFPEGLP PDLTLEQIRG REGARVRDAY ARSSQAYGVR WDTRQYKQQD WHRATPVNKA ISAGNACLYG LAHAAILSMG YSPALGFIHT GKMLSFVYDV ADLYKLEVVL PVAFREAATP GDDLERRVRT GLRDHMTKLR LLERMAADLL RLLGGDDTDP NSTAPGDLWD PEGHAAGGVN HAGHDS
|
| |