Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0232 |
Symbol | |
ID | 4027315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 263339 |
End bp | 264259 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637965383 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_572295 |
Protein GI | 92112367 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTTCA TTCCATTGAA GCCCATCCCT GTGAAAGATC GCATGTCGAT GATCTTCGTC GGGATGGGGC AGATCGACGT GCGTGATGGT GCCTTCGTCG TCATCGATGA TGTGAATGGC GAGCGCATGC ACATTCCCGT GGGGTCTGTC GCGTGCTTGC TGCTCGAGCC CGGTACTCGC GTGTCGCACG CTGCTGTCAA ACTCGCGTCA GTGGTGGGTA CGCTGTTGAT TTGGGTCGGT GATGCCGGCG TGCGTCTATA TAGCGCGGGT CAGCCTGGCG GCGCGCGTTC GGACAAGCTG CTCTATCAAG CGCAGTTGGC ACGGGATGAA AAACTGCGGC TGAAAGTCGT GCGCAAGATG TTCGAGCTTC GTTTCGGCGA AGAGCCGCCC TCGCGGCGTA GCGTCGATCA ATTGCGAGGT ATGGAAGGGG CTCGGGTACG CAAGACATAC CAGCTTCTTG CCAAGCAGTA TGGCGTCAAG TGGCACGGGC GTCGCTATGA CCCGACTCAA TGGGATGCTT CCGATGTGGC CAACCAGTGC CTTTCGGCCG CGACCGCCTG TCTTTACGGC ATCACGGAAG CCGCCATCCT GGCAGCCGGT TATGCGCCGG CGATTGGCTT TCTGCATACC GGCAAACCCC TGAGCTTCGT TTACGACATC GCCGATATCG TCAAATTCGA GACGGTGGTG CCGGCTGCGT TTCGCGTAGC GGCTCGTAAT CCGCCGATGC CGGAGCGGGA AGTACGCGTT GCGTGTCGCG ATGCTTTCAA GCAAGCCCGC TTATTGCAGC GGTTGATCCC CATGATCGAG GACGTATTGG CCGCTGGAGA AATCGAGCCT CCACCTCCGC CGCCGGATGC GGTCCCACCG GCGATCCCGG AGCCTGAATC CGTCGGCGAT GCAGGGCATC GGAGTCAATG A
|
Protein sequence | MGFIPLKPIP VKDRMSMIFV GMGQIDVRDG AFVVIDDVNG ERMHIPVGSV ACLLLEPGTR VSHAAVKLAS VVGTLLIWVG DAGVRLYSAG QPGGARSDKL LYQAQLARDE KLRLKVVRKM FELRFGEEPP SRRSVDQLRG MEGARVRKTY QLLAKQYGVK WHGRRYDPTQ WDASDVANQC LSAATACLYG ITEAAILAAG YAPAIGFLHT GKPLSFVYDI ADIVKFETVV PAAFRVAARN PPMPEREVRV ACRDAFKQAR LLQRLIPMIE DVLAAGEIEP PPPPPDAVPP AIPEPESVGD AGHRSQ
|
| |