Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4008 |
Symbol | cas1 |
ID | 6968155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3703218 |
End bp | 3704141 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643387776 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_002272219 |
Protein GI | 209397933 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGTTTG TACCACTGAG TCCGATCCCG TTAAAAGATC GCACCTCTAT GATCTTCCTC CAGTACGGTC AAATCGACGT ACTGGACGGC GCTTTCGTGC TGATCGACAA AACCGGGATC CGCACGCACA TTCCGGTGGG ATCGGTCGCC TGCATTATGC TCGAACCGGG AACGAGAGTT TCCCACGCGG CGGTGCATCT GGCCGCCACG GTGGGAACAC TGCTGGTCTG GGTCGGTGAA GCGGGCGTTC GCGTTTACTC TTCCGGACAA CCCGGAGGGG CGCGGGCAGA TAAATTACTC TACCAGGCAA AGCTGGCTTT AACGGAAGAT CTACGCCTGA AGGTGGTGCG CAAAATGTAT GAATTACGTT TTCGTGAGCC ACCGCCAGCT CGCCGTTCAG TGGATCAGCT ACGGGGAATT GAGGGATCCC GCGTTCGCCA GACCTATGCA TTACTGGCGA AACAATATGG TGTGAAATGG AATGGTCGCA AATACGATCC TAAAGACTGG GAAAAAGGCG ATGTTGTGAA TCGCTGCATC AGTGCTGCCA CATCATGTCT GTACGGTATT TCTGAAGCGG CAGTATTAGC CGCGGGATAT GCGCCCGCTA TTGGATTTAT TCATAGTGGC AAACCGCTTT CATTTGTTTA TGACATAGCC GATATCATTA AATTTGATTC GGTTGTGCCA AAGGCATTTG AAATAGCAGC GAGGCAACCC GCAGAACCTG ATAAAGAAGT CAGATTAGCC TGTCGCGATA TTTTCCGTAG CACTAAGTTA ACGGGCAAAT TAATACCGTT AATTGAGAAA GTCCTTGCTG CAGGTGAAAT TGAACCACCA CAACCCGCGC CGGATATGTT ACCGCCTGCC ATCCCTGAAC CTGAAACGCT GGGTGATAGT GGTCACCGGG GGCGCGGCGG ATGA
|
Protein sequence | MTFVPLSPIP LKDRTSMIFL QYGQIDVLDG AFVLIDKTGI RTHIPVGSVA CIMLEPGTRV SHAAVHLAAT VGTLLVWVGE AGVRVYSSGQ PGGARADKLL YQAKLALTED LRLKVVRKMY ELRFREPPPA RRSVDQLRGI EGSRVRQTYA LLAKQYGVKW NGRKYDPKDW EKGDVVNRCI SAATSCLYGI SEAAVLAAGY APAIGFIHSG KPLSFVYDIA DIIKFDSVVP KAFEIAARQP AEPDKEVRLA CRDIFRSTKL TGKLIPLIEK VLAAGEIEPP QPAPDMLPPA IPEPETLGDS GHRGRGG
|
| |