Gene ECH74115_4008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4008 
Symbolcas1 
ID6968155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3703218 
End bp3704141 
Gene Length924 bp 
Protein Length307 aa 
Translation table11 
GC content52% 
IMG OID643387776 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002272219 
Protein GI209397933 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTTTG TACCACTGAG TCCGATCCCG TTAAAAGATC GCACCTCTAT GATCTTCCTC 
CAGTACGGTC AAATCGACGT ACTGGACGGC GCTTTCGTGC TGATCGACAA AACCGGGATC
CGCACGCACA TTCCGGTGGG ATCGGTCGCC TGCATTATGC TCGAACCGGG AACGAGAGTT
TCCCACGCGG CGGTGCATCT GGCCGCCACG GTGGGAACAC TGCTGGTCTG GGTCGGTGAA
GCGGGCGTTC GCGTTTACTC TTCCGGACAA CCCGGAGGGG CGCGGGCAGA TAAATTACTC
TACCAGGCAA AGCTGGCTTT AACGGAAGAT CTACGCCTGA AGGTGGTGCG CAAAATGTAT
GAATTACGTT TTCGTGAGCC ACCGCCAGCT CGCCGTTCAG TGGATCAGCT ACGGGGAATT
GAGGGATCCC GCGTTCGCCA GACCTATGCA TTACTGGCGA AACAATATGG TGTGAAATGG
AATGGTCGCA AATACGATCC TAAAGACTGG GAAAAAGGCG ATGTTGTGAA TCGCTGCATC
AGTGCTGCCA CATCATGTCT GTACGGTATT TCTGAAGCGG CAGTATTAGC CGCGGGATAT
GCGCCCGCTA TTGGATTTAT TCATAGTGGC AAACCGCTTT CATTTGTTTA TGACATAGCC
GATATCATTA AATTTGATTC GGTTGTGCCA AAGGCATTTG AAATAGCAGC GAGGCAACCC
GCAGAACCTG ATAAAGAAGT CAGATTAGCC TGTCGCGATA TTTTCCGTAG CACTAAGTTA
ACGGGCAAAT TAATACCGTT AATTGAGAAA GTCCTTGCTG CAGGTGAAAT TGAACCACCA
CAACCCGCGC CGGATATGTT ACCGCCTGCC ATCCCTGAAC CTGAAACGCT GGGTGATAGT
GGTCACCGGG GGCGCGGCGG ATGA
 
Protein sequence
MTFVPLSPIP LKDRTSMIFL QYGQIDVLDG AFVLIDKTGI RTHIPVGSVA CIMLEPGTRV 
SHAAVHLAAT VGTLLVWVGE AGVRVYSSGQ PGGARADKLL YQAKLALTED LRLKVVRKMY
ELRFREPPPA RRSVDQLRGI EGSRVRQTYA LLAKQYGVKW NGRKYDPKDW EKGDVVNRCI
SAATSCLYGI SEAAVLAAGY APAIGFIHSG KPLSFVYDIA DIIKFDSVVP KAFEIAARQP
AEPDKEVRLA CRDIFRSTKL TGKLIPLIEK VLAAGEIEPP QPAPDMLPPA IPEPETLGDS
GHRGRGG