Gene EcSMS35_2883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2883 
Symbolcas1 
ID6143521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2952596 
End bp2953519 
Gene Length924 bp 
Protein Length307 aa 
Translation table11 
GC content52% 
IMG OID641617752 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001744907 
Protein GI170681830 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.921772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTTTG TACCACTGAG TCCGATCCCG TTAAAAGATC GCACATCTAT GATCTTCCTC 
CAGTACGGTC AGGTCGACGT ACTGGACGGC GCTTTCGTGC TGATCGACAA AACCGGGATC
CGCACGCATA TTCCAGTGGG ATCGGTCGCC TGCATTATGC TCGAACCGGG AACAAGAGTT
TCCCACGCGG CAGTCCATCT GGCCGCCACG GTGGGAACAC TGCTGGTCTG GGTCGGTGAA
GCGGGCGTTC GCGTTTACTC CTCCGGACAA CCCGGAGGGG CGCGGGCAGA TAAATTACTC
TACCAGGCAA AGCTGGCTTT AACGGAAGAT CTACGGTTGA AGGTGGTGCG CAAAATGTAT
GAATTACGTT TTCGTGAACC ACCGCCAGCT CGCCGTTCAG TGGAGCAGTT ACGGGGAATT
GAGGGATCCC GTGTTCGCCA GACCTATGCA TTACTGGCGA AGCAATATGG TGTGAAATGG
AATGGTCGCA AATACGATCC TAAAGACTGG GAAAAAGGCG ATGTTGTCAA TCGCTGCATC
AGTGCTGCCA CATCATGCCT GTACGGTATA TCTGAAGCGG CGGTATTAGC CGCGGGATAT
GCGCCCGCTA TTGGATTTAT CCATAGTGGC AAACCGCTTT CATTTGTTTA TGACATAGCC
GATATCATTA AATTTGATTT GGTTGTGCCA AAGGCATTTG AAATAGCGGC AAGACACCCG
GCAGAACCTG ATAAAGAAGT GAGACTGGCC TGCCGTGATA TTTTCCGTAG TAGTAAATTA
ACGGGCAAAT TAATTCCGTT AATTGAGGAA GTCCTTGCCG CCGGTGAAAT TGAACCACCA
CAACCTGCGC CGGATATGTT ACCACCAGCT ATCCCGGAAC CTGAATTGCT AGGTGATAGT
GGTCACCGGG GGCGCAGCGG ATGA
 
Protein sequence
MTFVPLSPIP LKDRTSMIFL QYGQVDVLDG AFVLIDKTGI RTHIPVGSVA CIMLEPGTRV 
SHAAVHLAAT VGTLLVWVGE AGVRVYSSGQ PGGARADKLL YQAKLALTED LRLKVVRKMY
ELRFREPPPA RRSVEQLRGI EGSRVRQTYA LLAKQYGVKW NGRKYDPKDW EKGDVVNRCI
SAATSCLYGI SEAAVLAAGY APAIGFIHSG KPLSFVYDIA DIIKFDLVVP KAFEIAARHP
AEPDKEVRLA CRDIFRSSKL TGKLIPLIEE VLAAGEIEPP QPAPDMLPPA IPEPELLGDS
GHRGRSG