Gene EcE24377A_3057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3057 
Symbolcas1 
ID5589688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3057621 
End bp3058541 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content52% 
IMG OID640926701 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001464077 
Protein GI157155405 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.82457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGTTTG TACCACTGAG CCCGATCCCA TTAAAAGATC GCACCACCAT GATCTTCCTC 
CAGTACGGTC AGATCGACGT ACTGGACGGC GCTTTCGTGC TGATCGACAA AACAGGGATC
CGCACACATA TTCCGGTAGG ATCGGTCGCC TGCATTATGC TCGAACCGGG AACGAGAGTT
TCCCACGCGG CGGTGCATCT GGCCGCCACG GTGGGAACAC TGCTGGTCTG GGTCGGTGAA
GCGGGCGTTC GCGTTTACTC ATCCGGACAA CCCGGAGGGG CGCGGGCAGA TAAATTACTC
TACCAGGCAA AGCTGGCTTT AACGGAAGAT CTACGCCTGA AGGTGGTGCG CAAAATGTAT
GAATTACGTT TTCGTGAGCC ACCGCCAGCT CGCCGTTCAG TGGAGCAGCT ACGTGGAATT
GAGGGATCTC GCGTTCGCCA GACGTATGCA TTACTGGCGA AACAATATGG TGTGAAATGG
AATGGTCGCA AATACGATCC TAAAGACTGG GAAAAAGGCG ATGTTGTGAA TCGCTGCATC
AGTGCTGCTA CATCATGTCT GTACGGTATA TCTGAAGCGG CAGTATTAGC CGCGGGATAT
GCGCCCGCTA TTGGGTTTAT CCATAGTGGC AAACCGCTTT CATTTGTTTA TGACATAGCC
GATATCATTA AATTTGATTC GGTTGTACCA AAGGCATTTG AAATAGCGGC GAGGCAACCC
GCAGAACCTG ATAAAGAAGT CAGATTAGCC TGTCGTGATA TTTTCCGTAG CACTAAGTTA
ACGGGCAAAT TAATACCGTT AATTGAGGAA GTCCTTGCCG CAGGTGAAAT TGAACCGCCA
CAACCTGCGC CGGATATGTT ACCGCCAGCC ATCCCCGAAC CTGAAACGTT GGGCGATAGC
GGTCATCGAG GACGCGGCTG A
 
Protein sequence
MTFVPLSPIP LKDRTTMIFL QYGQIDVLDG AFVLIDKTGI RTHIPVGSVA CIMLEPGTRV 
SHAAVHLAAT VGTLLVWVGE AGVRVYSSGQ PGGARADKLL YQAKLALTED LRLKVVRKMY
ELRFREPPPA RRSVEQLRGI EGSRVRQTYA LLAKQYGVKW NGRKYDPKDW EKGDVVNRCI
SAATSCLYGI SEAAVLAAGY APAIGFIHSG KPLSFVYDIA DIIKFDSVVP KAFEIAARQP
AEPDKEVRLA CRDIFRSTKL TGKLIPLIEE VLAAGEIEPP QPAPDMLPPA IPEPETLGDS
GHRGRG