Gene EcE24377A_3060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3060 
Symbolcse4 
ID5589843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3059927 
End bp3060982 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content50% 
IMG OID640926704 
ProductCRISPR-associated Cse4 family protein 
Protein accessionYP_001464080 
Protein GI157157725 
COG category 
COG ID 
TIGRFAM ID[TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAT TTATTCAGCT TCATTTGTTA ACCGCTTACC CTGCTGCCAA CCTTAACCGT 
GATGATACCG GTGCGCCAAA AACTGTGGTG CTGGGTGGAG CAACGCGTCT GCGCGTTTCC
TCGCAAAGTC TGAAACGTGC GTGGCGCACT TCTGCACTTT TTGAACAGGC ACTGGCGGGC
CATATTGGTA TTCGCAGTGG GCGTATTGCG CGTGAGGCGG CAACTATCCT GATTGAGAAA
GGAATCGAAG AGAAAAAAGC CATCGAATGG GCGGCAAAAA TTGCAGATTA TCTTGGGAAA
GCTAAAAACG ACAAAAAACC AAAAGATCCG CTCACTAACG CCGAAACTGA ACAGTTAGTC
CATATCAGCC CGGCAGAATT TGACGCCGTA AAAGTGCTGG CCCATCAGCT TGCAGAAGAA
AAGCGCGCGC CAAAAGAGGA AGATCTCGCG TTGTTACGCA AAGACCGTAT GGCGGTAGAT
ATTGCCATGT TTGGTCGTAT GCTGGCGAAT AAACCCGAGT TTAATGTTGA AGCCGCCTGC
CAGGTAGCGC ACGCATTTGG TGTCAGTGAA ACGATTGTCG AAGATGATTT TTTCACCGCC
GTTGATGATT TGCGCCAGGC TTCTGAAGAT GCCGGTGCCG GGCATCTTGG TGAAACCGGA
TTCGGTTCTG CACTGTTCTA CACCTATATC TGCATTGATA AAGATCTGCT GGTCGAAAAC
CTTGGCGGTG ACGAAGCGTT AGCTAATCAG ACCATACGTG CCTTTACGGA AGCCGCACTT
AAAGTCTCCC CAACCGGCAA ACAGAACAGC TTTGCCAGCC GTGCCTACGC CTCCTGGGCG
CTGGCAGAAA AAGGCACCGA TCAACCGCGT TCTCTGGCGG CAGCTTTCTA TGAACCCATT
AATGGCACCC GTCAGTTAGA GGTGGCGGTG CAGCGTATTA CAACTCTTCG CGAAAATATG
AATACGGTCT ACGAACAGAA GACCGATTAC GCAAGTTTTG ACGTGATGAA CAAACAGGGA
AGCATGAAGG ACGTGCTGGA CTTTATCTGC GCGTAA
 
Protein sequence
MTTFIQLHLL TAYPAANLNR DDTGAPKTVV LGGATRLRVS SQSLKRAWRT SALFEQALAG 
HIGIRSGRIA REAATILIEK GIEEKKAIEW AAKIADYLGK AKNDKKPKDP LTNAETEQLV
HISPAEFDAV KVLAHQLAEE KRAPKEEDLA LLRKDRMAVD IAMFGRMLAN KPEFNVEAAC
QVAHAFGVSE TIVEDDFFTA VDDLRQASED AGAGHLGETG FGSALFYTYI CIDKDLLVEN
LGGDEALANQ TIRAFTEAAL KVSPTGKQNS FASRAYASWA LAEKGTDQPR SLAAAFYEPI
NGTRQLEVAV QRITTLRENM NTVYEQKTDY ASFDVMNKQG SMKDVLDFIC A