Gene EcE24377A_3062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3062 
Symbolcse1 
ID5590399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3061527 
End bp3063089 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content51% 
IMG OID640926706 
ProductCRISPR-associated Cse1 family protein 
Protein accessionYP_001464082 
Protein GI157156933 
COG category 
COG ID 
TIGRFAM ID[TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTCGT TTTCACTTCT GACAACCCCG TGGTTGCCTG TTCGTTTTAA AGACGGAACG 
ACAGGCAAGC TGGCACCAGT CGATCTGGCT GATGAAAATG TTGTCGATAT CGCTGCGCCG
CGGGCAGATC TCCAGGGTGC GGCCTGGCAG TTTTTGCTGG GGCTACTACA AACCAGCTTC
GCGCCCAAAA ATCACGGTCG TTGGGATGAT ATCTGGGAAG ACGGACTGGA GGCTGAAAAG
CTACGGGAAG CATTGCTGTC CTTAGAACAC GCTTTCCAGT TTGGGGCAGA TTCCCCTTCA
TTTATGCAGG ATTTCGAGGC GCTCAAGGGA GATAAAGTTC AGGTCGCTTC GCTACTGCCT
GAGATTCCCG GCGCTCAAAC AACGAAGTTT AATAAAGACC ACTTTATTAA GCGTGGCGTG
ACTGAACACG TATGCCCTCA TTGTTCCGCG TTAGCTCTGT TCTCCCTACA GTTAAATGCG
CCATCAGGCG GCAAAGGCTA TCGCACCGGT TTGCGCGGCG GTGGGCCGAT GACCACTCTG
ATTGAATTAC AGGAATATCA AGGCAATCAA CAAACGCCCT TGTGGCGCAA ACTGTGGCCC
AATGTGATGC CGCAGGATGA AGCCGACTTA CCGCTACCCA AAAAATTTGA CGATCTGGTT
TTCCCCTGGC TTGGCCCGAC GCGAACCAGC GAACTGGCCG GTGCGGTGGT AACCCATGAT
CAGGTCAATA AACTCCAGGC TTACTGGGGA ATGCCGCGCC GTATTCGTAT TGATTTTAAT
ACCACGACAG TCGGCAACTG CGATATTTGC GGTGAACAGA GCGACGCGCT CCTGAGCCTG
ATGACGACCA AAAATTACGG TGCGAATTAT GCCATGTGGC AGCATCCCTT AACGCCTTAC
CGTATACCAC TTAAAGAGGG TGGCGAGTTT TACTCGGTCA AACCACAACC GGGCGGTTTA
ATCTGGCGCG ACTGGTTAGG CCTTATCGAA ACGGGTAAAT CAGAAAACAA TACGGAACTT
CCCGCGCTGG TGGTGAAACT CTTTAATGCC AGCAGTCTGA AACAGGCAAA AGTAGGCCTG
TGGGGATTTG GCTATGATTT CGACAACATG AAAGCGCGCT GTTGGTACGA ACACCATTTC
CCGCTGCTGC TCAAGAAAAA AGAAGGCCAG ATACCGAAAC TGCGTCTGGC TGCGCAAACG
GCTTCACGGA TTCTAAGTCT GTTACGAAGT GCATTGAAAG AAGCGTGGTT CTCCGATCCA
AAAGGTGCAC GGGGTGATTT CAGTTTTGTG GATATCGACT TCTGGAACAA AACTCAGCAT
CGCTTCCTGA GGTTAGTGCG CCAAATTGAA GAAGGTCAGG ATGCGGACGA ATTACTCGGC
AAATGGCAAA AGGAAATTTG GTTATTCGCG CGTCAGGATT TTGACGAGCG TGTATTCACC
AATCCATATG AGCCTGTTGA TTTGAAACGC GTTATGACCG CGCGCAAGAA ATATTTCACA
ACATCGGCGG AGAAGCAAAG TGCTAAAGCC GCCAGGGAGA AAAAGCAGGA GGCTGCTGAA
TGA
 
Protein sequence
MNSFSLLTTP WLPVRFKDGT TGKLAPVDLA DENVVDIAAP RADLQGAAWQ FLLGLLQTSF 
APKNHGRWDD IWEDGLEAEK LREALLSLEH AFQFGADSPS FMQDFEALKG DKVQVASLLP
EIPGAQTTKF NKDHFIKRGV TEHVCPHCSA LALFSLQLNA PSGGKGYRTG LRGGGPMTTL
IELQEYQGNQ QTPLWRKLWP NVMPQDEADL PLPKKFDDLV FPWLGPTRTS ELAGAVVTHD
QVNKLQAYWG MPRRIRIDFN TTTVGNCDIC GEQSDALLSL MTTKNYGANY AMWQHPLTPY
RIPLKEGGEF YSVKPQPGGL IWRDWLGLIE TGKSENNTEL PALVVKLFNA SSLKQAKVGL
WGFGYDFDNM KARCWYEHHF PLLLKKKEGQ IPKLRLAAQT ASRILSLLRS ALKEAWFSDP
KGARGDFSFV DIDFWNKTQH RFLRLVRQIE EGQDADELLG KWQKEIWLFA RQDFDERVFT
NPYEPVDLKR VMTARKKYFT TSAEKQSAKA AREKKQEAAE