Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3153 |
Symbol | cse1 |
ID | 6486845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | - |
Start bp | 3060910 |
End bp | 3062445 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642738461 |
Product | CRISPR-associated protein Cse1 family |
Protein accession | YP_002042185 |
Protein GI | 194445089 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 0.670831 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTAA CCAAAGAGAA ATGGCTTCCC GTCATATTCT CAAACGGCGA TAAAAAGAAA ATATCATTAC GCGATCTTCT GGATAACCGC ATTCAGGATC TTGCCTATCC TCGGGCTGAT TTTCAGGGGG CGGCATGGCA AATGCTGATT GGTATTTTAC AATGTACCGT CGCGCCGGAA GATAAAGAAG AATGGGCAGA TATCTGGCAT GAAAGTATCG AATTCGAACA ATGGGAGAAG GCGTTAAATA CGATTTCTCT GGCTCTACAG TTCGGCGAGC AAAAACCTTC CTTCCTGCAA AGTTTTGATC CTCTCGATAG TGAATATGGT TCTATTGCCG GGCTGCTGGT GGATGCGCCG GGCGGGAATG CGCTCAAGCT CAATAAAGAT CATTTTGTAA AACGTGGCAA CGTAGAACAA ATATGTCCTC ACTGCGCGGC GATAGCGCTA TTTGCGATTC AAACCAATTC ACCTGCCGGC GGGGCGGGTT ACCGGGTAGG GATGCGCGGC GGTGGTCCGC TGACTACGCT GGTGGTACCG CAGGAAGAAG ATAAATATCC ACTATGGAAA AAACTTTGGC TTAACGTTTT GCCGCAGGAA GAGCCGCCGA ATGTTACACA GCATCCACTC ATTTTTCCCT GGCTTGCGCC GACGAAAACC AGCGAAAAAG CGGGGAATGT GGTCACACCG GATAATGCGC ACCCCTTGCA AGCCTACTGG GGGATGCCGC GGCGCATAGA ACTGGATTTC ACCCACACTG TGGCAGGTAT CTGCGATTTG TGCGGGGAGC ATCACGAATC ACTGCTACTG CAAATGTGTA GTAAAAATTA TGGCGTTCAG TACGACAGTT GGCTACATCC CTTCTCCCCA TATCGGCAGG CATTGAAAGA TCCATCCGCA CCCTGGCTGG CGTTTAAAGG GCAGCCGGGC GGGTCAAGTT ATAAAGACTG GTTGGGGCTG ATGCTCAATC GTGAGGATAA GTTCAACAAA ATGCAGCCTG CAAAGGTCGT TCGTGCCGCT GGTCAGCGGA ACAAAATGAG CCTGTGGTGC TTTGCCTGGG ATATGGATAA GGCCAAGGTC CGCTGCTGGT ATCAGCACCG TATTCCGCTC ATTAGCGTTT TGCACGAAGA GCAATTTCTC GCTGCACTTA ACACCGTGCT GGTGCTGGCT AGTGAGTCGC TGTCGCTGTT ACGGAACGCG TTAAAGAGCG CCAAATTCGA TTGTCCGAAA GAAGCCAAAA TGGATTTTAG TATGGTGGAT ATCGCCTTCT GGCAGGAAAC CGAACCCGCT TTTCGGACGT TGCAAGAGGC GCTGGCTGTC GATCCGCTTC GGCAGGATAC GCAGACTCGC CACGCAGTAA GTCAGTGGGA GGCGGAATTA GCACACTATC TATTTCACGT TTTTGACCGT GATGCCCTGA CCAACCCCGA CTGCCCGGAC GATATCCTGC AGCGCCAGCT GACGGCCCGA CAGGATTTAG CCAGCAGCTA TCGTAAACAT AAAGCGCGCA AGGATGTGTT GGCGCTGGTC GAATAA
|
Protein sequence | MDLTKEKWLP VIFSNGDKKK ISLRDLLDNR IQDLAYPRAD FQGAAWQMLI GILQCTVAPE DKEEWADIWH ESIEFEQWEK ALNTISLALQ FGEQKPSFLQ SFDPLDSEYG SIAGLLVDAP GGNALKLNKD HFVKRGNVEQ ICPHCAAIAL FAIQTNSPAG GAGYRVGMRG GGPLTTLVVP QEEDKYPLWK KLWLNVLPQE EPPNVTQHPL IFPWLAPTKT SEKAGNVVTP DNAHPLQAYW GMPRRIELDF THTVAGICDL CGEHHESLLL QMCSKNYGVQ YDSWLHPFSP YRQALKDPSA PWLAFKGQPG GSSYKDWLGL MLNREDKFNK MQPAKVVRAA GQRNKMSLWC FAWDMDKAKV RCWYQHRIPL ISVLHEEQFL AALNTVLVLA SESLSLLRNA LKSAKFDCPK EAKMDFSMVD IAFWQETEPA FRTLQEALAV DPLRQDTQTR HAVSQWEAEL AHYLFHVFDR DALTNPDCPD DILQRQLTAR QDLASSYRKH KARKDVLALV E
|
| |