Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C3137 |
Symbol | cse1 |
ID | 6489646 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | - |
Start bp | 3057204 |
End bp | 3058760 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642743282 |
Product | Cse1 family CRISPR-associated protein |
Protein accession | YP_002046901 |
Protein GI | 194447974 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.199062 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 0.768115 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACAATT TTTCACTTTT AACAACGCCC TGGCTCCCCG TCCGTTTCAA AGACGGTTCC ACGGGCAAGC TGGCCCCCGT CGATCTGGCG GATGAAAACG TGGTGGACAT CGCCGCAACG CGAGCAGATT TACAGGGAGC GGCCTGGCAG TTTCTGTTGG GATTGCTGCA ATGCAGTATC GCGCCGAAAA GATACAAAAA TTGGGAGGAT ATCTGGTTTG ATGGATTGCA TGCCGATGTG CTCCATAAGG CATTAGCACC GTTAGAACAC GCTTTTCAGT TTGGCGCGGA ATCCCCCTCG TTTATGCAGG ATTTTGAACC GTTAAGCGGC GAAAAAGTCT CTATTGCCTC ATTGTTGCCG GAAATACCTG GCGCGCAAAC CACGAAGTTC AATAAAGATC ATTTTGTCAA ACGCGGCGTA ACGGAACGTT TTTGTCCGCA CTGCGCGGCG CTGGCGCTGT TCTCGTTGCA GCTTAACGCG CCTGCGGGCG GCAAAGGCTA TCGTACCGGG CTGCGCGGCG GCGGGCCACT GACCACGCTG GTTGAATTGC AGGAATATCA GGGCGAGCGG CAAACGCCGC TCTGGCGCAA GCTGTGGCTC AACGTGATGC CGCAGGATAC TGCGGATCTG CCTTTACCAG ACCAGTGTGA TGCGACCGTT TTCCCGTGGC TTGCCGCGAC GCGGACCAGC GAGCAGGCGA ATGCCGTTAC CACGCCGGAG CAGGTCAATA AACTCCAGGC GTACTGGGGG ATGCCGCGTC GTATCCGCCT GGATTTTGCC ACCTTACAGT CAGGTTGCTG CGATATTTGC GGCGCTGAAA GCGATGAGCT TCTTGGCTTT ATGACCGTTA AGAACTACGG CGTTAACTAC GATGGCTGGC GTCATCCGCT GACGCCCTAT CGCGCCCCGG TAAAAGATCA AAACGCCTTC TTTTCCGTTA AACCGCAGCC CGGCGGCCTT ATCTGGCGCG ACTGGCTGGG ATTAAGTCAG AACAACCAGA CGGAAGCGAA TTACGAATCT CCCGCGCAGG TAGTCAAGGT GTTTAACGCC CGCTCGCTGA CTGACGTTAA AGCGGGGATC TGGGGCTTTG GCGCGGATTT CGACAATATG AAAATCCGCT GCTGGTATGA GCATCACTTC CCGTTGCTGA TGACGGAAGG TCTGATCCCT GATTTACGTA AGGCCGTGCA AACTGCGGCC CGCCTGTTGA GCCTGCTTCG CAGTGCGCTA AAAGAAGCGT GGTTCGCCAA TGCGAAGGAT GCCCGCGGTG ATTTCAGTTT TATCGACATT GATTTCTGGA ACCTGACGCA GGGACGTTTT CTCAACCTGA TTCACGATCT GGAAAACGGC CACAAGCCGG ACGAAAGGTT GAATAAATGG CAAAGAGAAC TTTGGCTGTT TACCCGTCAT TACTTCGATG ATCACGTCTT TACCAACCCC TACGAGAGCA GCGATCTGGA ACGCATCATG ACCGCGCGCA AGAAATATTT TACGACATCG GCGGAAAAAC AAAGCGCAAA AGCCGCCAAA GCAAAGAAAC AGGAGGCTGC TGAATGA
|
Protein sequence | MDNFSLLTTP WLPVRFKDGS TGKLAPVDLA DENVVDIAAT RADLQGAAWQ FLLGLLQCSI APKRYKNWED IWFDGLHADV LHKALAPLEH AFQFGAESPS FMQDFEPLSG EKVSIASLLP EIPGAQTTKF NKDHFVKRGV TERFCPHCAA LALFSLQLNA PAGGKGYRTG LRGGGPLTTL VELQEYQGER QTPLWRKLWL NVMPQDTADL PLPDQCDATV FPWLAATRTS EQANAVTTPE QVNKLQAYWG MPRRIRLDFA TLQSGCCDIC GAESDELLGF MTVKNYGVNY DGWRHPLTPY RAPVKDQNAF FSVKPQPGGL IWRDWLGLSQ NNQTEANYES PAQVVKVFNA RSLTDVKAGI WGFGADFDNM KIRCWYEHHF PLLMTEGLIP DLRKAVQTAA RLLSLLRSAL KEAWFANAKD ARGDFSFIDI DFWNLTQGRF LNLIHDLENG HKPDERLNKW QRELWLFTRH YFDDHVFTNP YESSDLERIM TARKKYFTTS AEKQSAKAAK AKKQEAAE
|
| |