Gene SeHA_C3137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3137 
Symbolcse1 
ID6489646 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3057204 
End bp3058760 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content54% 
IMG OID642743282 
ProductCse1 family CRISPR-associated protein 
Protein accessionYP_002046901 
Protein GI194447974 
COG category 
COG ID 
TIGRFAM ID[TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.199062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value0.768115 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAATT TTTCACTTTT AACAACGCCC TGGCTCCCCG TCCGTTTCAA AGACGGTTCC 
ACGGGCAAGC TGGCCCCCGT CGATCTGGCG GATGAAAACG TGGTGGACAT CGCCGCAACG
CGAGCAGATT TACAGGGAGC GGCCTGGCAG TTTCTGTTGG GATTGCTGCA ATGCAGTATC
GCGCCGAAAA GATACAAAAA TTGGGAGGAT ATCTGGTTTG ATGGATTGCA TGCCGATGTG
CTCCATAAGG CATTAGCACC GTTAGAACAC GCTTTTCAGT TTGGCGCGGA ATCCCCCTCG
TTTATGCAGG ATTTTGAACC GTTAAGCGGC GAAAAAGTCT CTATTGCCTC ATTGTTGCCG
GAAATACCTG GCGCGCAAAC CACGAAGTTC AATAAAGATC ATTTTGTCAA ACGCGGCGTA
ACGGAACGTT TTTGTCCGCA CTGCGCGGCG CTGGCGCTGT TCTCGTTGCA GCTTAACGCG
CCTGCGGGCG GCAAAGGCTA TCGTACCGGG CTGCGCGGCG GCGGGCCACT GACCACGCTG
GTTGAATTGC AGGAATATCA GGGCGAGCGG CAAACGCCGC TCTGGCGCAA GCTGTGGCTC
AACGTGATGC CGCAGGATAC TGCGGATCTG CCTTTACCAG ACCAGTGTGA TGCGACCGTT
TTCCCGTGGC TTGCCGCGAC GCGGACCAGC GAGCAGGCGA ATGCCGTTAC CACGCCGGAG
CAGGTCAATA AACTCCAGGC GTACTGGGGG ATGCCGCGTC GTATCCGCCT GGATTTTGCC
ACCTTACAGT CAGGTTGCTG CGATATTTGC GGCGCTGAAA GCGATGAGCT TCTTGGCTTT
ATGACCGTTA AGAACTACGG CGTTAACTAC GATGGCTGGC GTCATCCGCT GACGCCCTAT
CGCGCCCCGG TAAAAGATCA AAACGCCTTC TTTTCCGTTA AACCGCAGCC CGGCGGCCTT
ATCTGGCGCG ACTGGCTGGG ATTAAGTCAG AACAACCAGA CGGAAGCGAA TTACGAATCT
CCCGCGCAGG TAGTCAAGGT GTTTAACGCC CGCTCGCTGA CTGACGTTAA AGCGGGGATC
TGGGGCTTTG GCGCGGATTT CGACAATATG AAAATCCGCT GCTGGTATGA GCATCACTTC
CCGTTGCTGA TGACGGAAGG TCTGATCCCT GATTTACGTA AGGCCGTGCA AACTGCGGCC
CGCCTGTTGA GCCTGCTTCG CAGTGCGCTA AAAGAAGCGT GGTTCGCCAA TGCGAAGGAT
GCCCGCGGTG ATTTCAGTTT TATCGACATT GATTTCTGGA ACCTGACGCA GGGACGTTTT
CTCAACCTGA TTCACGATCT GGAAAACGGC CACAAGCCGG ACGAAAGGTT GAATAAATGG
CAAAGAGAAC TTTGGCTGTT TACCCGTCAT TACTTCGATG ATCACGTCTT TACCAACCCC
TACGAGAGCA GCGATCTGGA ACGCATCATG ACCGCGCGCA AGAAATATTT TACGACATCG
GCGGAAAAAC AAAGCGCAAA AGCCGCCAAA GCAAAGAAAC AGGAGGCTGC TGAATGA
 
Protein sequence
MDNFSLLTTP WLPVRFKDGS TGKLAPVDLA DENVVDIAAT RADLQGAAWQ FLLGLLQCSI 
APKRYKNWED IWFDGLHADV LHKALAPLEH AFQFGAESPS FMQDFEPLSG EKVSIASLLP
EIPGAQTTKF NKDHFVKRGV TERFCPHCAA LALFSLQLNA PAGGKGYRTG LRGGGPLTTL
VELQEYQGER QTPLWRKLWL NVMPQDTADL PLPDQCDATV FPWLAATRTS EQANAVTTPE
QVNKLQAYWG MPRRIRLDFA TLQSGCCDIC GAESDELLGF MTVKNYGVNY DGWRHPLTPY
RAPVKDQNAF FSVKPQPGGL IWRDWLGLSQ NNQTEANYES PAQVVKVFNA RSLTDVKAGI
WGFGADFDNM KIRCWYEHHF PLLMTEGLIP DLRKAVQTAA RLLSLLRSAL KEAWFANAKD
ARGDFSFIDI DFWNLTQGRF LNLIHDLENG HKPDERLNKW QRELWLFTRH YFDDHVFTNP
YESSDLERIM TARKKYFTTS AEKQSAKAAK AKKQEAAE