Gene SeHA_C3135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3135 
Symbolcse4 
ID6491245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3055575 
End bp3056633 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content53% 
IMG OID642743280 
ProductCse4 family CRISPR-associated protein 
Protein accessionYP_002046899 
Protein GI194451697 
COG category 
COG ID 
TIGRFAM ID[TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACGT TTATTCAGCT CCATTTACTG ACCGCTTACC CCGCCGCTAA CCTGAACCGT 
GATGATACCG GTGCGCCAAA AACCGTGGTG CTTGGCGGAG CAACACGTCT GCGTATCTCC
TCTCAGAGCC TGAAACGCGC CTGGCGTACA TCTGAGTTAT TTGAACAGGC ATTAGCTGGC
CATATTGGTA TTCGTACTGG TCGCATTGCT CGTGAGGCGG CGCAAATCCT CGTTGATAGC
GGCATTGACG CCAAAAAAGC GGTTGAGTAC GTCAAGAACA TCGCCAACTG CTTTGGCAAG
GTAAAAGAGG ATAAGAAACC CAAAGATGAG TTGACGAATG CTGAAACCGA GCAACTGGTG
CATATCAGCC CTGCTGAGTT TGAGGCCGTG AAAGCGCTGG CGCGCCGTCT GGCAGAAGAA
AAACGTCCGG CAACAGAAGA GGAAGCAGAA CTGTTACGTC ACGATCGCAT GGCCGTCGAT
ATTGCCATGT TTGGCCGGAT GTTAGCGAAG AAAACTGATT TTAACGTGGA AGCCGCCTGC
CAGGTCGCCC ACGCCTTCGG CGTCAGCGAA ACGATCATCG AAGACGATTT CTTTACCGCT
GTGGATGACC TACGCCAGGC ATCGGCAGAA GATGCAGGCG CAGGCCATCT CGGCGAAACC
GGCTTTGGCT CCGCGCTGTT TTACACCTAT ATCTGCATCG ACAAAGATCT GCTGGTGAAA
AACCTGAACG ACAATGAAGA ACTGGCAAAC AAAACGCTGC GCGCCTTTAC TGAAGCGGCG
CTGAAAGTGT CGCCGACCGG CAAACAGAAC AGCTTTGCCA GCCGTGCCTA TGCCTCGTGG
GCGCTGGCCG AAAAAGGCAC CGACCAACCA CGTTCACTGG CGGCCGCGTT TTATGAACCG
ATCAACGGTA CAGACCAATT GAACGTTGCG GTTAAGCGTA TTACATCGCT GCATAAGAAT
ATGAATAAGG TTTATGGCCA GCGGACTGAT ACCGCCAGTT TCGACGTGAT GAATCAGCAG
GGAAGCATGA AAGACGTGCT TGATTTCATC TGCGCGTAA
 
Protein sequence
MTTFIQLHLL TAYPAANLNR DDTGAPKTVV LGGATRLRIS SQSLKRAWRT SELFEQALAG 
HIGIRTGRIA REAAQILVDS GIDAKKAVEY VKNIANCFGK VKEDKKPKDE LTNAETEQLV
HISPAEFEAV KALARRLAEE KRPATEEEAE LLRHDRMAVD IAMFGRMLAK KTDFNVEAAC
QVAHAFGVSE TIIEDDFFTA VDDLRQASAE DAGAGHLGET GFGSALFYTY ICIDKDLLVK
NLNDNEELAN KTLRAFTEAA LKVSPTGKQN SFASRAYASW ALAEKGTDQP RSLAAAFYEP
INGTDQLNVA VKRITSLHKN MNKVYGQRTD TASFDVMNQQ GSMKDVLDFI CA