Gene SeD_A3252 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3252 
Symbolcse4 
ID6873426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3123947 
End bp3125005 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content52% 
IMG OID642786267 
Productcrispr-associated protein, Cse4 family 
Protein accessionYP_002216908 
Protein GI198244668 
COG category 
COG ID 
TIGRFAM ID[TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.633483 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value0.732205 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAACGT TTATTCAGCT CCATTTACTG ACCGCTTACC CCGCCGCTAA CCTGAACCGT 
GATGATACCG GTGCGCCAAA AACCGTGGTG CTTGGCGGAG CAACACGTCT GCGTATCTCC
TCTCAGAGCC TGAAACGCGC CTGGCGTACA TCTGAGTTAT TTGAACAGGC ATTAGCTGGC
CATATTGGTA TTCGTACTGG TCGCATTGCT CGTGAGGCGG CGCAAATCCT CGTTGATAGC
GGCATTGACG CTAAAAAAGC GGTTGAGTAT GTGGAAAAAA TTGCCAACTG TTTTGGCAAG
GTAAAGGCGG AAAAGAAACC AAAAGATGAA CTGACGAATG CTGAAACCGA GCAACTGGTG
CATATCAGCC CAGCTGAATT TGAGGGCGTA AAAGCGCTAG CGCACCGTCT GGCGGAAGAA
AAACGCGCGC CAAAAGAGGA AGAGCTTGCA CTGCTACGTA AAGATCGCAT GGCTGTCGAT
ATTGCCATGT TTGGCCGTAT GCTGGCGAAT AAGCCCGATT TTAACGTGGA AGCTGCCTGT
CAGGTCGCCC ACGCCTTCGG CGTCAGCGAA ACGATCGTCG AAGACGATTT CTTTACTGCT
GTGGATGACC TACGCCAGGC ATCGGCAGAA GATGCAGGCG CAGGCCATCT CGGCGAAACC
GGCTTTGGCT CCGCGCTGTT TTACACCTAT ATCTGCATCG ACAAAGATCT GCTGGTGAAA
AACCTGAACG GCAATGAAGA ACTGGCAAAC AAAACGCTGC GCGCCTTTAC TGAAGCGGCG
CTGAAAGTGT CGCCGACCGG CAAACAGAAC AGCTTTGCCA GCCGTGCCTA TGCCTCGTGG
GCGCTGGCGG AAAAAGGCAC CGACCAACCA CGTTCACTGG CGGCCGCGTT TTATGAACCG
ATCAACGGTA CAGACCAATT GAACGTTGCG GTTAAGCGTA TTACCGCGCT GCGTGAAAAT
ATGAATGCGG TCTATGCACA GGAGACGGCG TTCAAAGACT TTAACGTTAT GAATCAGCAG
GGAAGCATGA AAGACGTGCT TGATTTCATC TGCGCGTAA
 
Protein sequence
MTTFIQLHLL TAYPAANLNR DDTGAPKTVV LGGATRLRIS SQSLKRAWRT SELFEQALAG 
HIGIRTGRIA REAAQILVDS GIDAKKAVEY VEKIANCFGK VKAEKKPKDE LTNAETEQLV
HISPAEFEGV KALAHRLAEE KRAPKEEELA LLRKDRMAVD IAMFGRMLAN KPDFNVEAAC
QVAHAFGVSE TIVEDDFFTA VDDLRQASAE DAGAGHLGET GFGSALFYTY ICIDKDLLVK
NLNGNEELAN KTLRAFTEAA LKVSPTGKQN SFASRAYASW ALAEKGTDQP RSLAAAFYEP
INGTDQLNVA VKRITALREN MNAVYAQETA FKDFNVMNQQ GSMKDVLDFI CA