Gene SeD_A3254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3254 
Symbolcse1 
ID6874405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3125576 
End bp3127132 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content54% 
IMG OID642786269 
Productcrispr-associated protein, Cse1 family 
Protein accessionYP_002216910 
Protein GI198243398 
COG category 
COG ID 
TIGRFAM ID[TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value0.552745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAATT TTTCACTTTT AACAACGCCC TGGCTCCCCG TCCGTTTCAA AGACGGTTCC 
ACGGGCAAGC TGGCCCCCGT CGATCTGGCG GATGAAAACG TGGTGGACAT CGCCGCAACG
CGAGCAGATT TACAGGGAGC GGCCTGGCAG TTTCTGTTGG GATTGCTGCA ATGCAGTATC
GCGCCGAAAA GATACAAAAA TTGGGAGGAT ATCTGGTTTG ATGGATTGCA TGCCGATGTG
CTCCATAAGG CATTAGCACC GTTAGAACAC GCTTTTCAGT TTGGCGCGGA ATCCCCCTCG
TTTATGCAGG ATTTTGAACC GTTAAGCGGC GAAAAAGTCT CTATTGCCTC ATTGTTGCCG
GAAATACCTG GCGCGCAAAC CACGAAGTTC AATAAAGATC ATTTTGTCAA ACGCGGCGTA
ACGGAACGTT TTTGTCCGCA CTGCGCGGCG CTGGCGCTGT TCTCGTTGCA GCTTAACGCG
CCTGCGGGCG GCAAAGGCTA TCGTACCGGG CTGCGCGGCG GCGGGCCACT GACCACGCTG
GTTGAATTGC AGGAATATCA GGGCGAGCGG CAAACGCCGA TCTGGCGCAA GCTGTGGCTC
AACGTGATGC CGCAGGATAC TGCGGATCTG CCTTTACCAG ACCAGTGTGA TGCGACCGTT
TTCCCGTGGC TTGCCGCGAC GCGGACCAGC GAGCAGGCGA ATGCCGTTAC CACGCCGGAG
CAGGTCAATA AACTCCAGGC GTACTGGGGG ATGCCGCGTC GTATCCGCCT GGATTTTGCC
ACCTTACAGT CAGGTTGCTG CGATATTTGC GGCGCTGAAA GCGATGAGCT TCTTGGCTTT
ATGACCGTCA AGAACTACGG CGTTAACTAC GATGGCTGGC GGCACCCGCT GACGCCTTAT
CGCGCCCCGG TAAAAGATCA AAACGCCTTC TTTTCCGTTA AACCGCAGCC CGGCGGCCTT
ATCTGGCGCG ACTGGCTGGG GTTAAGTCAG AACAACCAGA CGGAAGCGAA TTACGAATCT
CCCGCGCAGG TAGTCAAGGT GTTTAACGCC CGCTCGCTGA CTGACGTTAA AGCGGGGATC
CGGGGCTTTG GCGCGGATTT CGACAATATG AAAATCCGCT GCTGGTATGA GCATCACTTC
CCGTTGCTGA TGACGGAAGG TCTGATCCCT GATTTACGTA AGGCCGTGCA AACTGCGGCC
CGCCTGTTGA GCCTGCTTCG CAGTGCGCTA AAAGAAGCGT GGTTCACCAA TGCGAAGGAT
GCGCGGGGTG ATTTCAGTTT TATCGACATT GATTTCTGGA ACCTGACGCA GGGGCGCTTT
CTCAATCTGA TCCACGATCT GGAAAACGGA CACAAGCCGG ACGAAAGGCT GAATAAATGG
CAAAGAGAAC TTTGGCTGTT TACCCGTTGT TACTTCGATG ATCACGTCTT TACCAACCCC
TACGAGAGCA GCGATCTGGA GCGCATCATG AAGGCGCGCA AAAAATATTT TACTTCATCG
GCGGAAAAGC AAAGCGCAAA AGCCGCCAAA GCAAAGAAAC AGGAGGCTGC TGAATGA
 
Protein sequence
MDNFSLLTTP WLPVRFKDGS TGKLAPVDLA DENVVDIAAT RADLQGAAWQ FLLGLLQCSI 
APKRYKNWED IWFDGLHADV LHKALAPLEH AFQFGAESPS FMQDFEPLSG EKVSIASLLP
EIPGAQTTKF NKDHFVKRGV TERFCPHCAA LALFSLQLNA PAGGKGYRTG LRGGGPLTTL
VELQEYQGER QTPIWRKLWL NVMPQDTADL PLPDQCDATV FPWLAATRTS EQANAVTTPE
QVNKLQAYWG MPRRIRLDFA TLQSGCCDIC GAESDELLGF MTVKNYGVNY DGWRHPLTPY
RAPVKDQNAF FSVKPQPGGL IWRDWLGLSQ NNQTEANYES PAQVVKVFNA RSLTDVKAGI
RGFGADFDNM KIRCWYEHHF PLLMTEGLIP DLRKAVQTAA RLLSLLRSAL KEAWFTNAKD
ARGDFSFIDI DFWNLTQGRF LNLIHDLENG HKPDERLNKW QRELWLFTRC YFDDHVFTNP
YESSDLERIM KARKKYFTSS AEKQSAKAAK AKKQEAAE