Gene Csal_0229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0229 
Symbol 
ID4027312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp260737 
End bp261843 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content57% 
IMG OID637965380 
ProductCRISPR-associated Cse4 family protein 
Protein accessionYP_572292 
Protein GI92112364 
COG category 
COG ID 
TIGRFAM ID[TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACT TCATTCAGTT GCATCTACTA ACTTCTTATC CGCCCTCCAA CCTCAACCGT 
GACGATCTGG GCCGTCCCAA GACGGCGTTC ATGGGTGGCG CGCGCCGTTT GCGCGTCTCC
TCCCAGAGTC TCAAGCGGAA TTGGCGCACC TCGACGCTCT TCGAGGAAGC GGTCGATGGT
CACAGGGGAA CGCGCACCAA GCGATTGGGT AAGCAGGTAT TCGATCGCAT GAAAGATGCT
GGTGCTGACG ACAAAATCGC CGGTAGTAGT GCTGAAAAGA TCGCGAGTAT CTTCGGTACG
CTGCGCAAGG TTGGAAAGGG TGAGAAGCAC GAATTCGAGA TCGAGCAGCT CGTTCATGTC
GGGCCGGAAG AGATTGCGGC GCTGGAAACA CTGGCCAATA CATTGGGGGC AGAAAAGCGC
GAGCCGACTG ACGAAGAGAT CACGAATGTT CTCCTGCGAC GCCCGACAGC CGTCGATATT
GCGCTCTTTG GACGTATGCT CACCGGGGAA AAAAATAAGC TCGTCAAATA CAGTGTAGAG
GCTGCCTGCC AGGTCGCGCA CGCTATTACC GTCCATGCCG CCGAGGTCGA GGACGACTAC
TTCACCGCAG TCGACGATCT CAATACCGGC GATGAGGATC GTGGCGCAGC GCATATCGGT
GAGGCCGGTT TTGCCGCCGG GCTGTTTTAT CTCTACCTCT GCATTGATCG CGATCTACTG
GTCGAGAATC TGCAAGGCAA TGCGGACTTG GCCGATCGCG CCATTGCTGC GCTGGTTGAA
TCCGCCGTCA AGGTTTCGCC CAAGGGCAAG CAGAACAGTT TCGGCTCTCG CGCCCACGCC
AGTTATCTGC TGGCAGAAAA AGGCGATCAG CAGCCGCGCT CGTTGTCGGC CTCCTTCCTG
CAGCCGGTCA ATGGTGAAGG ACAAGCGATC AAGGCCATCA GCAAACTGGA ACGCCAGGCT
CAGGCTTTCG ATGATGCTTA TGGTGCCGGC GCCGACAGTC GCTTTGTTCT CTCTGCCGAG
CCGGATTATG AAAAGCCTCC GCTGAAGGGT GACGTCCAAA CGGGAAATCT GCAGGACCTG
CTGACTTTCC TCAAGGGCGA TGACTAA
 
Protein sequence
MSHFIQLHLL TSYPPSNLNR DDLGRPKTAF MGGARRLRVS SQSLKRNWRT STLFEEAVDG 
HRGTRTKRLG KQVFDRMKDA GADDKIAGSS AEKIASIFGT LRKVGKGEKH EFEIEQLVHV
GPEEIAALET LANTLGAEKR EPTDEEITNV LLRRPTAVDI ALFGRMLTGE KNKLVKYSVE
AACQVAHAIT VHAAEVEDDY FTAVDDLNTG DEDRGAAHIG EAGFAAGLFY LYLCIDRDLL
VENLQGNADL ADRAIAALVE SAVKVSPKGK QNSFGSRAHA SYLLAEKGDQ QPRSLSASFL
QPVNGEGQAI KAISKLERQA QAFDDAYGAG ADSRFVLSAE PDYEKPPLKG DVQTGNLQDL
LTFLKGDD