Gene Csal_0232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0232 
Symbol 
ID4027315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp263339 
End bp264259 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content60% 
IMG OID637965383 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_572295 
Protein GI92112367 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTTCA TTCCATTGAA GCCCATCCCT GTGAAAGATC GCATGTCGAT GATCTTCGTC 
GGGATGGGGC AGATCGACGT GCGTGATGGT GCCTTCGTCG TCATCGATGA TGTGAATGGC
GAGCGCATGC ACATTCCCGT GGGGTCTGTC GCGTGCTTGC TGCTCGAGCC CGGTACTCGC
GTGTCGCACG CTGCTGTCAA ACTCGCGTCA GTGGTGGGTA CGCTGTTGAT TTGGGTCGGT
GATGCCGGCG TGCGTCTATA TAGCGCGGGT CAGCCTGGCG GCGCGCGTTC GGACAAGCTG
CTCTATCAAG CGCAGTTGGC ACGGGATGAA AAACTGCGGC TGAAAGTCGT GCGCAAGATG
TTCGAGCTTC GTTTCGGCGA AGAGCCGCCC TCGCGGCGTA GCGTCGATCA ATTGCGAGGT
ATGGAAGGGG CTCGGGTACG CAAGACATAC CAGCTTCTTG CCAAGCAGTA TGGCGTCAAG
TGGCACGGGC GTCGCTATGA CCCGACTCAA TGGGATGCTT CCGATGTGGC CAACCAGTGC
CTTTCGGCCG CGACCGCCTG TCTTTACGGC ATCACGGAAG CCGCCATCCT GGCAGCCGGT
TATGCGCCGG CGATTGGCTT TCTGCATACC GGCAAACCCC TGAGCTTCGT TTACGACATC
GCCGATATCG TCAAATTCGA GACGGTGGTG CCGGCTGCGT TTCGCGTAGC GGCTCGTAAT
CCGCCGATGC CGGAGCGGGA AGTACGCGTT GCGTGTCGCG ATGCTTTCAA GCAAGCCCGC
TTATTGCAGC GGTTGATCCC CATGATCGAG GACGTATTGG CCGCTGGAGA AATCGAGCCT
CCACCTCCGC CGCCGGATGC GGTCCCACCG GCGATCCCGG AGCCTGAATC CGTCGGCGAT
GCAGGGCATC GGAGTCAATG A
 
Protein sequence
MGFIPLKPIP VKDRMSMIFV GMGQIDVRDG AFVVIDDVNG ERMHIPVGSV ACLLLEPGTR 
VSHAAVKLAS VVGTLLIWVG DAGVRLYSAG QPGGARSDKL LYQAQLARDE KLRLKVVRKM
FELRFGEEPP SRRSVDQLRG MEGARVRKTY QLLAKQYGVK WHGRRYDPTQ WDASDVANQC
LSAATACLYG ITEAAILAAG YAPAIGFLHT GKPLSFVYDI ADIVKFETVV PAAFRVAARN
PPMPEREVRV ACRDAFKQAR LLQRLIPMIE DVLAAGEIEP PPPPPDAVPP AIPEPESVGD
AGHRSQ