Gene SeHA_C3132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3132 
Symbolcas1 
ID6489479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3053269 
End bp3054189 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content56% 
IMG OID642743277 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002046896 
Protein GI194448225 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTTCG TGCCGCTCAA CCCGATCCCG TTAAAAGATC GAACCTCAAT GATCTTCCTC 
CAGTACGGTC AGATTGACGT GCTGGATGGG GCATTCGTGC TGATCGATAA AACGGGAATC
CGCACGCATA TTCCCGTTGG TTCGGTCGCT TGTATCATGC TGGAACCGGG AACGCGGGTT
TCCCATGCGG CTGTGCGTCT GGCATCGACG GTGGGAACGC TGTTGGTGTG GGTGGGCGAG
GCGGGAGTGC GGGTTTATTC CTCCGGACAA CCCGGCGGCG CACGAGCCGA TAAGTTGCTT
TATCAGGCAA AGCTGGCGTT AGATGATGAC CTGCGGCTGA AGGTGGTGCG CAAAATGTAT
GAACTGCGTT TTCGCGAACC GCCGCCCGCC CGTCGTTCCG TTGAGCAACT GCGCGGTATT
GAAGGATCCC GTGTGCGGGC GACCTATGCA TTGCTGGCGA AGCAGTATGG CGTGAAGTGG
CATGGTCGTA ACTATGATCC GAAAGACTGG GAGAAGGGGG ATGTCGTCAA CCGATGTATT
AGCGCGGCGA CATCGTGCCT GTACGGGATT TCAGAAGCGG CTATCCTGGC GGCGGGATAT
GCGCCAGCTA TCGGTTTTAT CCATAGTGGT AAGCCGCTTT CTTTTGTTTA TGACATTGCC
GATATCATCA AATTTGAATC GGTGGTGCCC AAAGCATTTG AGATCGCTGC TCGTCACCCG
GCGGAACCTG ATAAAGAAGT GCGCCTGGCC TGCCGTGATA TTTTTCGCAG TTCGAAGCTG
ACCGGAAAAT TGATCCCACT GATCGAAGAG GTGCTCGCTG CCGGTGAAAT TGAACCACCT
CAGCCTGCGC CGGATATGTT GCCGCCAGCA ATACCGGAAC CTGAATCACT GGGTGATAGC
GGCCATCGGG GGCATGGTTG A
 
Protein sequence
MTFVPLNPIP LKDRTSMIFL QYGQIDVLDG AFVLIDKTGI RTHIPVGSVA CIMLEPGTRV 
SHAAVRLAST VGTLLVWVGE AGVRVYSSGQ PGGARADKLL YQAKLALDDD LRLKVVRKMY
ELRFREPPPA RRSVEQLRGI EGSRVRATYA LLAKQYGVKW HGRNYDPKDW EKGDVVNRCI
SAATSCLYGI SEAAILAAGY APAIGFIHSG KPLSFVYDIA DIIKFESVVP KAFEIAARHP
AEPDKEVRLA CRDIFRSSKL TGKLIPLIEE VLAAGEIEPP QPAPDMLPPA IPEPESLGDS
GHRGHG