Gene SeAg_B3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeAg_B3066 
Symbolcas1 
ID6795440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Agona str. SL483 
KingdomBacteria 
Replicon accessionNC_011149 
Strand
Start bp2989621 
End bp2990541 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content55% 
IMG OID642777227 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_002147836 
Protein GI197248534 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03638] CRISPR-associated endonuclease Cas1, ECOLI subtype 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGTTCG TACCGCTCAA CCCGATCCCG TTAAAAGATC GAACCTCGAT GATCTTCCTC 
CAGTACGGTC AAATTGACGT GCTGGATGGG GCATTCGTGC TGATCGATAA AACGGGAGTC
CGCACGCACA TTCCCGTCGG TTCGGTCGCT TGTATCATGC TGGAACCGGG AACGCGGGTT
TCCCATGCGG CAGTGCATTT AGCATCAACG GTCGGCACCC TGTTGGTATG GGTGGGCGAG
GCGGGAGTGC GGGTCTATTC CTCCGGACAA CCCGGTGGCG CACGAGCCGA TAAGTTGCTT
TATCAGGCAA AGCTGGCGTT AGATGATGAC CTGCGGCTTA AAGTAGTCCG CAAAATGTAT
GAACTGCGTT TTCGTGAGCC GCCTCCCGCC CGTCGTTCCG TTGAGCAACT GCGCGGTATT
GAAGGATCCC GTGTGCGGGC GACCTATGCA TTACTGGCGA AGCAGTATGG CGTGAAATGG
CATGGTCGTA ACTACGATCC GAAAGACTGG GAGAAGGGGG ATGTCGTCAA CCGATGTATT
AGCGCGGCGA CATCGTGCCT GTACGGGATT TCAGAAGCGG CTATCCTGGC GGCGGGCTAT
GCGCCAGCTA TCGGTTTTAT CCATAGCGGT AAGCCGCTTT CTTTTGTTTA TGACATTGCC
GATATCATCA AATTTGAATC GGTGGTGCCC AAAGCATTTG AGATCGCCGC TCGTCACCCG
GCGGAACCTG ATAAAGAAGT GCGCCTGGCC TGCCGGGATA TTTTTCGCAG TTCGAAGCTG
ACCGGAAAAT TGATCCCACT GATCGAAGAG GTACTCGCTG CCGGTGAAAT TGAACCCCCT
CAGCCTGCGT CGGATATGCT GCCGCCAGCA ATACCGGAAC CTGAATCACT GGGTGATAGC
GGCCATCGGG GGCATGGTTG A
 
Protein sequence
MTFVPLNPIP LKDRTSMIFL QYGQIDVLDG AFVLIDKTGV RTHIPVGSVA CIMLEPGTRV 
SHAAVHLAST VGTLLVWVGE AGVRVYSSGQ PGGARADKLL YQAKLALDDD LRLKVVRKMY
ELRFREPPPA RRSVEQLRGI EGSRVRATYA LLAKQYGVKW HGRNYDPKDW EKGDVVNRCI
SAATSCLYGI SEAAILAAGY APAIGFIHSG KPLSFVYDIA DIIKFESVVP KAFEIAARHP
AEPDKEVRLA CRDIFRSSKL TGKLIPLIEE VLAAGEIEPP QPASDMLPPA IPEPESLGDS
GHRGHG