Gene STER_1476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_1476 
Symbol 
ID4437922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp1379106 
End bp1379975 
Gene Length870 bp 
Protein Length289 aa 
Translation table11 
GC content32% 
IMG OID639677080 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_820831 
Protein GI116628212 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.095696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGTT GGAGAACAGT AGTCGTAAAC ATACATTCTA AACTGTCATA TAAGAATAAT 
CACCTTATTT TTAGAAACTC ATATAAAACT GAGATGATTC ATCTTTCTGA AATTGATATA
CTTTTATTGG AAACAACTGA CATTGTATTG ACTACTATGT TAGTGAAGAG ATTAGTTGAT
GAAAATATTC TAGTTATTTT CTGTGATGAT AAACGATTGC CGACAGCTTT TCTTACACCG
TATTATGCAC GTCATGATTC TAGTCTGCAA ATAGCTAGAC AAATTGCGTG GAAGGAAAAT
GTTAAATGTG AGGTATGGAC TGCTATTATT GCGCAAAAGA TATTGAATCA ATCTTACTAT
TTGGGAGAAT GTTCCTTCTT TGAGAAATCT CAATCTATTA TGGAATTATA TCATGGATTA
GAACGGTTTG ATCCTAGTAA TCGTGAGGGT CACTCTGCCA GAATTTATTT TAATACGCTT
TTCGGTAATG ATTTTACTAG GGAAAGTGAT AATGATATCA ATGCAGCGCT GGATTATGGT
TATACCTTGT TATTGAGTAT GTTTGCACGT GAGGTTGTCG TCTGTGGTTG TATGACACAA
ATTGGTTTAA AACATGCTAA CCAGTTTAAT CAATTTAACC TTGCTAGTGA TATTATGGAG
CCATTTAGGC CAATTATAGA TAGAATTGTT TATCAAAATC GACATAATAA TTTCGTCAAA
ATCAAAAAAG AACTTTTTTC GATCTTTTCA GAAACCTATC TCTATAATGG AAAAGAGATG
TATTTATCAA ACATTGTTAG TGATTATACA AAAAAAGTTA TAAAAGCGTT AAATCAGTTA
GGTGAGGAAA TTCCTGAATT TAGAATATGA
 
Protein sequence
MAGWRTVVVN IHSKLSYKNN HLIFRNSYKT EMIHLSEIDI LLLETTDIVL TTMLVKRLVD 
ENILVIFCDD KRLPTAFLTP YYARHDSSLQ IARQIAWKEN VKCEVWTAII AQKILNQSYY
LGECSFFEKS QSIMELYHGL ERFDPSNREG HSARIYFNTL FGNDFTRESD NDINAALDYG
YTLLLSMFAR EVVVCGCMTQ IGLKHANQFN QFNLASDIME PFRPIIDRIV YQNRHNNFVK
IKKELFSIFS ETYLYNGKEM YLSNIVSDYT KKVIKALNQL GEEIPEFRI