Gene STER_0747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_0747 
Symbol 
ID4438193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp677901 
End bp680891 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content45% 
IMG OID639676443 
Producttype I restriction-modification system restriction subunit 
Protein accessionYP_820197 
Protein GI116627578 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID[TIGR00348] type I site-specific deoxyribonuclease, HsdR family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCAG TTACTCCAGA ACTTGAGATA GAACGTCAAT TAATCGAGCA ATTAACCTCT 
GGTGAGAGCC AGTGGACCTA TCGCCCAGAC TTAAAAAATG AAGGTCAATT ATGGGATAAC
TTTTTTGAGA AGTTAGCGCA AAATAACGTT GCCCTCCTTG CTGACCATCC CTTGACTGAG
CAGGAGAAGC ACCAAATCAA GAACCAGCTT AACTTCGTTA ACTATTATGA AGCTGCTAAG
TGGTTGGCTG GTGAAAATGG TATTGCCAAA GTCGAGGTAC AGCGTGAAGA TGCTAGCCTT
GGGACTATCC GACTTTCTAT CATCTGGCGT GACAATATCG CTGCTGGAAA ATCCAGTTAC
GAAGTGGTCA ATCAAGTTGA ACGTGACAAG GCTTATCCAC AAGATCAGGA CCGTCGTTTA
GATACGACGC TGATGATTAA CGGACTTCCT CTTATCCATA TCGAGCTCAA GAGTCCTCGT
GTCGCCTTTT TGGATGCCTT TCATCAAATC AAAAAGTATG ACCGTGAAGG AAAGTTACGT
GGCATCTATT CTGCCCTTCA GATGTTTGTT GTGACTAATA AGGTTGATAC ACGTTATATT
GCAGCTGCAC GAGAGGATAA GCTTAATAAA CAATTCTTAA CATCTTGGGT AGACAAAGAT
AATAAACCCA TGACCAGTCT CATGGACTTT GCCCACGAAG TCCTGTCTAT TCCACGTGCC
CATCAGATGG TGATGCAGTA CTCCGTTATT GATGATAGTA AAAAAGCCCT TATCCTCTTA
CGTCCTTATC AAATTCATGC CATCGAGTCT GTTCAAGATG CCTCTCGCCG TCAAGAATCT
GGCTATATCT GGCATACCAC GGGTTCTGGT AAGACCCTGA CCTCCTACAA GGTGGCTCGC
AACCTCTTGC AGATTCCAGC CATTCAAAAG ACAATTTTTG TCGTCGACCG TAGGGACCTT
GACCAGCAGA CGACCTCGTC CTTTCTCTCT TATGCGGCTA ATGATATCAT TGACATCGAC
GAAACGGACA ATACTCATGA CTTGGTCAAG CGTTTGGCAG GAAATGATAA GCGAGTGATT
GTGACGACTA TCCAGAAAAT CACAACCATG ATGCGCAAGT TTGGAGAGGG TAAATACCAA
AAGGATTCGG AAAAAATCAA AGACCTTCGT GTGGCTTTCG TTGTGGATGA ATGTCACCGT
GCGGTCACAC CTCAGACTCA AAAGGATATC AAAGGCTTTT TCCACAATTC GCTCTGGTAT
GGCTTTACTG GTACACCGAT TTTCAAGGAA AATAAACGCA AGCAATTAGG CGATTTGGCA
CAAACTACCC ACCAACAATA TGGCGAACGC CTACACGAGT ATACGGTCAA AGAAGCCATC
CATGACGGGG CAGTCCTGGG ATTCAAGGTA GACTATCGAA ATACTATTAT TTCGCCGATT
CCTGAAAAAG ATCTTCCTGA CTCTGTTTAT GAAGATAAAG AACACATGCT GGAAGTACTG
GATGCTATCC TAAACAAGAG TTATCAACAG TTGGGCTTTC CAAATGGTGT GGGCAAGACC
TACGAGGCTA TCTTAACGGT TAAGTCCATT CCACAAGCTC AGGCTTATTA CAATCTGCTC
AAGAGTATCA AGGCAGGACA GGAACGTGTC AAGGTGTCAG AGCGAGCTAA GCGCGTGCTA
CCCGACTTTC CAAAGGTGAC GGTCACCTAC TCTGTGTCGG AAAATGAAGA AGAGTCTATC
GCCTACCAAG ACCACATGAA ACAGGTCATG GACGACTACA ATCAGGAGTA TGGGACCCAT
TTTAATATGG CTGATTTGCG TGGTTTTAAC ACGGATATCA ATAATCGGTT AGCAAGAAAG
TTAGACAAAT ACATTCCTCG CAATGAGCAG TTGGATTTGG TCATCGTCGT TGACCGTCTT
TTGACCGGTT TTGATGCCCC ATGTCTATCC ACACTCTTTA TCGACCGTAA GCCCATGCGT
CCACAGGATT TGATTCAAGC TTTTAGTCGC ACTAACCGCA TCTTTGACAG TAAAAAGCGT
TATGGCCATA TCATCACCTT CCAAAGACCA GAGGCCTTTA AAGAGGCTGT GGATAATGCT
CTTAAGCTCT ACTCAAACGG TGGTGAAAAT GACGTCACTG CCCCAGATTG GGATGAAGAA
AAAGCCAACT TCATCCAAGC CTGGATAGAC TTTCAAGTAA AAGTGGAAGA TGTGGAGAAC
TATGTCATCA CGATTGAGCA GGCCACTAGC CCACAAATTC GTCGCATTGC CAAGGCTTAC
CAAGCCTTTG ACAAGTATCT GGCATCCATC CGTGTCTATA ACGAATACGA TGAAGCGGCT
ATCTATGCGG AGACAGGACT ATCAGATGAA AAGCTCGAAA CCTATCTAGG AATCTATCAA
AATATTCTAT CAGAGCTCAA AAGTCGAGCC GAAGACGATG GTGATGATAA TCTCTTTGAT
ATCCACTACG AACTGGAATC GGTCCAAGTC GACGAAATTA ACTATGCTTA CATCCTGACC
TTGATTCAAA GTATGATTCA GCAGGAGGAG GATAGTCCAC AGGCCCTCAG TGACAAGGAT
ATCGAAACGG TTGATAACTA TATCCAGAGT CTGGAACGTA CCAATCCTAA ACTAGCCGAA
ATTATCCGAA ACCTGTGGAA AGAAGTCCAA GCCAATCCTA AAGATTACCA AGGACAATCT
ATCACTGGTG TCTTGGACAC TATGATTGAA GCCATTATCG ACGACCATCT CAAACGCTTT
GCCAGAGAAT GGTATGTCGG ACTCGATGAA CTTCGTTACT ACGTCCAACA CTATCGTAAA
GGTGCCAAGA AGCAATCAGG CGAAAGCCAG CTCACCAAGA GTCAGCGCTA CAAAGACTAC
AAGGCAGAAG TAGCAGATGC CCTCAACCCA CTCAGCTACA AAAAACAAAT AAAAGAAGCC
TACACCAAAC TCATTGAAGA GGTTATTGAA CCATTGAGAG TGGGGAGGTA G
 
Protein sequence
MPAVTPELEI ERQLIEQLTS GESQWTYRPD LKNEGQLWDN FFEKLAQNNV ALLADHPLTE 
QEKHQIKNQL NFVNYYEAAK WLAGENGIAK VEVQREDASL GTIRLSIIWR DNIAAGKSSY
EVVNQVERDK AYPQDQDRRL DTTLMINGLP LIHIELKSPR VAFLDAFHQI KKYDREGKLR
GIYSALQMFV VTNKVDTRYI AAAREDKLNK QFLTSWVDKD NKPMTSLMDF AHEVLSIPRA
HQMVMQYSVI DDSKKALILL RPYQIHAIES VQDASRRQES GYIWHTTGSG KTLTSYKVAR
NLLQIPAIQK TIFVVDRRDL DQQTTSSFLS YAANDIIDID ETDNTHDLVK RLAGNDKRVI
VTTIQKITTM MRKFGEGKYQ KDSEKIKDLR VAFVVDECHR AVTPQTQKDI KGFFHNSLWY
GFTGTPIFKE NKRKQLGDLA QTTHQQYGER LHEYTVKEAI HDGAVLGFKV DYRNTIISPI
PEKDLPDSVY EDKEHMLEVL DAILNKSYQQ LGFPNGVGKT YEAILTVKSI PQAQAYYNLL
KSIKAGQERV KVSERAKRVL PDFPKVTVTY SVSENEEESI AYQDHMKQVM DDYNQEYGTH
FNMADLRGFN TDINNRLARK LDKYIPRNEQ LDLVIVVDRL LTGFDAPCLS TLFIDRKPMR
PQDLIQAFSR TNRIFDSKKR YGHIITFQRP EAFKEAVDNA LKLYSNGGEN DVTAPDWDEE
KANFIQAWID FQVKVEDVEN YVITIEQATS PQIRRIAKAY QAFDKYLASI RVYNEYDEAA
IYAETGLSDE KLETYLGIYQ NILSELKSRA EDDGDDNLFD IHYELESVQV DEINYAYILT
LIQSMIQQEE DSPQALSDKD IETVDNYIQS LERTNPKLAE IIRNLWKEVQ ANPKDYQGQS
ITGVLDTMIE AIIDDHLKRF AREWYVGLDE LRYYVQHYRK GAKKQSGESQ LTKSQRYKDY
KAEVADALNP LSYKKQIKEA YTKLIEEVIE PLRVGR