Gene STER_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_0042 
Symbol 
ID4436839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp27036 
End bp28403 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content42% 
IMG OID639675808 
Productglucan binding protein 
Protein accessionYP_819611 
Protein GI116626992 
COG category[S] Function unknown 
COG ID[COG3883] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.634979 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC GCATTCTTTC AGCAGTGTTG GTAAGTGGTG TGACTTTATC AGCAGCTGCA 
TCAGTACATG CAGAGGATTA TGATTCGCAA ATCGCTGCTA CAAATAACGC AATTAGTAAC
CTTGCTTCTC AACAAGAGGC TGCGCAAGCA CAAGTTGCTA CAATCCAATC TCAAGTATCA
ACTCTCAGAA CACAAAAAAC TGAATTGGAA GCTAAAAATG CCGAACTTGA AAAAGTATCT
GCAGATTTGG AATCTGAAAT TCAAGAATTA TCAAGTAAGA TTGTAGCTCG TCAAGATTCT
TTGGCGAAAC AAGCTCGTAG TGCTCAACAA AATAACACTG CTACTAGCTA CATTAACTCT
ATCTTGAACT CTAAGTCAAT CTCTGAAGCT ATTACTCGTA TCACAGCTAT TTCAAAAGTT
GTTACAGCTA ATAATGATTT ATTGACTAAA CAAGAGTCAG ATCAAAAAGA GCTTGCTGCT
AAACAAGTAG AAAATCAAGC TGCAATTAAT ACTATTGCAG CTAATAAGTC AGAACTTGAA
ACAACAGAAG CTGGTTTGAC AACTCAACAA GCAGAACTTG AAGCAGCACA AGTTACTTTG
GCTGCTGAAT TGGCAACAGC TCAAAATGAG AAGACATCAC TTGTCTCTGC TAAATCAACT
GCTGAATCTG TAGCTGCCTC GACGGCTGCA TCAGTGGCAC AGTCTCAAGC AATTGCTGAA
TCAGAGGCAA CTGCACAAGT TGTAGATTCT TCAGAAGCAG CAACATCAGT AGCTTCTTCA
GAAGTAGCTG CTACTTCTGA AGCTGTAGCT CAGCCTTCAG AGGCACCAGT TTCTGAAACA
TCGACAGCTT CTGAGGCTGC ACAGGAGCCG GCTTCATCAG AAACATCAGA AGTACAACCA
GAATCAGCTG CACCAGCTGT TTCAGAAGCT CCTGTTAGCG TAGCACCTGT CGTAACATCA
GAAGCTGCAC CAGCTGCTTC AGAAGCTCCT GCACCAGCTG CTGAAACACA TAAAGTGTCA
GCAGCATCAA CTCCTAATAC ATATCCAGTT GGACAATGTA CTTGGGGTGT AAAATCATTG
GCTCCATGGG CTGGTAATAA TTGGGGGAAT GCTAAAAACT GGATTGCTAG TGCGCGAGCA
GCTGGTCATT CAGTAGGTAC AACTCCAGTA GCCGGTGCGA TTGCGGTATG GCCAAATGAT
GGTGGTGGTT ATGGTCACGT AGCTTATGTT ACATCAGCAT CAGGTGCAAA TTCAATTCAA
GTTATGGAAT CGAACTACGC TGGTAACATG TCAATCAGTA ACTACCGTGG TACATTTGAT
CCAACATCAT CAGCGCATGG TGGTTCTGTA TTTTATATTT ACCCATAA
 
Protein sequence
MKKRILSAVL VSGVTLSAAA SVHAEDYDSQ IAATNNAISN LASQQEAAQA QVATIQSQVS 
TLRTQKTELE AKNAELEKVS ADLESEIQEL SSKIVARQDS LAKQARSAQQ NNTATSYINS
ILNSKSISEA ITRITAISKV VTANNDLLTK QESDQKELAA KQVENQAAIN TIAANKSELE
TTEAGLTTQQ AELEAAQVTL AAELATAQNE KTSLVSAKST AESVAASTAA SVAQSQAIAE
SEATAQVVDS SEAATSVASS EVAATSEAVA QPSEAPVSET STASEAAQEP ASSETSEVQP
ESAAPAVSEA PVSVAPVVTS EAAPAASEAP APAAETHKVS AASTPNTYPV GQCTWGVKSL
APWAGNNWGN AKNWIASARA AGHSVGTTPV AGAIAVWPND GGGYGHVAYV TSASGANSIQ
VMESNYAGNM SISNYRGTFD PTSSAHGGSV FYIYP