Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_0042 |
Symbol | |
ID | 4436839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | + |
Start bp | 27036 |
End bp | 28403 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 639675808 |
Product | glucan binding protein |
Protein accession | YP_819611 |
Protein GI | 116626992 |
COG category | [S] Function unknown |
COG ID | [COG3883] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.634979 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAC GCATTCTTTC AGCAGTGTTG GTAAGTGGTG TGACTTTATC AGCAGCTGCA TCAGTACATG CAGAGGATTA TGATTCGCAA ATCGCTGCTA CAAATAACGC AATTAGTAAC CTTGCTTCTC AACAAGAGGC TGCGCAAGCA CAAGTTGCTA CAATCCAATC TCAAGTATCA ACTCTCAGAA CACAAAAAAC TGAATTGGAA GCTAAAAATG CCGAACTTGA AAAAGTATCT GCAGATTTGG AATCTGAAAT TCAAGAATTA TCAAGTAAGA TTGTAGCTCG TCAAGATTCT TTGGCGAAAC AAGCTCGTAG TGCTCAACAA AATAACACTG CTACTAGCTA CATTAACTCT ATCTTGAACT CTAAGTCAAT CTCTGAAGCT ATTACTCGTA TCACAGCTAT TTCAAAAGTT GTTACAGCTA ATAATGATTT ATTGACTAAA CAAGAGTCAG ATCAAAAAGA GCTTGCTGCT AAACAAGTAG AAAATCAAGC TGCAATTAAT ACTATTGCAG CTAATAAGTC AGAACTTGAA ACAACAGAAG CTGGTTTGAC AACTCAACAA GCAGAACTTG AAGCAGCACA AGTTACTTTG GCTGCTGAAT TGGCAACAGC TCAAAATGAG AAGACATCAC TTGTCTCTGC TAAATCAACT GCTGAATCTG TAGCTGCCTC GACGGCTGCA TCAGTGGCAC AGTCTCAAGC AATTGCTGAA TCAGAGGCAA CTGCACAAGT TGTAGATTCT TCAGAAGCAG CAACATCAGT AGCTTCTTCA GAAGTAGCTG CTACTTCTGA AGCTGTAGCT CAGCCTTCAG AGGCACCAGT TTCTGAAACA TCGACAGCTT CTGAGGCTGC ACAGGAGCCG GCTTCATCAG AAACATCAGA AGTACAACCA GAATCAGCTG CACCAGCTGT TTCAGAAGCT CCTGTTAGCG TAGCACCTGT CGTAACATCA GAAGCTGCAC CAGCTGCTTC AGAAGCTCCT GCACCAGCTG CTGAAACACA TAAAGTGTCA GCAGCATCAA CTCCTAATAC ATATCCAGTT GGACAATGTA CTTGGGGTGT AAAATCATTG GCTCCATGGG CTGGTAATAA TTGGGGGAAT GCTAAAAACT GGATTGCTAG TGCGCGAGCA GCTGGTCATT CAGTAGGTAC AACTCCAGTA GCCGGTGCGA TTGCGGTATG GCCAAATGAT GGTGGTGGTT ATGGTCACGT AGCTTATGTT ACATCAGCAT CAGGTGCAAA TTCAATTCAA GTTATGGAAT CGAACTACGC TGGTAACATG TCAATCAGTA ACTACCGTGG TACATTTGAT CCAACATCAT CAGCGCATGG TGGTTCTGTA TTTTATATTT ACCCATAA
|
Protein sequence | MKKRILSAVL VSGVTLSAAA SVHAEDYDSQ IAATNNAISN LASQQEAAQA QVATIQSQVS TLRTQKTELE AKNAELEKVS ADLESEIQEL SSKIVARQDS LAKQARSAQQ NNTATSYINS ILNSKSISEA ITRITAISKV VTANNDLLTK QESDQKELAA KQVENQAAIN TIAANKSELE TTEAGLTTQQ AELEAAQVTL AAELATAQNE KTSLVSAKST AESVAASTAA SVAQSQAIAE SEATAQVVDS SEAATSVASS EVAATSEAVA QPSEAPVSET STASEAAQEP ASSETSEVQP ESAAPAVSEA PVSVAPVVTS EAAPAASEAP APAAETHKVS AASTPNTYPV GQCTWGVKSL APWAGNNWGN AKNWIASARA AGHSVGTTPV AGAIAVWPND GGGYGHVAYV TSASGANSIQ VMESNYAGNM SISNYRGTFD PTSSAHGGSV FYIYP
|
| |