Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_1071 |
Symbol | |
ID | 4437285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | - |
Start bp | 993554 |
End bp | 995014 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 639676712 |
Product | transcriptional activator-exopolysaccharide biosynthesis |
Protein accession | YP_820466 |
Protein GI | 116627847 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.903122 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCGC GTACGAATCG TAAGCAAAAA CGTACGGGTA ATAGATCATG GGGGATGGTC AACGTTGGAT TGACCATTCT GTATGCTATT TTAGCATTGG TCTTATTATT CACCATGTTC AATTATAATT TCCTATCCTT TAGGTTTTTG AACATCATTA TCACCATTGG TTTGTTGGTA GTTCTTGCTA TTAGCATCTT CCTTCAGAAG ACTAAGAAAT CACCACTAGT GACAACAGTT GTACTGATTA TCTTCTCGCT AGTTTCTCTG GTTGGTATTT TTGGTTTTAA ACAAATGATT GACATCACTA ACCGTATGAA TCAGACGGCA GCATTTTCTG AAGTAGAAAT GAGCATCGTG GTTCCTAAGG AAAGTGACAT CAAAGATGTG AGCCAGCTTA CTAGCGTACA GGCACCTACT AAGGTTGATA AGAACAATAT CGAGATCTTG ATGTCAGCTC TCAAAAAAGA TAAAAAAGTT GATGTTAAAG TTGATGATGT TGCCTCATAT CAAGAAGCTT ATGATAATCT TAAGTCTGGC AAATCTAAAG CTATGGTCTT GAGTGGGTCT TATGCTAGCC TACTAGAGTC TGTCGATAGT AATTATGCTT CAAATCTAAA AACAATTTAT ACTTATAAAA TTAAAAAGAA GAATAGCAAC TCTGCAAACC AAGTAGATTC AAAAGTCTTC AATATTTATA TTAGTGGTAT TGATACCTAC GGTTCGATTT CAACAGTGTC ACGTTCAGAT GTCAATATCA TTATGACAGT AAACATGAAT ACACATAAGA TTCTCTTGAC GACTACTCCA CGTGATGCAT ACGTTAAGAT TCCTGGTGGT GGAGCAGACC AGTATGATAA ATTAACCCAC GCAGGTATTT ATGGCGTTGA AACATCTGAA CAAACTCTGG AAGATCTATA TGGTACTAAG ATTGATTACT ATGCCCGAAT TAACTTCACA TCTTTCCTTA AGTTGATTGA CCAACTTGGT GGTGTGACAG TCCATAATGA TCAAGCTTTC ACAAGTCTTC ATGGGAAGTT TGATTTCCCA GTTGGAGATA TCCAAATGAA TTCAGAGCAA GCACTTGGAT TTGTTCGTGA ACGCTATAGT TTAGATGGCG GAGATAATGA TCGTGGTAAA AACCAGGAAA AAGTCATTTC TGCGATTGTA AACAAGTTGG CTTCTCTAAA GTCTGTATCA AACTTTACTT CAATCGTTAA TAATCTCCAA GACTCTGTTC AGACAAATAT TTCTTTGGAT ACCATTAATG CTTTGGCTAA TACACAACTT GATTCAGGTT CTAAATTTAC GGTGACTTCT CAAGCAGTAA CAGGTACAGG TTCAACCGGA CAATTGATCT CTTATGCGAT GCCAAATTCT AGTCTTTACA TGATGAAACT AGATAATTCG AGTGTGGAAA GTGCCTCTCA AGCTATCAAA AAATTGATGG AGGAAAAATA A
|
Protein sequence | MSSRTNRKQK RTGNRSWGMV NVGLTILYAI LALVLLFTMF NYNFLSFRFL NIIITIGLLV VLAISIFLQK TKKSPLVTTV VLIIFSLVSL VGIFGFKQMI DITNRMNQTA AFSEVEMSIV VPKESDIKDV SQLTSVQAPT KVDKNNIEIL MSALKKDKKV DVKVDDVASY QEAYDNLKSG KSKAMVLSGS YASLLESVDS NYASNLKTIY TYKIKKKNSN SANQVDSKVF NIYISGIDTY GSISTVSRSD VNIIMTVNMN THKILLTTTP RDAYVKIPGG GADQYDKLTH AGIYGVETSE QTLEDLYGTK IDYYARINFT SFLKLIDQLG GVTVHNDQAF TSLHGKFDFP VGDIQMNSEQ ALGFVRERYS LDGGDNDRGK NQEKVISAIV NKLASLKSVS NFTSIVNNLQ DSVQTNISLD TINALANTQL DSGSKFTVTS QAVTGTGSTG QLISYAMPNS SLYMMKLDNS SVESASQAIK KLMEEK
|
| |