Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_0576 |
Symbol | |
ID | 4438437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | + |
Start bp | 519935 |
End bp | 522925 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 639676285 |
Product | hypothetical protein |
Protein accession | YP_820042 |
Protein GI | 116627423 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain [TIGR01168] Gram-positive signal peptide, YSIRK family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGGGA AACAACAAGA TTTTCGAACA GAGAAGTACA TTCGTTACGG TATTCGTAAG TTCAGCTTTG GAGCAGCATC AGTAGCAATT GCTGTTGGTT TAATGTTCTT AGGTAATGGC GTAGTATCAG CGACGGAAGT ACAATCAGCT GAAACAGCTA TTACAACATC TGCTTCTAGT CAAGGTGACA AGGAGACGGA AAAGGCTGAG CCTAAAGCAG AAACAGTAGA AACTGTGAAG CCAGAAACTG TGAGTCCAGA GACTATAAAG CCAGAAACTG TGAATCCAGA GACTGTGAAG CCAGAAACTG TGAATCCAGA GACTGTGAAT TCAGAGACTA TGAAGCCAGA AACTGTGAAT CCAGAAACTG AAAAAGCTAC AGAAGTGAAA TCAGAAGTTG CAGCTACAAA GGCAGCAACT AAAGCAACTC TAGCGAATAC AGATGCTAGT CAAGCCGATG TAGATGCACA AGCAGAAACT GTATCAGCTT TGAGCACAGC TGTAACTGAA TCAAATACTG CTGAAAAGAA ATTAAGACCA ACTGAAATTT ATACAACTAA TGGTACTCCA GGTACCGCCG GCACTGCAGT TAAGACAAAT GCAAAGTATG CAAATTCTGT GGAAACAGTG GATAATAACA AGCGTGGTAA TAGAGAGAAT ACAGATTCTA TTAACCAAAG ACGAAGAGTC AAAAGAAGTG CAATAGATAC AACTAATTTT GCTCAAAACT ACGATAGAAA GGTATTGTCA GGTTCAGTTA CCTATCCGGG TTTCTCAGTT AGTGATCCAG ATTATCCATC TGGAATGTGG ATTGATCCAG ATAAATCTCA TTACAGTTAT GAATGGTTGC AAGCAAAAAT AAAGGGAAAA AATCATGGAA ACCAGATTGT TTTTTCAACT AATCGAACGG GCGATGGAAT TGTTTATGTA ACGGAATTAT CGGAATCAAA TAAGGTTTTA AAGCAATACA CTTTAAAGCA AAATACTAAG GTAAAATCTG CTGTATTTGA TAATACCACC TATTATAATG ATGTTTATAT TGCAATGGTC CACTCTGATG GCTTTGATAA ATTTGGCAAC AGTTTTGTTG CTAACAGTTT TGTTACTAAA TATAGAGCTA GTACATTTGA TAATTACTAC TACAGTACAC TTGCTTATTT TGTACCAAAA TTGATCACAC AAACAACCTA TTTTGTTGAT AAAGATGGCA AGCAAATCAC CGACAGTAAT GGTAATCCGG TTGTTCCTTA TACACAAAAA GGGTTAGTTG GCCAAAAATA TACGACGAGT CCCATACTTA TAAATGGTTA CTATGCTATT GCACCTGCTA ATTCAAATGG CGTTATGTCA CCATACGGAG AGATTGGTGC TAGTTATGTT AAAGACTTTC ATGATGGAGT AGTTATTACT TACACTCAGA CTGGTTCAGA TGGAACAATG TCTGCATCAA TTGCTCGATA TGGAATAGTT CTCAAAAAAT GGACAAATTT AAAGCCTTCA GATCCTACGG AAACGTATAG TGATGATCGA TTTGTTGTTG GGTCATACGG AATTAAAAAT CCATATATCC CACAAACCTC TGATATTAAG TATGTTTACA ACAAGCTCGG CAACTGGATT GTCTCTAATC CTGATGGCTC GACAAAATCA ATTATTTATC CCAATGATCC TAGTGACCCT ACTAAGATTG CTGATTCATC TGTGCCTGGT TACCCGGTTA TTGATTTTCT TCCTGGCTAT ACACCTAAAG ACCATATGGG GACTCCTTTG GTGCCAGTTG ATCCCGATGA TCATACTAAG GGCTATATTC CACCAACACC TGAAAACCCA ACAGAAGATA CTGTAATCAC TTATGAGAAA GATTCTCAAA AAGCAACAGT AACTTATGTT GTAGAAGGAA CAGGAACAGT ACTTCACACA GATAACCTAG AAGGTAAATC TGGAGAACCA ATTGATTACT CAACAGTAGC TAAGCTTGCT GAACTTAAAG CTCTAGGGTA TGACCTCGTA ACTGATGGAT TCACAACAGC TACAGATAAG AACTTCGATA AAGATACGAA AGTAGATCAA AGCTTTGTAG TAACGGTTAA ACCACACGTT GAACCAATCA AACCAGTAGA TCCAGAAAAT CCAAATGATC CAAATAAACC AAAACCAGGT GATCCAATCG ATCCTAACAA CCCTGATGGA CCAAAATGGA CAGAAGATCT CATTAAACAG ATTGATACAA CTCGCCACGT GAACCGTACA ATCACATCCG TTAACGAAAA AGGTGAGGAA GTAGCTCAGA AAGTAACAGA TAAAGTTACA TTCACTCGTG AAGGTAAGAT TAATTCTGTA ACAGGTGAAA TCACTTACGG CAACTGGACA GCTAAAGATG GAGATACAAC ATTCGATAAA GTTGAATCAC CAGTAGTTCA AGGCTACATC CTGAAAGATG CTAAACAAAA AGAAGTAGCA GCTACAACAG GATTAACGGT AGACTCTAAA GATGAAAATA TCGAAGTTGT CTACACTAAA CTTGGTTCAT GGGTACCAAA AGTACCAGAA GGATTCGAGG AACCAAAACT TGATAAACCT CAATATCCAA ACGATCCAAC AGATCCAACA AAACCAGGAA CACCAACAAC AGTGATTCCT CAAGTGCCAG GAACAACTCC AAAAGATTCA AATGGAAACC CACTGAAACC AGTAGATCCA AACGATCCAA GTAAAGGATA TGTTCCACCA ACACCTGAAA ACCCAACAGA AGATACTGTA ATCAACTATG TTCCAGTACC ACAACCTGTA AGACCTACAG ATAATGGAGA TAATAATGGA ACACCAACAA CTGCAGCTCA ACCAGCATCA CCTTCTACAC CTCAATATAT GGATGGCAAA CGTGAACTTC CAAATACAGG TACAGAAGAT AATGCTAGCC TAGCAGCACT TGGACTTCTC GGAGTATTGA GTGGATTTGG TCTTGTAGCT CATAAGAAGA AAGAAGATTA A
|
Protein sequence | MRGKQQDFRT EKYIRYGIRK FSFGAASVAI AVGLMFLGNG VVSATEVQSA ETAITTSASS QGDKETEKAE PKAETVETVK PETVSPETIK PETVNPETVK PETVNPETVN SETMKPETVN PETEKATEVK SEVAATKAAT KATLANTDAS QADVDAQAET VSALSTAVTE SNTAEKKLRP TEIYTTNGTP GTAGTAVKTN AKYANSVETV DNNKRGNREN TDSINQRRRV KRSAIDTTNF AQNYDRKVLS GSVTYPGFSV SDPDYPSGMW IDPDKSHYSY EWLQAKIKGK NHGNQIVFST NRTGDGIVYV TELSESNKVL KQYTLKQNTK VKSAVFDNTT YYNDVYIAMV HSDGFDKFGN SFVANSFVTK YRASTFDNYY YSTLAYFVPK LITQTTYFVD KDGKQITDSN GNPVVPYTQK GLVGQKYTTS PILINGYYAI APANSNGVMS PYGEIGASYV KDFHDGVVIT YTQTGSDGTM SASIARYGIV LKKWTNLKPS DPTETYSDDR FVVGSYGIKN PYIPQTSDIK YVYNKLGNWI VSNPDGSTKS IIYPNDPSDP TKIADSSVPG YPVIDFLPGY TPKDHMGTPL VPVDPDDHTK GYIPPTPENP TEDTVITYEK DSQKATVTYV VEGTGTVLHT DNLEGKSGEP IDYSTVAKLA ELKALGYDLV TDGFTTATDK NFDKDTKVDQ SFVVTVKPHV EPIKPVDPEN PNDPNKPKPG DPIDPNNPDG PKWTEDLIKQ IDTTRHVNRT ITSVNEKGEE VAQKVTDKVT FTREGKINSV TGEITYGNWT AKDGDTTFDK VESPVVQGYI LKDAKQKEVA ATTGLTVDSK DENIEVVYTK LGSWVPKVPE GFEEPKLDKP QYPNDPTDPT KPGTPTTVIP QVPGTTPKDS NGNPLKPVDP NDPSKGYVPP TPENPTEDTV INYVPVPQPV RPTDNGDNNG TPTTAAQPAS PSTPQYMDGK RELPNTGTED NASLAALGLL GVLSGFGLVA HKKKED
|
| |