Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | STER_1023 |
Symbol | |
ID | 4438594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus thermophilus LMD-9 |
Kingdom | Bacteria |
Replicon accession | NC_008532 |
Strand | - |
Start bp | 946300 |
End bp | 947505 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 639676674 |
Product | hypothetical protein |
Protein accession | YP_820428 |
Protein GI | 116627809 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2837] Predicted iron-dependent peroxidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR01412] Tat-translocated enzyme [TIGR01413] Dyp-type peroxidase family |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGATA AAAAATTTTT AGACCAAAAA ATGGACCGTC GTGAATTTCT TAAAAAATCA GGTATTGGAG GGGCTGGGCT TGCACTTGGT CTTTCTGGTG CATCTGCTTT TTTTGCTAAT CAGGATCGTT CAAGTAAAAA AGCCCTAGAT GGGGATGAAG ATATTAGCTT TTTTGGTAAG CACCAGGCTG GGATTACGAC TCCCATGCAG AAGGCTTGCT ACTTGGTGGT GCTAGATCTT CATACAACCG ATAAAAAAGA AGTCATCCAG CTTTTTAAAG ACTGGACCGA TTATAGTAGT AAATTGGTCG AAGGAGAGTT AGTCAAAAAA GACGGTTCTA ATGCCCTCTT GCCTCCTATG GATACAGGCG AAACCGTGGG ACTCAATCCC TATCGCCTTA GCCTGACTTT TGGAGTTTCG GCTGATTTTC TTAAAAAGCT TGGCCTAGAA TCCAAGCGTC CTAAGCTCTT CCGTGATTTA CCTCCATTTC CAAAGGAGCA GTTGCAGGAC AAGTATACGG GTGGAGATAT CGTCATTCAA GCCTGTGCAG ATGATGAACA AGTAGCCTTC CATGCTGTCC GCAATCTGAT TCGCAAAGGT CGTAATAAAA TCACTATGAA GTGGAGCAAG TCAGGTTTTG CAGCTATTGG TGACCGTAAG GAAACGCCTC GCAATCTCTT TGGTTTCAAG GATGGAACTG CTAATGTAAC GAAGGAAAAG GAATTCGACA AGGTTGTCTG GGCTGATAGT AAGGATTGGA TGAAGGGTGG TTCTTATATG GCTCTTCGCC TGGTCCAGAT GCACTTGGAA ACTTGGGATC GTACCAATTT GCAGGAACAG GAAAATACCT TTGGTCGTTA CAAGGAATCA GGCGCTCCTT TTGGTAAGAA AGATGAGTTT GATGAAGTAG ATTTATCTAA ACTTCCCGTA GATTCCCATG TGCGTTTGGC CAAAGAAGTA AATCTTCCTA TCTTACGTCG TTCCTATTCC TATTCAGATG GCATTGATGA AAGAACGGGT CAGTTTGATG CAGGTTTGAT ATTCATTGCC TACCAGAAGG ACCCAGACCG TTTTGTCAAA ATACAGACCA ATCTTGGAGC TGTAGACAAG ATGAATGAGT ATATCACCCA TATCGGAAGC GGGCTCTTTG CTTGTTTTGC TGGCGTGGAG AAAGGAGGCT ACCTTGGTCA AGCACTCTTT GAATAA
|
Protein sequence | MTDKKFLDQK MDRREFLKKS GIGGAGLALG LSGASAFFAN QDRSSKKALD GDEDISFFGK HQAGITTPMQ KACYLVVLDL HTTDKKEVIQ LFKDWTDYSS KLVEGELVKK DGSNALLPPM DTGETVGLNP YRLSLTFGVS ADFLKKLGLE SKRPKLFRDL PPFPKEQLQD KYTGGDIVIQ ACADDEQVAF HAVRNLIRKG RNKITMKWSK SGFAAIGDRK ETPRNLFGFK DGTANVTKEK EFDKVVWADS KDWMKGGSYM ALRLVQMHLE TWDRTNLQEQ ENTFGRYKES GAPFGKKDEF DEVDLSKLPV DSHVRLAKEV NLPILRRSYS YSDGIDERTG QFDAGLIFIA YQKDPDRFVK IQTNLGAVDK MNEYITHIGS GLFACFAGVE KGGYLGQALF E
|
| |