Gene STER_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_1023 
Symbol 
ID4438594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp946300 
End bp947505 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content43% 
IMG OID639676674 
Producthypothetical protein 
Protein accessionYP_820428 
Protein GI116627809 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGATA AAAAATTTTT AGACCAAAAA ATGGACCGTC GTGAATTTCT TAAAAAATCA 
GGTATTGGAG GGGCTGGGCT TGCACTTGGT CTTTCTGGTG CATCTGCTTT TTTTGCTAAT
CAGGATCGTT CAAGTAAAAA AGCCCTAGAT GGGGATGAAG ATATTAGCTT TTTTGGTAAG
CACCAGGCTG GGATTACGAC TCCCATGCAG AAGGCTTGCT ACTTGGTGGT GCTAGATCTT
CATACAACCG ATAAAAAAGA AGTCATCCAG CTTTTTAAAG ACTGGACCGA TTATAGTAGT
AAATTGGTCG AAGGAGAGTT AGTCAAAAAA GACGGTTCTA ATGCCCTCTT GCCTCCTATG
GATACAGGCG AAACCGTGGG ACTCAATCCC TATCGCCTTA GCCTGACTTT TGGAGTTTCG
GCTGATTTTC TTAAAAAGCT TGGCCTAGAA TCCAAGCGTC CTAAGCTCTT CCGTGATTTA
CCTCCATTTC CAAAGGAGCA GTTGCAGGAC AAGTATACGG GTGGAGATAT CGTCATTCAA
GCCTGTGCAG ATGATGAACA AGTAGCCTTC CATGCTGTCC GCAATCTGAT TCGCAAAGGT
CGTAATAAAA TCACTATGAA GTGGAGCAAG TCAGGTTTTG CAGCTATTGG TGACCGTAAG
GAAACGCCTC GCAATCTCTT TGGTTTCAAG GATGGAACTG CTAATGTAAC GAAGGAAAAG
GAATTCGACA AGGTTGTCTG GGCTGATAGT AAGGATTGGA TGAAGGGTGG TTCTTATATG
GCTCTTCGCC TGGTCCAGAT GCACTTGGAA ACTTGGGATC GTACCAATTT GCAGGAACAG
GAAAATACCT TTGGTCGTTA CAAGGAATCA GGCGCTCCTT TTGGTAAGAA AGATGAGTTT
GATGAAGTAG ATTTATCTAA ACTTCCCGTA GATTCCCATG TGCGTTTGGC CAAAGAAGTA
AATCTTCCTA TCTTACGTCG TTCCTATTCC TATTCAGATG GCATTGATGA AAGAACGGGT
CAGTTTGATG CAGGTTTGAT ATTCATTGCC TACCAGAAGG ACCCAGACCG TTTTGTCAAA
ATACAGACCA ATCTTGGAGC TGTAGACAAG ATGAATGAGT ATATCACCCA TATCGGAAGC
GGGCTCTTTG CTTGTTTTGC TGGCGTGGAG AAAGGAGGCT ACCTTGGTCA AGCACTCTTT
GAATAA
 
Protein sequence
MTDKKFLDQK MDRREFLKKS GIGGAGLALG LSGASAFFAN QDRSSKKALD GDEDISFFGK 
HQAGITTPMQ KACYLVVLDL HTTDKKEVIQ LFKDWTDYSS KLVEGELVKK DGSNALLPPM
DTGETVGLNP YRLSLTFGVS ADFLKKLGLE SKRPKLFRDL PPFPKEQLQD KYTGGDIVIQ
ACADDEQVAF HAVRNLIRKG RNKITMKWSK SGFAAIGDRK ETPRNLFGFK DGTANVTKEK
EFDKVVWADS KDWMKGGSYM ALRLVQMHLE TWDRTNLQEQ ENTFGRYKES GAPFGKKDEF
DEVDLSKLPV DSHVRLAKEV NLPILRRSYS YSDGIDERTG QFDAGLIFIA YQKDPDRFVK
IQTNLGAVDK MNEYITHIGS GLFACFAGVE KGGYLGQALF E