Gene STER_0478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_0478 
Symbol 
ID4438514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp424226 
End bp425725 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content45% 
IMG OID639676199 
Productsurface antigen 
Protein accessionYP_819956 
Protein GI116627337 
COG category[R] General function prediction only 
COG ID[COG3942] Surface antigen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTATCAA AATCTAAAAC TACAAAGGCA CTTCTTTACT CGACTGCAGC ACTTTCGCTT 
TTTGCTGCTA GCCACGTACA TGCCGATGAA ACTTCTCACT GGACAGCACG TTCAGTAGAT
CAAATCAAGG CAGATATCTC TGTAAATGAT AATCAACAAA CTTACACTGT TCAATATGGT
GATACTTTAG GTAGTATTGT TGAAGCGATG GGAATCGATA TGAATGTTTT GGCTAATATC
AAAGAAATTA CAAACATTGA CTTGATTTTC CCTGGAACAG TTTTGACAAC AACATACAAC
GCCGATAATC AAGCAGTATC AGTTAAAGTT GAAACACCAT CTTCAGAAAC TTCTGACACA
CCTGTAGTAG CAGAATCTAA CTTGACAACT AACGAAGTGA CTGTAAATGG TCAATCTGTA
GTAGCATCTG ATTTGTCAGC TCCAGTTGAA ACTGTTAGCT TGACAGCTAC TCAAGCACCT
GCTAAAGAAG AATCAACACA AGTTGTTTCA GAAGTAACAG AGGCTATCGC ATCAGCATCA
GATACTCCAG CTTACGCAGA TACTGAACAA CCAGTTGCAG ACGCTATTGA TCATGTTACT
TCATCAGCTG AAGAAACACT TGCAGAAGAG GAAGCTCCAG CAACTGAAAC ATCTGCACAA
GCTGAAACAA CTGAAGTAGC AGCAACATCA GAAGCTGCAT CAGAAGCTGC ATCAGATGCG
CCAGCAGAAC AACTAGCAGC TGCATCAGAG GCACCAGAGA GCTCAGAAGT GCCAGCAGAA
CAACTAGCAG CTGCATCAGA GGCACCAGAG AGCTCAGAAG CGCCAGCAGA ACAACCAGCA
GCTGCACCAG AGAGCTCAGA AGCGCCAGCA GAACAACCAG CAGCAACATC AGAAGCTGCA
TCAGAAGCTC CTGCTAGCGT AGTACCTGTC GCAACATCAG AAGCTGTATC AGAAGCACCA
GCTGTATCAG AAGTGCCAGC AGAACAACTA GCAGCTGCAT CAGAGGCACC AGAGAGCTCA
GAAGTGCCAG CAGAACAACC AGCAGCTGCA CCAGAGAGCT CAGAAGCGCC AGCAGAACAA
CCAGCAGCAA CATCAGAAGC TGCACCAGCT ACATCAGAAG CTCCAGCAGA ACAACTAGCA
GCGACATCAG AAGCTGCATC AACTCCTAAT ACATATCCAG TTGGACAATG TACTTGGGGT
GCGAAATCAT TGGCTCCATG GGCTGGTAAT AATTGGGGTA ATGCTAAAGA CTGGATTGCT
AGTGCGCAAG CAGCTGGTCA CTCAGTAGGT ACAACTCCAG TAGCCGGTGC GATTGCGGTA
TGGCCAAATG ATGGTGGTGG TTATGGTCAC GTAGCTTATG TTACATCAGC ATCAGGTGTA
AATTCAATTC AAGTTATGGA ATCGAACTAT GCTGGTAACA TGTTAATCGG TAACTACCGT
GGTACATTTG ATCCAACATC ATCAGCGCAT GGTGGTTCTG TATATTATAT TTATCCATAA
 
Protein sequence
MLSKSKTTKA LLYSTAALSL FAASHVHADE TSHWTARSVD QIKADISVND NQQTYTVQYG 
DTLGSIVEAM GIDMNVLANI KEITNIDLIF PGTVLTTTYN ADNQAVSVKV ETPSSETSDT
PVVAESNLTT NEVTVNGQSV VASDLSAPVE TVSLTATQAP AKEESTQVVS EVTEAIASAS
DTPAYADTEQ PVADAIDHVT SSAEETLAEE EAPATETSAQ AETTEVAATS EAASEAASDA
PAEQLAAASE APESSEVPAE QLAAASEAPE SSEAPAEQPA AAPESSEAPA EQPAATSEAA
SEAPASVVPV ATSEAVSEAP AVSEVPAEQL AAASEAPESS EVPAEQPAAA PESSEAPAEQ
PAATSEAAPA TSEAPAEQLA ATSEAASTPN TYPVGQCTWG AKSLAPWAGN NWGNAKDWIA
SAQAAGHSVG TTPVAGAIAV WPNDGGGYGH VAYVTSASGV NSIQVMESNY AGNMLIGNYR
GTFDPTSSAH GGSVYYIYP