Gene STER_1554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_1554 
Symbol 
ID4437828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp1456810 
End bp1458165 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content44% 
IMG OID639677152 
Productanthranilate synthase component I 
Protein accessionYP_820902 
Protein GI116628283 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0288667 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA TTTTACCAGC CGATACCTTA ACACCAATCT TGGCTTATAT GCGTGTTCAA 
GGGGAACACA AGGTTATCCT TGAATCTATT CCTCGTGAGA AGGAAAATGC ACGTTTTTCT
ATTATTGCCT ACAATCCGGT CTTTGAAGTA ACCTTCAAGG ATGGTGTTCT TTATGAAAAT
GGTAAGGCGA TTGATCAGGA TCCTTTCGAA TACTTGGACC AAGTAACAGT CAAGGGCATC
AAGTCTGACC TACCTTTCGC AGGTGGTGCT ATCGGATTTG CAGGTTATGA TATGATTGGT
CTCTATGAAA ATATCGGGGA GATTCCTGAA GATACGATTG GGACACCTGA TATGCATTTC
TTCATCTATG AGTCTTATTT GATTTTTGAT CACAAGAAGG AAAAGGTTTA TGTGGTTGAG
GACAACATCT ACTCTGGCCG TGACAACGAT GCGGTGCGTC AAGCTCTTGG TCAGGTGGTA
ACGATCCTAC AGACTCAGGC GCCAAACGAG TTTACACCTC AGGCCTTGCA AGCTTTGCAA
TTTTCGAATC ATATCGAAAA AGAGGTCTTC ATGGATATGG TGGCTAAGGC TAAGAAACTC
ATTCGTGAGG GAGATATGTT CCAATGTGTG CTTAGTCAAC GCTTTTCAGC GGACTTTGAG
GGAGATCCTT TGGATTACTA CCGTAACTTG CGCGTGACCA ACCCATCAAA CTACCTTTAT
TTCTATGATT TCGGAGATTA TCAGGTGATT GGTGCCAGCC CAGAGAGCCT GGTTTCAGTG
AAAAATGGAG AGGTGTTCAC CAATCCGATT GCTGGAACTC GCCCTCGTGG TGCCAATGAG
GATGAAGATG CTGCCTTGGC GGATGAACTC TCACATGATG TTAAAGAAAC TGCGGAACAC
CGTATGTTGG TTGACTTGGG ACGTAATGAT ATTGGTAAGA TTGCCAAAAA TGGTACGGTC
AAGGTGACCA AGTATATGGA GGTTGAGTAT TTCCGCTATG TGATGCACCT TACTAGTGTG
GTTAAGGGGC AACTCTTACC GGAGTTGACG TCCCTTGATG CTCTAAAGTC AACCATACCA
GCTGGAACTG TGTCCGGGGC GCCTAAGATT CGTGCCATGC GTCGTATCTA TGAGCTTGAG
CAGGAAAAAC GTGGTATTTA CGCGGGAGCT ATCGGTTATT TGTCTGCAAC AGGAGATATG
GACTTTGCTA TTGCTATCCG TACTATGATT CTCAAAAATC AAAAAGCCTA TGTTCAGGCA
GGTGCAGGTG TTGTCTATGA CAGTGTTCCT GAAAATGAAT TTTTTGAAAC GATTAATAAG
GCGAAAGCTA TGACAAGAAT AGGAGATGTC CAATGA
 
Protein sequence
MRKILPADTL TPILAYMRVQ GEHKVILESI PREKENARFS IIAYNPVFEV TFKDGVLYEN 
GKAIDQDPFE YLDQVTVKGI KSDLPFAGGA IGFAGYDMIG LYENIGEIPE DTIGTPDMHF
FIYESYLIFD HKKEKVYVVE DNIYSGRDND AVRQALGQVV TILQTQAPNE FTPQALQALQ
FSNHIEKEVF MDMVAKAKKL IREGDMFQCV LSQRFSADFE GDPLDYYRNL RVTNPSNYLY
FYDFGDYQVI GASPESLVSV KNGEVFTNPI AGTRPRGANE DEDAALADEL SHDVKETAEH
RMLVDLGRND IGKIAKNGTV KVTKYMEVEY FRYVMHLTSV VKGQLLPELT SLDALKSTIP
AGTVSGAPKI RAMRRIYELE QEKRGIYAGA IGYLSATGDM DFAIAIRTMI LKNQKAYVQA
GAGVVYDSVP ENEFFETINK AKAMTRIGDV Q