Gene STER_1704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSTER_1704 
Symbol 
ID4437667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus thermophilus LMD-9 
KingdomBacteria 
Replicon accessionNC_008532 
Strand
Start bp1593900 
End bp1594931 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content39% 
IMG OID639677294 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_821043 
Protein GI116628424 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTTTA AAGCAGTAAG TGAAAAAATT GATTTTGATG CAGTTAAAGA ATCAACTACA 
TTGACAGGAG AAGCCCTTGC CAAGAAGCAA GCTCGTGATA AAGAGCTTGA GGCAATCATC
AAGGGTGAGG ACAACCGTAC ACTTTTGGTG ATTGGTCCAT GTTCATCTGA TAATGAGGAT
GCAGTTCTTG ATTATGCACA CCGTTTGGCT AAACTTCAAG AAGAAGTGAA AGATAAAGTA
TTTATGGTTA TGCGTGTTTA TACTGCTAAA CCTCGTACAA ACGGTGATGG GTATAAAGGG
CTTGTGCACC AACCTGATGC TGAAGGAAAA CCGAATCTAA TTAATGGTAT CAAAGCTGTA
CGTAATCTTC ACTACCGTGT TATCACAGAA ACTGGTATCA CAACTGCTGA CGAAATGCTT
TACCCGGAAA ACCTTCCTTT GGTAGATGAC CTTGTGTCCT ACATTGCCAT TGGTGCTCGT
TCTGTAGAAG ATCAACAACA CCGTTTCGTC GCGTCGGGTA TTGACGTACC AACTGGAATG
AAAAATCCAA CATCAGGTAA CCTTAACATT ATGTTTAATG GTATCTATGC TGCACAAAAC
AAACAAGACT TCCTCTTTAA CGGTGAAGAA GTTCAGACAT CTGGAAATCC GTTAGCACAC
GCTATCCTTC GTGGATCAAC AAATGAATAC GGGAAGAATA TTCCTAATTT CTACTATGAT
GATATCCTTG ATACTATTAA CACCTATGAA AAGATGGGAC TTCAAAATCC ATTTATCGTG
ATTGATACGA ACCACGATAA TTCTGGTAAA CGTTACTTGG AGCAAGTACG TATTGTACGT
CAGACCTTGA TTAACCGTGA TTGGAATGAA AAAATCAATA AGTTTGCACG TGGATTTATG
ATCGAGTCTT ATCTTGAAGA TGGTCGTCAG GACACACCAG AAGTTTATGG AAAATCAATT
ACTGACCCAT GTCTTGGTTG GGATAATACT GAGCAATTGA TTCGTGAAAT TCACGCAACA
CTTTCAAAAT AA
 
Protein sequence
MVFKAVSEKI DFDAVKESTT LTGEALAKKQ ARDKELEAII KGEDNRTLLV IGPCSSDNED 
AVLDYAHRLA KLQEEVKDKV FMVMRVYTAK PRTNGDGYKG LVHQPDAEGK PNLINGIKAV
RNLHYRVITE TGITTADEML YPENLPLVDD LVSYIAIGAR SVEDQQHRFV ASGIDVPTGM
KNPTSGNLNI MFNGIYAAQN KQDFLFNGEE VQTSGNPLAH AILRGSTNEY GKNIPNFYYD
DILDTINTYE KMGLQNPFIV IDTNHDNSGK RYLEQVRIVR QTLINRDWNE KINKFARGFM
IESYLEDGRQ DTPEVYGKSI TDPCLGWDNT EQLIREIHAT LSK