Gene ECH74115_5026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5026 
SymbolgltS 
ID6970578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4675111 
End bp4676316 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content54% 
IMG OID643388707 
Productsodium/glutamate symporter 
Protein accessionYP_002273134 
Protein GI209397994 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0786] Na+/glutamate symporter 
TIGRFAM ID[TIGR00210] sodium--glutamate symport carrier (gltS) 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCATC TCGATACTTT AGCAACGCTT GTTGCCGCAA CGCTGACGTT GTTGCTCGGG 
CGTAAGTTGG TCCATTCCGT CTCCTTTTTG AAGAAATACA CCATACCGGA ACCTGTTGCG
GGTGGTTTGT TGGTGGCGCT GGCGCTACTA GTACTGAAAA AAAGCATGGG CTGGGAAGTC
AACTTTGATA TGTCCCTGCG CGATCCGTTA ATGCTGGCTT TCTTCGCCAC CATTGGCCTG
AACGCCAACA TTGCCAGTTT GCGTGCCGGT GGGCGTGTGG TTGGCATCTT CTTGATTGTG
GTTGTTGGTC TGTTGGTGAT GCAAAATGCC ATTGGCATTG GTATGGCTAG CCTGTTAGGG
CTTGATCCGC TGATGGGGCT GTTGGCCGGT TCTATTACGC TTTCCGGCGG TCACGGTACG
GGCGCTGCGT GGAGTAAATT GTTCATTGAA CGTTATGGCT TCACCAATGC GACGGAAGTG
GCGATGGCCT GTGCAACGTT CGGTCTGGTG CTGGGCGGCT TGATTGGCGG TCCGGTGGCG
CGCTATCTGG TGAAACACTC CACCACGCCG AACGGTATTC CGGATGACCA GGAAGTCCCG
ACGGCGTTTG AAAAGCCGGA TGTGGGCCGC ATGATCACCT CGCTGGTGCT GATTGAAACT
ATCGCGCTGA TTGCTATCTG CCTGACGGTG GGGAAAATTG TTGCGCAACT TTTGGCTGGC
ACTGCTTTTG AACTGCCGAC CTTCGTCTGT GTACTGTTTG TTGGCGTGAT TCTGAGCAAC
GGTCTGTCAA TGATGGGCTT TTACCGCGTC TTTGAGCGTG CGGTATCCGT GCTGGGTAAC
GTAAGCCTGT CGTTGTTCCT GGCGATGGCG CTGATGGGGC TGAAACTGTG GGAGCTGGCT
TCGCTGGCGC TGCCGATGCT GGCGATTCTG GTGGTACAGA CCATCTTCAT GGCGTTGTAT
GCCATCTTCG TTACCTGGCG CATGATGGGC AAAAACTACG ATGCGGCAGT GCTGGCTGCG
GGTCACTGTG GTTTTGGCCT CGGTGCAACG CCAACGGCAA TCGCCAACAT GCAGGCGATC
ACTGAACGCT TTGGCCCGTC GCACATGGCG TTTTTGGTGG TGCCGATGGT CGGTGCGTTC
TTTATCGATA TCGTCAATGC GCTGGTGATT AAGCTGTATT TAATGTTGCC GATTTTTGCC
GGTTAA
 
Protein sequence
MFHLDTLATL VAATLTLLLG RKLVHSVSFL KKYTIPEPVA GGLLVALALL VLKKSMGWEV 
NFDMSLRDPL MLAFFATIGL NANIASLRAG GRVVGIFLIV VVGLLVMQNA IGIGMASLLG
LDPLMGLLAG SITLSGGHGT GAAWSKLFIE RYGFTNATEV AMACATFGLV LGGLIGGPVA
RYLVKHSTTP NGIPDDQEVP TAFEKPDVGR MITSLVLIET IALIAICLTV GKIVAQLLAG
TAFELPTFVC VLFVGVILSN GLSMMGFYRV FERAVSVLGN VSLSLFLAMA LMGLKLWELA
SLALPMLAIL VVQTIFMALY AIFVTWRMMG KNYDAAVLAA GHCGFGLGAT PTAIANMQAI
TERFGPSHMA FLVVPMVGAF FIDIVNALVI KLYLMLPIFA G