Gene EcSMS35_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3988 
SymbolgltS 
ID6145498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4065349 
End bp4066554 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content54% 
IMG OID641618814 
Productsodium/glutamate symporter 
Protein accessionYP_001745953 
Protein GI170682521 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0786] Na+/glutamate symporter 
TIGRFAM ID[TIGR00210] sodium--glutamate symport carrier (gltS) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCATC TCGATACTTT AGCAACGCTT GTTGCCGCAA CGCTGACGTT GCTGCTCGGG 
CGCAAGTTGG TCCATTCCGT CTCCTTTTTG AAGAAATACA CCATACCGGA ACCTGTTGCG
GGTGGTTTAT TAGTGGCGCT GGCGCTGCTG ATACTGAAAA AAAGCATGGG CTGGGAAGTC
AACTTTGATA TGTCCCTGCG CGATCCGTTG ATGCTGGCTT TCTTCGCCAC CATTGGCCTG
AACGCCAACA TTGCCAGTTT GCGTGCCGGT GGGCGTGTGG TTGGCATCTT CTTGATTGTG
GTTGTTGGCC TGTTGGTGAT GCAAAATGCC ATTGGCATTG GTATGGCTAG CCTGTTAGGG
CTTGATCCGC TGATGGGGCT GTTGGCCGGT TCTATTACGC TTTCCGGCGG TCACGGTACG
GGAGCTGCAT GGAGTAAATT GTTCATTGAA CGTTATGGCT TCACCAATGC GACAGAAGTG
GCGATGGCCT GTGCAACGTT CGGTTTGGTG CTGGGCGGCT TGATTGGCGG TCCGGTGGCA
CGCTATCTGG TGAAACACTC CACCACACCG AACGGTATTC CGGATGACCA GGAAGTTCCG
ACCGCGTTCG AAAAGCCGGA TGTGGGCCGC ATGATCACCT CGCTGGTGTT GATTGAAACT
ATCGCGCTGA TTGCTATCTG CCTGACGGTG GGGAAAATTG TTGCGCAACT TTTGGCTGGT
ACTGCTTTTG AACTACCGAC CTTCGTCTGT GTACTATTTG TTGGCGTGAT TCTGAGCAAC
GGTCTGTCAA TGATGGGCTT TTACCGCGTC TTTGAGCGAG CGGTATCCGT GCTGGGTAAC
GTAAGCCTGT CGTTGTTCCT GGCGATGGCG TTGATGGGGC TGAAACTGTG GGAGCTGGCT
TCGCTGGCGC TGCCGATGCT GGCGATTCTG GTGGTACAGA CCATCTTCAT GGCGTTGTAT
GCCATCTTCG TTACCTGGCG CATGATGGGC AAAAACTACG ATGCGGCAGT GCTGGCTGCG
GGTCACTGTG GCTTTGGCCT TGGCGCAACG CCAACAGCAA TCGCCAACAT GCAGGCGATC
ACTGAACGCT TTGGCCCGTC GCACATGGCG TTCCTGGTAG TGCCGATGGT CGGTGCGTTC
TTTATCGATA TCGTCAATGC GCTGGTGATT AAGCTGTATT TGATGTTGCC GATTTTTGCC
GGTTAA
 
Protein sequence
MFHLDTLATL VAATLTLLLG RKLVHSVSFL KKYTIPEPVA GGLLVALALL ILKKSMGWEV 
NFDMSLRDPL MLAFFATIGL NANIASLRAG GRVVGIFLIV VVGLLVMQNA IGIGMASLLG
LDPLMGLLAG SITLSGGHGT GAAWSKLFIE RYGFTNATEV AMACATFGLV LGGLIGGPVA
RYLVKHSTTP NGIPDDQEVP TAFEKPDVGR MITSLVLIET IALIAICLTV GKIVAQLLAG
TAFELPTFVC VLFVGVILSN GLSMMGFYRV FERAVSVLGN VSLSLFLAMA LMGLKLWELA
SLALPMLAIL VVQTIFMALY AIFVTWRMMG KNYDAAVLAA GHCGFGLGAT PTAIANMQAI
TERFGPSHMA FLVVPMVGAF FIDIVNALVI KLYLMLPIFA G