Gene EcHS_A3864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3864 
SymbolgltS 
ID5593010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3857043 
End bp3858248 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content54% 
IMG OID640922974 
Productsodium/glutamate symporter 
Protein accessionYP_001460452 
Protein GI157163134 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0786] Na+/glutamate symporter 
TIGRFAM ID[TIGR00210] sodium--glutamate symport carrier (gltS) 


Plasmid Coverage information

Num covering plasmid clones67 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCATC TCGATACTTT AGCAACGCTT GTTGCCGCAA CGCTGACGTT GCTGCTCGGG 
CGTAAGTTGG TCCATTCCGT CTCCTTTTTG AAGAAATACA CCATACCGGA ACCTGTTGCG
GGTGGTTTGT TGGTGGCGCT AGCGCTACTG GTACTGAAAA AAAGCATGGG CTGGGAAGTC
AACTTTGATA TGTCCCTGCG CGATCCGTTA ATGCTGGCTT TCTTCGCCAC CATTGGCCTG
AACGCCAACA TTGCCAGTTT GCGTGCCGGT GGGCGTGTGG TTGGCATCTT CTTGATTGTG
GTTGTTGGTC TGTTGGTGAT GCAAAATGCC ATTGGCATTG GTATGGCTAG CCTGTTAGGG
CTTGATCCGC TGATGGGGCT GTTGGCCGGT TCTATTACGC TTTCCGGCGG TCACGGTACG
GGCGCTGCGT GGAGTAAATT GTTCATTGAA CGTTATGGCT TCACCAATGC GACAGAAGTG
GCGATGGCCT GTGCAACGTT CGGTTTGGTG CTGGGCGGCT TGATTGGCGG TCCGGTAGCG
CGCTATCTGG TGAAACACTC CACCACGCCG AACGGTATTC CGGATGACCA GGAAGTCCCG
ACGGCGTTTG AAAAGCCGGA TGTGGGCCGC ATGATCACCT CGTTGGTGCT GATTGAAACT
ATCGCGCTGA TTGCTATCTG CCTGACGGTG GGGAAAATTG TTGCGCAACT TTTGGCTGGC
ACTGCTTTTG AACTGCCGAC CTTCGTCTGT GTACTGTTTG TTGGCGTGAT TCTGAGCAAC
GGTCTGTCAA TGATGGGCTT TTACCGCGTC TTTGAGCGTG CGGTATCCGT GCTGGGTAAC
GTAAGCTTGT CGTTGTTCCT GGCGATGGCG TTGATGGGGC TGAAACTGTG GGAGCTGGCT
TCGCTGGCGC TGCCGATGCT GGCGATTCTG GTGGTACAGA CCATCTTCAT GGCGTTGTAT
GCCATCTTCG TTACCTGGCG CATGATGGGC AAAAACTACG ATGCGGCAGT GCTGGCTGCG
GGTCACTGTG GTTTTGGCCT CGGTGCAACG CCAACGGCAA TCGCCAACAT GCAGGCGATC
ACTGAACGTT TTGGCCCGTC GCACATGGCG TTTTTGGTGG TGCCGATGGT CGGTGCGTTC
TTTATCGATA TCGTCAATGC GCTGGTAATT AAGTTGTATT TGATGTTGCC GATTTTTGCC
GGTTAA
 
Protein sequence
MFHLDTLATL VAATLTLLLG RKLVHSVSFL KKYTIPEPVA GGLLVALALL VLKKSMGWEV 
NFDMSLRDPL MLAFFATIGL NANIASLRAG GRVVGIFLIV VVGLLVMQNA IGIGMASLLG
LDPLMGLLAG SITLSGGHGT GAAWSKLFIE RYGFTNATEV AMACATFGLV LGGLIGGPVA
RYLVKHSTTP NGIPDDQEVP TAFEKPDVGR MITSLVLIET IALIAICLTV GKIVAQLLAG
TAFELPTFVC VLFVGVILSN GLSMMGFYRV FERAVSVLGN VSLSLFLAMA LMGLKLWELA
SLALPMLAIL VVQTIFMALY AIFVTWRMMG KNYDAAVLAA GHCGFGLGAT PTAIANMQAI
TERFGPSHMA FLVVPMVGAF FIDIVNALVI KLYLMLPIFA G