Gene ECH74115_4404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4404 
SymbolsstT 
ID6966847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4082659 
End bp4083903 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content55% 
IMG OID643388125 
Productserine/threonine transporter SstT 
Protein accessionYP_002272562 
Protein GI209396321 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3633] Na+/serine symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.912253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value0.786267 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACGC AACGTTCACC GGGGCTATTC CGGCGTCTGG CTCATGGCAG CCTGGTAAAA 
CAAATCCTGG TCGGCCTTGT TCTGGGGATT CTTCTGGCAT GGATCTCAAA ACCCGCGGCG
GAAGCTGTTG GTCTGTTAGG TACTTTGTTC GTCGGCGCAC TGAAAGCCGT TGCCCCCATC
CTGGTGTTGA TGCTGGTGAT GGCATCTATT GCTAACCACC AGCACGGGCA GAAAACCAAT
ATCCGCCCTA TTTTGTTCCT CTATCTGCTG GGTACCTTCT CTGCTGCTCT GGCCGCAGTC
GTTTTCAGCT TCGCCTTCCC TTCTACCCTG CACTTATCCA GTAGCGCGGG TGATATTTCG
CCACCGTCAG GCATTGTAGA AGTGATGCGC GGGCTGGTAA TGAGCATGGT TTCCAACCCC
ATCGACGCGC TGCTGAAAGG TAACTACATC GGGATCCTGG TGTGGGCAAT TGGCCTCGGT
TTCGCACTGC GTCACGGTAA CGAGACCACC AAAAACCTGG TCAACGATAT GTCGAATGCC
GTCACCTTTA TGGTGAAACT GGTCATTCGC TTCGCACCGA TTGGTATTTT TGGTCTGGTT
TCTTCTACCC TGGCAACCAC CGGTTTCTCC ACACTGTGGG GCTACGCGCA ACTGCTGGTC
GTGCTGGTTG GCTGTATGTT ACTGGTGGCG CTGGTGGTTA ACCCATTGCT GGTGTGGTGG
AAAATTCGTC GTAACCCGTT CCCGCTGGTG CTGCTGTGCC TGCGCGAAAG CGGCGTGTAT
GCCTTCTTCA CCCGCAGCTC TGCGGCGAAC ATTCCGGTGA ATATGGCACT GTGTGAAAAG
CTGAATCTGG ATCGCGATAC CTATTCCGTT TCTATTCCGC TGGGAGCCAC CATCAATATG
GCGGGCGCAG CAATCACCAT TACCGTGTTG ACGCTGGCTG CGGTTAATAC GCTGGGTATT
CCGGTCGATC TGCCCACAGC GCTGCTGTTG AGCGTAGTGG CTTCTCTGTG TGCCTGTGGC
GCATCCGGCG TGGCGGGGGG TTCTTTGCTG CTGATCCCAC TGGCCTGTAA TATGTTCGGT
ATTTCGAACG ATATCGCCAT GCAGGTGGTT GCCGTCGGCT TTATTATCGG CGTATTGCAG
GACTCTTGCG AAACCGCGCT GAACTCTTCA ACTGACGTGC TGTTCACTGC GGCAGCTTGC
CAGGCGGAAG ACGATCGTCT GGCAAATAGC GCCCTGCGTA ATTAA
 
Protein sequence
MTTQRSPGLF RRLAHGSLVK QILVGLVLGI LLAWISKPAA EAVGLLGTLF VGALKAVAPI 
LVLMLVMASI ANHQHGQKTN IRPILFLYLL GTFSAALAAV VFSFAFPSTL HLSSSAGDIS
PPSGIVEVMR GLVMSMVSNP IDALLKGNYI GILVWAIGLG FALRHGNETT KNLVNDMSNA
VTFMVKLVIR FAPIGIFGLV SSTLATTGFS TLWGYAQLLV VLVGCMLLVA LVVNPLLVWW
KIRRNPFPLV LLCLRESGVY AFFTRSSAAN IPVNMALCEK LNLDRDTYSV SIPLGATINM
AGAAITITVL TLAAVNTLGI PVDLPTALLL SVVASLCACG ASGVAGGSLL LIPLACNMFG
ISNDIAMQVV AVGFIIGVLQ DSCETALNSS TDVLFTAAAC QAEDDRLANS ALRN