Gene EcSMS35_3457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3457 
Symbolmtr 
ID6143895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3531329 
End bp3532573 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content54% 
IMG OID641618286 
Producttryptophan permease 
Protein accessionYP_001745435 
Protein GI170683709 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00837] aromatic amino acid transport protein 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.813995 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACAC TAACCACCAC CCAAACGTCA CCGTCGCTGC TTGGCGGCGT GGTGATTATC 
GGCGGCACCA TTATTGGCGC AGGGATGTTT TCTCTGCCAG TGGTCATGTC CGGGGCGTGG
TTCTTCTGGT CAATGGCGGC GCTGATCTTT ACCTGGTTCT GTATGCTGCA TTCCGGCTTG
ATGATTCTGG AAGCTAACCT GAATTATCGA ATCGGTTCGA GCTTTGACAC CATCACCAAA
GACCTGCTCG GCAAAGGCTG GAACGTAGTC AACGGCATTT CCATTGCCTT TGTGCTCTAT
ATCCTGACTT ACGCCTATAT TTCTGCCAGC GGTTCGATTC TGCATCACAC CTTCGCGGAG
ATGTCGCTGA ACGTCCCTGC ACGGGCAGCT GGATTTGGTT TTGCACTACT GGTGGCGTTT
GTGGTGTGGT TGAGCACCAA AGCGGTCAGC CGGATGACGG CGATTGTGCT GGGGGCGAAA
GTCATTACGT TTTTCCTCAC CTTCGGTAGC CTGCTGGGAC ATGTGCAGCC AACGACCTTG
TTCAACGTTG CCGAAAGCAA TGCGTCTTAT GCGCCGTATC TGCTGATGAC ACTGCCATTC
TGTCTGGCAT CTTTTGGTTA TCACGGTAAC GTGCCGAGCC TGATGAAGTA TTACGGCAAA
GATCCGAAAA CCATCGTGAA ATGCCTGGTG TACGGTACGC TGATGGCGCT GGCGCTGTAT
ACCATCTGGT TGCTGGCGAC GATGGGCAAC ATCCCTCGTC CGGAGTTTAT CGGCATCGCC
GAGAAGGGCG GTAATATTGA TGTGCTGGTA CAGGCGTTAA GCGGCGTGCT GAACAGCCGT
AGCCTGGATC TGCTGCTGGT TGTGTTCTCA AACTTTGCGG TAGCGAGTTC GTTCCTCGGC
GTTACGCTGG GTTTGTTTGA CTATCTGGCA GATCTGTTTG GTTTCGATGA CTCGGCTATG
GGCCGCTTGA AAACAGCGTT GCTGACCTTT GCCCCGCCTG TTGTGGGTGG CCTGCTGTTT
CCTAACGGAT TCCTGTACGC CATTGGTTAT GCTGGCTTAG CGGCTACCAT CTGGGCGGCA
ATTGTTCCGG CGCTGTTGGC CCGCGCATCG CGTAAACGCT TTGGTAGCCC GAAATTCCGC
GTCTGGGGCG GCAAGCCGAT GATTATGCTG ATTCTGGTAT TTGGCGTTGG CAACGCACTG
GTCCATATCT TATCGAGCTT TAATTTGCTG CCGGTGTATC AGTAA
 
Protein sequence
MATLTTTQTS PSLLGGVVII GGTIIGAGMF SLPVVMSGAW FFWSMAALIF TWFCMLHSGL 
MILEANLNYR IGSSFDTITK DLLGKGWNVV NGISIAFVLY ILTYAYISAS GSILHHTFAE
MSLNVPARAA GFGFALLVAF VVWLSTKAVS RMTAIVLGAK VITFFLTFGS LLGHVQPTTL
FNVAESNASY APYLLMTLPF CLASFGYHGN VPSLMKYYGK DPKTIVKCLV YGTLMALALY
TIWLLATMGN IPRPEFIGIA EKGGNIDVLV QALSGVLNSR SLDLLLVVFS NFAVASSFLG
VTLGLFDYLA DLFGFDDSAM GRLKTALLTF APPVVGGLLF PNGFLYAIGY AGLAATIWAA
IVPALLARAS RKRFGSPKFR VWGGKPMIML ILVFGVGNAL VHILSSFNLL PVYQ