Gene EcSMS35_2688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2688 
SymbolhcaT 
ID6142858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2762041 
End bp2763180 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content56% 
IMG OID641617560 
Productputative 3-phenylpropionic acid transporter 
Protein accessionYP_001744725 
Protein GI170684196 
COG category 
COG ID 
TIGRFAM ID[TIGR00882] oligosaccharide:H+ symporter
[TIGR00902] phenyl proprionate permease family protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTTGC AATCCACGCG CTGGTTGGCG CTCGGCTATT TCACATACTT TTTTAGTTAC 
GGCATTTTTC TACCTTTCTG GAGCGTCTGG CTTAAAGGTA TTGGTTTAAC GCCAGAAACC
ATCGGCCTGT TATTGGGGGC GGGTCTGGTT GCCCGTTTCC TCGGGAGTTT GCTCATCGCG
CCCCGCGTCA GCGATCCTTC CCGCCTGATT TCCGCCTTGC GCGTGCTGGC ACTGCTGACA
CTTCTCTTTG CTGTCGCCTT CTGGGCGGGG GCGCACGTAG CGTGGCTGAT GCTGGTGATG
ATTGGCTTTA ACCTCTTTTT CTCACCGCTG GTACCGTTGA CCGATGCACT GGCGAATACG
TGGCAAAAGC AGTTCCCGCT TGATTACGGC AAAGTGAGAC TGTGGGGCTC GGTAGCGTTT
GTCATTGGCT CGGCGCTGAC GGGCAAACTG GTCAGTATGT TTGATTATCG GGTGATCCTC
GCGCTGTTGA CGTTGGGCGT GGCATCCATG CTGCTCGGCT TTCTCATCCG TCCGACGATT
CAGCCACAAG GGGCAAGCCG CCAGCAGGAG AGCACCGGTT GGTCTGCGTG GTTGGCGTTG
GTTCGCCAGA ACTGGCGCTT TCTGGCCTGC GTTTGTTTAT TGCAGGGGGC ACATGCGGCC
TATTACGGTT TTAGCGCCAT TTACTGGCAG GCAGCTGGCT ACTCGGCCTC GGCGGTGGGC
TATTTGTGGT CGCTGGGCGT GGTGGCAGAA GTCATTATCT TTGCGCTGAG TAATAAGCTT
TTCCGCCGTT GTAGCGCCCG TGATATGTTG CTGATCTCGG CAGTATGTGG GGTATTGCGT
TGGGGAATTA TGGGGGCAAC CACTGAGCTA CTGTGGTTGA TTATGGTGCA AATTTTGCAT
TGCGGCACCT TCACCGTGTG CCATCTGGCC GCTATGCGCT ACATTGCTGC TCGCCAGGGT
AGCGAAGTCA TCCGTTTACA GGCGGTTTAC TCTGCCGTCG CGATGGGCGG CAGTATCGCC
ATCATGACCG TTTTCGCCGG TTTCCTGTAT CAATATCTCG GCCACGGCGT GTTCTGGGTG
ATGGCGCTGG TGGCGCTTCC GGCAATGTTT TTGCGCCCGA AAGTTGTTCC CTCATGCTGA
 
Protein sequence
MVLQSTRWLA LGYFTYFFSY GIFLPFWSVW LKGIGLTPET IGLLLGAGLV ARFLGSLLIA 
PRVSDPSRLI SALRVLALLT LLFAVAFWAG AHVAWLMLVM IGFNLFFSPL VPLTDALANT
WQKQFPLDYG KVRLWGSVAF VIGSALTGKL VSMFDYRVIL ALLTLGVASM LLGFLIRPTI
QPQGASRQQE STGWSAWLAL VRQNWRFLAC VCLLQGAHAA YYGFSAIYWQ AAGYSASAVG
YLWSLGVVAE VIIFALSNKL FRRCSARDML LISAVCGVLR WGIMGATTEL LWLIMVQILH
CGTFTVCHLA AMRYIAARQG SEVIRLQAVY SAVAMGGSIA IMTVFAGFLY QYLGHGVFWV
MALVALPAMF LRPKVVPSC