Gene ECH74115_3767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3767 
SymbolhcaT 
ID6971295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3491325 
End bp3492464 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content56% 
IMG OID643387558 
Productputative 3-phenylpropionic acid transporter 
Protein accessionYP_002272011 
Protein GI209399133 
COG category 
COG ID 
TIGRFAM ID[TIGR00882] oligosaccharide:H+ symporter
[TIGR00902] phenyl proprionate permease family protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTTGC AATCCACGCG CTGGTTGGCG CTCGGCTATT TCACATACTT TTTTAGTTAC 
GGCATTTTTC TACCTTTCTG GAGCGTCTGG CTTAAAGGGA TTGGTTTAAC GCCAGAAACC
ATCGGCCTGT TATTGGGGGC AGGTCTGGTT GCCCGTTTTC TCGGGAGTTT GCTCATCGCG
CCCCGCGTCA GCGATCCTTC TCGCCTGATT TCCGCCTTGC GCGTGCTGGC ACTGCTGACA
CTTCTCTTTG CTGTCGCCTT CTGGGCGGGG GCGCACGTAG CGTGGCTGAT GCTGGTGATG
ATTGGCTTTA ACCTCTTTTT CTCACCGCTG GTACCGTTGA CCGATGCACT GGCGAATACG
TGGCAAAAGC AGTTCCCGCT TGATTACGGC AAAGTGCGAC TGTGGGGCTC GGTGGCGTTT
GTCATTGGCT CGGCGCTGAC GGGCAAACTG GTCACTATGT TTGATTATCG GGTGATCCTC
GTGCTGTTGA CGTTGGGCGT GGCATCCATG CTGCTCGGCT TTCTCATCCG TCCGACGATT
CAGCCACAAG GGGCAAGCCG CCAGCAGGAG AGCACCGGTT GGTCAGCGTG GTTGGCGCTG
GTTCGCCAGA ACTGGCGCTT TCTGGCCTGC GTTTGTTTAT TGCAGGGGGC ACATGCGGCC
TATTACGGTT TTAGCGCCAT TTACTGGCAG GCAGCTGGCT ACTCGGCCTC GGCGGTGGGG
TATTTGTGGT CGCTGGGCGT GGTGGCGGAA GTCATTATCT TTGCACTGAG TAATAAACTT
TTCCGCCGTT GTAGTGCACG CGATATGCTG TTGATCTCAG CGATTTGCGG CGTAGTGCGC
TGGGGCATTA TGGGAGCAAC TACGGCGTTG CCGTGGTTGA TAGTGGTGCA AATTCTGCAT
TGCGGCACCT TCACGGTCTG CCACCTGGCC GCCATGCGCT ATATTGCTGC TCGCCAGGGT
AGCGAAGTCA TCCGTTTACA GGCGGTTTAC TCTGCCGTCG CGATGGGCGG CAGTATCGCT
ATCATGACCG TTTTCGCCGG TTTCCTGTAT CAATATCTGG GCCACGGCGT GTTCTGGGTG
ATGGCGCTGG TGGCGCTTCC GGCAATGTTT TTGCGCCCGA AAGTTGTTCC CTCATGCTGA
 
Protein sequence
MVLQSTRWLA LGYFTYFFSY GIFLPFWSVW LKGIGLTPET IGLLLGAGLV ARFLGSLLIA 
PRVSDPSRLI SALRVLALLT LLFAVAFWAG AHVAWLMLVM IGFNLFFSPL VPLTDALANT
WQKQFPLDYG KVRLWGSVAF VIGSALTGKL VTMFDYRVIL VLLTLGVASM LLGFLIRPTI
QPQGASRQQE STGWSAWLAL VRQNWRFLAC VCLLQGAHAA YYGFSAIYWQ AAGYSASAVG
YLWSLGVVAE VIIFALSNKL FRRCSARDML LISAICGVVR WGIMGATTAL PWLIVVQILH
CGTFTVCHLA AMRYIAARQG SEVIRLQAVY SAVAMGGSIA IMTVFAGFLY QYLGHGVFWV
MALVALPAMF LRPKVVPSC