Gene EcolC_1141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1141 
Symbol 
ID6068051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1245077 
End bp1246216 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content57% 
IMG OID641600557 
Productputative 3-phenylpropionic acid transporter 
Protein accessionYP_001724135 
Protein GI170019181 
COG category 
COG ID 
TIGRFAM ID[TIGR00882] oligosaccharide:H+ symporter
[TIGR00902] phenyl proprionate permease family protein 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTTGC AATCCACGCG CTGGTTGGCG CTCGGCTATT TCACATACTT TTTTAGTTAC 
GGCATTTTTC TACCTTTCTG GAGCGTCTGG CTTAAAGGGA TTGGTTTAAC GCCAGAAACC
ATCGGCCTGT TATTGGGGGC AGGTCTGGTT GCCCGTTTTC TCGGGAGTTT GCTCATCGCG
CCCCGCGTCA GCGATCCTTC CCGCCTGATT TCCGCCTTGC GCGTGCTGGC ACTGCTGACA
CTTCTCTTTG CTGTCGCCTT CTGGGCGGGG GCGCACGTAG CGTGGCTGAT GCTGGTGATG
ATTGGCTTTA ACCTCTTTTT CTCACCGCTG GTACCGTTGA CCGATGCACT GGCGAATACG
TGGCAAAAGC AGTTCCCGCT TGATTACGGC AAAGTGCGAC TGTGGGGCTC GGTGGCGTTT
GTCATTGGCT CGGCGCTGAC GGGCAAACTG GTCACTATGT TTGATTATCG GGTGATCCTC
GCGCTGTTGA CGTTGGGCGT GGCATCCATG CTGCTCGGCT TTCTCATCCG TCCGACGATT
CAGCCACAAG GGGCAAGCCG CCAGCAGGAG AGCACCGGTT GGTCAGCGTG GTTGGCGCTG
GTTCGCCAGA ACTGGCGCTT TCTGGCCTGC GTTTGTTTAT TGCAGGGGGC ACATGCGGCC
TATTACGGTT TTAGCGCCAT TTACTGGCAG GCAGCTGGCT ACTCGGCCTC GGCGGTGGGG
TATTTGTGGT CGCTGGGCGT GGTGGCGGAA GTCATTATCT TTGCGCTGAG TAATAAACTT
TTCCGCCGTT GTAGTGCACG CGATATGCTG TTGATCTCGG CGATTTGCGG CGTAGTGCGC
TGGGGCATTA TGGGAGCAAC TACGGCGTTG CCGTGGTTGA TAGTGGTGCA AATTCTGCAT
TGCGGCACCT TCACGGTCTG CCACCTGGCC GCCATGCGTT ATATTGCTGC TCGCCAGGGT
AGCGAAGTCA TCCGTTTACA GGCGGTTTAC TCTGCCGTCG CGATGGGCGG CAGTATCGCT
ATCATGACCG TTTTCGCCGG TTTCCTGTAT CAATATCTGG GCCACGGCGT GTTCTGGGTA
ATGGCGCTGG TGGCGCTGCC GGCAATGTTT TTGCGCCCGA AAGTTGTTCC CTCATGCTGA
 
Protein sequence
MVLQSTRWLA LGYFTYFFSY GIFLPFWSVW LKGIGLTPET IGLLLGAGLV ARFLGSLLIA 
PRVSDPSRLI SALRVLALLT LLFAVAFWAG AHVAWLMLVM IGFNLFFSPL VPLTDALANT
WQKQFPLDYG KVRLWGSVAF VIGSALTGKL VTMFDYRVIL ALLTLGVASM LLGFLIRPTI
QPQGASRQQE STGWSAWLAL VRQNWRFLAC VCLLQGAHAA YYGFSAIYWQ AAGYSASAVG
YLWSLGVVAE VIIFALSNKL FRRCSARDML LISAICGVVR WGIMGATTAL PWLIVVQILH
CGTFTVCHLA AMRYIAARQG SEVIRLQAVY SAVAMGGSIA IMTVFAGFLY QYLGHGVFWV
MALVALPAMF LRPKVVPSC