Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2688 |
Symbol | hcaT |
ID | 6142858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2762041 |
End bp | 2763180 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641617560 |
Product | putative 3-phenylpropionic acid transporter |
Protein accession | YP_001744725 |
Protein GI | 170684196 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00882] oligosaccharide:H+ symporter [TIGR00902] phenyl proprionate permease family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTTTGC AATCCACGCG CTGGTTGGCG CTCGGCTATT TCACATACTT TTTTAGTTAC GGCATTTTTC TACCTTTCTG GAGCGTCTGG CTTAAAGGTA TTGGTTTAAC GCCAGAAACC ATCGGCCTGT TATTGGGGGC GGGTCTGGTT GCCCGTTTCC TCGGGAGTTT GCTCATCGCG CCCCGCGTCA GCGATCCTTC CCGCCTGATT TCCGCCTTGC GCGTGCTGGC ACTGCTGACA CTTCTCTTTG CTGTCGCCTT CTGGGCGGGG GCGCACGTAG CGTGGCTGAT GCTGGTGATG ATTGGCTTTA ACCTCTTTTT CTCACCGCTG GTACCGTTGA CCGATGCACT GGCGAATACG TGGCAAAAGC AGTTCCCGCT TGATTACGGC AAAGTGAGAC TGTGGGGCTC GGTAGCGTTT GTCATTGGCT CGGCGCTGAC GGGCAAACTG GTCAGTATGT TTGATTATCG GGTGATCCTC GCGCTGTTGA CGTTGGGCGT GGCATCCATG CTGCTCGGCT TTCTCATCCG TCCGACGATT CAGCCACAAG GGGCAAGCCG CCAGCAGGAG AGCACCGGTT GGTCTGCGTG GTTGGCGTTG GTTCGCCAGA ACTGGCGCTT TCTGGCCTGC GTTTGTTTAT TGCAGGGGGC ACATGCGGCC TATTACGGTT TTAGCGCCAT TTACTGGCAG GCAGCTGGCT ACTCGGCCTC GGCGGTGGGC TATTTGTGGT CGCTGGGCGT GGTGGCAGAA GTCATTATCT TTGCGCTGAG TAATAAGCTT TTCCGCCGTT GTAGCGCCCG TGATATGTTG CTGATCTCGG CAGTATGTGG GGTATTGCGT TGGGGAATTA TGGGGGCAAC CACTGAGCTA CTGTGGTTGA TTATGGTGCA AATTTTGCAT TGCGGCACCT TCACCGTGTG CCATCTGGCC GCTATGCGCT ACATTGCTGC TCGCCAGGGT AGCGAAGTCA TCCGTTTACA GGCGGTTTAC TCTGCCGTCG CGATGGGCGG CAGTATCGCC ATCATGACCG TTTTCGCCGG TTTCCTGTAT CAATATCTCG GCCACGGCGT GTTCTGGGTG ATGGCGCTGG TGGCGCTTCC GGCAATGTTT TTGCGCCCGA AAGTTGTTCC CTCATGCTGA
|
Protein sequence | MVLQSTRWLA LGYFTYFFSY GIFLPFWSVW LKGIGLTPET IGLLLGAGLV ARFLGSLLIA PRVSDPSRLI SALRVLALLT LLFAVAFWAG AHVAWLMLVM IGFNLFFSPL VPLTDALANT WQKQFPLDYG KVRLWGSVAF VIGSALTGKL VSMFDYRVIL ALLTLGVASM LLGFLIRPTI QPQGASRQQE STGWSAWLAL VRQNWRFLAC VCLLQGAHAA YYGFSAIYWQ AAGYSASAVG YLWSLGVVAE VIIFALSNKL FRRCSARDML LISAVCGVLR WGIMGATTEL LWLIMVQILH CGTFTVCHLA AMRYIAARQG SEVIRLQAVY SAVAMGGSIA IMTVFAGFLY QYLGHGVFWV MALVALPAMF LRPKVVPSC
|
| |