Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1518 |
Symbol | |
ID | 5594565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1525335 |
End bp | 1526603 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640920672 |
Product | benzoate transporter |
Protein accession | YP_001458228 |
Protein GI | 157160910 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3135] Uncharacterized protein involved in benzoate metabolism |
TIGRFAM ID | [TIGR00843] benzoate transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 48 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGACTCC TCCGGCAAAA CGGAAGTTTA TCACTTGTGC GTTATAACGG ACAAATGCTA CGGTGCCTGT ACGCTATAAC GCACGAGGTG ACTATGCGTC TGTTTTCTAT TCCTCCACCC ACGCTACTGG CGGGGTTTCT GGCGGTATTA ATTGGCTACG CCAGTTCAGC GGCAATAATC TGGCAAGCAG CGATTGTCGC CGGAGCCACC ACTGCACAAA TCTCTGGCTG GATGACGGCG CTGGGGCTGG CAATGGGCGT CAGTACGCTG ACTCTGACAT TATGGTATCG CGTACCTGTT CTCACCGCAT GGTCAACGCC TGGCGCGGCT TTGTTGGTCA CCGGATTGCA GGGACTAACA CTTAACGAAG CCATCGGCGT TTTTATTGTC ACCAACGCGC TAATAGTCCT CTGCGGCATA ACGGGACTCT TTGCTCGTCT GATGCGCATT ATTCCGCACT CGCTTGCGGC GGCAATGCTT GCCGGGATTT TATTACGCTT TGGTTTACAG GCGTTTGCCA GTCTGGACGG TCAATTTACG TTGTGTGGAA GTATGTTGCT GGTATGGCTG GCAACCAAGG CCGTTGCGCC GCGCTATGCG GTAATTGCCG CGATGATTAT TGGGATCGTG ATCGTCATCG CGCAAGGTGA CGTTGTCACA ACTGATGTTG TCTTTAAACC CGTTCTCCCC ACTTATATTA CCCCTGATTT TTCGTTTGCT CACAGCCTGA GCGTTGCACT CCCCCTTTTT CTGGTGACGA TGGCATCGCA AAACGCACCG GGTATCGCAG CAATGAAAGC AGCTGGATAT TCGGCTCCTG TTTCGCCATT AATTGTATTT ACTGGATTGC TGGCACTGGT TTTTTCCCCT TTCGGCGTTT ATTCCGTCGG TATTGCGGCA ATCACCGCGG CTATTTGCCA AAGCCCGGAA GCGCATCCGG ATAAAGATCA ACGTTGGCTG GCCGCTGCCG TTGCAGGCAT TTTCTATTTG CTCGCAGGTC TGTTTGGTAG TGCCATTACC GGGATGATGG CTGCCCTGCC CGTAAGTTGG ATCCAGATGC TGGCAGGTCT GGCGCTGTTA AGTACCATCG GCGGCAGTTT GTATCAGGCG CTGCATAATG AGCGTGAGCG AGACGCGGCG GTGGTGGCAT TTCTGATAAC GGCAAGTGGA TTGACGCTGG TCGGGATTGG TTCTGCGTTT TGGGGATTAA TTGCCGGAGG CGTTTGTTAC GTGGTGTTGA ATTTAATCGC TGACAGAAAC CGATATTGA
|
Protein sequence | MRLLRQNGSL SLVRYNGQML RCLYAITHEV TMRLFSIPPP TLLAGFLAVL IGYASSAAII WQAAIVAGAT TAQISGWMTA LGLAMGVSTL TLTLWYRVPV LTAWSTPGAA LLVTGLQGLT LNEAIGVFIV TNALIVLCGI TGLFARLMRI IPHSLAAAML AGILLRFGLQ AFASLDGQFT LCGSMLLVWL ATKAVAPRYA VIAAMIIGIV IVIAQGDVVT TDVVFKPVLP TYITPDFSFA HSLSVALPLF LVTMASQNAP GIAAMKAAGY SAPVSPLIVF TGLLALVFSP FGVYSVGIAA ITAAICQSPE AHPDKDQRWL AAAVAGIFYL LAGLFGSAIT GMMAALPVSW IQMLAGLALL STIGGSLYQA LHNERERDAA VVAFLITASG LTLVGIGSAF WGLIAGGVCY VVLNLIADRN RY
|
| |