Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1742 |
Symbol | |
ID | 6144930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1745063 |
End bp | 1746055 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616617 |
Product | benzoate transporter |
Protein accession | YP_001743795 |
Protein GI | 170684188 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3135] Uncharacterized protein involved in benzoate metabolism |
TIGRFAM ID | [TIGR00843] benzoate transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000016088 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 2.5530700000000003e-18 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGACTCC TCCGGCAAAA CGGAAGTTTA TCACTTGTGC GTTATAACGG ACAAATGCTA CGGTGCCTGT ACGCTATAAC GCACGAGGTG ACTATGCGTC TGTTTTCTAT TCCTCCACCC ACGCTACTGG CGGGGTTTCT GGCGGTATTA ATTGGCTACG CCAGTTCAGC GGCAATAATC TGGCAAGCAG CGATTGTCGC CGGAGCCACC ACTGCACAAA TCTCTGGCTG GATGACGGCG CTGGGGCTGG CAATGGGCGT CAGTACGCTG GCTCTGACAT TATGGTATCG CGTACCTGTT CTCACCGCAT GGTCAACGCC TGGCGCGGCT TTGTTGGTCA CCGGATTGCA GGGACTAACA CTTAACGAAA CCATCGGCGT TTTTATTGTC ACCAACGCGT TAATAGTCCT CTGCGGCATA ACGGGACTCT TTGCTCATCT GATGCGCATT ATTCCGCACT CGCTTGCGGC GGCAATGCTT GCCGGGATTT TATTACGCTT TGGTTTACAG GCGTTTGCGA GCCTGGATGG TCAATTTACG TTGTGTGGCA GTATGTTGCT GGTATGGCTG GCAACCAGGG CCGTTGCGCC GCGCTATGCG GTAATTGCCG CGATGATTAT TGGGGTCGTG ATCGTTATCG CGCAAGGTGA CATTGTCACA ACTGATGTTG TCTTTAAACC CGTTCTCCCC ACTTATATTA GCCCTGATTT TTCGTTTGCT CACAGCCTGA GCGTTGCACT CCCCCTTTTT CTGGTGACGA TGGCATCGCA AAACGCACCG GGTATCGCAG CAATGAAAGC AGCCGGATAT TCGGCTCCTG TTTCGCCATT AATTGTATTT ACTGGATTGC TGGCACTGGT TTTTTCCCCT TTCGGCGTTT ATTCCGTCGG TATTGCGGCA ATCACCGCGG CTATTTGCCA AAGCCCGGAA GCGCATCCGG ATAAAGATCA ACGTTGGCTG GCCGCTGCCG TTGCAGTAAT GCCAGTCAGT TAA
|
Protein sequence | MRLLRQNGSL SLVRYNGQML RCLYAITHEV TMRLFSIPPP TLLAGFLAVL IGYASSAAII WQAAIVAGAT TAQISGWMTA LGLAMGVSTL ALTLWYRVPV LTAWSTPGAA LLVTGLQGLT LNETIGVFIV TNALIVLCGI TGLFAHLMRI IPHSLAAAML AGILLRFGLQ AFASLDGQFT LCGSMLLVWL ATRAVAPRYA VIAAMIIGVV IVIAQGDIVT TDVVFKPVLP TYISPDFSFA HSLSVALPLF LVTMASQNAP GIAAMKAAGY SAPVSPLIVF TGLLALVFSP FGVYSVGIAA ITAAICQSPE AHPDKDQRWL AAAVAVMPVS
|
| |