Gene EcSMS35_1742 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1742 
Symbol 
ID6144930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1745063 
End bp1746055 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content52% 
IMG OID641616617 
Productbenzoate transporter 
Protein accessionYP_001743795 
Protein GI170684188 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3135] Uncharacterized protein involved in benzoate metabolism 
TIGRFAM ID[TIGR00843] benzoate transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000016088 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.5530700000000003e-18 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGACTCC TCCGGCAAAA CGGAAGTTTA TCACTTGTGC GTTATAACGG ACAAATGCTA 
CGGTGCCTGT ACGCTATAAC GCACGAGGTG ACTATGCGTC TGTTTTCTAT TCCTCCACCC
ACGCTACTGG CGGGGTTTCT GGCGGTATTA ATTGGCTACG CCAGTTCAGC GGCAATAATC
TGGCAAGCAG CGATTGTCGC CGGAGCCACC ACTGCACAAA TCTCTGGCTG GATGACGGCG
CTGGGGCTGG CAATGGGCGT CAGTACGCTG GCTCTGACAT TATGGTATCG CGTACCTGTT
CTCACCGCAT GGTCAACGCC TGGCGCGGCT TTGTTGGTCA CCGGATTGCA GGGACTAACA
CTTAACGAAA CCATCGGCGT TTTTATTGTC ACCAACGCGT TAATAGTCCT CTGCGGCATA
ACGGGACTCT TTGCTCATCT GATGCGCATT ATTCCGCACT CGCTTGCGGC GGCAATGCTT
GCCGGGATTT TATTACGCTT TGGTTTACAG GCGTTTGCGA GCCTGGATGG TCAATTTACG
TTGTGTGGCA GTATGTTGCT GGTATGGCTG GCAACCAGGG CCGTTGCGCC GCGCTATGCG
GTAATTGCCG CGATGATTAT TGGGGTCGTG ATCGTTATCG CGCAAGGTGA CATTGTCACA
ACTGATGTTG TCTTTAAACC CGTTCTCCCC ACTTATATTA GCCCTGATTT TTCGTTTGCT
CACAGCCTGA GCGTTGCACT CCCCCTTTTT CTGGTGACGA TGGCATCGCA AAACGCACCG
GGTATCGCAG CAATGAAAGC AGCCGGATAT TCGGCTCCTG TTTCGCCATT AATTGTATTT
ACTGGATTGC TGGCACTGGT TTTTTCCCCT TTCGGCGTTT ATTCCGTCGG TATTGCGGCA
ATCACCGCGG CTATTTGCCA AAGCCCGGAA GCGCATCCGG ATAAAGATCA ACGTTGGCTG
GCCGCTGCCG TTGCAGTAAT GCCAGTCAGT TAA
 
Protein sequence
MRLLRQNGSL SLVRYNGQML RCLYAITHEV TMRLFSIPPP TLLAGFLAVL IGYASSAAII 
WQAAIVAGAT TAQISGWMTA LGLAMGVSTL ALTLWYRVPV LTAWSTPGAA LLVTGLQGLT
LNETIGVFIV TNALIVLCGI TGLFAHLMRI IPHSLAAAML AGILLRFGLQ AFASLDGQFT
LCGSMLLVWL ATRAVAPRYA VIAAMIIGVV IVIAQGDIVT TDVVFKPVLP TYISPDFSFA
HSLSVALPLF LVTMASQNAP GIAAMKAAGY SAPVSPLIVF TGLLALVFSP FGVYSVGIAA
ITAAICQSPE AHPDKDQRWL AAAVAVMPVS