Gene EcolC_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2226 
Symbol 
ID6066830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2445261 
End bp2446436 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content53% 
IMG OID641601632 
Productbenzoate transporter 
Protein accessionYP_001725191 
Protein GI170020237 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3135] Uncharacterized protein involved in benzoate metabolism 
TIGRFAM ID[TIGR00843] benzoate transporter 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.711061 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTGT TTTCTATTCC TCCACCCACG CTACTGGCGG GGTTTCTGGC GGTATTAATT 
GGCTACGCCA GTTCAGCGGC AATAATCTGG CAAGCAGCGA TTGTCGCCGG AGCCACCACT
GCACAAATCT CTGGCTGGAT GACGGCGCTG GGGCTGGCAA TGGGCGTCAG TACGCTGACT
CTGACATTAT GGTATCGCGT ACCTGTTCTC ACCGCATGGT CAACGCCTGG CGCGGCTTTG
TTGGTCACCG GATTGCAGGG ACTAACACTT AACGAAGCCA TCGGCGTTTT TATTGTCACC
AACGCGCTAA TAGTCCTCTG CGGCATAACG GGACTCTTTG CTCGTCTGAT GCGCATTATT
CCGCACTCGC TTGCGGCGGC AATGCTTGCC GGGATTTTAT TACGCTTTGG TTTACAGGCG
TTTGCCAGTC TGGACGGTCA ATTTACGTTG TGTGGAAGTA TGTTGCTGGT ATGGCTGGCA
ACCAAGGCCG TTGCGCCGCG CTATGCGGTA ATTGCCGCGA TGATTATTGG GATCGTGATC
GTCATCGCGC AAGGTGACGT TGTCACAACT GATGTTGTCT TTAAACCCGT TCTCCCCACT
TATATTACCC CTGATTTTTC GTTTGCTCAC AGCCTGAGCG TTGCACTCCC CCTTTTTCTG
GTGACGATGG CATCGCAAAA CGCACCGGGT ATCGCAGCAA TGAAAGCAGC TGGATATTCG
GCTCCTGTTT CGCCATTAAT TGTATTTACT GGATTGCTGG CACTGGTTTT TTCCCCTTTC
GGCGTTTATT CCGTCGGTAT TGCGGCAATC ACCGCGGCTA TTTGCCAAAG CCCGGAAGCG
CATCCGGATA AAGATCAACG TTGGCTGGCC GCTGCCGTTG CAGGCATTTT CTATTTGCTC
GCAGGTCTGT TTGGTAGTGC CATTACCGGG ATGATGGCTG CCCTGCCCGT AAGTTGGATC
CAGATGCTGG CAGGTCTGGC GCTGTTAAGT ACCATCGGCG GCAGTTTGTA TCAGGCGCTG
CATAATGAGC GTGAGCGAGA CGCGGCGGTG GTGGCATTTC TGGTAACGGC AAGTGGATTG
ACGCTGGTCG GGATTGGTTC TGCGTTTTGG GGATTAATTG CCGGAGGCGT TTGTTACGTG
GTGTTGAATT TAATCGCTGA CAGAAACCGA TATTGA
 
Protein sequence
MRLFSIPPPT LLAGFLAVLI GYASSAAIIW QAAIVAGATT AQISGWMTAL GLAMGVSTLT 
LTLWYRVPVL TAWSTPGAAL LVTGLQGLTL NEAIGVFIVT NALIVLCGIT GLFARLMRII
PHSLAAAMLA GILLRFGLQA FASLDGQFTL CGSMLLVWLA TKAVAPRYAV IAAMIIGIVI
VIAQGDVVTT DVVFKPVLPT YITPDFSFAH SLSVALPLFL VTMASQNAPG IAAMKAAGYS
APVSPLIVFT GLLALVFSPF GVYSVGIAAI TAAICQSPEA HPDKDQRWLA AAVAGIFYLL
AGLFGSAITG MMAALPVSWI QMLAGLALLS TIGGSLYQAL HNERERDAAV VAFLVTASGL
TLVGIGSAFW GLIAGGVCYV VLNLIADRNR Y