Gene ECH74115_3372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3372 
Symbol 
ID6969819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3116691 
End bp3117881 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content50% 
IMG OID643387181 
Producttransporter, major facilitator family 
Protein accessionYP_002271644 
Protein GI209400975 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAGA AGTTATGGAC GAAGGATTTT TGGGCAATAA CCATCATCAG CTTTATTATT 
TTCTTCGTCT TTTATGTTTT ACTAACATTG TTGCCAATTT ATATCTCTGA CCGCTTGCAT
GCCTCTCCTG ATAAAGCAGG TTTGTTGGTG ACTTTATTTT TAATTGCGGC GATTGTTATT
CGACCCTTTG CCGGGCAATG GGTGGGTAAA TATTCGAATA AAACTATTCT GGTGCTCTCT
TCTCTGGCCT TTTTGGTGGT CACTGCGCTG TATCCTTTTT GCCACTCAAT AGAATCACTG
CTTTTTATTA GGGTGCTTCA TGGTATTACC TTCGGGGTTA TCACAACGGT AAAGGGAACG
ATTTCCGCGC GGCTGATCCC GGCCTCCCGA CGTGGGGAGG GCATCAGTTT TTTCTCTCTG
GCAATGGGGC TGGCAATGGT GGTCGGGCCG TGGATTGGCC TGAATATGGC GCGCTGGGAG
GCCTTTAATA TGGCTTTCTG GTTATGTACT GGCGTTGCGG CGGTGGGGAT TATCCTGTCG
CTGATTATGA CCGTGCCGCC GGTTATCAGC CATGCCGACG GTTCAAAGCC AAAGATGGGC
TTCGCCGCCA TGTTCGATCG CGCGGCATTG CCGTTTGCCA TGGTTACATT CTTTATGACC
TTTTCGTATG CCGGGGTTTC TGCCTTTCTG GCGCTTTACG CCCGCGAACT TAATCTGATG
TCGGCGGCCA GTAATTTCCT GCTCTGCTAC GCCATCTTCC TGATGATCTG CCGTACCTTC
ACCGGCAATG TTTGCGACAA AAAAGGCCCG AAATATGTGG TTTACCCCTG CCTGCTGTTC
TTTACGGTTG GGCTGGTGGT TCTCGGCTAC ACCCAGGGCA GCGTAATGAT GGTCGTTTCT
GGCGCGTTGA TTGGTATCGG GTATGGTTCC GTGACGCCAG TTTTTCAGAC GCAGATTATC
AGTTCAGTGG AACCGCATAA AATCGGTGTC GCAAACTCCC TCTTCTTCAA TGCGATGGAT
GCAGGCCTGG CGCTGGGAGC CTGTGTGATG GGGATGATGG TTGCACATAC TGGCTACCGA
ATGATTTATC TGCTGGGCGC ACTATTAGTG GTAGTGGCTG GTGGAGTCTA TGCGCTGCAA
ATGAAGGGAA AAAGCGGTGT CGCGCTAGTA GTGGCAAAAG AAATTCATTA A
 
Protein sequence
MKEKLWTKDF WAITIISFII FFVFYVLLTL LPIYISDRLH ASPDKAGLLV TLFLIAAIVI 
RPFAGQWVGK YSNKTILVLS SLAFLVVTAL YPFCHSIESL LFIRVLHGIT FGVITTVKGT
ISARLIPASR RGEGISFFSL AMGLAMVVGP WIGLNMARWE AFNMAFWLCT GVAAVGIILS
LIMTVPPVIS HADGSKPKMG FAAMFDRAAL PFAMVTFFMT FSYAGVSAFL ALYARELNLM
SAASNFLLCY AIFLMICRTF TGNVCDKKGP KYVVYPCLLF FTVGLVVLGY TQGSVMMVVS
GALIGIGYGS VTPVFQTQII SSVEPHKIGV ANSLFFNAMD AGLALGACVM GMMVAHTGYR
MIYLLGALLV VVAGGVYALQ MKGKSGVALV VAKEIH