Gene ECH74115_5320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5320 
Symbol 
ID6967929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4961161 
End bp4962426 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content45% 
IMG OID643388981 
Producttransporter, major facilitator family 
Protein accessionYP_002273390 
Protein GI209396239 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.581076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.378409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACGA AAAAGAAATG GGCGTTATTT AGTCTATTAA CACTGTGTGG CGGTACAATT 
TATAAATTAC CGTCGCTGAA AGATGCGTTT TATATCCCGA TGCAGGAATA TTTCCATTTG
ACCAATGGTC AAATTGGTAA TGCTATGTCG GTAAACTCAT TTGTTACCAC AGTGGGCTTT
TTTCTGTCTA TTTATTTTGC CGATAAACTA CCGCGCAGAT ACACCATGTC ATTCTCACTC
ATTGCGACAG GATTACTGGG TGTTTATTTG ACGACAATGC CGGGGTATTG GGGCATCCTC
TTTGTCTGGG CGCTATTTGG CGTTACTTGC GACATGATGA ACTGGCCGGT CTTGCTCAAG
TCGGTAAGTC GATTGGGCAA TAGCGAACAA CAAGGTCGGT TGTTTGGCTT CTTCGAAACA
GGGCGTGGCA TTGTCGATAC CGTGGTGGCA TTTTCTGCGT TGGCAGTATT TACCTGGTTT
GGCAGTGGCT TATTAGGTTT TAAAGCAGGC ATCTGGTTCT ATTCCCTTAT TGTGATTGCC
GTAGGCATTA CTATTTTCTT TGTCCTGAAT GACAAAGAAG AGGCACCGTC CGTTGAGGTG
AAAAAAGAAG ACGGGGCATC GCAAAACACC AGTATGACCT CGGTGCTGAA AGACAAAACT
ATCTGGCTTA TCGCTTTCAA CGTCTTCTTC GTTTACGCGG TTTACTGTGG CCTGACATTC
TTCATTCCAT TCCTGAAAAA CATCTATCTA TTGCCCGTTG CGCTGGTGGG GGCTTACGGC
ATCATTAACC AATACTGTCT GAAAATGATT GGTGGACCGA TTGGTGGCAT GATTTCAGAT
AAAATCCTGA AATCGCCGAG TAAATATCTA TGCTACACCT TTATCATCAG TACCGCTGCG
CTCGTACTGT TGATTATGCT GCCGCACGAA AGTATGCCGG TCTATTTAGG GATGGCATGT
ACGCTGGGCT TTGGCGCGAT AGTCTTTACA CAGCGAGCCG TATTTTTTGC ACCTATCGGC
GAAGCAAAAA TTGCTGAAAA TAAAACAGGC GCGGCGATGG CGTTGGGTAG CTTTATTGGT
TACGCCCCGG CGATGTTCTG CTTCAGTCTG TATGGCTACA TTCTGGATTT AAATCCGGGG
ATTATTGGCT ACAAAATCGT GTTTGGCATT ATGGCCTGCT TCGCATTCTG TGGTGCGGTG
GTTTCCGTAA TGCTGGTTAA GCGTATTAGC CAACGTAAGA AAGAGATGCT GGCGGCTGAA
GCTTAA
 
Protein sequence
MLTKKKWALF SLLTLCGGTI YKLPSLKDAF YIPMQEYFHL TNGQIGNAMS VNSFVTTVGF 
FLSIYFADKL PRRYTMSFSL IATGLLGVYL TTMPGYWGIL FVWALFGVTC DMMNWPVLLK
SVSRLGNSEQ QGRLFGFFET GRGIVDTVVA FSALAVFTWF GSGLLGFKAG IWFYSLIVIA
VGITIFFVLN DKEEAPSVEV KKEDGASQNT SMTSVLKDKT IWLIAFNVFF VYAVYCGLTF
FIPFLKNIYL LPVALVGAYG IINQYCLKMI GGPIGGMISD KILKSPSKYL CYTFIISTAA
LVLLIMLPHE SMPVYLGMAC TLGFGAIVFT QRAVFFAPIG EAKIAENKTG AAMALGSFIG
YAPAMFCFSL YGYILDLNPG IIGYKIVFGI MACFAFCGAV VSVMLVKRIS QRKKEMLAAE
A