Gene ECH74115_5844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5844 
Symbol 
ID6968143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5495559 
End bp5496737 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content54% 
IMG OID643389466 
Producttransporter, major facilitator family 
Protein accessionYP_002273858 
Protein GI209396495 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.482212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCGA CATCGCATCC CGTAGAACGT TTTTCTTTCA GCACCGCGTT ATTCGGGATG 
CTGGTTCTGA CCTTAGGTAT GGGTTTAGGC CGCTTTCTCT ATACGCCGAT GCTGCCAGTC
ATGCTGGCGG AAGGCGAGTT TTCATTTAGC GAACTCTCAT GGATCGCCAG TGGTAACTAT
GCCGGGTATC TGGCGGGGAG CCTGCTGTTT TCATTCGGCG CGTTTCATTT ACCCTCACGC
CTGCGCCCGT TCCTGTTAGC TTCCGCCCTC GCAACCGGAT TATTAATCCT CGCGATGGCG
TGGCTGCCGC CGTTTCTTCT GGTTTTCATC ATTCGCTTTC TGGCGGGAGT CGCCAGCGCC
GGGATGTTGA TTTTCGGCTC AACACTCATC ATGCAACATA CTCGCCATCC CTTTGTCCTT
GCGGCGCTAT TTTCTGGTGT TGGCGTCGGC ATCGCTCTGG GTAATGAATA TGTGCTGGCA
GGCCTGCATT TTGCCCTCTC TTCACAAACG TTGTGGCAAG GTGCCGGAGC ACTTTCTGCC
ATTATATTGC TTGCTCTGGC GCTGCTCATC CCGTCGAATA AACACGTTAT CCCGCCAGCG
CCATTGGCAA AAATCGCGCA ACAACCCATG AGCTGGTGGT TACTGGCGAT TCTGTATGGT
CTGGCGGGTT TTGGTTATAT CATCGTCGCC ACCTACCTGC CGCTCATGGC GAAAGACGCG
GGCCAGCCTG TGTTGACGGC TCACCTCTGG ACACTGGTTG GCTTGTCGAT TGTCCCAGGT
TGCTTTGGCT GGCTGTGGGC AGCCAAACGG TGGGGAGCAT TACCTTGCCT GACCGCGAAT
TTGCTGGTGC AGGCGATCTG CGTGCTGTTA ACCCTCGCCA GCAGCTCTCC TTTATTACTC
ATCATCAGCA GTATTGGTTT TGGCGGCACC TTTATGGGAA CGACCTCGCT GGTGATGACC
ATCGCCCGCC AGCTTAGCGT GCCGGGAAAT CTTAACCTTT TGGGCTTTGT GACACTCATT
TATGGTATCG GGCAAATTCT TGGCCCGGCG CTGACCAGTA TGCTCGGCAA CGGAACGTCG
GCGCTCGCCA GCGCCACACT CTGCGGCGCA GCGGCGCTAT TTATCGCAGC ATTAATCTGC
GGGATGCAAA TATTCAAATT GCATACGAAT TATTCTTAA
 
Protein sequence
MNSTSHPVER FSFSTALFGM LVLTLGMGLG RFLYTPMLPV MLAEGEFSFS ELSWIASGNY 
AGYLAGSLLF SFGAFHLPSR LRPFLLASAL ATGLLILAMA WLPPFLLVFI IRFLAGVASA
GMLIFGSTLI MQHTRHPFVL AALFSGVGVG IALGNEYVLA GLHFALSSQT LWQGAGALSA
IILLALALLI PSNKHVIPPA PLAKIAQQPM SWWLLAILYG LAGFGYIIVA TYLPLMAKDA
GQPVLTAHLW TLVGLSIVPG CFGWLWAAKR WGALPCLTAN LLVQAICVLL TLASSSPLLL
IISSIGFGGT FMGTTSLVMT IARQLSVPGN LNLLGFVTLI YGIGQILGPA LTSMLGNGTS
ALASATLCGA AALFIAALIC GMQIFKLHTN YS