Gene ECH74115_5657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5657 
Symbol 
ID6969605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5298612 
End bp5300063 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content52% 
IMG OID643389290 
Productinner membrane protein YjeH 
Protein accessionYP_002273686 
Protein GI209400815 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.0960639 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGGAC TCAAACAAGA ACTGGGGCTG GCCCAGGGCA TTGGCCTGCT ATCGACGTCA 
TTATTAGGCA CTGGCGTGTT TGCCGTTCCT GCGTTAGCTG CACTGGTAGC GGGCAATAAC
AGCCTGTGGG CGTGGCCCAT TTTGATTATC TTAGTGTTCC CGATTGCGAT TGTGTTTGCG
ATTCTGGGTC GCCACTATCC CAGCGCAGGC GGCGTCGCGC ACTTCGTCGG TATGGCGTTT
GGTTCGCGGC TTGAGCGAGT CACCGGCTGG CTGTTTTTAT CGGTCATTCC CGTGGGTTTG
CCTGCCGCGC TACAAATTGC TGCCGGATTC GGCCAGGCAA TGTTTGGCTG GCATAGCTGG
CAACTGTTGT TGGCAGAACT CGGTACGCTG GCGTTGGTGT GGTATATCGG TACTCGCGGT
GCCAGTTCCA GTGCAAATCT ACAAACAGTT ATTGCCGGAC TTATCGTCGC ACTGATTGTC
GCTATCTGGT GGGCGGGCGA TATCAAACCT GCGAGTATCC CCTTCCCCGC GCCAGGAAAT
ATCGAACTTA CCGGGTTATT TGCTGCGTTA TCAGTGATGT TCTGGTGTTT TGTCGGTCTG
GAAGCATTTG CCCATCTTGC CTCGGAATTT AAAAATCCAG AACGTGATTT TCCTCGTGCT
TTGATGATTG GTCTGCTGCT GGCAGGATTA GTCTACTGGG GCTGTACGGT AGTCGTCTTA
CACTTCGACG CCTATGGTGA ACAAATGGCC GCGGCAGCAT CGCTTCCAAA AATTGTAGTA
CAACTGTTCG GTGTAGGAGC GTTATGGATT GCCTGCGTAA TTGGCTATCT GGCCTGCTTT
GCCAGTCTCA ACATTTATAT ACAGAGCTTC GCCCGCCTGG TCTGGTCGCA GGCGCAACAT
AATCCTGACC ACTACCTGGC ACGCCTCTCT TCTCGCCATA TCCCGAATAA TGCCCTCAAT
GCGGTGCTCG GCTGCTGCGT GGTGAGCACG TTGGTGATTC ATGCTTTAGA GATCAATCTG
GACGCTCTTA TTATTTATGC CAATGGCATC TTTATTATGA TTTATCTGTT ATGCATGCTG
GCAGGCTGTA AATTATTGCA AGGTCGTTAT CGACTACTGG CGGTGGTTGG CGGGCTGTTA
TGCGTTCTGT TACTGGCAAT GATCGGCTGG AAAAGTCTCT ATGCGCTGAT CATGCTGGCG
GGGTTATGGC TGTTTCTGCC AAAACGAAAA ACGCCGGAAA ATGGCATAAC CACATCATCC
GGCGTTTCGA CATTAATCCT GGCGATCGTC TTTATGATCA AGGCGGTCGC GGTCATCATC
CTTTCGCTGG TACTCACCAT CAAAAGTATT ACCGCCACCG GTCCCGGCGC TAAAACCGCC
GCCTGGCATG CGAGAAAAGC GCAAATGCGG CATCAACTTC ACTGTCAGAT GCTTTTGCAC
CGGCGGCAAT AA
 
Protein sequence
MSGLKQELGL AQGIGLLSTS LLGTGVFAVP ALAALVAGNN SLWAWPILII LVFPIAIVFA 
ILGRHYPSAG GVAHFVGMAF GSRLERVTGW LFLSVIPVGL PAALQIAAGF GQAMFGWHSW
QLLLAELGTL ALVWYIGTRG ASSSANLQTV IAGLIVALIV AIWWAGDIKP ASIPFPAPGN
IELTGLFAAL SVMFWCFVGL EAFAHLASEF KNPERDFPRA LMIGLLLAGL VYWGCTVVVL
HFDAYGEQMA AAASLPKIVV QLFGVGALWI ACVIGYLACF ASLNIYIQSF ARLVWSQAQH
NPDHYLARLS SRHIPNNALN AVLGCCVVST LVIHALEINL DALIIYANGI FIMIYLLCML
AGCKLLQGRY RLLAVVGGLL CVLLLAMIGW KSLYALIMLA GLWLFLPKRK TPENGITTSS
GVSTLILAIV FMIKAVAVII LSLVLTIKSI TATGPGAKTA AWHARKAQMR HQLHCQMLLH
RRQ