Gene ECH74115_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2159 
SymbolydfJ 
ID6971089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2071499 
End bp2072782 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content47% 
IMG OID643386054 
Productinner membrane metabolite transport protein ydfJ 
Protein accessionYP_002270543 
Protein GI209399099 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0943239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTCC AGTTATATTC GCTCGGCGCA GCGTTAGTGT TTCATGAAAT ATTTTTTCCT 
GAATCATCAA CGGCAATGGC GTTAATTCTG GCAATGGGAA CCTACGGTGC AGGTTATGTG
GCGCGTATTG TCGGAGCATT TATTTTCGGC AAAATGGGCG ACAGAATAGG GCGTAAAAAA
GTGCTCTTTA TTACCATCAC CATGATGGGG ATCTGTACCA CCTTAATTGG AGTGTTACCG
ACCTATGCAC AGATTGGTGT TTTTGCCCCC ATCTTGCTGG TGACGCTGCG TATTATTCAG
GGATTGGGTG CAGGTGCGGA AATTTCCGGT GCCGGTACGA TGCTGGCGGA ATATGCGCCA
AAAGGTAAGC GCGGAATTAT CTCCTCATTT GTGGCTATGG GAACTAACTG CGGAACCTTG
AGCGCAACGG CAATCTGGGC CTTTATGTTC TTCATTCTCA GTAAAGAGGA ACTGCTGGCG
TGGGGATGGC GTATACCGTT CCTGGCGAGC GTTGTCGTGA TGGTCTTTGC TATCTGGTTG
CGTATGAATC TGAAAGAAAG CCCGGTCTTT GAGAAGGTTA ACGACAGCAA CCAACCGACA
GCAAAACCTG CACCTGCTGG TAGCATATTC CAGAGCAAAT CCTTCTGGCT GGCAACAGGG
CTGCGTTTTG GTCAGGCAGG TAACTCCGGG TTAATTCAGA CTTTCCTTGC AGGCTATTTA
GTACAGACGT TATTGTTTAA CAAAGCAATT CCAACAGATG CATTGATGAT CAGTTCGATT
CCCGGCTTTA TGACCATTCC GTTCCTTGGT TGGTTATCCG ATAAAATTGG TCGCCGGATC
CCGTATATTA TTATGAATAC CTCCGCGATT GTGCTGGCAT GGCCAATGCT TTCTATCATC
GTAGATAAAA GCTATGCCCC GAGCACCATT ATGGTTGCAC TGATTGTGAT TCATAACTGT
GCGGTGCTGG GATTATTTGC TCTGGAAAAC ATTACCATGG CAGAAATGTT CGGCTGTAAA
AACCGCTTTA CCCGGATGGC TATTTCTAAA GAAATTGGTG GTCTTATCGC TTCCGGTTTT
GGTCCTATCC TGGCGGGTAT TTTCTGCACC ATGACGGAAT CCTGGTATCC GATCGCCATT
ATGATCATGG CATATTCAGT GATTGGTTTA ATCTCTGCGC TGAAAATGCC AGAAGTGAAA
GACCGTGATT TAAGTGCGCT GGAAGACGCT GCGGAAGATC AACCGCGTGT TGTAAGAGCT
GCGCAACCTT CCAGAAGTCT GTAA
 
Protein sequence
MDFQLYSLGA ALVFHEIFFP ESSTAMALIL AMGTYGAGYV ARIVGAFIFG KMGDRIGRKK 
VLFITITMMG ICTTLIGVLP TYAQIGVFAP ILLVTLRIIQ GLGAGAEISG AGTMLAEYAP
KGKRGIISSF VAMGTNCGTL SATAIWAFMF FILSKEELLA WGWRIPFLAS VVVMVFAIWL
RMNLKESPVF EKVNDSNQPT AKPAPAGSIF QSKSFWLATG LRFGQAGNSG LIQTFLAGYL
VQTLLFNKAI PTDALMISSI PGFMTIPFLG WLSDKIGRRI PYIIMNTSAI VLAWPMLSII
VDKSYAPSTI MVALIVIHNC AVLGLFALEN ITMAEMFGCK NRFTRMAISK EIGGLIASGF
GPILAGIFCT MTESWYPIAI MIMAYSVIGL ISALKMPEVK DRDLSALEDA AEDQPRVVRA
AQPSRSL