Gene ECH74115_4941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4941 
SymbolxylA 
ID6969880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4578718 
End bp4580040 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content50% 
IMG OID643388624 
Productxylose isomerase 
Protein accessionYP_002273051 
Protein GI209400679 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2115] Xylose isomerase 
TIGRFAM ID[TIGR02630] xylose isomerase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0307022 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAGCCT ATTTTGACCA GCTCGATCGC GTTCGTTATG AAGGCTCAAA ATCCTCAAAC 
CCGTTAGCAT TCCGTCACTA CAATCCCGAC GAACTGGTGT TGGGCAAGCG TATGGAAGAG
CACTTGCGAT TTGCCGCCTG CTACTGGCAC ACCTTCTGCT GGAACGGGGC GGATATGTTT
GGTGTGGGGG CGTTTAATCG TCCGTGGCAG CAGCCTGGTG AGGCACTGGC GTTGGCGAAG
CGTAAAGCAG ATGTCGCATT TGAGTTTTTC CACAAGTTAC ATGTGCCATT TTATTGCTTC
CACGATGTGG ATGTTTCTCC TGAGGGCGCG TCGTTAAAAG AGTACATCAA TAATTTTGCG
CAAATGGTTG ATGTTCTGGC AGGCAAGCAA GAAGAGAGCG GCGTGAAGCT GCTGTGGGGA
ACCGCTAACT GCTTTACAAA CCCTCGCTAT GGCGCGGGTG CGGCGACGAA CCCAGATCCA
GAAGTCTTCA GCTGGGCGGC AACGCAAGTT GTTACAGCGA TGGAAGCAAC CCATAAATTG
GGTGGTGAAA ACTATGTCCT GTGGGGCGGT CGTGAAGGTT ACGAAACGCT GTTAAATACC
GATTTGCGTC AGGAGCGTGA ACAACTGGGC CGCTTTATGC AAATGGTGGT TGAGCATAAA
CATAAAATCG GCTTCCAGGG CACGTTGCTT ATCGAACCGA AACCGCAAGA ACCGACTAAA
CATCAATATG ATTACGATGC TGCGACAGTC TATGGCTTCC TGAAACAGTT TGGTCTGGAA
AAAGAGATTA AACTGAACAT TGAAGCTAAC CACGCGACGC TGGCAGGTCA CTCTTTCCAT
CATGAAATAG CCACCGCCAT TGCGCTTGGC CTGTTCGGTT CTGTCGACGC CAACCGTGGC
GATGCGCAAC TGGGCTGGGA CACCGACCAG TTCCCGAACA GTGTGGAAGA GAATGCGCTG
GTGATGTATG AAATTCTCAA AGCAGGCGGT TTCACCACCG GTGGTCTGAA CTTCGATGCC
AAAGTACGTC GTCAAAGTAC TGATAAATAT GATCTGTTTT ACGGTCATAT CGGCGCGATG
GATACGATGG CACTGGCGCT GAAAATTGCA GCGTGCATGA TTGAAGATGG CGAGCTGGAT
AAACGCATCG CGCAGCGTTA TTCCGGCTGG AATAGCGAAT TGGGCCAGCA AATCCTGAAA
GGCCAAATGT CACTGGCAGA TTTAGCCAAA TATGCTCAGG AACATAATTT GTCTCCGGTG
CATCAGAGTG GTCGCCAGGA GCAACTGGAA AATCTGGTAA ATCATTATCT GTTCGACAAA
TAA
 
Protein sequence
MQAYFDQLDR VRYEGSKSSN PLAFRHYNPD ELVLGKRMEE HLRFAACYWH TFCWNGADMF 
GVGAFNRPWQ QPGEALALAK RKADVAFEFF HKLHVPFYCF HDVDVSPEGA SLKEYINNFA
QMVDVLAGKQ EESGVKLLWG TANCFTNPRY GAGAATNPDP EVFSWAATQV VTAMEATHKL
GGENYVLWGG REGYETLLNT DLRQEREQLG RFMQMVVEHK HKIGFQGTLL IEPKPQEPTK
HQYDYDAATV YGFLKQFGLE KEIKLNIEAN HATLAGHSFH HEIATAIALG LFGSVDANRG
DAQLGWDTDQ FPNSVEENAL VMYEILKAGG FTTGGLNFDA KVRRQSTDKY DLFYGHIGAM
DTMALALKIA ACMIEDGELD KRIAQRYSGW NSELGQQILK GQMSLADLAK YAQEHNLSPV
HQSGRQEQLE NLVNHYLFDK