Gene ECH74115_5029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5029 
SymbolyicI 
ID6967102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4679870 
End bp4682188 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content52% 
IMG OID643388710 
Productalpha-xylosidase YicI 
Protein accessionYP_002273137 
Protein GI209396989 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA GCGACGGAAA CTGGTTGATT CAACCTGGCC TCAATTTGAT TCACCCGCTT 
CAGGTGTTCG AGGTTGAACA GCAGGGTAAT GAAATGGTGG TCTATGCTGC CCCCCGTGAT
GTGCGTGAAC GAACCTGGCA GCTTGATACG CCTTTATTTA CGCTGCGCTT TTTCTCCCCA
CAGGAAGGTA TTGTCGGTGT ACGGATTGAG CATTTTCAGG GGGCGCTGAA TAACGGTCCT
CATTATCCGC TCAATATTTT GCAGGACGTG AAGGTCACAA TCGAAAACAC AGAACGTTAC
GCTGAGTTTA AAAGTGGCAA CTTAAGCGCG CGTGTCAGCA AAGGTGAGTT CTGGTCACTG
GATTTTCTGC GCAACGGCGA ACGTATTACC GGTAGTCAGG TGAAAAATAA TGGCTACGTG
CAGGACACGA ATAATCAACG AAATTATATG TTTGAGCGGC TGGATCTTGG CGTTGGCGAA
ACAGTTTACG GTCTGGGAGA GCGCTTTACT GCCCTGGTGC GCAATGGCCA GACGGTAGAG
ACCTGGAACC GGGACGGCGG CACAAGTACT GAACAGGCGT ATAAAAATAT CCCGTTCTAT
ATGACTAACC GTGGCTATGG GGTACTGGTC AATCATCCTC AATGCGTCTC TTTTGAAGTG
GGATCGGAGA AAGTCTCCAA AGTGCAGTTC AGCGTTGAGA GTGAATATCT CGAATACTTT
GTTATCGACG GCCCGACGCC GAAAGCGGTA CTTGATCGTT ATACCCGTTT TACTGGTCGT
CCGGCGCTGC CGCCCGCGTG GTCCTTCGGC CTGTGGCTAA CCACTTCATT TACCACCAAC
TACGACGAAG CGACGGTAAA CAGCTTTATC GATGGTATGG CGGGACGCAA TCTGCCACTG
CATGTTTTTC ACTTTGACTG TTTCTGGATG AAAGCCTTCC AGTGGTGCGA TTTTGAGTGG
GACCCGCTGA CTTTCCCGGA CCCGGAAGGG ATGATCCGCC GCCTGAAAGC GAAAGGGCTA
AAAATCTGCG TCTGGATTAA CCCCTATATC GGGCAAAAAT CCCCTGTCTT TAAAGAGTTA
CAAGAGAAAG GCTATTTACT CAAACGCCCG GACGGTTCGT TATGGCAGTG GGATAAATGG
CAGCCAGGTC TGGCGATTTA TGACTTTACC AATCCGGATG CCTGCAAATG GTACGCCGAC
AAACTGAAAG GTCTGGTCGC GATGGGCGTT GATTGCTTTA AGACCGACTT TGGCGAACGT
ATTCCAATCG ATGTTCAGTG GTTTGACGGT TCCGATCCGC AGAAAATGCA TAACCATTAT
GCGTACATCT ACAACGAACT GGTGTGGAAC GTGCTCAAGG ACACCGTTGG CGAGGAAGAA
GCCGTCTTGT TTGCTCGCTC GGCCTCCGTT GGTGCGCAGA AATTCCCGGT ACACTGGGGT
GGTGACTGTT ACGCTAACTA CGAATCAATG GCGGAAAGCC TGCGCGGTGG TTTGTCTATT
GGCCTTTCAG GTTTTGGTTT CTGGAGCCAC GATATCGGCG GCTTTGAGAA TACCGCTCTG
GCGCACGTTT ACAAACGCTG GTGCGCGTTT GGTTTGCTCT CCAGCCATAG CCGTTTACAT
GGCAGCAAAT CTTATCGTGT GCCGTGGGCC TACGATGATG AGTCCTGTGA TGTGGTGCGC
TTCTTCACGC AACTGAAATG CCGCATGATG CCGTATCTGT ATCGTGAAGC TGCTCGTGCG
AACGCGCGGG GTACGCCGAT GATGCGGGCC ATGATGATGG AGTTCCCGGA CGATCCGGCT
TGTGATTACC TTGACCGTCA ATACATGTTA GGCGACAACG TGATGGTTGC TCCGGTGTTC
AGTGAAGCGG GCGATGTGCA GTTCTACTTG CCGGAAGGTC GCTGGACACA TCTGTGGCAC
AACGATGAAC TCGATGGTAG TCGCTGGCAT AAACAGCAGC ACAGCTTCCT GAGTCTGCCC
GTTTATGTGC GTGATAACAC CCTACTGGCG CTGGGCAACA ACGAGCAACG TCCCGATTAC
GCGTGGCACG AAGGCACGGC ATTCCAGCTC TTTAATCTAC AAGACGGGCA TGAAGCCATC
TGTGAAGTGC CCGCTGCTGA CGGTTCCGTT CTTTTCACCC TGAAAGCGGC GCGTACTGGC
AACACGATTA CTGTGACTGG TGCGGGCGAG GCGAAGAACT GGACACTGTG CTTGCGCAAT
ATTGTGAAAG TAAATGGTCT GCAAGGTGGT TCGCAGGCTG AAAGTGAGCA GGGGCTGGTG
GTGACGCCTC AAGGGAATGC GCTGACAATT ACGTTGTAA
 
Protein sequence
MKISDGNWLI QPGLNLIHPL QVFEVEQQGN EMVVYAAPRD VRERTWQLDT PLFTLRFFSP 
QEGIVGVRIE HFQGALNNGP HYPLNILQDV KVTIENTERY AEFKSGNLSA RVSKGEFWSL
DFLRNGERIT GSQVKNNGYV QDTNNQRNYM FERLDLGVGE TVYGLGERFT ALVRNGQTVE
TWNRDGGTST EQAYKNIPFY MTNRGYGVLV NHPQCVSFEV GSEKVSKVQF SVESEYLEYF
VIDGPTPKAV LDRYTRFTGR PALPPAWSFG LWLTTSFTTN YDEATVNSFI DGMAGRNLPL
HVFHFDCFWM KAFQWCDFEW DPLTFPDPEG MIRRLKAKGL KICVWINPYI GQKSPVFKEL
QEKGYLLKRP DGSLWQWDKW QPGLAIYDFT NPDACKWYAD KLKGLVAMGV DCFKTDFGER
IPIDVQWFDG SDPQKMHNHY AYIYNELVWN VLKDTVGEEE AVLFARSASV GAQKFPVHWG
GDCYANYESM AESLRGGLSI GLSGFGFWSH DIGGFENTAL AHVYKRWCAF GLLSSHSRLH
GSKSYRVPWA YDDESCDVVR FFTQLKCRMM PYLYREAARA NARGTPMMRA MMMEFPDDPA
CDYLDRQYML GDNVMVAPVF SEAGDVQFYL PEGRWTHLWH NDELDGSRWH KQQHSFLSLP
VYVRDNTLLA LGNNEQRPDY AWHEGTAFQL FNLQDGHEAI CEVPAADGSV LFTLKAARTG
NTITVTGAGE AKNWTLCLRN IVKVNGLQGG SQAESEQGLV VTPQGNALTI TL