Gene EcSMS35_3999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3999 
SymbolyicI 
ID6144075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4077337 
End bp4079655 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content52% 
IMG OID641618824 
Productalpha-xylosidase YicI 
Protein accessionYP_001745963 
Protein GI170680791 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA GCGACGGAAA CTGGTTGATT CAACCTGGCC TCAATTTGAT TCACCCGCTT 
CAGGTGTTCG AGGTTGAACA GCAGGGTAAT GAAATGGTGG TCTATGCTGC CCCCCGTGAT
GTGCGTGAAC GTACCTGGCA GCTTGATACG CCTTTATTTA CGCTGCGCTT TTTCTCCCCA
CAGGAAGGTA TTGTCGGTGT ACGGATTGAG CATTTTCAGG GGGCGCTGAA TAACGGTCCT
CATTACCCGC TCAATATTTT GCAGGACGTG AAGGTCACAA TCGAAAACAC AGAACGTTAC
GCTGAGTTTA AAAGTGGCAA CTTAAGTGCG CGTGTCAGCA AAGGTGAGTT CTGGTCACTG
GATTTTCTGC GCAACGGCGA ACGTATTACC GGTAGTCAGG TGAAAAATAA TGGCTACGTG
CAGGACACGA ATAATCAACG AAATTATATG TTTGAGCGGC TGGATCTTGG CGTTGGCGAA
ACAGTTTACG GTCTGGGAGA GCGCTTTACT GCCCTGGTGC GCAATGGCCA AACGGTAGAG
ACCTGGAACC GGGACGGCGG CACAAGTACT GAACAGGCGT ATAAAAATAT TCCGTTCTAT
ATGACTAACC GTGGTTATGG GGTACTGGTC AATCATCCTC AATGCGTCTC TTTTGAAGTG
GGATCGGAGA AAGTCTCCAA AGTGCAGTTC AGCGTTGAGA GTGAATATCT CGAATACTTT
GTTATCGACG GCCCGACGCC GAAAGCGGTA CTTGATCGTT ATACCCGTTT TACTGGTCGT
CCGGCGCTGC CGCCCGCGTG GTCCTTCGGT CTGTGGCTAA CCACTTCATT TACCACCAAC
TACGACGAAG CGACGGTAAA CAGCTTTATC GATGGTATGG CGGAACGCAA TCTGCCGCTG
CATGTTTTCC ACTTTGACTG TTTCTGGATG AAAGCCTTCC AGTGGTGCGA TTTTGAGTGG
GACCCGCTGA CTTTCCCGGA CCCGGAAGGG ATGATCCGCC GTCTGAAAGC GAAAGGGCTA
AAAATCTGCG TCTGGATTAA CCCTTATATC GGGCAAAAAT CCCCTGTCTT TAAAGAGTTA
CAAGAGAAAG GTTATTTACT CAAACGCCCG GACGGTTCGC TGTGGCAGTG GGATAAATGG
CAGCCAGGTC TGGCGATTTA TGACTTTACC AATCCGGATG CCTGCAAATG GTACGCCGAC
AAACTGAAAG GTCTGGTCGC GATGGGCGTT GATTGCTTTA AGACCGACTT TGGCGAACGT
ATTCCAACCG ATGTTCAATG GTTTGACGGT TCCGATCCGC AGAAAATGCA TAACCATTAT
GCGTACATCT ACAACGAACT GGTGTGGAAC GTGCTCAAGG ACACCGTTGG TGAGGAAGAA
GCCGTCTTGT TTGCCCGCTC GGCCTCCGTT GGTGCGCAGA AATTTCCGGT ACACTGGGGT
GGCGACTGTT ACGCTAACTA CGAATCAATG GCGGAAAGCC TGCGCGGTGG TTTGTCTATT
GGCCTTTCAG GTTTTGGCTT CTGGAGCCAC GATATCGGCG GCTTTGAAAA TACCGCTCCG
GCGCACGTTT ACAAACGCTG GTGCGCGTTT GGTTTGCTCT CCAGCCATAG CCGTTTACAC
GGCAGCAAAT CTTATCGTGT GCCGTGGGCC TATGATGATG AGTCCTGTGA TGTGGTGCGC
TTCTTCACGC AACTGAAATG CCGCATGATG CCGTATCTGT ATCGTGAGGC TGCTCGTGCG
AACGCGCGGG GTACGCCGAT GATGCGGGCC ATGATGATGG AGTTCCCGGA CGATCCGGCT
TGTGATTACC TTGACCGTCA ATACATGTTA GGCGACAACG TGATGGTTGC TCCGGTGTTC
ACTGAATCGG GCGATGTGCA GTTCTACTTG CCGGAAGGTC GCTGGACACA CCTGTGGCAC
AACGATGAAC TCGATGGTAG TCGCTGGCAT AAACAGCAGC ACGGCTTCCT GAGTCTGCCC
GTTTATGTGC GTGATAACAC CCTACTGGCG CTGGGCAACA ACGAGCAACG TCCCGATTAC
GAGTGGCACG AAGGCACGGC ATTCCACCTC TTTAATCTGC AAGACGGGCA TGAAGCCATC
TGTGAAGTGC CCGCTGCCGA TGGTTCCGTT CTTTTCACCC TGAAAGCGGC GCGTACTGGC
AACACAATTA CTGTGAATGG TACGGGCGAG GCGAAGAACT GGACGCTGTG CTTGCGCAAT
GTTGTGAAAG TAAATGGTCT GCAAGGCGGT TCGCAGGCTG AAAGTGAGCT GGGGCTGGTG
GTGACGCCTC AAGGGAATGC GCTGACAATT ACGTTGTAA
 
Protein sequence
MKISDGNWLI QPGLNLIHPL QVFEVEQQGN EMVVYAAPRD VRERTWQLDT PLFTLRFFSP 
QEGIVGVRIE HFQGALNNGP HYPLNILQDV KVTIENTERY AEFKSGNLSA RVSKGEFWSL
DFLRNGERIT GSQVKNNGYV QDTNNQRNYM FERLDLGVGE TVYGLGERFT ALVRNGQTVE
TWNRDGGTST EQAYKNIPFY MTNRGYGVLV NHPQCVSFEV GSEKVSKVQF SVESEYLEYF
VIDGPTPKAV LDRYTRFTGR PALPPAWSFG LWLTTSFTTN YDEATVNSFI DGMAERNLPL
HVFHFDCFWM KAFQWCDFEW DPLTFPDPEG MIRRLKAKGL KICVWINPYI GQKSPVFKEL
QEKGYLLKRP DGSLWQWDKW QPGLAIYDFT NPDACKWYAD KLKGLVAMGV DCFKTDFGER
IPTDVQWFDG SDPQKMHNHY AYIYNELVWN VLKDTVGEEE AVLFARSASV GAQKFPVHWG
GDCYANYESM AESLRGGLSI GLSGFGFWSH DIGGFENTAP AHVYKRWCAF GLLSSHSRLH
GSKSYRVPWA YDDESCDVVR FFTQLKCRMM PYLYREAARA NARGTPMMRA MMMEFPDDPA
CDYLDRQYML GDNVMVAPVF TESGDVQFYL PEGRWTHLWH NDELDGSRWH KQQHGFLSLP
VYVRDNTLLA LGNNEQRPDY EWHEGTAFHL FNLQDGHEAI CEVPAADGSV LFTLKAARTG
NTITVNGTGE AKNWTLCLRN VVKVNGLQGG SQAESELGLV VTPQGNALTI TL