Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3999 |
Symbol | yicI |
ID | 6144075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4077337 |
End bp | 4079655 |
Gene Length | 2319 bp |
Protein Length | 772 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618824 |
Product | alpha-xylosidase YicI |
Protein accession | YP_001745963 |
Protein GI | 170680791 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTA GCGACGGAAA CTGGTTGATT CAACCTGGCC TCAATTTGAT TCACCCGCTT CAGGTGTTCG AGGTTGAACA GCAGGGTAAT GAAATGGTGG TCTATGCTGC CCCCCGTGAT GTGCGTGAAC GTACCTGGCA GCTTGATACG CCTTTATTTA CGCTGCGCTT TTTCTCCCCA CAGGAAGGTA TTGTCGGTGT ACGGATTGAG CATTTTCAGG GGGCGCTGAA TAACGGTCCT CATTACCCGC TCAATATTTT GCAGGACGTG AAGGTCACAA TCGAAAACAC AGAACGTTAC GCTGAGTTTA AAAGTGGCAA CTTAAGTGCG CGTGTCAGCA AAGGTGAGTT CTGGTCACTG GATTTTCTGC GCAACGGCGA ACGTATTACC GGTAGTCAGG TGAAAAATAA TGGCTACGTG CAGGACACGA ATAATCAACG AAATTATATG TTTGAGCGGC TGGATCTTGG CGTTGGCGAA ACAGTTTACG GTCTGGGAGA GCGCTTTACT GCCCTGGTGC GCAATGGCCA AACGGTAGAG ACCTGGAACC GGGACGGCGG CACAAGTACT GAACAGGCGT ATAAAAATAT TCCGTTCTAT ATGACTAACC GTGGTTATGG GGTACTGGTC AATCATCCTC AATGCGTCTC TTTTGAAGTG GGATCGGAGA AAGTCTCCAA AGTGCAGTTC AGCGTTGAGA GTGAATATCT CGAATACTTT GTTATCGACG GCCCGACGCC GAAAGCGGTA CTTGATCGTT ATACCCGTTT TACTGGTCGT CCGGCGCTGC CGCCCGCGTG GTCCTTCGGT CTGTGGCTAA CCACTTCATT TACCACCAAC TACGACGAAG CGACGGTAAA CAGCTTTATC GATGGTATGG CGGAACGCAA TCTGCCGCTG CATGTTTTCC ACTTTGACTG TTTCTGGATG AAAGCCTTCC AGTGGTGCGA TTTTGAGTGG GACCCGCTGA CTTTCCCGGA CCCGGAAGGG ATGATCCGCC GTCTGAAAGC GAAAGGGCTA AAAATCTGCG TCTGGATTAA CCCTTATATC GGGCAAAAAT CCCCTGTCTT TAAAGAGTTA CAAGAGAAAG GTTATTTACT CAAACGCCCG GACGGTTCGC TGTGGCAGTG GGATAAATGG CAGCCAGGTC TGGCGATTTA TGACTTTACC AATCCGGATG CCTGCAAATG GTACGCCGAC AAACTGAAAG GTCTGGTCGC GATGGGCGTT GATTGCTTTA AGACCGACTT TGGCGAACGT ATTCCAACCG ATGTTCAATG GTTTGACGGT TCCGATCCGC AGAAAATGCA TAACCATTAT GCGTACATCT ACAACGAACT GGTGTGGAAC GTGCTCAAGG ACACCGTTGG TGAGGAAGAA GCCGTCTTGT TTGCCCGCTC GGCCTCCGTT GGTGCGCAGA AATTTCCGGT ACACTGGGGT GGCGACTGTT ACGCTAACTA CGAATCAATG GCGGAAAGCC TGCGCGGTGG TTTGTCTATT GGCCTTTCAG GTTTTGGCTT CTGGAGCCAC GATATCGGCG GCTTTGAAAA TACCGCTCCG GCGCACGTTT ACAAACGCTG GTGCGCGTTT GGTTTGCTCT CCAGCCATAG CCGTTTACAC GGCAGCAAAT CTTATCGTGT GCCGTGGGCC TATGATGATG AGTCCTGTGA TGTGGTGCGC TTCTTCACGC AACTGAAATG CCGCATGATG CCGTATCTGT ATCGTGAGGC TGCTCGTGCG AACGCGCGGG GTACGCCGAT GATGCGGGCC ATGATGATGG AGTTCCCGGA CGATCCGGCT TGTGATTACC TTGACCGTCA ATACATGTTA GGCGACAACG TGATGGTTGC TCCGGTGTTC ACTGAATCGG GCGATGTGCA GTTCTACTTG CCGGAAGGTC GCTGGACACA CCTGTGGCAC AACGATGAAC TCGATGGTAG TCGCTGGCAT AAACAGCAGC ACGGCTTCCT GAGTCTGCCC GTTTATGTGC GTGATAACAC CCTACTGGCG CTGGGCAACA ACGAGCAACG TCCCGATTAC GAGTGGCACG AAGGCACGGC ATTCCACCTC TTTAATCTGC AAGACGGGCA TGAAGCCATC TGTGAAGTGC CCGCTGCCGA TGGTTCCGTT CTTTTCACCC TGAAAGCGGC GCGTACTGGC AACACAATTA CTGTGAATGG TACGGGCGAG GCGAAGAACT GGACGCTGTG CTTGCGCAAT GTTGTGAAAG TAAATGGTCT GCAAGGCGGT TCGCAGGCTG AAAGTGAGCT GGGGCTGGTG GTGACGCCTC AAGGGAATGC GCTGACAATT ACGTTGTAA
|
Protein sequence | MKISDGNWLI QPGLNLIHPL QVFEVEQQGN EMVVYAAPRD VRERTWQLDT PLFTLRFFSP QEGIVGVRIE HFQGALNNGP HYPLNILQDV KVTIENTERY AEFKSGNLSA RVSKGEFWSL DFLRNGERIT GSQVKNNGYV QDTNNQRNYM FERLDLGVGE TVYGLGERFT ALVRNGQTVE TWNRDGGTST EQAYKNIPFY MTNRGYGVLV NHPQCVSFEV GSEKVSKVQF SVESEYLEYF VIDGPTPKAV LDRYTRFTGR PALPPAWSFG LWLTTSFTTN YDEATVNSFI DGMAERNLPL HVFHFDCFWM KAFQWCDFEW DPLTFPDPEG MIRRLKAKGL KICVWINPYI GQKSPVFKEL QEKGYLLKRP DGSLWQWDKW QPGLAIYDFT NPDACKWYAD KLKGLVAMGV DCFKTDFGER IPTDVQWFDG SDPQKMHNHY AYIYNELVWN VLKDTVGEEE AVLFARSASV GAQKFPVHWG GDCYANYESM AESLRGGLSI GLSGFGFWSH DIGGFENTAP AHVYKRWCAF GLLSSHSRLH GSKSYRVPWA YDDESCDVVR FFTQLKCRMM PYLYREAARA NARGTPMMRA MMMEFPDDPA CDYLDRQYML GDNVMVAPVF TESGDVQFYL PEGRWTHLWH NDELDGSRWH KQQHGFLSLP VYVRDNTLLA LGNNEQRPDY EWHEGTAFHL FNLQDGHEAI CEVPAADGSV LFTLKAARTG NTITVNGTGE AKNWTLCLRN VVKVNGLQGG SQAESELGLV VTPQGNALTI TL
|
| |