Gene SbBS512_E4107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4107 
SymbolyicI 
ID6272069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3832439 
End bp3834730 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content52% 
IMG OID641727937 
Productalpha-xylosidase YicI 
Protein accessionYP_001882369 
Protein GI187732405 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTA GCGATGGAAA CTGGTTGATT CAACCTGGCC TCAATTTGAT TCACCCGCTT 
CAGGTGTTCG AGGTTGAACA GCAGGATAAT GAAATGGTGG TCTATGCTGC CCCCCGTGAT
GTGCGTGAAC GTACCTGGCA GCTTGATACG CCTTTATTTA CGCTGCGCTT TTTCTCCCCA
CAGGAAGGTA TTGTCGGTGT GCGGATTGAG CATTTTCAGG GAGCGCTGAA TAACGGTCCT
CATTATCCGC TCAATATTTT GCAGGACGTG AAGGTCACAA TCGAAAACAC AGAACGTTAT
GCTGAGTTTA AAAGTGGCAA CTTAAGCGCG CGTGTCAGCA AAGGTGAGTT CTGGTCACTC
GATTTTCTGC GCAACGGTGA ACGTATTACC GGTCAGGACA CGAATAATCA ACGAAATTAT
ATGTTTGAGC GGCTTGATCT TGGCGTTGGC GAAACAGTTT ATGGTCTGGG AGAGCGTTTT
ACTGCCCTGG TGCGCAATGG TCAGACGGTA GAGACCTGGA ACCGGGACGG CGGCACAAGT
ACTGAACAGG CGTATAAAAA TATCCCGTTC TATATGACTA ACCGTGGCTA TGGGGTGCTG
GTCAATCACC CCCAGAGTGT CTCTTTTGAA GTGGGATCGG AGAAAGTCTC CAAAGTGCAG
TTCAGCGTTG AGAGCGAATA TCTCGAATAC TTTGTTATCG ACGGTCCGAC GCCGAAAGCG
GTACTTGATC GTTATACCCG TTTTACTGGT CGTCCGGCGC TGCCGCCCGC ATGGTCCTTC
GGCCTGTGGC TAACCACTTC ATTTACCACC AACTACGACG AAGCGACGGT AAACAGCTTT
ATCGATGGTA TGGCGGAACG CAATCTGCCG CTGCATGTTT TCCACTTTGA CTGTTTCTGG
ATGAAAGCCT TCCAGTGGTG CGATTTTGAG TGGGACCCGC TGACTTTCCC GGACCCGGAA
GGGATGATCC GCCGCCTGAA AGCGAAAGGG CTGAAAATCT GCGTCTGGAT TAACCCCTAT
ATCGGTCAAA AATCCCCCGT CTTTAAAGAG TTACAAGAGA AAGGCTATTT ACTCAAACGC
CCGGACGGTT CGCTATGGCA GTGGGATAAA TGGCAGCCAG GTCTGGCGAT TTATGACTTT
ACCAATCCGG ATGCCTGCAA ATGGTACGCC GACAAACTGA AAGGTCTGGT CGCGATGGGC
GTTGATTGCT TTAAGACCGA CTTTGGCGAA CGTATCCCAA CTGATGTTCA GTGGTTTGAC
GGTTCCGATC CGCAGAAAAT GCATAACCAT TATGCGCACA TCTACAACGA ACTGGTGTGG
AACGTGCTCA AGGACACCGT TGGTGAGGAA GAAGCTGTCT TGTTTGCCCG CTCGGCCTCC
GTCGGTGCGC AGAAATTCCC GGTACACTGG GGTGGCGATT GTTACGCTAA CTACGAATCA
ATGGCGGAAA GCCTGCGCGG TGGTTTGTCT ATTGGCCTTT CAGGTTTTGG CTTCTGGAGC
CACTATATCG GCGGCTTTGA AAATACCGCT CCGGCGCACG TTTACAAATG CTGGTGCGCG
TTTGGTTTGC TCTCCAGCCA TAGCCGTTTA CACGGTAGCA AATCTTATCG TGTGCCGTGG
GCCTACGATG ATGAGTCCTG TGATGTGGTG CGCTTCTTCA CGCAACTGAA ATGCCGCATG
ATGCCGTATC TGTATCGTGA AGCTGCGCGT GCAAACGCGC GGGGTACGCC GATGATGCGG
GCCATGATGA TGGAGTTCCC GGACGATCCG GCTTATGATT ACCTTGACCG TCAATACATG
TTAGGCGACA ACGTGATGGT TGCGCCGGTG TTCACTGAAG CGGGCGATGT GCAGTTCTAC
CTGCCGGAAG GTCGCTGGAC ACACCTGTGG CACAACGATG AGCTCGACGG TTGTTGCTGG
CATAAACAGC AGCACAGCTT CCTGAGTCTG CCCATTTATG TGCGTGATAA CACCCTACTG
GCGCTGGGCA ACAACGATCA ACGTCCCGAT TACGCGTGGC ACGAAGGCAC GGCATTCCAC
CTCTTTAATC TGCAAGACGG GCATGAAGCC GTCTGTGAAG TGCCCGCTGC TGACGGATCG
GTGATCTTTA CTTTAAAAGC AGCACGTACT GGCAACACGA TTACTGTGAC TGGTGCGGGC
GAGGCGAAGA ACTGGACACT GTGCCTGCGC AATGTTGTGA AAGTAAATGG TCTGCAAGAC
GGTTCGCAGG CTGAAAGTGA GCAGGGGCTG GTGGTGAAGC CTCAAGGGAA TGCGCTGACA
ATTACGTTGT AA
 
Protein sequence
MKISDGNWLI QPGLNLIHPL QVFEVEQQDN EMVVYAAPRD VRERTWQLDT PLFTLRFFSP 
QEGIVGVRIE HFQGALNNGP HYPLNILQDV KVTIENTERY AEFKSGNLSA RVSKGEFWSL
DFLRNGERIT GQDTNNQRNY MFERLDLGVG ETVYGLGERF TALVRNGQTV ETWNRDGGTS
TEQAYKNIPF YMTNRGYGVL VNHPQSVSFE VGSEKVSKVQ FSVESEYLEY FVIDGPTPKA
VLDRYTRFTG RPALPPAWSF GLWLTTSFTT NYDEATVNSF IDGMAERNLP LHVFHFDCFW
MKAFQWCDFE WDPLTFPDPE GMIRRLKAKG LKICVWINPY IGQKSPVFKE LQEKGYLLKR
PDGSLWQWDK WQPGLAIYDF TNPDACKWYA DKLKGLVAMG VDCFKTDFGE RIPTDVQWFD
GSDPQKMHNH YAHIYNELVW NVLKDTVGEE EAVLFARSAS VGAQKFPVHW GGDCYANYES
MAESLRGGLS IGLSGFGFWS HYIGGFENTA PAHVYKCWCA FGLLSSHSRL HGSKSYRVPW
AYDDESCDVV RFFTQLKCRM MPYLYREAAR ANARGTPMMR AMMMEFPDDP AYDYLDRQYM
LGDNVMVAPV FTEAGDVQFY LPEGRWTHLW HNDELDGCCW HKQQHSFLSL PIYVRDNTLL
ALGNNDQRPD YAWHEGTAFH LFNLQDGHEA VCEVPAADGS VIFTLKAART GNTITVTGAG
EAKNWTLCLR NVVKVNGLQD GSQAESEQGL VVKPQGNALT ITL