Gene EcHS_A3867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3867 
SymbolyicI 
ID5592797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3861802 
End bp3864120 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content52% 
IMG OID640922977 
Productalpha-xylosidase YicI 
Protein accessionYP_001460455 
Protein GI157163137 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTA GCGATGGAAA CTGGTTGATT CAACCTGGCC TCAATTTGAT TCACCCGCTT 
CAGGTGTTCG AGGTTGAACA GCAGGATAAT GAAATGGTGG TCTATGCTGC CCCCCGTGAT
GTACGTGAAC GTACCTGGCA GCTTGACACG CCTTTATTTA CGCTGCGCTT TTTCTCCCCA
CAGGAAGGTA TTGTCGGTGT GCGGATTGAG CATTTTCAGG GGGCGCTGAA TAACGGTCCT
CATTATCCGC TCAATATTTT GCAGGACGTG AAGGTCACAA TCGAAAACAC AGAACGTTAT
GCTGAGTTTA AAAGTGGCAA CTTAAGCGCG CGTGTCAGCA AAGGTGAGTT CTGGTCACTG
GATTTTCTGC GCAACGGCGA ACGTATTACC GGTAGTCAGG TGAAAAATAA TGGCTACGTG
CAGGACACGA ATAATCAACG CAATTATATG TTTGAGCGGC TTGATCTTGG CGTTGGCGAA
ACAGTTTACG GTCTGGGAGA GCGCTTTACT GCCCTGGTGC GCAATGGCCA GACGGTAGAG
ACCTGGAACC GGGACGGCGG CACAAGTACT GAACAGGTGT ATAAAAATAT CCCGTTCTAC
ATGACTAACC GTGGTTATGG GGTACTGGTC AATCATCCCC AGTGTGTCTC TTTTGAAGTG
GGATCGGAGA AAGTCTCCAA AGTGCAGTTC AGCGTTGAGA GTGAATATCT CGAATACTTT
GTTATCGACG GCCCGACGCC GAAAGCGGTA CTTGATCGTT ATACCCGCTT TACTGGTCGT
CCGGCGCTGC CGCCCGCGTG GTCCTTCGGC CTGTGGCTAA CCACTTCATT TACCACCAAC
TACGACGAAG CGACGGTAAA CAGCTTTATC GATGGTATGG CGGAACGCAA TCTGCCGCTG
CATGTTTTCC ACTTTGACTG TTTCTGGATG AAAGCCTTCC AGTGGTGCGA TTTTGAGTGG
GACCCGCTGA CTTTCCCTGA CCCGGAAGGG ATGATCCGCC GCCTGAAAGC GAAAGGGCTG
AAAATCTGCA TCTGGATTAA CCCCTATATC GGTCAAAAAT CCCCCGTCTT TAAAGAGTTA
CAAGAGAAAG GCTATTTACT CAAACGCCCG GACGGTTCGC TATGGCAGTG GGATAAATGG
CAGCCAGGTC TGGCGATTTA TGACTTTACC AATCCGGATG CCTGCAAATG GTACGCCGAC
AAACTGAAAG GTCTGGTCGC GATGGGCGTT GATTGCTTTA AGACCGACTT TGGCGAACGT
ATCCCAACTG ATGTTCAGTG GTTTGACGGT TCCGATCCGC AGAAAATGCA TAACCATTAT
GCGTACATCT ACAACGAACT GGTGTGGAAC GTGCTCAAGG ACACCGTTGG TGAGGAAGAA
GCTGTCTTGT TTGCCCGCTC GGCCTCCGTC GGTGCGCAGA AATTCCCGGT ACACTGGGGT
GGCGATTGTT ACGCTAACTA CGAATCAATG GCGGAAAGCC TGCGCGGTGG TTTGTCTATT
GGCCTTTCAG GTTTTGGCTT CTGGAGCCAC GATATCGGCG GCTTTGAAAA TACCGCTCCG
GCGCACGTTT ACAAACGCTG GTGCGCGTTT GGTTTGCTCT CCAGCCATAG CCGTTTACAC
GGTAGCAAAT CTTATCGTGT GCCGTGGGCC TACGATGATG AGTCCTGTGA TGTGGTGCGC
TTCTTCACGC AACTGAAATG CCGCATGATG CCGTATCTGT ATCGTGAAGC TGCGCGTGCG
AACGCGCGGG GTACGCCGAT GATGCGGGCC ATGATGATGG AGTTCCCGGA CGATCCGGCT
TGTGATTACC TTGACCGTCA ATACATGTTA GGCGACAACG TGATGGTTGC GCCGGTGTTC
ACTGAAGCGG GCGATGTGCA GTTCTACCTG CCGGAAGGTC GCTGGACACA CCTGTGGCAC
AACGATGAAC TCGACGGTAG TCGCTGGCAT AAACAGCAGC ACGGCTTCCT GAGTCTGCCC
GTTTATGTGC GTGATAACAC TCTACTGGCG CTGGGCAACA ACGATCAACG TCCCGATTAC
GTGTGGCACG AAGGCACGGC ATTCCACCTC TTTAATCTGC AAGACGGGCA TGAAGCCGTC
TGTGAAGTGC CCGCTGCTGA CGGATCGGTG ATCTTTACTT TAAAAGCAGC ACGTACTGGC
AACACGATTA CTGTGACTGG TGCGGGCGAG GCGAAGAACT GGACACTGTG CCTGCGCAAT
GTTGTGAAAG TAAATGGTCT GCAAGACGGT TCGCAGGCTG AAAGTGAGCA GGGGCTGGTG
GTGAAGCCTC AAGGGAATGC GCTGACAATT ACGTTGTAA
 
Protein sequence
MKISDGNWLI QPGLNLIHPL QVFEVEQQDN EMVVYAAPRD VRERTWQLDT PLFTLRFFSP 
QEGIVGVRIE HFQGALNNGP HYPLNILQDV KVTIENTERY AEFKSGNLSA RVSKGEFWSL
DFLRNGERIT GSQVKNNGYV QDTNNQRNYM FERLDLGVGE TVYGLGERFT ALVRNGQTVE
TWNRDGGTST EQVYKNIPFY MTNRGYGVLV NHPQCVSFEV GSEKVSKVQF SVESEYLEYF
VIDGPTPKAV LDRYTRFTGR PALPPAWSFG LWLTTSFTTN YDEATVNSFI DGMAERNLPL
HVFHFDCFWM KAFQWCDFEW DPLTFPDPEG MIRRLKAKGL KICIWINPYI GQKSPVFKEL
QEKGYLLKRP DGSLWQWDKW QPGLAIYDFT NPDACKWYAD KLKGLVAMGV DCFKTDFGER
IPTDVQWFDG SDPQKMHNHY AYIYNELVWN VLKDTVGEEE AVLFARSASV GAQKFPVHWG
GDCYANYESM AESLRGGLSI GLSGFGFWSH DIGGFENTAP AHVYKRWCAF GLLSSHSRLH
GSKSYRVPWA YDDESCDVVR FFTQLKCRMM PYLYREAARA NARGTPMMRA MMMEFPDDPA
CDYLDRQYML GDNVMVAPVF TEAGDVQFYL PEGRWTHLWH NDELDGSRWH KQQHGFLSLP
VYVRDNTLLA LGNNDQRPDY VWHEGTAFHL FNLQDGHEAV CEVPAADGSV IFTLKAARTG
NTITVTGAGE AKNWTLCLRN VVKVNGLQDG SQAESEQGLV VKPQGNALTI TL