Gene EcolC_0055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0055 
Symbol 
ID6068428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp57236 
End bp59554 
Gene Length2319 bp 
Protein Length772 aa 
Translation table11 
GC content52% 
IMG OID641599459 
Productalpha-xylosidase YicI 
Protein accessionYP_001723068 
Protein GI170018114 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.353636 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA GCGATGGAAA CTGGTTGATT CAACCTGGCC TCAATTTGAT TCACCCGCTT 
CAGGTGTTCG AGGTTGAACA GCAGGATAAT GAAATGGTGG TCTATGCTGC CCCCCGTGAT
GTGCGTGAAC GTACCTGGCA GCTTGATACG CCTTTATTTA CGCTGCGCTT TTTCTCCCCA
CAGGAAGGTA TTGTCGGTGT GCGGATTGAG CATTTTCAGG GGGCGCTGAA TAACGGTCCT
CATTATCCGC TCAATATTTT GCAGGACGTG AAGGTCACAA TCGAAAACAC AGAACGTTAT
GCTGAGTTTA AAAGTGGCAA CTTAAGCGCG CGTGTCAGCA AAGGTGAGTT CTGGTCACTG
GATTTTCTGC GCAACGGCGA ACGTATTACC GGTAGTCAGG TGAAAAATAA TGGCTACGTG
CAGGACACGA ATAATCAACG CAATTATATG TTTGAGCGGC TTGATCTTGG CGTTGGCGAA
ACAGTTTACG GTCTGGGAGA GCGCTTTACT GCCCTGGTGC GCAATGGCCA GACGGTAGAG
ACCTGGAACC GGGACGGCGG CACAAGTACT GAACAGGCGT ATAAAAATAT CCCGTTCTAC
ATGACTAACC GTGGTTATGG GGTACTGGTC AATCATCCCC AGTGTGTCTC TTTTGAAGTG
GGATCGGAGA AAGTCTCCAA AGTGCAGTTC AGCGTTGAGA GTGAATATCT CGAATACTTT
GTTATCGACG GCCCGACGCC GAAAGCGGTA CTTGATCGTT ATACCCGCTT TACTGGTCGT
CCGGCGCTGC CGCCCGCGTG GTCCTTCGGC CTGTGGCTAA CCACTTCATT TACCACCAAC
TACGACGAAG CGACGGTAAA CAGCTTTATC GATGGTATGG CGGAACGCAA TCTGCCGCTG
CATGTTTTCC ACTTTGACTG TTTCTGGATG AAAGCCTTCC AGTGGTGCGA TTTTGAGTGG
GACCCGCTGA CTTTCCCTGA CCCGGAAGGG ATGATCCGCC GCCTGAAAGC GAAAGGGCTG
AAAATCTGCG TCTGGATTAA CCCCTATATC GGTCAAAAAT CCCCCGTCTT TAAAGAGTTA
CAAGAGAAAG GCTATTTACT CAAACGCCCG GACGGTTCGC TATGGCAGTG GGATAAATGG
CAGCCAGGTC TGGCGATTTA TGACTTTACC AATCCGGATG CCTGCAAATG GTACGCCGAC
AAACTGAAAG GTCTGGTCGC GATGGGCGTT GATTGCTTTA AGACCGACTT TGGCGAACGT
ATCCCAACTG ATGTTCAGTG GTTTGACGGT TCCGATCCGC AGAAAATGCA TAACCATTAT
GCGTACATCT ACAACGAACT GGTGTGGAAC GTGCTCAAGG ACACCGTTGG TGAGGAAGAA
GCTGTCTTGT TTGCCCGCTC GGCCTCCGTC GGTGCGCAGA AATTCCCGGT ACACTGGGGT
GGCGATTGTT ACGCTAACTA CGAATCAATG GCGGAAAGCC TGCGCGGTGG TTTGTCTATT
GGCCTTTCAG GTTTTGGCTT CTGGAGCCAC GATATCGGCG GCTTTGAAAA TACCGCTCCG
GCGCACGTTT ACAAACGCTG GTGCGCGTTT GGTTTGCTCT CCAGCCATAG CCGTTTACAC
GGTAGCAAAT CTTATCGTGT GCCGTGGGCC TACGATGATG AGTCCTGTGA TGTGGTGCGC
TTCTTCACGC AACTGAAATG CCGCATGATG CCGTATCTGT ATCGTGAAGC TGCGCGTGCG
AACGCGCGGG GTACGCCGAT GATGCGGGCC ATGATGATGG AGTTCCCGGA CGATCCGGCT
TGTGATTACC TTGACCGTCA ATACATGTTA GGCGACAACG TGATGGTTGC GCCGGTGTTC
ACTGAAGCGG GCGATGTGCA GTTCTACCTG CCGGAAGGTC GCTGGACACA CCTGTGGCAC
AACGATGAAC TCGACGGTAG TCGCTGGCAT AAACAGCAGC ACGGCTTCCT GAGTCTGCCC
GTTTATGTGC GTGATAACAC TCTACTGGCG CTGGGCAACA ACGATCAACG TCCCGATTAC
GTGTGGCACG AAGGCACGGC ATTCCACCTC TTTAATCTGC AAGACGGGCA TGAAGCCGTC
TGTGAAGTGC CCGCTGCTGA CGGATCGGTG ATCTTTACTT TAAAAGCAGC ACGTACTGGC
AACACGATTA CTGTGACTGG TGCGGGCGAG GCGAAGAACT GGACACTGTG CCTGCGCAAT
GTTGTGAAAG TAAATGGTCT GCAAGACGGT TCGCAGGCTG AAAGTGAGCA GGGGCTGGTG
GTGAAGCCTC AAGGGAATGC GCTGACAATT ACGTTGTAA
 
Protein sequence
MKISDGNWLI QPGLNLIHPL QVFEVEQQDN EMVVYAAPRD VRERTWQLDT PLFTLRFFSP 
QEGIVGVRIE HFQGALNNGP HYPLNILQDV KVTIENTERY AEFKSGNLSA RVSKGEFWSL
DFLRNGERIT GSQVKNNGYV QDTNNQRNYM FERLDLGVGE TVYGLGERFT ALVRNGQTVE
TWNRDGGTST EQAYKNIPFY MTNRGYGVLV NHPQCVSFEV GSEKVSKVQF SVESEYLEYF
VIDGPTPKAV LDRYTRFTGR PALPPAWSFG LWLTTSFTTN YDEATVNSFI DGMAERNLPL
HVFHFDCFWM KAFQWCDFEW DPLTFPDPEG MIRRLKAKGL KICVWINPYI GQKSPVFKEL
QEKGYLLKRP DGSLWQWDKW QPGLAIYDFT NPDACKWYAD KLKGLVAMGV DCFKTDFGER
IPTDVQWFDG SDPQKMHNHY AYIYNELVWN VLKDTVGEEE AVLFARSASV GAQKFPVHWG
GDCYANYESM AESLRGGLSI GLSGFGFWSH DIGGFENTAP AHVYKRWCAF GLLSSHSRLH
GSKSYRVPWA YDDESCDVVR FFTQLKCRMM PYLYREAARA NARGTPMMRA MMMEFPDDPA
CDYLDRQYML GDNVMVAPVF TEAGDVQFYL PEGRWTHLWH NDELDGSRWH KQQHGFLSLP
VYVRDNTLLA LGNNDQRPDY VWHEGTAFHL FNLQDGHEAV CEVPAADGSV IFTLKAARTG
NTITVTGAGE AKNWTLCLRN VVKVNGLQDG SQAESEQGLV VKPQGNALTI TL