Gene EcSMS35_4263 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4263 
Symbol 
ID6144713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4361872 
End bp4363908 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content54% 
IMG OID641619084 
Productalpha-glucosidase 
Protein accessionYP_001746208 
Protein GI170679942 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACGC CACGCCCACA GCTATTAGAT TTTCAATTTC ATCAGAATAA CGACAGTTTT 
ACTCTACGTT TTCAAGACCG CCTTATTTTA ATCCATAGCA AAGATAATCC TTGCTTATCG
ATTGGCTCAG GTATAGCGGA TATCGATATG TTCCGCGGTA ATTTCAGCAT TAAAGATAAA
TTGCAGGAGA AAATTGCGCT TACCGACGCC ATCGTCAGCC AGTCACCGGA TGGTTGGTTA
ATCCATTTCA GCCGTGGTTC TGACATTAGC GCCACGCTGC GTATCTCTGC CGACGATCAG
GGCCGTTTAT TGCTGGAACT ACAAAACGAC AACCATCACC ACAACCGCAT CTGGCTGCGT
CTTGCCGCAC AACCAGAGGA CCATATCTAC GGCTGCGGCG AACAATTCTC CTACTTTGAT
CTGCGCGGCA AACCGTTCCC GCTATGGACC AGTGAACAAG GCGTTGGTCG CAACAAACAA
ACCTATGTCA CCTGGCAGGC CGACTGCAAA GAGAACGCGG GCGGCGACTA TTACTGGACT
TTCTTCCCGC AGCCTACGTT TGTCAGCACG CAGAAGTATT ACTGCCATGT TGATAACAGT
TGCTATATGA ACTTCGACTT TAGTGCCCCG GAATACCATG AACTGGCGCT GTGGGAAGAG
AAAGCAACAC TGCGTTTTGA ATGTGCTGAC ACATACATCT CCCTGTTAGA AAAATTAACC
GCCCTGCTGG GACGCCAGCC AGAACTGCCC GACTGGATTT ATGACGGGGT AACGCTCGGC
ATTCAGGGCG GGACTGAAGT GTGCCAGAAG AAACTGGACA CCATGCGTAA CGCGGGCGTG
AAGGTCAACG GCATCTGGGC GCAGGACTGG TCCGGGATCC GTATGACCTC GTTTGGTAAA
CGTGTGATGT GGAACTGGAA GTGGAACAGC GAAAACTATC CGCAACTGGA TTCGCGTATT
AAACAGTGGA ACAAAGAAGG CGTGCAGTTC CTCGCCTACA TCAACCCGTA TGTTGCCAGC
GATAAGGACC TCTGCGCCGA GGCGGCTTCA CACGGCTATC TGGCAAAAGA TGCCTCTGGC
GGCGACTATC TGGTGGAGTT TGGCGAGTTC TATGGCGGCG TTGTCGATCT CACCAATCCA
GAAGCCTACG CCTGGTTCAA GGAAGTGATC AAAAAGAACA TGATTGAACT TGGCTGCGGC
GGCTGGATGG CTGACTTCGG CGAGTATCTG CCCACCGACA CGTACCTGCA TAACGGCGTC
AGCGCGGAAA TTATGCATAA CGCCTGGCCT GCGCTGTGGG CGAAATGTAA CTACGAAGCC
CTTGAAGAAA CAGGCAAGCT CGGCGAGCTC CTCTTCTTTA TGCGCGCCGG TTCTACCGGT
AGCCAGAAAT ACTCCACCAT GATGTGGGCG GGCGACCAGA ACGTTGACTG GAGTCTGGAC
GATGGACTGG CGTCAGTCGT CCCGGCGGCG CTGTCTCTGG CAATGACCGG ACATGGCCTG
CATCACAGTG ATATTGGCGG TTACACCACC CTGTTTGAGA TGAAGCGCAG CAAAGAGCTG
CTGCTGCGCT GGTGCGATTT CAGCGCCTTC ACGCCGATGA TGCGCACCCA CGAAGGTAAC
CGTCCTGGCG ACAACTGGCA GTTTGATAGC GACGCGGAAA CTATCGCCCA TTTTGCCCGT
ATGACCACCG TCTTCACCAC CCTGAAACCT TACCTGAAAG AGGCCGTCGC GCTGAATGCG
AAGTCCGGCC TGCCGGTTAT GCGCCCGCTG TTCCTGCATT ACGAAGACGA CGCGCAGACC
TACACCCTGA AATATCAGTA CCTGTTAGGC CGCGACATTC TGGTCGCTCC GGTGCATGAA
GAAGGCCGTA GCGACTGGAC GCTCTATCTG CCGGAGGATA ACTGGGTCCA TGCCTGGACG
GGTGAAGCGT TCCGGGGCGG GGAAGTTACC GTTAATGCGC CCATCGGTAA ACCGCCGGTC
TTTTATCGCG CCGGTAGCGA ATGGGCGGCA CTGTTCGCCA CGTTAAAAAG TATCTAA
 
Protein sequence
MDTPRPQLLD FQFHQNNDSF TLRFQDRLIL IHSKDNPCLS IGSGIADIDM FRGNFSIKDK 
LQEKIALTDA IVSQSPDGWL IHFSRGSDIS ATLRISADDQ GRLLLELQND NHHHNRIWLR
LAAQPEDHIY GCGEQFSYFD LRGKPFPLWT SEQGVGRNKQ TYVTWQADCK ENAGGDYYWT
FFPQPTFVST QKYYCHVDNS CYMNFDFSAP EYHELALWEE KATLRFECAD TYISLLEKLT
ALLGRQPELP DWIYDGVTLG IQGGTEVCQK KLDTMRNAGV KVNGIWAQDW SGIRMTSFGK
RVMWNWKWNS ENYPQLDSRI KQWNKEGVQF LAYINPYVAS DKDLCAEAAS HGYLAKDASG
GDYLVEFGEF YGGVVDLTNP EAYAWFKEVI KKNMIELGCG GWMADFGEYL PTDTYLHNGV
SAEIMHNAWP ALWAKCNYEA LEETGKLGEL LFFMRAGSTG SQKYSTMMWA GDQNVDWSLD
DGLASVVPAA LSLAMTGHGL HHSDIGGYTT LFEMKRSKEL LLRWCDFSAF TPMMRTHEGN
RPGDNWQFDS DAETIAHFAR MTTVFTTLKP YLKEAVALNA KSGLPVMRPL FLHYEDDAQT
YTLKYQYLLG RDILVAPVHE EGRSDWTLYL PEDNWVHAWT GEAFRGGEVT VNAPIGKPPV
FYRAGSEWAA LFATLKSI