Gene EcE24377A_4401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4401 
Symbol 
ID5590243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4392303 
End bp4394339 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content53% 
IMG OID640928018 
Productalpha-glucosidase 
Protein accessionYP_001465362 
Protein GI157159043 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACGC CACGTCCACA GTTATTAGAT TTTCAATTTC ATCAGAATAA CGACAGTTTT 
ACCCTACATT TTCAACAACG TCTTATTTTA ACCCATAGCA AAGATAATCC TTGTTTATGG
ATTGGCTCAG GTATAGCGGA TATCGATATG TTCCGCGGTA ATTTCAGCAT TAAAGATAAA
CTACAGGAGA AAATTGCGCT TACCGACGCC ATCGTCAGCC AGTCACCGGA TGGTTGGTTA
ATTCATTTCA GCCGTGGTTC TGACATTAGC GCCACGCTGA ATATCTCTGC CGACGATCAG
GGCCGTTTAT TGCTGGAACT ACAAAACGAC AACCTTAACC ACAACCGTAT CTGGCTGCGC
CTTGCCGCTC AACCAGAGGA CCATATCTAC GGCTGCGGCG AACAGTTTTC CTACTTCGAT
CTGCGTGGCA AACCGTTCCC GCTATGGACC AGTGAACAAG GCGTTGGTCG CAACAAACAA
ACCTATGTCA CCTGGCAGGC CGACTGCAAA GAAAATGCGG GCGGCGACTA TTACTGGACT
TTCTTCCCAC AGCCTACGTT TGTCAGCACG CAGAAGTATT ACTGCCATGT TGATAACAGT
TGCTATATGA ACTTCGACTT TAGTGCCCCG GAATACCATG AACTGGCGCT GTGGGAAGAC
AAAGCAACGC TGCGTTTTGA ATGTGCTGAC ACATACATTT CCCTGCTGGA AAAATTAACC
GCCCTGCTGG GACGCCAGCC AGAACTGCCC GACTGGATTT ATGACGGAGT AACGCTCGGC
ATTCAGGGCG GGACGGAAGT GTGCCAGAAG AAACTGGACA CCATGCGTAA CGCGGGCGTG
AAGGTCAACG GCATCTGGGC GCAGGACTGG TCCGGTATTC GTATGACCTC TTTTGGCAAA
CGCGTGATGT GGAACTGGAA GTGGAACAGC GAAAACTACC CGCAACTGGA TTCACGCATT
AAGCAGTGGA ATCAGGAGGG CGTGCAGTTC CTGGCCTATA TCAACCCGTA TGTTGCCAGC
GATAAAGATC TCTGCGAAGA AGCGGCACAA CACGGCTATC TGGCAAAAGA TGCCTCTGGC
GGTGACTATC TGGTGGAGTT TGGCGAGTTT TACGGCGGCG TTGTCGATCT CACTAATCCA
GAAGCCTACG CCTGGTTCAA GGAAGTGATC AAAAAGAACA TGATTGAACT CGGCTGCGGC
GGCTGGATGG CTGACTTCGG CGAGTATCTG CCCACCGACA CGTACCTGCA TAACGGCGTC
AGTGCCGAAA TTATGCATAA CGCCTGGCCT GCGCTGTGGG CGAAGTGTAA CTACGAAGCC
CTTGAAGAAA CGGGCAAGCT CGGCGAGATC CTTTTCTTTA TGCGCGCCGG TTCTACCGGT
AGCCAGAAAT ACTCCACCAT GATGTGGGCG GGCGACCAGA ACGTCGACTG GAGTCTCGAC
GATGGCCTGG CGTCGGTTGT CCCGGCGGCG CTGTCGCTGG CAATGACCGG ACATGGCCTG
CACCACAGCG ACATTGGCGG TTACACCACC CTGTTTGAGA TGAAGCGCAG CAAAGAGCTG
CTGCTGCGCT GGTGCGATTT CAGCGCCTTC ACGCCGATGA TGCGCACCCA CGAAGGTAAC
CGTCCTGGCG ACAACTGGCA GTTTGACGGC GACGCAGAAA CTATCGCCCA TTTTGCCCGT
ATGACCTCCG TCTTCACCAC CCTGAAACCT TACCTGAAAG AGGCTGTCGC GCTGAATGCG
AAGTCCGGCC TGCCGGTTAT GCGCCCGCTG TTCCTGCATT ACGAAGACGA TGCGCACACT
TACACCCTGA AATATCAGTA CCTGTTAGGT CGCGACATTC TGGTCGCTCC GGTGCATGAA
GAAGGCCGTA GCGACTGGAC GCTCTATCTG CCGGAGGATA ACTGGGTCCA CGCCTGGACG
GGTGAAGCGT TCCGGGGCGG GGAAGTTACC GTTAATGCGC CCATCGGCAA GCCGCCGGTC
TTTTATCGCG CCGATAGCGA ATGGGCGGCA CTGTTCGCGT CGTTAAAAAG CATCTAA
 
Protein sequence
MDTPRPQLLD FQFHQNNDSF TLHFQQRLIL THSKDNPCLW IGSGIADIDM FRGNFSIKDK 
LQEKIALTDA IVSQSPDGWL IHFSRGSDIS ATLNISADDQ GRLLLELQND NLNHNRIWLR
LAAQPEDHIY GCGEQFSYFD LRGKPFPLWT SEQGVGRNKQ TYVTWQADCK ENAGGDYYWT
FFPQPTFVST QKYYCHVDNS CYMNFDFSAP EYHELALWED KATLRFECAD TYISLLEKLT
ALLGRQPELP DWIYDGVTLG IQGGTEVCQK KLDTMRNAGV KVNGIWAQDW SGIRMTSFGK
RVMWNWKWNS ENYPQLDSRI KQWNQEGVQF LAYINPYVAS DKDLCEEAAQ HGYLAKDASG
GDYLVEFGEF YGGVVDLTNP EAYAWFKEVI KKNMIELGCG GWMADFGEYL PTDTYLHNGV
SAEIMHNAWP ALWAKCNYEA LEETGKLGEI LFFMRAGSTG SQKYSTMMWA GDQNVDWSLD
DGLASVVPAA LSLAMTGHGL HHSDIGGYTT LFEMKRSKEL LLRWCDFSAF TPMMRTHEGN
RPGDNWQFDG DAETIAHFAR MTSVFTTLKP YLKEAVALNA KSGLPVMRPL FLHYEDDAHT
YTLKYQYLLG RDILVAPVHE EGRSDWTLYL PEDNWVHAWT GEAFRGGEVT VNAPIGKPPV
FYRADSEWAA LFASLKSI