Gene EcE24377A_4068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4068 
SymbolmalS 
ID5588951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4050118 
End bp4052148 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content53% 
IMG OID640927687 
Productperiplasmic alpha-amylase precursor 
Protein accessionYP_001465047 
Protein GI157158372 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCG CCGCCTGTTT TCTGACACTC CTTCCTGGCT TCGCCGTTGC CGCCAGCTGG 
ACTTCTCCGG GGTTCCCTGC CTTTAGCGAA CAGGGAACGG GAACATTTGT CAGCCACGCG
CAGTTGCCCA AAGGTACGCG TCCACTCACG CTAAATTTTG ACCAACAGTG CTGGCAGCCT
GCAGATGCGA TAAAACTCAA TCAGATGCTT TCCCTGCAAC CTTGTAGCAA CACGCCGCCT
CAATGGCGAT TGTTCAGAGA CGGCAAATAT ACGCTGCAAA TAGACACCCG CTCCGGTACG
CCAACATTGA TGATTTCCAT CCAGAACGCC GCCGAACCGG TAGCAAACCT GGTCCGTGAA
TGCCCGAAAT GGGATGGATT ACCGCTCACG CTGGATGTCA GCGCCACTTT CCCGGAAGGA
GCCGCCGTAC GGGATTATTA CAGCCAGCAA ATTGCGATAG TGAAGAACGG TCAAATAACG
TTACAACCCG CTGCTACCAG CAACGGTTTA CTCCTGCTGG AACGGGCAGA AACTGACGCC
CCTGCCCCTT TCGACTGGCA TAACGCCACG GTTTACTTTG TGCTGACAGA TCGTTTCGAA
AACGGCGATC CCAGTAATGA CCAGAGTTAC GGACGTCATA AAGACGGTAT GGCGGAAATT
GGCACTTTTC ACGGCGGCGA TTTACGCGGC CTGACCAACA AACTGGATTA CCTCCAGCAG
TTGGGCGTTA ATGCTTTATG GATAAGCGCC CCATTTGAGC AAATTCACGG CTGGGTCGGC
GGCGGTACAA AAGGCGATTT CCCGCATTAT GCCTACCACG GTTATTACAC ACAGGACTGG
ACGAATCTTG ATGCCAATAT GGGCAACGAA GCCGATCTAC GGACGCTGGT TGATAGCGCA
CATCAGCGCG GTATTCGTAT TCTCTTTGAT GTCGTGATGA ACCACACCGG CTATGCCACG
CTGGCGGATA TGCAGGAATA TCAGTTTGGC GCGTTATATC TTTCTGGTGA CGAGGTGAAA
AAAACGCTGG GTGAACGCTG GAGCGACTGG AAACCTGCCG CCGGGCAAAC CTGGCATAGC
TTTAACGATT ACATTAATTT CAGCGACAAA ACAGGCTGGG ATAAATGGTG GGGAAAAAAC
TGGATCAGAA CGGATATCGG CGATTACGAC AATCCTGGAT TCGACGATCT CACTATGTCG
CTGGCCTTTT TGCCGGATAT CAAAACCGAA TCAACTACCG CTTCTGGTCT GCCGGTGTTC
TATAAAAACA AAACGGATAC TCACGCTAAA GTCATCGAAG GCTTTACACC TCGCGATTAC
TTAACCCACT GGTTAAGTCA GTGGGTCCGC GACTATGGGA TTGATGGTTT TCGGGTCGAT
ACCGCCAAAC ATGTTGAGTT GCCCGCCTGG CAGCAACTGA AAACCGAAGC CAGCGCCGCG
CTTCGCGAAT GGAAAAAAGC TTACCCCGAC AAAGCATTAG ATGACAAACC TTTCTGGATG
ACCGGTGAAG CCTGGGGCCA CGGCGTGATG CAAAGTGACT ACTATCGCCA CGGCTTCGAT
GCGATGATCA ATTTCGATTA TCAGGAGCAG GCGGCGAAAG CAGTCGATTG TCTGGCGCAG
ATGGATACGA CCTGGCAGCA AATGGCGGAG AAATTGCAGG GTTTCAACGT GTTGAGCTAC
CTCTCGTCGC ATGATACCCG TCTGTTCCGT GAAGGGGGCG ACAAAGCAGC AGAGTTATTA
CTATTAGCGC CAGGCGCGGT ACAAATCTTT TATGGCGATG AATCCTCGCG TCCGTTCGGT
CCTACAGGTT CTGATCCGCT GCAAGGTACA CGTTCGGATA TGAACTGGCA GGATGTTAGC
GGTAAATCTG CCGCCAACGT CGCGCACTGG CAGAAAATCA GCCAGTTCCG CGCCCGCCAT
CCCGCAATTG GCGCGGGCAA ACAAACGACA CTTTCGCTGA AGCAGGGCTA CGGCTTTGTT
CGTGAGCATG GCGACGATAA AGTGCTGGTC ATCTGGGCTG GGCAACAGTG A
 
Protein sequence
MKLAACFLTL LPGFAVAASW TSPGFPAFSE QGTGTFVSHA QLPKGTRPLT LNFDQQCWQP 
ADAIKLNQML SLQPCSNTPP QWRLFRDGKY TLQIDTRSGT PTLMISIQNA AEPVANLVRE
CPKWDGLPLT LDVSATFPEG AAVRDYYSQQ IAIVKNGQIT LQPAATSNGL LLLERAETDA
PAPFDWHNAT VYFVLTDRFE NGDPSNDQSY GRHKDGMAEI GTFHGGDLRG LTNKLDYLQQ
LGVNALWISA PFEQIHGWVG GGTKGDFPHY AYHGYYTQDW TNLDANMGNE ADLRTLVDSA
HQRGIRILFD VVMNHTGYAT LADMQEYQFG ALYLSGDEVK KTLGERWSDW KPAAGQTWHS
FNDYINFSDK TGWDKWWGKN WIRTDIGDYD NPGFDDLTMS LAFLPDIKTE STTASGLPVF
YKNKTDTHAK VIEGFTPRDY LTHWLSQWVR DYGIDGFRVD TAKHVELPAW QQLKTEASAA
LREWKKAYPD KALDDKPFWM TGEAWGHGVM QSDYYRHGFD AMINFDYQEQ AAKAVDCLAQ
MDTTWQQMAE KLQGFNVLSY LSSHDTRLFR EGGDKAAELL LLAPGAVQIF YGDESSRPFG
PTGSDPLQGT RSDMNWQDVS GKSAANVAHW QKISQFRARH PAIGAGKQTT LSLKQGYGFV
REHGDDKVLV IWAGQQ