Gene EcSMS35_3894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3894 
SymbolmalS 
ID6142972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3963567 
End bp3965597 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content53% 
IMG OID641618720 
Productperiplasmic alpha-amylase precursor 
Protein accessionYP_001745859 
Protein GI170684113 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTCG CCGCCTGTTT TCTGACACTC CTTCCTGGCT TCGCCGTTGC CGCCAGCTGG 
ACTTCTCCGG GGTTTCCCGC CTTTAGCGAA CAGGGGACGG GAACATTTGT CAGCCACGCA
CAGTTGCCCA AAGGTACGCG TCCACTAACG CTAAATTTTG ACCAACAGTG CTGGCAGCCT
GCGGATGCGA TAAAACTCAA TCAGATGCTT TCCCTGCAAC CTTGTAGCAA CACGCCGCCT
CAATGGCGAT TGTTCAGGGA CGGCGAATAT ACGCTGCAAC TAGACACCCG CTCCGGTACA
CCAACATTGA TGATTTCCAT CCAGAACGCC GTCGAACCGG TAGCAAGCCT GGTCCGTGAA
TGCCCGAAAT GGGATGGATT ACCGCTCACA CTGGATGTCA GCGCCACTTT TCCGGAGGGA
GCCGCCGTCC GGGATTATTA CAGCCAGCAA ATTGCGATAG TGAAGAACGG TCAAATAACG
TTACAACCCG CTGCCACCAG CAACGGTTTA CTCCTGCTGG AACGGGCAGA AACTGACGCC
TCTGCCCCTT TCGACTGGCA TAACGCCACG GTTTACTTTG TGCTGACAGA TCGTTTCGAA
AACGGCGATC CCAGTAATGA CCAGAGTTAC GGACGTCATA AAGACGGTAT GGCAGAAATT
GGCACTTTTC ACGGCGGCGA TTTACGCGGC CTGACCAACA AACTGGATTA CCTCCAGCAG
CTGGGCGTCA ATGCTTTATG GATAAGTGCC CCATTTGAGC AAATTCACGG CTGGGTCGGC
GGCGGTACAA AAGGCGATTT CCCGCATTAT GCCTACCACG GTTATTACAC ACAGGACTGG
ACGAATCTTG ATGCCAATAT GGGCCGCGAA GCCGATCTAC GGACGCTGGT TGATAGCGCG
CATCAGCGCG GTATTCGTAT TCTCTTTGAT GTCGTGATGA ACCACACCGG TTACGCCACA
CTGGCGGATA TGCAGGAGTT TCAGTTTGGC GCGTTATATC TTTCTGGTGA CGAAGTGAAA
AAAACGCTGG GTGAACGCTG GAGCGACTGG AAACCTGCCG CCGGGCAAAC CTGGCATAGC
TTTAACGATT ACATTAATTT CAGCGACAAA ACAGGCTGGG ATAAATGGTG GGGAAAAAAC
TGGATCAGAA CCGATATCGG CGATTACGAC AATCCTGGAT TCGACGATCT CACTATGTCG
CTGGCCTTTT TGCCGGATAT CAAAACCGAA TCAACGACCG CTTCTGGTCT GCCGGTGTTC
TATAAAAACA AAACGGATAC CCACGCTAAA GTCATCGACG GCTTTACCCC TCGCGATTAC
TTAACCCACT GGTTAAGTCA GTGGGTTCGC GACTATGGGA TTGATGGTTT TCGGGTCGAC
ACCGCCAAAC ATGTTGAGTT GCCCGCCTGG CAGCAACTGA AAACCGAAGC CAGCGCCGCG
CTTCGCGAAT GGAAAAAAGC TAACTCCGAC AAAGCATTAG ATGACAAACC TTTCTGGATG
ACCGGTGAAG CCTGGGGCCA CGGCGTGATG CAAAGTGACT ACTATCGCCA CGGCTTCGAT
GCGATGATCA ATTTCGATTA TCAGGAGCAG GCGGCGAAAG CAGTCGATTG TCTGGCGCAG
ATGGATACGA CCTGGCAGCA AATGGCGGAG AAATTGCAGG ATTTCAACGT GTTGAGCTAC
CTCTCATCGC ATGATACCCG CCTGTTCCGC GAAGGTGGTA ATAAAGCAGC AGAGTTATTG
TTATTAGCGC CAGGCGCGGT ACAAATCTTT TATGGTGATG AATCCTCGCG TCCGTTCGGC
CCCACCGGTT CGGATCCGCT GCAAGGTACG CGTTCGGATA TGAACTGGCA GGATGTTAGC
GGTAAATCTG CCGCCAGCGT CGCGCACTGG CAGAAAATCA GCCAGTTCCG CGCCCGCCAT
CCCGCAATTG GCGAGGGCAA ACAAACGACA CTTTCGATGA AGCAGGGCTA CGGCTTTGTT
CGTGAGCATG GCGACGATAA AGTGCTGGTC ATCTGGGCTG GGCAACAGTG A
 
Protein sequence
MKLAACFLTL LPGFAVAASW TSPGFPAFSE QGTGTFVSHA QLPKGTRPLT LNFDQQCWQP 
ADAIKLNQML SLQPCSNTPP QWRLFRDGEY TLQLDTRSGT PTLMISIQNA VEPVASLVRE
CPKWDGLPLT LDVSATFPEG AAVRDYYSQQ IAIVKNGQIT LQPAATSNGL LLLERAETDA
SAPFDWHNAT VYFVLTDRFE NGDPSNDQSY GRHKDGMAEI GTFHGGDLRG LTNKLDYLQQ
LGVNALWISA PFEQIHGWVG GGTKGDFPHY AYHGYYTQDW TNLDANMGRE ADLRTLVDSA
HQRGIRILFD VVMNHTGYAT LADMQEFQFG ALYLSGDEVK KTLGERWSDW KPAAGQTWHS
FNDYINFSDK TGWDKWWGKN WIRTDIGDYD NPGFDDLTMS LAFLPDIKTE STTASGLPVF
YKNKTDTHAK VIDGFTPRDY LTHWLSQWVR DYGIDGFRVD TAKHVELPAW QQLKTEASAA
LREWKKANSD KALDDKPFWM TGEAWGHGVM QSDYYRHGFD AMINFDYQEQ AAKAVDCLAQ
MDTTWQQMAE KLQDFNVLSY LSSHDTRLFR EGGNKAAELL LLAPGAVQIF YGDESSRPFG
PTGSDPLQGT RSDMNWQDVS GKSAASVAHW QKISQFRARH PAIGEGKQTT LSMKQGYGFV
REHGDDKVLV IWAGQQ