Gene EcHS_A3774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3774 
SymbolmalS 
ID5592852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3766800 
End bp3768830 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content53% 
IMG OID640922888 
Productperiplasmic alpha-amylase precursor 
Protein accessionYP_001460366 
Protein GI157163048 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCG CCGCCTGTTT TCTGACACTC CTTCCTGGCT TCGCCGTTGC CGCCAGCTGG 
ACTTCTCCGG GGTTTCCCGC CTTTAGCGAA CAGGGGACAG GAACATTTGT CAGCCACGCG
CAGTTGCCCA AAGGTACGCG TCCACTAACG CTAAATTTTG ACCAACAGTG CTGGCAGCCT
GCGGATGCGA TAAAACTCAA TCAGATGCTT TCCCTGCAAC CTTGTAGCAA CACGCCGCCT
CAATGGCGAT TGTTCAGGGA CGGCGAATAT ACGCTGCAAA TAGACACCCG CTCCGGTACG
CCAACATTGA TGATTTCCAT CCAGAACGCC GCCGAACCGG TAGCAAGCCT GGTCCGTGAA
TGCCCGAAAT GGGATGGATT ACCGCTCACA GTGGATGTCA GCGCCACTTT CCCGGAAGGA
GCCGCCGTAC GGGATTATTA CAGCCAGCAA ATTGCGATAG TGAAGAACGG TCAAATAATG
TTACAACCCG CTGCCACCAG CAACGGTTTA CTCCTGCTGG AACGGGCAGA AACTGACACA
TCCGCCCCTT TCGACTGGCA TAACGCCACG GTTTACTTTG TGCTGACAGA TCGTTTCGAA
AACGGCGATC CCAGTAATGA CCAGAGTTAC GGACGTCATA AAGACGGTAT GGCGGAAATT
GGCACTTTTC ACGGCGGCGA TTTACGCGGC CTGACCAACA AACTGGATTA CCTCCAGCAG
TTGGGCGTTA ATGCTTTATG GATAAGCGCC CCATTTGAGC AAATTCACGG CTGGGTCGGC
GGCGGTACAA AAGGCGATTT CCCGCATTAT GCCTACCACG GTTATTACAC ACAGGACTGG
ACGAATCTTG ATGCCAATAT GGGCAACGAA GCCGATCTAC GGACGCTGGT TGATAGCGCA
CATCAGCGCG GTATTCGTAT TCTCTTTGAT GTCGTGATGA ACCACACCGG CTATGCCACG
CTGGCGGATA TGCAGGAGTA TCAGTTTGGC GCGTTATATC TTTCTGGTGA CGAAGTGAAA
AAATCGCTGG GTGAACGCTG GAGCGACTGG AAACCTGCCG CCGGGCAAAC CTGGCATAGC
TTTAACGATT ACATTAATTT CAGCGACAAA ACAGGCTGGG ATAAATGGTG GGGAAAAAAC
TGGATCAGAA CGGATATCGG CGATTACGAC AATCCTGGAT TCGACGATCT CACTATGTCG
CTAGCCTTTT TGCCGGATAT CAAAACCGAA TCAACTACCG CTTCTGGTCT GCCGGTGTTC
TATAAAAACA AAATGGATAC CCACGCCAAA GCCATTGACG GCTATACGCC GCGCGATTAC
TTAACCCACT GGTTAAGTCA GTGGGTCCGC GACTATGGGA TTGATGGTTT TCGGGTCGAT
ACCGCCAAAC ATGTTGAGTT GCCCGCCTGG CAGCAACTGA AAACCGAAGC CAGCGCCGCG
CTTCGCGAAT GGAAAAAAGC TAACCCCGAC AAAGCATTAG ATGACAAACC TTTCTGGATG
ACCGGTGAAG CCTGGGGCCA CGGCGTGATG CAAAGTGACT ACTATCGCCA CGGCTTCGAT
GCGATGATCA ATTTCGATTA TCAGGAGCAG GCGGCGAAAG CAGTCGACTG TCTGGCGCAG
ATGGATACGA CCTGGCAGCA AATGGCGGAG AAATTGCAGG GTTTCAACGT GTTGAGCTAC
CTCTCGTCGC ATGATACCCG CCTGTTCCGT GAAGGGGGCG ACAAAGCAGC AGAGTTATTA
CTATTAGCGC CAGGCGCGGT ACAAATCTTT TATGGTGATG AATCCTCGCG TCCGTTCGGT
CCTACAGGTT CTGATCCGCT GCAAGGTACA CGTTCGGATA TGAACTGGCA GGATGTTAGC
GGTAAATCTG CCGCCAGCGT CGCGCACTGG CAGAAAATCA GCCAGTTCCG CGCCCGCCAT
CCCGCAATTG GCGCGGGCAA ACAAACGACA CTTTTGCTGA AGCAGGGCTA CGGCTTTGTT
CGTGAGCATG GCGACGATAA AGTGCTGGTC GTCTGGGCAG GGCAACAGTA A
 
Protein sequence
MKLAACFLTL LPGFAVAASW TSPGFPAFSE QGTGTFVSHA QLPKGTRPLT LNFDQQCWQP 
ADAIKLNQML SLQPCSNTPP QWRLFRDGEY TLQIDTRSGT PTLMISIQNA AEPVASLVRE
CPKWDGLPLT VDVSATFPEG AAVRDYYSQQ IAIVKNGQIM LQPAATSNGL LLLERAETDT
SAPFDWHNAT VYFVLTDRFE NGDPSNDQSY GRHKDGMAEI GTFHGGDLRG LTNKLDYLQQ
LGVNALWISA PFEQIHGWVG GGTKGDFPHY AYHGYYTQDW TNLDANMGNE ADLRTLVDSA
HQRGIRILFD VVMNHTGYAT LADMQEYQFG ALYLSGDEVK KSLGERWSDW KPAAGQTWHS
FNDYINFSDK TGWDKWWGKN WIRTDIGDYD NPGFDDLTMS LAFLPDIKTE STTASGLPVF
YKNKMDTHAK AIDGYTPRDY LTHWLSQWVR DYGIDGFRVD TAKHVELPAW QQLKTEASAA
LREWKKANPD KALDDKPFWM TGEAWGHGVM QSDYYRHGFD AMINFDYQEQ AAKAVDCLAQ
MDTTWQQMAE KLQGFNVLSY LSSHDTRLFR EGGDKAAELL LLAPGAVQIF YGDESSRPFG
PTGSDPLQGT RSDMNWQDVS GKSAASVAHW QKISQFRARH PAIGAGKQTT LLLKQGYGFV
REHGDDKVLV VWAGQQ