Gene ECD_03423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03423 
SymbolmalS 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3599649 
End bp3601679 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content53% 
IMG OID 
Productperiplasmic alpha-amylase precursor 
Protein accessionACT45222 
Protein GI253979552 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCG CCGCCTGTTT TCTGACACTC CTTCCTGGCT TCGCCGTTGC CGCCAGCTGG 
ACTTCTCCGG GGTTTCCCGC CTTTAGCGAA CAGGGGACAG GAACATTTGT CAGCCACGCG
CAGTTGCCCA AAGGTACGCG TCCACTAACG CTAAATTTTG ACCAACAGTG CTGGCAGCCT
GCGGATGCGA TAAAACTCAA TCAGATGCTT TCCCTGCAAC CTTGTAGCAA CACGCCGCCT
CAATGGCGAT TGTTCAGGGA CGGCGAATAT ACGCTGCAAA TAGACACCCG CTCCGGTACG
CCAACATTGA TGATTTCCAT CCAGAACGCC GCCGAACCGG TAGCAAGCCT GGTCCGTGAA
TGCCCGAAAT GGGATGGATT ACCGCTCACA GTGGATGTCA GCGCCACTTT CCCGGAAGGA
GCCGCCGTAC GGGATTATTA CAGCCAGCAA ATTGCGATAG TGAAGAACGG TCAAATAATG
TTACAACCCG CTGCCACCAG CAACGGTTTA CTCCTGCTGG AACGGGCAGA AACTGACACA
TCCGCCCCTT TCGACTGGCA TAACGCCACG GTTTACTTTG TGCTGACAGA TCGTTTCGAA
AACGGCGATC CCAGTAATGA CCAGAGTTAC GGACGTCATA AAGACGGTAT GGCGGAAATT
GGCACTTTTC ACGGCGGCGA TTTACGCGGC CTGACCAACA AACTGGATTA CCTCCAGCAG
TTGGGCGTTA ATGCTTTATG GATAAGCGCC CCATTTGAGC AAATTCACGG CTGGGTCGGC
GGCGGTACAA AAGGCGATTT CCCGCATTAT GCCTACCACG GTTATTACAC ACAGGACTGG
ACGAATCTTG ATGCCAATAT GGGCAACGAA GCCGATCTAC GGACGCTGGT TGATAGCGCA
CATCAGCGCG GTATTCGTAT TCTCTTTGAT GTCGTGATGA ACCACACCGG CTATGCCACG
CTGGCGGATA TGCAGGAGTA TCAGTTTGGC GCGTTATATC TTTCTGGTGA CGAAGTGAAA
AAATCGCTGG GTGAACGCTG GAGCGACTGG AAACCTGCCG CCGGGCAAAC CTGGCATAGC
TTTAACGATT ACATTAATTT CAGCGACAAA ACAGGCTGGG ATAAATGGTG GGGAAAAAAC
TGGATCAGAA CGGATATCGG CGATTACGAC AATCCTGGAT TCGACGATCT CACTATGTCG
CTAGCCTTTT TGCCGGATAT CAAAACCGAA TCAACTACCG CTTCTGGTCT GCCGGTGTTC
TATAAAAACA AAATGGATAC CCACGCCAAA GCCATTGACG GCTATACGCC GCGCGATTAC
TTAACCCACT GGTTAAGTCA GTGGGTCCGC GACTATGGGA TTGATGGTTT TCGGGTCGAT
ACCGCCAAAC ATGTTGAGTT GCCCGCCTGG CAGCAACTGA AAACCGAAGC CAGCACCGCG
CTTCGCGAAT GGAAAAAAGC TAACCCCGAC AAAGCATTAG ATGACAAACC TTTCTGGATG
ACCGGTGAAG CCTGGGGCCA CGGCGTGATG CAAAGTGACT ACTATCGCCA CGGCTTCGAT
GCGATGATCA ATTTCGATTA TCAGGAGCAG GCGGCGAAAG CAGTCGACTG TCTGGCGCAG
ATGGATACGA CCTGGCAGCA AATGGCGGAG AAATTGCAGG GTTTCAACGT GTTGAGCTAC
CTCTCGTCGC ATGATACCCG CCTGTTCCGT GAAGGGGGCG ACAAAGCAGC AGAGTTATTA
CTATTAGCGC CAGGCGCGGT ACAAATCTTT TATGGTGATG AATCCTCGCG TCCGTTCGGT
CCTACAGGTT CTGATCCGCT GCAAGGTACA CGTTCGGATA TGAACTGGCA GGATGTTAGC
GGTAAATCTG CCGCCAGCGT CGCGCACTGG CAGAAAATCA GCCAGTTCCG CGCCCGCCAT
CCCGCAATTG GCGCGGGCAA ACAAACGACA CTTTTGCTGA AGCAGGGCTA CGGCTTTGTT
CGTGAGCATG GCGACGATAA AGTGCTGGTC GTCTGGGCAG GGCAACAGTA A
 
Protein sequence
MKLAACFLTL LPGFAVAASW TSPGFPAFSE QGTGTFVSHA QLPKGTRPLT LNFDQQCWQP 
ADAIKLNQML SLQPCSNTPP QWRLFRDGEY TLQIDTRSGT PTLMISIQNA AEPVASLVRE
CPKWDGLPLT VDVSATFPEG AAVRDYYSQQ IAIVKNGQIM LQPAATSNGL LLLERAETDT
SAPFDWHNAT VYFVLTDRFE NGDPSNDQSY GRHKDGMAEI GTFHGGDLRG LTNKLDYLQQ
LGVNALWISA PFEQIHGWVG GGTKGDFPHY AYHGYYTQDW TNLDANMGNE ADLRTLVDSA
HQRGIRILFD VVMNHTGYAT LADMQEYQFG ALYLSGDEVK KSLGERWSDW KPAAGQTWHS
FNDYINFSDK TGWDKWWGKN WIRTDIGDYD NPGFDDLTMS LAFLPDIKTE STTASGLPVF
YKNKMDTHAK AIDGYTPRDY LTHWLSQWVR DYGIDGFRVD TAKHVELPAW QQLKTEASTA
LREWKKANPD KALDDKPFWM TGEAWGHGVM QSDYYRHGFD AMINFDYQEQ AAKAVDCLAQ
MDTTWQQMAE KLQGFNVLSY LSSHDTRLFR EGGDKAAELL LLAPGAVQIF YGDESSRPFG
PTGSDPLQGT RSDMNWQDVS GKSAASVAHW QKISQFRARH PAIGAGKQTT LLLKQGYGFV
REHGDDKVLV VWAGQQ