Gene SbBS512_E3951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3951 
SymbolmalS 
ID6270624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3684229 
End bp3686259 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content52% 
IMG OID641727800 
Productperiplasmic alpha-amylase precursor 
Protein accessionYP_001882233 
Protein GI187734117 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCG CCTCCTGTTT TCTGACACTC CTTCCTGGCT TCGCCGTTGC CGCCAGCTGG 
ACTTCTCCGG GGTTCCCTGT CTTTAGCGAA CAGGGAACGG GAACATTTGT CAGCCACGCG
CAGTTGCCCA AAGGTACGCG TCCACTCACG CTAAATTTTG ACCAACAGTG CTGGCAGCCT
GCAGATGCGA TAAAACTCAA TCAGATGCTT TCCCTGCAAC CTTGTAGCAA CACGCCGCCT
CAATGGCGAT TGTTCAGGGA CGGCAAATAT ACGCTGCAAA TAGACACCCG CTCCGGTACG
CCAACATTGA TGATTTCCAT CCAGAACGCC GCCGAAGCGG TAGCAAACCT GGTCCGTGAA
TGCCCGAAAT GGGATGGATT ACCGCTCACG CTGGATGTCA GCGCCACTTT CCCGGAAGGA
GCCGCCGTAC GGGATTATTA CAGCCAGCAA ATTGCGATAG TGAAGAACGG TCAAATAACG
TTACAACCCG CTGCTACCAG CAACGGTTTA CTCCTGCTGG AACGGGCAGA AACTGACACA
TCCGCCCCTT TCGACTGGCA TAACGCCACG GTTTACTTTG TGCTGACAGA TCGTTTCGAA
AACGGCGATC CCAGTAATGA CCAGAGTTAC GGACGTCATA AAGACGGTAT GGCGGAAATT
GGCACTTTTC ACGGCGGCGA TTTACGCGCC CTGATCAATA AACTGGATTA CCTCCAGCAG
TTGGGCGTTA ATGCTTTATG GATAAGCGCC CCATTTGAGC AAATTCACGG CTGGGTCGGC
GGCGGTACAA AAGGCGATTT CCCGCATTAT GCCTACCACG GTTATTACAC ACAGGACTGG
ACGAATCTTG ATGCCAATAT GGGCAACGAA GCCGATCTAC GGACGCTGGT TGATAGCGCA
CATCAGCGCG GTATTCGTAT TCTCTTTGAT GTCGTGATGA ACCACACCGG CTATGCCACG
CTGGCGGATA TGCAGGAGTA TCAGTTTGGC GCGTTATATC TTTCTGGTGA CGAAGTGAAA
AAAACGCTGG GTGAACGCTG GAGCGACTGG AAACCTGCCG CCGGGCAAAC CTGGCATAGC
TTTAACGATT ACATTAATTT CAGCGACAAA ACAGGCTGGG ATAAATGGTG GGGAAAAAAC
TGGATCAGAA CGGATATCGG CGATTACGAC AATCCTGGAT TCGACGATCT CACTATGTCG
CTGGCCTTTT TGCCGGATAT CAAAACCGAA TCAACTACCG CTTCTGGTCT GCCGGTGTTC
TATAAAAACA AAACGGATAC TCACGCTAAA GTCATCGAAG GCTTTACACC TCGCGATTAC
TTAACCCACT GGTTAAGTCA GTGGGTCCGC GACTATGGGA TTGATGGTTT TCGGGTCGAT
ACCGCCAAAC ATGTTGAGTT GCCCGCCTGG CAGCAACTGA AAACCGAAGC CAGCTCCGCG
CTTCGCGAAT GGAAAAAAAC TAACCCCGAC AAAGCATTAG ATGACAAACC TTTCTGGATG
ACCGGTGAAG CCTGGGGCCA CGGCGTGATG CAAAGTGACT ACTATCGCCA CGGCTTCGAT
GCGATGATCA ATTTCGATTA TCAGGAGCAG GCGGCGAAAG CAGTCGATTG TCTGGCGCAG
ATGGATACGA CCTGGCAGCA AATGGCGGAG AAATTGCAGG GTTTCAACGT GTTGAGCTAC
CTCTCGTCGC ATGATACCCG TCTGTTCCGT GAAGGGGGCG ACAAAGCAGC AGAGTTATTA
CTATTAGCGC CAGGCGCGGT ACAAATCTTT TATGGCGATG AATCCTCGCG TCCGTTCGGT
CCTACAGGTT CTGATCCGCT GCAAGGTACA CGTTCGGATA TGAACTGGCA GGATGTTAGC
GGTAAATCTG CCGCCAACGT CGCGCACTGG CAGAAAATCA GCCAGTTCCG CGCCCGCCAT
CCCGCAATTG GCGCGGGCAA ACAAACGACA CTTTCGCTGA AGCAGGGCTA CGGCTTTGTT
CGTGAGCATG GCGACGATAA AGTGCTGGTC ATCTGGGCTG GGCAACAGTG A
 
Protein sequence
MKLASCFLTL LPGFAVAASW TSPGFPVFSE QGTGTFVSHA QLPKGTRPLT LNFDQQCWQP 
ADAIKLNQML SLQPCSNTPP QWRLFRDGKY TLQIDTRSGT PTLMISIQNA AEAVANLVRE
CPKWDGLPLT LDVSATFPEG AAVRDYYSQQ IAIVKNGQIT LQPAATSNGL LLLERAETDT
SAPFDWHNAT VYFVLTDRFE NGDPSNDQSY GRHKDGMAEI GTFHGGDLRA LINKLDYLQQ
LGVNALWISA PFEQIHGWVG GGTKGDFPHY AYHGYYTQDW TNLDANMGNE ADLRTLVDSA
HQRGIRILFD VVMNHTGYAT LADMQEYQFG ALYLSGDEVK KTLGERWSDW KPAAGQTWHS
FNDYINFSDK TGWDKWWGKN WIRTDIGDYD NPGFDDLTMS LAFLPDIKTE STTASGLPVF
YKNKTDTHAK VIEGFTPRDY LTHWLSQWVR DYGIDGFRVD TAKHVELPAW QQLKTEASSA
LREWKKTNPD KALDDKPFWM TGEAWGHGVM QSDYYRHGFD AMINFDYQEQ AAKAVDCLAQ
MDTTWQQMAE KLQGFNVLSY LSSHDTRLFR EGGDKAAELL LLAPGAVQIF YGDESSRPFG
PTGSDPLQGT RSDMNWQDVS GKSAANVAHW QKISQFRARH PAIGAGKQTT LSLKQGYGFV
REHGDDKVLV IWAGQQ