Gene EcSMS35_3698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3698 
SymbolmalP 
ID6146740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3757601 
End bp3759994 
Gene Length2394 bp 
Protein Length797 aa 
Translation table11 
GC content52% 
IMG OID641618525 
Productmaltodextrin phosphorylase 
Protein accessionYP_001745665 
Protein GI170683923 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0058] Glucan phosphorylase 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR02093] glycogen/starch/alpha-glucan phosphorylases 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.411664 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAAC CTATTTTTAA CGATAAGCAA TTTCAGGAAG CGCTTTCACG TCAGTGGCAG 
CGTTATGGCT TAAATTCTGC GGCTGAAATG ACCCCTCGTC AGTGGTGGCT GGCAGTGAGC
GAAGCACTGG CCGAAATGCT GCGTGCTCAG CCATTTGCCA AACCGGTGGC GAATCAGCGA
CATGTTAACT ACATCTCAAT GGAGTTTTTG ATTGGTCGCC TGACGGGCAA CAACCTGTTG
AATCTCGGCT GGTATCAGGA TGTACAGGAT TCGTTGAAGG CTTATGACAT CAACCTGACT
GACCTGCTGG AAGAAGAGAT CGACCCGGCG CTGGGTAACG GTGGTCTGGG ACGTCTGGCG
GCGTGCTTCC TCGACTCAAT GGCAACTGTC GGGCAGTCAG CTACTGGCTA CGGCCTTAAT
TATCAATATG GCTTGTTCCG CCAGTCATTT GTCGATGGCA AACAGGTTGA AGCGCCGGAT
GACTGGCATC GCAGTAACTA CCCGTGGTTC CGCCACAATG AAGCACTGGA TGTGCAGGTA
GGAATTGGCG GTAAAGTGAC GAAAGACGGA CGCTGGGAGC CGGAATTTAC CATTACCGGT
CAAGCGTGGG ATCTCCCCGT TGTCGGCTAT CGTAATGGCG TGGCACAGCC ACTGCGTCTG
TGGCAGGCGA CGCATGCGCA TCCGTTTGAT CTGACTAAAT TTAACGACGG TGATTTCCTG
CGCGCCGAAC AGCAGGGCAT CAACGCGGAA AAACTGACCA AAGTTCTCTA TCCAAACGAC
AACCATACTG CCGGTAAAAA GCTGCGCCTG ATGCAGCAAT ACTTCCAGTG TGCCTGTTCG
GTAGCGGATA TTTTGCGTCG CCATCATCTG GCGGGGCGTA AACTGCACGA ACTGGCGGAT
TACGAAGTTA TTCAGCTGAA CGATACCCAC CCAACAATCG CGATTCCAGA ACTGCTGCGC
GTGCTGATCG ATGAGCACCA GATGAGCTGG GATGACGCCT GGGCTATCAC CAGTAAAACT
TTCGCTTACA CCAACCATAC CCTGATGCCA GAAGCGCTGG AACGCTGGGA TGTGAAACTG
GTGAAAGGCT TACTGCCGCG CCACATGCAG ATTATTAACG AAATTAATAC ACGCTTTAAA
ACGCTGGTGG AAAAAAACTG GCCGGGCGAT GAAAAAGTGT GGGCCAAACT GGCGGTGGTA
CACGACAAAC AAGTGCATAT GGCGAACCTG TGTGTGGTTG GCGGTTTCGC GGTGAACGGT
GTTGCGGCGC TGCACTCGGA TCTGGTGGTG AAAGATCTGT TCCCGGAATA TCACCAGCTA
TGGCCGAACA AATTCCATAA CGTCACCAAC GGTATTACCC CACGTCGCTG GATCAAACAG
TGCAACCCGG CACTGGCGGC TCTGTTGGAT AAATCACTGA AAAAAGAGTG GGCTAACGAT
CTCGATCAAC TGATCAATCT GGAAAAATTC GCTGATGATG CGAAATTCCG TCAGCAATAT
CGCGAGATCA AGCAGGCGAA TAAAGTCCGT CTGGCAGAGT TTGTGAAAGT TCGTACCGGT
ATTGAGATCA ATCCACAGGC GATTTTCGAT ATTCAGATCA AACGTCTGCA TGAGTACAAA
CGCCAGCACC TGAATCTGCT GCATATTCTG GCGTTGTACA AAGAAATTCG TGAAAACCCG
CAGGCTGATC GCGTACCGCG CGTCTTCCTC TTTGGCGCGA AAGCGGCACC GGGCTACTAC
CTGGCGAAAA ACATTATCTT TGCGATCAAC AAAGTGGCTG ACGTGATCAA CAACGATCCG
CAGGTTGGCG ACAAATTGAA GGTGGTGTTC CTGCCGGATT ATTGCGTTTC GGCGGCGGAA
AAACTGATCC CGGCGGCGGA TATCTCCGAA CAAATTTCGA CGGCAGGTAA AGAAGCTTCC
GGTACCGGCA ATATGAAACT GGCGCTCAAT GGTGCGCTTA CTGTCGGTAC GCTGGATGGT
GCGAACGTTG AAATCGCCGA GAAAGTTGGT GAAGAAAATA TCTTTATTTT TGGTCATACC
GTGGAACAAG TGAAGGCAAT TCTGGCCAAA GGCTACGACC CGGTGAAATG GCGGAAGAAA
GATAAGGTGC TGGACGCAGT ATTGAAAGAG CTGGAAAGCG GCAAATACAG CGACGGCGAT
AAGCATGCCT TCGACCAGAT GCTGCACAGT ATCGGCAAAC AGGGCGGCGA TCCGTATCTG
GTGATGGCGG ATTTCGCAGC CTATGTAGAG GCACAAAAGC AGGTGGATGT GCTGTACCGC
GACCAGGAGG CCTGGACCCG CGCGGCGATC CTCAATACCG CCCGCTGCGG TATGTTTAGC
TCGGATCGCT CTATTCGCGA TTATCAGGCT CGTATCTGGC AGGCAAAACG CTAA
 
Protein sequence
MSQPIFNDKQ FQEALSRQWQ RYGLNSAAEM TPRQWWLAVS EALAEMLRAQ PFAKPVANQR 
HVNYISMEFL IGRLTGNNLL NLGWYQDVQD SLKAYDINLT DLLEEEIDPA LGNGGLGRLA
ACFLDSMATV GQSATGYGLN YQYGLFRQSF VDGKQVEAPD DWHRSNYPWF RHNEALDVQV
GIGGKVTKDG RWEPEFTITG QAWDLPVVGY RNGVAQPLRL WQATHAHPFD LTKFNDGDFL
RAEQQGINAE KLTKVLYPND NHTAGKKLRL MQQYFQCACS VADILRRHHL AGRKLHELAD
YEVIQLNDTH PTIAIPELLR VLIDEHQMSW DDAWAITSKT FAYTNHTLMP EALERWDVKL
VKGLLPRHMQ IINEINTRFK TLVEKNWPGD EKVWAKLAVV HDKQVHMANL CVVGGFAVNG
VAALHSDLVV KDLFPEYHQL WPNKFHNVTN GITPRRWIKQ CNPALAALLD KSLKKEWAND
LDQLINLEKF ADDAKFRQQY REIKQANKVR LAEFVKVRTG IEINPQAIFD IQIKRLHEYK
RQHLNLLHIL ALYKEIRENP QADRVPRVFL FGAKAAPGYY LAKNIIFAIN KVADVINNDP
QVGDKLKVVF LPDYCVSAAE KLIPAADISE QISTAGKEAS GTGNMKLALN GALTVGTLDG
ANVEIAEKVG EENIFIFGHT VEQVKAILAK GYDPVKWRKK DKVLDAVLKE LESGKYSDGD
KHAFDQMLHS IGKQGGDPYL VMADFAAYVE AQKQVDVLYR DQEAWTRAAI LNTARCGMFS
SDRSIRDYQA RIWQAKR