Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3698 |
Symbol | malP |
ID | 6146740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3757601 |
End bp | 3759994 |
Gene Length | 2394 bp |
Protein Length | 797 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641618525 |
Product | maltodextrin phosphorylase |
Protein accession | YP_001745665 |
Protein GI | 170683923 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0058] Glucan phosphorylase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR02093] glycogen/starch/alpha-glucan phosphorylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.411664 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCACAAC CTATTTTTAA CGATAAGCAA TTTCAGGAAG CGCTTTCACG TCAGTGGCAG CGTTATGGCT TAAATTCTGC GGCTGAAATG ACCCCTCGTC AGTGGTGGCT GGCAGTGAGC GAAGCACTGG CCGAAATGCT GCGTGCTCAG CCATTTGCCA AACCGGTGGC GAATCAGCGA CATGTTAACT ACATCTCAAT GGAGTTTTTG ATTGGTCGCC TGACGGGCAA CAACCTGTTG AATCTCGGCT GGTATCAGGA TGTACAGGAT TCGTTGAAGG CTTATGACAT CAACCTGACT GACCTGCTGG AAGAAGAGAT CGACCCGGCG CTGGGTAACG GTGGTCTGGG ACGTCTGGCG GCGTGCTTCC TCGACTCAAT GGCAACTGTC GGGCAGTCAG CTACTGGCTA CGGCCTTAAT TATCAATATG GCTTGTTCCG CCAGTCATTT GTCGATGGCA AACAGGTTGA AGCGCCGGAT GACTGGCATC GCAGTAACTA CCCGTGGTTC CGCCACAATG AAGCACTGGA TGTGCAGGTA GGAATTGGCG GTAAAGTGAC GAAAGACGGA CGCTGGGAGC CGGAATTTAC CATTACCGGT CAAGCGTGGG ATCTCCCCGT TGTCGGCTAT CGTAATGGCG TGGCACAGCC ACTGCGTCTG TGGCAGGCGA CGCATGCGCA TCCGTTTGAT CTGACTAAAT TTAACGACGG TGATTTCCTG CGCGCCGAAC AGCAGGGCAT CAACGCGGAA AAACTGACCA AAGTTCTCTA TCCAAACGAC AACCATACTG CCGGTAAAAA GCTGCGCCTG ATGCAGCAAT ACTTCCAGTG TGCCTGTTCG GTAGCGGATA TTTTGCGTCG CCATCATCTG GCGGGGCGTA AACTGCACGA ACTGGCGGAT TACGAAGTTA TTCAGCTGAA CGATACCCAC CCAACAATCG CGATTCCAGA ACTGCTGCGC GTGCTGATCG ATGAGCACCA GATGAGCTGG GATGACGCCT GGGCTATCAC CAGTAAAACT TTCGCTTACA CCAACCATAC CCTGATGCCA GAAGCGCTGG AACGCTGGGA TGTGAAACTG GTGAAAGGCT TACTGCCGCG CCACATGCAG ATTATTAACG AAATTAATAC ACGCTTTAAA ACGCTGGTGG AAAAAAACTG GCCGGGCGAT GAAAAAGTGT GGGCCAAACT GGCGGTGGTA CACGACAAAC AAGTGCATAT GGCGAACCTG TGTGTGGTTG GCGGTTTCGC GGTGAACGGT GTTGCGGCGC TGCACTCGGA TCTGGTGGTG AAAGATCTGT TCCCGGAATA TCACCAGCTA TGGCCGAACA AATTCCATAA CGTCACCAAC GGTATTACCC CACGTCGCTG GATCAAACAG TGCAACCCGG CACTGGCGGC TCTGTTGGAT AAATCACTGA AAAAAGAGTG GGCTAACGAT CTCGATCAAC TGATCAATCT GGAAAAATTC GCTGATGATG CGAAATTCCG TCAGCAATAT CGCGAGATCA AGCAGGCGAA TAAAGTCCGT CTGGCAGAGT TTGTGAAAGT TCGTACCGGT ATTGAGATCA ATCCACAGGC GATTTTCGAT ATTCAGATCA AACGTCTGCA TGAGTACAAA CGCCAGCACC TGAATCTGCT GCATATTCTG GCGTTGTACA AAGAAATTCG TGAAAACCCG CAGGCTGATC GCGTACCGCG CGTCTTCCTC TTTGGCGCGA AAGCGGCACC GGGCTACTAC CTGGCGAAAA ACATTATCTT TGCGATCAAC AAAGTGGCTG ACGTGATCAA CAACGATCCG CAGGTTGGCG ACAAATTGAA GGTGGTGTTC CTGCCGGATT ATTGCGTTTC GGCGGCGGAA AAACTGATCC CGGCGGCGGA TATCTCCGAA CAAATTTCGA CGGCAGGTAA AGAAGCTTCC GGTACCGGCA ATATGAAACT GGCGCTCAAT GGTGCGCTTA CTGTCGGTAC GCTGGATGGT GCGAACGTTG AAATCGCCGA GAAAGTTGGT GAAGAAAATA TCTTTATTTT TGGTCATACC GTGGAACAAG TGAAGGCAAT TCTGGCCAAA GGCTACGACC CGGTGAAATG GCGGAAGAAA GATAAGGTGC TGGACGCAGT ATTGAAAGAG CTGGAAAGCG GCAAATACAG CGACGGCGAT AAGCATGCCT TCGACCAGAT GCTGCACAGT ATCGGCAAAC AGGGCGGCGA TCCGTATCTG GTGATGGCGG ATTTCGCAGC CTATGTAGAG GCACAAAAGC AGGTGGATGT GCTGTACCGC GACCAGGAGG CCTGGACCCG CGCGGCGATC CTCAATACCG CCCGCTGCGG TATGTTTAGC TCGGATCGCT CTATTCGCGA TTATCAGGCT CGTATCTGGC AGGCAAAACG CTAA
|
Protein sequence | MSQPIFNDKQ FQEALSRQWQ RYGLNSAAEM TPRQWWLAVS EALAEMLRAQ PFAKPVANQR HVNYISMEFL IGRLTGNNLL NLGWYQDVQD SLKAYDINLT DLLEEEIDPA LGNGGLGRLA ACFLDSMATV GQSATGYGLN YQYGLFRQSF VDGKQVEAPD DWHRSNYPWF RHNEALDVQV GIGGKVTKDG RWEPEFTITG QAWDLPVVGY RNGVAQPLRL WQATHAHPFD LTKFNDGDFL RAEQQGINAE KLTKVLYPND NHTAGKKLRL MQQYFQCACS VADILRRHHL AGRKLHELAD YEVIQLNDTH PTIAIPELLR VLIDEHQMSW DDAWAITSKT FAYTNHTLMP EALERWDVKL VKGLLPRHMQ IINEINTRFK TLVEKNWPGD EKVWAKLAVV HDKQVHMANL CVVGGFAVNG VAALHSDLVV KDLFPEYHQL WPNKFHNVTN GITPRRWIKQ CNPALAALLD KSLKKEWAND LDQLINLEKF ADDAKFRQQY REIKQANKVR LAEFVKVRTG IEINPQAIFD IQIKRLHEYK RQHLNLLHIL ALYKEIRENP QADRVPRVFL FGAKAAPGYY LAKNIIFAIN KVADVINNDP QVGDKLKVVF LPDYCVSAAE KLIPAADISE QISTAGKEAS GTGNMKLALN GALTVGTLDG ANVEIAEKVG EENIFIFGHT VEQVKAILAK GYDPVKWRKK DKVLDAVLKE LESGKYSDGD KHAFDQMLHS IGKQGGDPYL VMADFAAYVE AQKQVDVLYR DQEAWTRAAI LNTARCGMFS SDRSIRDYQA RIWQAKR
|
| |