Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4496 |
Symbol | malE |
ID | 6143609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4592779 |
End bp | 4593969 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641619312 |
Product | maltose ABC transporter periplasmic protein |
Protein accession | YP_001746424 |
Protein GI | 170679620 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAA AAACAGGTGC ACGCATCCTC GCATTATCCG CATTAACGAC GATGATGTTT TCCGCCTCGG CTCTCGCCAA AATCGAAGAA GGTAAACTGG TAATCTGGAT TAACGGCGAT AAAGGCTATA ACGGTCTCGC TGAAGTCGGT AAGAAATTCG AGAAAGATAC CGGAATTAAA GTCACCGTCG AGCATCCGGA TAAACTGGAA GAGAAATTCC CTCAGGTTGC GGCAACTGGC GATGGCCCTG ACATTATCTT CTGGGCGCAC GACCGCTTTG GTGGCTACGC TCAATCTGGC CTGTTGGCTG AAATCACCCC GGACAAAGCG TTCCAGGACA AGCTGTATCC GTTTACCTGG GATGCCGTTC GTTACAACGG CAAGCTGATT GCTTACCCGA TTGCTGTCGA AGCGTTATCG CTGATTTATA ACAAAGATCT GCTGCCGAAC CCGCCAAAAA CCTGGGAAGA GATCCCGGCG CTGGATAAAG AACTGAAAGC GAAAGGTAAG AGCGCGCTGA TGTTCAACCT GCAAGAACCA TACTTCACCT GGCCGCTGAT TGCGGCTGAC GGGGGTTATG CGTTCAAGTA TGAAAACGGC AAGTACGACA TTAAAGACGT GGGCGTGGAT AACGCTGGCG CGAAAGCGGG TCTGACCTTC CTGGTTGACC TGATTAAAAA CAAACACATG AATGCAGACA CCGATTACTC CATCGCAGAA GCTGCCTTTA ATAAAGGCGA AACAGCGATG ACCATCAACG GCCCGTGGGC ATGGTCCAAC ATCGACACCA GCAAAGTGAA TTATGGTGTA ACGGTACTGC CGACCTTCAA GGGTCAACCA TCCAAACCGT TCGTTGGCGT GCTGAGCGCC GGTATTAACG CCGCCAGTCC GAACAAAGAG CTGGCGAAAG AGTTCCTCGA AAACTATCTG CTGACTGACG AAGGTCTGGA AGCGGTCAAT AAAGATAAAC CGCTGGGTGC AGTGGCGCTG AAGTCTTACG AGGAAGAGTT GGCGAAAGAT CCACGTATTG CCGCCACCAT GGAAAACGCC CAGAAAGGTG AAATCATGCC GAACATCCCG CAGATGTCCG CTTTCTGGTA TGCCGTGCGT ACTGCGGTGA TCAACGCCGC CAGCGGTCGT CAGACTGTCG ATGAAGCCCT GAAAGACGCG CAGACTCGTA TCACCAAGTA A
|
Protein sequence | MKIKTGARIL ALSALTTMMF SASALAKIEE GKLVIWINGD KGYNGLAEVG KKFEKDTGIK VTVEHPDKLE EKFPQVAATG DGPDIIFWAH DRFGGYAQSG LLAEITPDKA FQDKLYPFTW DAVRYNGKLI AYPIAVEALS LIYNKDLLPN PPKTWEEIPA LDKELKAKGK SALMFNLQEP YFTWPLIAAD GGYAFKYENG KYDIKDVGVD NAGAKAGLTF LVDLIKNKHM NADTDYSIAE AAFNKGETAM TINGPWAWSN IDTSKVNYGV TVLPTFKGQP SKPFVGVLSA GINAASPNKE LAKEFLENYL LTDEGLEAVN KDKPLGAVAL KSYEEELAKD PRIAATMENA QKGEIMPNIP QMSAFWYAVR TAVINAASGR QTVDEALKDA QTRITK
|
| |