Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4274 |
Symbol | malE |
ID | 5594200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4276973 |
End bp | 4278163 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640923376 |
Product | maltose ABC transporter periplasmic protein |
Protein accession | YP_001460821 |
Protein GI | 157163503 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 60 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA AAACAGGTGC ACGCATCCTC GCATTATCCG CATTAACGAC GATGATGTTT TCCGCCTCGG CTCTCGCCAA AATCGAAGAA GGTAAACTGG TAATCTGGAT TAACGGCGAT AAAGGCTATA ACGGTCTCGC TGAAGTCGGT AAGAAATTCG AGAAAGATAC CGGAATTAAA GTCACCGTTG AGCATCCGGA TAAACTGGAA GAGAAATTCC CTCAGGTTGC GGCAACTGGC GATGGCCCTG ACATTATCTT CTGGGCACAC GACCGCTTTG GTGGCTACGC TCAATCTGGC CTGTTGGCTG AAATCACCCC GGACAAAGCG TTCCAGGACA AGCTGTATCC GTTTACCTGG GATGCCGTAC GTTACAACGG CAAGCTGATT GCTTACCCGA TCGCTGTTGA AGCGTTATCG CTGATTTATA ACAAAGATCT GCTGCCGAAC CCACCAAAAA CCTGGGAAGA GATCCCGGCG CTGGATAAAG AACTGAAAGC GAAAGGTAAG AGCGCGCTGA TGTTCAACCT GCAAGAACCG TACTTCACCT GGCCGCTGAT TGCTGCTGAC GGGGGTTATG CGTTCAAGTA TGAAAACGGC AAGTACGATA TTAAAGACGT GGGCGTGGAT AACGCTGGCG CGAAAGCGGG TCTGACCTTC CTGGTTGACC TTATTAAAAA CAAACACATG AATGCAGACA CCGATTACTC CATCGCAGAA GCTGCCTTTA ATAAAGGCGA AACAGCGATG ACCATCAACG GCCCGTGGGC ATGGTCCAAC ATCGACACCA GCAAAGTGAA TTATGGTGTA ACGGTACTGC CGACCTTCAA GGGTCAACCA TCTAAACCGT TCGTTGGCGT GCTGAGCGCC GGTATTAACG CCGCCAGTCC GAACAAAGAG CTGGCGAAAG AGTTCCTCGA AAACTATCTG CTGACTGACG AAGGTCTGGA AGCGGTCAAT AAAGACAAAC CGCTGGGTGC AGTGGCGCTG AAGTCTTACG AGGAAGAGTT GGCGAAAGAT CCACGTATTG CCGCCACCAT GGAAAACGCC CAGAAAGGTG AAATCATGCC GAACATCCCG CAGATGTCCG CTTTCTGGTA TGCCGTGCGT ACTGCGGTGA TCAACGCCGC CAGCGGTCGT CAGACTGTCG ATGAAGCCCT GAAAGACGCG CAGACTCGTA TCACCAAGTA A
|
Protein sequence | MKIKTGARIL ALSALTTMMF SASALAKIEE GKLVIWINGD KGYNGLAEVG KKFEKDTGIK VTVEHPDKLE EKFPQVAATG DGPDIIFWAH DRFGGYAQSG LLAEITPDKA FQDKLYPFTW DAVRYNGKLI AYPIAVEALS LIYNKDLLPN PPKTWEEIPA LDKELKAKGK SALMFNLQEP YFTWPLIAAD GGYAFKYENG KYDIKDVGVD NAGAKAGLTF LVDLIKNKHM NADTDYSIAE AAFNKGETAM TINGPWAWSN IDTSKVNYGV TVLPTFKGQP SKPFVGVLSA GINAASPNKE LAKEFLENYL LTDEGLEAVN KDKPLGAVAL KSYEEELAKD PRIAATMENA QKGEIMPNIP QMSAFWYAVR TAVINAASGR QTVDEALKDA QTRITK
|
| |