Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1579 |
Symbol | malI |
ID | 6145867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1563484 |
End bp | 1564512 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641616456 |
Product | DNA-binding transcriptional repressor MalI |
Protein accession | YP_001743634 |
Protein GI | 170681507 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 70 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACCG CCAAAAAAAT AACCATTCAT GATGTTGCGC TGGCTGCGGG CGTGTCGGTA AGTACCGTTT CGCTGGTGCT AAGTGGCAAA GGGCGAATCT CTACCGCCAC AGGAGAACGC GTTAACGCCG CCATTGAAGA GCTGGGATTT GTGCGCAATC GTCAGGCGTC GGCGCTGCGC GGCGGGCAAA GCGGCGTCAT TGGTTTGATC GTCCGTGATT TATCTGCGCC GTTTTACGCC GAATTAACGG CCGGATTGAC GGAAGCTCTG GAAGCGCAGG GACGGATGGT TTTTTTGCTT CACGGCGGTA AAGACGGCGA GCAGCTGGCA CAACGGTTTT CACTGTTACT GAATCAGGGG GTCGATGGTG TGGTCATTGC CGGGGCTGCA GGAAGCAGCG ATGACCTGCG ACGGATGGCA GAAGAAAAAG CTATCCCGGT TATTTTCGCT TCCCGTGCCA GTTATCTTGA TGATGTTGAT ACGGTTCGCC CGGACAACAT GCAGGCTGCA CAGTTGTTGA CGGAGCATCT CATTCGCAAT GGGCATCAGC GTATCGCCTG GCTGGGAGGG CAAAGTTCCT CATTAACCCG GGCAGAACGG GTGGGGGGCT ATTGTGCGAC TCTACTAAAA TTTGGCCTGC CGTTTCACAG CGATTGGGTG CTGGAGTGCA CTTCCAGCCA GAAGCAAGCC GCGGAAGCTA TCACGGCGCT TTTACGTCAT AACCCGACCA TCAGCGCCGT GGTTTGCTAT AACGAAACTA TTGCGATGGG TGCATGGTTT GGTTTGCTGA AAGCAGGCAG GCAAAGCGGG GAAAGCGGAG TCGATCGTTA CTTTGAGCAA CAGGTTTCGC TGGCGGCATT TACCGATGCA ACGCCTACCA CACTTGATGA TATACCCGTT ACCTGGGCCA GCACGCCAGC GCGGGAACTT GGTACCACAC TTGCGGATCG CATGATGCAA AAAATCACCC ATGAAGAGAC GCATTCACGC AATCTTATTA TTCCCGCCCG GCTCATTGCG GCGAAATAA
|
Protein sequence | MATAKKITIH DVALAAGVSV STVSLVLSGK GRISTATGER VNAAIEELGF VRNRQASALR GGQSGVIGLI VRDLSAPFYA ELTAGLTEAL EAQGRMVFLL HGGKDGEQLA QRFSLLLNQG VDGVVIAGAA GSSDDLRRMA EEKAIPVIFA SRASYLDDVD TVRPDNMQAA QLLTEHLIRN GHQRIAWLGG QSSSLTRAER VGGYCATLLK FGLPFHSDWV LECTSSQKQA AEAITALLRH NPTISAVVCY NETIAMGAWF GLLKAGRQSG ESGVDRYFEQ QVSLAAFTDA TPTTLDDIPV TWASTPAREL GTTLADRMMQ KITHEETHSR NLIIPARLIA AK
|
| |