Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2332 |
Symbol | malI |
ID | 6970553 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2204076 |
End bp | 2205104 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643386206 |
Product | DNA-binding transcriptional repressor MalI |
Protein accession | YP_002270690 |
Protein GI | 209397923 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTACCG CCAAAAAAAT AACCATTCAT GATGTTGCGC TGGCTGCGGG CGTGTCGGTA AGTACCGTTT CGCTGGTGCT TAGTGGCAAA GGGCGAATCT CTACCGCCAC AGGAGAACGC GTTAACGCCG CCATTGAAGA GCTGGGATTT GTGCGCAATC GCCAGGCGTC GGCGCTGCGC GGTGGGCAAA GCGGCGTCAT TGGTTTGATC GTCCGTGATT TATCTGCGCC GTTTTACGCC GAATTGACGG CCGGATTGAC GGAAGCTCTG GAAGCGCAGG GACGGATGGT TTTTTTGGTT CACGGCGGTA AAGACGGCGA GCAGCTGGCA CAGCGGTTTT CACTGTTACT GAATCAGGGT GTCGATGGTG TGGTAATTGC CGGGGCTGCA GGAAGCAGCG ATGACCTGCG ACGGATGGCA GAAGAAAAAG CTATCCCGGT GATTTTCGCT TCCCGTGCCA GTTATCTTGA TGATGTTGAT ACGGTTCGCC CGGACAACAT GCAGGCTGCA CAGTTGTTGA CGGAGCATCT CATTCGCAAT GGGCATCAGC GGATCGCCTG GCTGGGAGGG CAAAGTTCCT CATTAACCCG TGCAGAACGG GTGGGGGGCT ATTGTGCAAC TCTACTAAAA TTTGGCCTGC CGTTTCACAG CGATTGGGTG TTGGAGTGCA CTTCCAGCCA GAAGCAAGCC GCGGAAGCTA TCACGGCGCT TTTACGTCAT AACCCGACCA TCAGTGCCGT GGTTTGCTAT AACGAAACTA TTGCGATGGG GGCATGGTTT GGTTTGCTGA AAGCAGGGCG GCAAAGCGGG GAAAGCGGAG TCGATCGTTA CTTTGAGCAA CAGGTTTCGC TGGCGGCATT TACCGATGCG ACACCAACCA CACTTGATGA TATACCCGTT ACCTGGGCCA GCACGCCTGC GCGGGAACTT GGTACCACAC TTGCGGATCG CATGATGCAA AAAATCACCC ATGAAGAGAC GCATTCACGC AATCTTATTA TTCCCGCCCG GCTCATTGCG GCGAAATAA
|
Protein sequence | MATAKKITIH DVALAAGVSV STVSLVLSGK GRISTATGER VNAAIEELGF VRNRQASALR GGQSGVIGLI VRDLSAPFYA ELTAGLTEAL EAQGRMVFLV HGGKDGEQLA QRFSLLLNQG VDGVVIAGAA GSSDDLRRMA EEKAIPVIFA SRASYLDDVD TVRPDNMQAA QLLTEHLIRN GHQRIAWLGG QSSSLTRAER VGGYCATLLK FGLPFHSDWV LECTSSQKQA AEAITALLRH NPTISAVVCY NETIAMGAWF GLLKAGRQSG ESGVDRYFEQ QVSLAAFTDA TPTTLDDIPV TWASTPAREL GTTLADRMMQ KITHEETHSR NLIIPARLIA AK
|
| |