Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0410 |
Symbol | mhpR |
ID | 5593988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 432093 |
End bp | 433040 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640919595 |
Product | DNA-binding transcriptional activator MhpR |
Protein accession | YP_001457180 |
Protein GI | 157159862 |
COG category | [K] Transcription |
COG ID | [COG1414] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.00000385619 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTTTT ATTGTGCGCT CAGTATAGGA AGGGTATTTT CGGCTACAAT CAAAACATGC CCGAATGTGC ACCAGGTGCA CAACGTTGTT TTAACTATAG AAATGTCAAT TAATATGCAG AACAATGAGC AGACGGAATA CAAAACCGTG CGCGGCTTAA CACGCGGTCT AATGTTATTA AATATGTTAA ATAAACTTGA TGGCGGTGCC AGCGTCGGGC TGCTGGCGGA ACTCAGCGGC CTGCATCGCA CCACTGTGCG GCGACTGCTG GAGACGCTGC AGGAAGAGGG ATATGTCCGC CGTAGCCCCT CCGATGACAG TTTTCGTCTG ACCATCAAAG TGCGGCAATT AAGCGAAGGA TTTCGTGACG AACAGTGGAT TTCTGCACTG GCGGCCCCGC TGCTGGGCGA TCTGTTGCGC GAAGTGGTAT GGCCGACAGA TGTGTCCACG CTGGATGTTG ATGCAATGGT GGTACGCGAA ACCACCCACC GTTTCAGCCG CTTATCCTTT CACCGGGCAA TGGTCGGGCG ACGTTTACCG CTCCTGAAAA CCGCCTCGGG CCTGACCTGG CTGGCCTTTT GCCCGGAACA AGAACGCAAG GAATTAATCG AAATGTTAGC CGCCCGCCCC GGTGATGACT ATCAACTGGC ACGAGAACCG TTAAAGCTGC AAGCCATTCT GGCGCGCGCG CGCAAAGAGG GTTACGGACA GAACTACCGC GGCTGGGATC AGGAGGAGAA GATCGCCTCT ATCGCCGTAC CGCTGCGCAG TGAACAACGG GTGATTGGCT GTCTGAATCT GGTGTATATG GCGAGCGCAA TGACCATTGA ACAGGCAGCG GAAAAGCATC TTCCGGCGCT ACAACGGGTA GCAAAACAGA TCGAAGAAGG GGTTGAATCG CAGGCTATTC TGGTGGCCGG ACGGCGAAGC GGCGTGCATT TACGTTGA
|
Protein sequence | MIFYCALSIG RVFSATIKTC PNVHQVHNVV LTIEMSINMQ NNEQTEYKTV RGLTRGLMLL NMLNKLDGGA SVGLLAELSG LHRTTVRRLL ETLQEEGYVR RSPSDDSFRL TIKVRQLSEG FRDEQWISAL AAPLLGDLLR EVVWPTDVST LDVDAMVVRE TTHRFSRLSF HRAMVGRRLP LLKTASGLTW LAFCPEQERK ELIEMLAARP GDDYQLAREP LKLQAILARA RKEGYGQNYR GWDQEEKIAS IAVPLRSEQR VIGCLNLVYM ASAMTIEQAA EKHLPALQRV AKQIEEGVES QAILVAGRRS GVHLR
|
| |