Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4584 |
Symbol | melR |
ID | 6145149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4685683 |
End bp | 4686591 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641619400 |
Product | DNA-binding transcriptional regulator MelR |
Protein accession | YP_001746512 |
Protein GI | 170681448 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.781071 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACAG ATACGTTTAT GTGCAGCAGC GACGAAAAGC AGACCCGTAG CCCACTGTCG CTGTATTCAG AATACCAGCG GATGGAAATT GAGTTTCGCG CACCACATAT CATGCCCACC AGCCACTGGC ATGGTCAGGT CGAAGTGAAT GTGCCTTTCG ATGGCGATGT GGAATACCTG ATCAACAATG AAAAAGTGAA TATCAATCAG GGTCATATCA CGCTGTTCTG GGCCTGCACA CCGCACCAAC TAACAGATAC CGGAACCTGT CAGAGCATGG CGATTTTTAA TCTGCCGATG CATCTGTTTC TCTCCTGGCC GCTGGATAAA GACCTGATTA ACCACGTTAC TCACGGCATG GTGATCAAAT CACTGGCGAC ACAGCAACTT AGCCCGTTTG AAGTGCGCCG CTGGCAGCAG GAATTGAACA GTCCGAACGA GCAAATTCGC CAGCTCGCCA TTGATGAAAT TGGCCTGATG CTCAAGCGAT TTAGCCTCTC TGGCTGGGAA CCGATTCTGG TCAATAAAAC CTCGCGCACA CACAAAAACA GCGTCTCGCG CCATGCGCAA TTTTATGTCA GCCAGATGCT GGGCTTTATT GCCGAAAACT ATGATCAGGC GCTGACCATC AACGATGTGG CTGAGCACGT CAAACTTAAC GCCAACTATG CAATGGGGAT ATTCCAGCGG GTCATGCAAT TGACAATGAA ACAGTACATT ACCGCGATGC GCATCAACCA CGTTCGCGCG TTACTGAGCG ATACCGATAA AAGTATTCTC GATATTGCCC TGACGGCAGG CTTTCGTTCG AGTAGCCGTT TTTACAGCAC GTTCGGCAAA TATGTCGGCA TGTCGCCGCA ACAATACCGC AAACTTAGCC AACAACGCCG CCAGGTGTTT CCCGGCTAA
|
Protein sequence | MNTDTFMCSS DEKQTRSPLS LYSEYQRMEI EFRAPHIMPT SHWHGQVEVN VPFDGDVEYL INNEKVNINQ GHITLFWACT PHQLTDTGTC QSMAIFNLPM HLFLSWPLDK DLINHVTHGM VIKSLATQQL SPFEVRRWQQ ELNSPNEQIR QLAIDEIGLM LKRFSLSGWE PILVNKTSRT HKNSVSRHAQ FYVSQMLGFI AENYDQALTI NDVAEHVKLN ANYAMGIFQR VMQLTMKQYI TAMRINHVRA LLSDTDKSIL DIALTAGFRS SSRFYSTFGK YVGMSPQQYR KLSQQRRQVF PG
|
| |