Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4678 |
Symbol | |
ID | 5318135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1192719 |
End bp | 1193549 |
Gene Length | 831 bp |
Protein Length | 276 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776476 |
Product | MerR family transcriptional regulator |
Protein accession | YP_001313408 |
Protein GI | 150376812 |
COG category | [K] Transcription [S] Function unknown |
COG ID | [COG0789] Predicted transcriptional regulators [COG1917] Uncharacterized conserved protein, contains double-stranded beta-helix domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.825044 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.111013 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCGGTG TTGGTGCCAT CCGCTACAAG GTTGCCGAGG CGGCAAGGCT CGCAGGCGTT TCGGCCTCGA CACTCAGATT GTGGGAAACC CAAGGTCTCG TCGTGCCCGA GCGCTCGGCG ACGGGTCACC GGCAATATAC GGATGCCGAT CTTGCCCGGT TGAAGCGTAT TTCCTGGTTC CGCTCCGAAC GCGGTCTCAA CCCGGCTGCG ATCAGGGAGG CTCTCGAGGC GGAAAATGCG GCCGCTGACG ATGAAAACGG CTTGGCGGTG CTGAGCGACG AGAATGCCGA GCTTCAGGTC GGACGGAAGC TACGCAGCCT GAGACATACC GCCGGTAAGA CTCTGGAGCA GGTGGCAGGC GACATCGGTA TCGCCGCCTC CGTGTTATCG ACGCTGGAGA GGACATCACA GGGTGTGTCA GTTGCCGTTC TGCACAATCT GGCGGAATAT TTCGACACCA CCGTCTCAAG TCTCTCCGGC GAAGAAGAGA CAAAGGCACG GGCTCTCGTG CGGGCAGGCG AATGGCGAAA CTGGCCGCGC ACGACACCGG GCGTTACGGT GCAGTTGCTT GCCGAAGGCA AGAACCAGAT GGATTGCCAT CGCTTCGTTC TGGCGCCGGG CGCATCGAGC GAGGGCGCCT ACCGGCATGA GGGAGAGGAA TTCGTTTATG TCCTCTCCGG GCGTGTGGAG TTCGTGCTGG ATTCGGATCA GTTCTACGAC CTTCACCCGG GCGATTCCCT CTACTTCGAG AGCCGCCGCC GCCATGCCTG GTCGAACAGG CACGACGGCG AAACCGTATT GTTGTGGATC AACACGCCGC CGACATTTTG A
|
Protein sequence | MGGVGAIRYK VAEAARLAGV SASTLRLWET QGLVVPERSA TGHRQYTDAD LARLKRISWF RSERGLNPAA IREALEAENA AADDENGLAV LSDENAELQV GRKLRSLRHT AGKTLEQVAG DIGIAASVLS TLERTSQGVS VAVLHNLAEY FDTTVSSLSG EEETKARALV RAGEWRNWPR TTPGVTVQLL AEGKNQMDCH RFVLAPGASS EGAYRHEGEE FVYVLSGRVE FVLDSDQFYD LHPGDSLYFE SRRRHAWSNR HDGETVLLWI NTPPTF
|
| |