Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5883 |
Symbol | |
ID | 5320185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 846809 |
End bp | 847879 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640777578 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_001314510 |
Protein GI | 150377915 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.92391 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCAGA ACCCGCCCCG ATCCACATCC GTCACCGTTG CCGACGTCGC ACGCAAGGCA GGCGTCTCCA AGGCGACGGC AGCGCGCGTG CTGGGCGGCT ACGGAACGGT GAGCGACCCG GTGCGCGACG CCGTGACAGC CGCTGCCCGG GCACTCGATT ACCGTCCGAA CGAGCTCGCC CGCAGCATGA CGACGGGCCG CTCCGGCACG ATCGGCGTCG TGGTGGGCGA CATAGAGAAC CCATTTTTCA GTCTTGCCAT GCGGGGCATC ACCGACGTGG CCCGCCAGGC CGGCTTTACG GTCATCCTCA TAAATTCCGG CGAGGACGTG GCTGTCGAGA AGGCGGCCAT TCGCACCCTG CTGGCCAAGC GCGTGGACGG CTTGATCGTC TCGCCGGCCA AGGAAAGCAA TGTCGATCAC CTTCAGGAAG CCGCCCGCTC GGGCCGGCCG CTGGCGCTGC TCGACCGCGG CAGTGAGACG CTCGACGTTG ACACCGTTAT TGCCGATGAC AGACACGCCG CCGAAGGCAT CACGCGACGG CTCATTGCGC TCGGCCATCG CCGCATCGCC TATATCACTG CGTGCGACAC ACCGGATCAT GTTTTCCGCG TGCCCTCAGA CGTAAATACG GGCTCGGTGC GCCGGCGCGT CGAAGGTTTT CTTGGCGTCT GCCGGGAGGC TGGCCTTCAG GGAATGGAAG GCTGGGTGCG TGTGGGCGCG ATCACGCCGG ACCATACGCG GGGCATCGTC TCGGCGATGT TGCAGTCGAG CGAGCGCCCG ACCGCGATCA TCGCCTCCGA CAGCGTGATC GGCCTCGAAG TTTTCAAGAC CAGCCGCGCA GCCGGCATTG CTATCCCGGA CGAGCTGTCG CTCGTATCGT TCCATGACGC CGATTGGACC TCGGTCACCT CGCCCCCTGT GACGGTGGTG AGGCAACCCG TCTATCGCCT GGGCGAAACA GCCGCGAAAC TGCTGGTCGA GCGGCTGAAC GGATATGAAG CAAGTGCCCG CCGAGTCGTG CTGCAAACCG AACTCATCGA ACGGGCTTCC GTCGCCGACG CGCCGGCATG A
|
Protein sequence | MDQNPPRSTS VTVADVARKA GVSKATAARV LGGYGTVSDP VRDAVTAAAR ALDYRPNELA RSMTTGRSGT IGVVVGDIEN PFFSLAMRGI TDVARQAGFT VILINSGEDV AVEKAAIRTL LAKRVDGLIV SPAKESNVDH LQEAARSGRP LALLDRGSET LDVDTVIADD RHAAEGITRR LIALGHRRIA YITACDTPDH VFRVPSDVNT GSVRRRVEGF LGVCREAGLQ GMEGWVRVGA ITPDHTRGIV SAMLQSSERP TAIIASDSVI GLEVFKTSRA AGIAIPDELS LVSFHDADWT SVTSPPVTVV RQPVYRLGET AAKLLVERLN GYEASARRVV LQTELIERAS VADAPA
|
| |