Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3579 |
Symbol | |
ID | 5318083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 6785 |
End bp | 7819 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640775394 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_001312327 |
Protein GI | 150375731 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGAAA GACCCTCGAG AAGACTCCGC CAAGCCGACA TCGCCGCCCA TGCCGGCGTA TCGGTTTCTA CCGTCTCGCG CGTGCTCGCC AACGAACCCG GCATCAGCGA AGATGTCCGG GTACAGATCT TCAAGGTCGC AAGCGAGCTC GGCTACCCAC TCAAAGCCGG CACTGCAGCT GGTTCACGCG CACTGGCATT GATCGCAAGC AACGGCGTTA CCGGCGGCTT GAGCGCTTTT TACCAGGGCA TCGTCGATGG CTTGCGCTCA GAGGCAGCCG CGCAGGGCAT GTCGTTCGAC ATCCGCCTCA TCAACGAGAT GAAGGCGACG CCGCAAGTCG TTGGCGAACA TCTGGAATCA GTCGGGGCGC AAGGGCTCTT TCTGGTCGGG ATCGACCCCA GCGAGGCACT TGGCGACTGG CTCGTGGAAA GCCGGTTGCC CGTCGTCCTC GTCAATGGCG TCGATCCACA ATTGCGCTTC GACGGCATCT CGCCGCCAAA CTTCTTCGGC GCCTTCGCTG CCACGCGGAT GCTGCTGGAT GCCGGGCACA GGCGCATCCT CCACCTGACC GGATCGCATC GCCATACGAT CCGCGAGCGT GTGCGCGGCT TCGAAGCGGC CATCGCCTGC GCTGAAGGCT GCGCGGCGCG CATCGTCCGC CTGCCCTTCG AGACCAATTC GAGCGCGGAA GCCCATGCGG CAACGCTCGA TGCGCTCGCC GTGGACGGGA ATTTCACCGC GGCCTTCTGC ATGAACGATT TCATCGCAGT GGGCGTTCTC GAGGCGGTCA CCGAACTTGG CCGCCGCGTA CCGGATGATT TCGCCATTAT CGGGTTCGAC GACCTGCCCT GCGCCGAGAT GGCCAATCCG CGCCTTTCGA CGATGCATGT CGACCGTGCA GCGCTCGGCC GGGAAGCGGT CGGCATGATG CAGTTTCGCT TCGCTCATCC GGACGTGCCC GCGCGGCATG TCTCTCACGC GGTCACCCCG GTGCCCGGCG GCACGATTGC ACGAAGGACG ACGCATGACC TATGA
|
Protein sequence | MIERPSRRLR QADIAAHAGV SVSTVSRVLA NEPGISEDVR VQIFKVASEL GYPLKAGTAA GSRALALIAS NGVTGGLSAF YQGIVDGLRS EAAAQGMSFD IRLINEMKAT PQVVGEHLES VGAQGLFLVG IDPSEALGDW LVESRLPVVL VNGVDPQLRF DGISPPNFFG AFAATRMLLD AGHRRILHLT GSHRHTIRER VRGFEAAIAC AEGCAARIVR LPFETNSSAE AHAATLDALA VDGNFTAAFC MNDFIAVGVL EAVTELGRRV PDDFAIIGFD DLPCAEMANP RLSTMHVDRA ALGREAVGMM QFRFAHPDVP ARHVSHAVTP VPGGTIARRT THDL
|
| |