Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4740 |
Symbol | |
ID | 5319108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1262311 |
End bp | 1263294 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640776538 |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_001313470 |
Protein GI | 150376874 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00130889 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAACAGGA TTTCACGCCG CGCGTTCATG CTTGCCGCGA CCGCGGCTGG CGCAATCGCC GTCGCCGGGA CTGTCGCATT CGCGGAACTG CCGAAGCTCG CGCAGAAGGA GACCTACAAG GTCGGCTTCG CGCAAACGGA ATCCAACAAC CCCTGGCGCA TCGCACAGAC CAACAGCATG AAGGCCGAGG CCGAGAAGCT CGGGCATCAG CTGGTCTATA CCGATGCTGC CGGATCCGCT GCCAAGCAGG TGGCCGACGT CAACTCGATG ATCGCCCAGG GGGTCGACCT GATCTTCCTC GCACCGCGGG AAGAAAAACC GCTCATCCCC GCCGTCATGG CCGCCAAGAA GGCCGGCATC CCCGTCATCC TGCTCGACCG AAGTGTTGAT CCGTCGCTTG CCAAGGCGGG CGAGGACTAC GTCACCTTCA TCGGCTCGGA TTTCATAGAG GAAGGCAAGC GCATCGCCGA GTGGCTGATC AAGAACGCCA ACGGCAAGTC GAAGATCATC GAGCTCGAAG GGACCACCGG TTCGTCTCCG GCCAACGACC GCAAGAAAGG CTTCGACGAG ACGATCAAAA CGGCAGGCGG CTTTGAGATC GTCGCATCGC AGTCGGGCGA TTTCGCCCGG GACAAGGGCC GGCAGGTTGC CGAAGCCCTG TTGCAGGCGC ACCCGGATGC CGACATCGTT TACGCGCATA ACGACGAAAT GGCGATCGGC GCCATCGCTG CCATCGAGGC GGCCGGCAAG GTCCCAGGCA AGGATGTGCT GGTATTGTCG ATCGACGGCG GCAAGGAAGC GGTTCAGGCG GTCATCGACG GCAAGATCGC AGCAGTCGTC GAATGCAATC CGCGCTTCGG ACCCAAGGCC TTCGAAACGA TGCTGCGTTA CGCCAAGGGC GAGAAGATCG ACCCGCTGGT CATAAACGAG GACAAGTTCT ACGACTCGAC CAACGCGGCT GCCGAACTCG CCAACGCGTA CTGA
|
Protein sequence | MNRISRRAFM LAATAAGAIA VAGTVAFAEL PKLAQKETYK VGFAQTESNN PWRIAQTNSM KAEAEKLGHQ LVYTDAAGSA AKQVADVNSM IAQGVDLIFL APREEKPLIP AVMAAKKAGI PVILLDRSVD PSLAKAGEDY VTFIGSDFIE EGKRIAEWLI KNANGKSKII ELEGTTGSSP ANDRKKGFDE TIKTAGGFEI VASQSGDFAR DKGRQVAEAL LQAHPDADIV YAHNDEMAIG AIAAIEAAGK VPGKDVLVLS IDGGKEAVQA VIDGKIAAVV ECNPRFGPKA FETMLRYAKG EKIDPLVINE DKFYDSTNAA AELANAY
|
| |