Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2950 |
Symbol | |
ID | 5323827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 3098606 |
End bp | 3099913 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640791901 |
Product | extracellular solute-binding protein |
Protein accession | YP_001328614 |
Protein GI | 150398147 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.110397 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGATGT TTGAACGCGG GTTGGCGTCC ATGCACGCCG CTAAACTAAC GGCGCTTGCC GCAGCCGCCG GCATGACGCT TCTCTTCGGT CCCGAAACGG CATCGGCCGA AACCGTCGTA AAATGGCTGC ATCTGGAGAC GGTTCCCGCC TATCTGAAGC AATGGGAGGA CATCGCCGCC AAGTACGAAA CCGAACATCC CGGCGTGGAT GTTCAGCTCC AATTCCTGGA AAACGAGGCT TTCAAGGCGA AGTTGCCGAC GCTGCTGCAA TCGGACGACG CTCCCCATTT CTTCTACAGC TGGGGCGGCG GAGTGCTGAA GCAGCAGGCC GAGACCGGCG CACTCAAGGA CCTGACGGAA GCAATGCGTG CCGATGGCGG CGCCTGGGAG AAGAGCTACA ACCCGGCGGC AGTCAAGGGC TTCACCTTTG AGGATCGAAT TTATGCGGTT CCCTTCAAAA TGGGAACGAT CAGCTTCTTC TACAATAAGG AGCTGTTCCA GAAGGGCGGC GTCAAGGCCG AGGACATCAA GAGCTGGGAT GATTTCCTCA CAGCGGTGAA AACGCTGAAG GAGGCCGGGA TCACGCCGAT CGCCGGCGGC GGCGGGGACA AATGGCCGCT GCACTTCTAC TGGAGCTATC TCGTGATGCG CAATGGCGGC CAGCAGGTAT TCGAGGATGC CAAGAACAAC GAGGGCGAGG GTTTCTTGCA CCCGGCGATC CTAAAGGCCG GTGAACAACT CGCCGAACTC GGCAAGCTCG AGCCGTTCCA GGGCGGCTAT CTCGGGGCGA ACTGGCCGCA AACGCTCGGC CTGTTCGGTG ACGGCAAGGC GGCGATGATC CTGAGCTTCG AAACCACCGA AGCCACCCAG CGCGCCAATT CCGGCGACGG CAAGGGCCTG GCACCCGAGA ACATCGGTCG CTTCCCCTTC CCGGCCGTCG AGGGCGGGGC AGGTGCGGCG ACCGATACGC TCGGAGGCCT CAACGGCTGG GCGGTAACCA AGAATGCCCC GCCTGAGGCG CTCGATTTCC TGCGCTATCT CACCAATGCT GAGAACGAGA GGCTTATGGC AAGTACCGGC ATGATCGTAC CGGTGGCCGT GGGCGCGGAA GAGGGCATCA CCAACCCGCT GGTGCGTGCT TCGGCCGACC AGCTTGCGGC CTCGACATGG CACCAGAACT ATTTCGACCA GGATCTCGGC CCCTCGGTCG GCCGCGTCGT GAACGACGCA TCCGTCGAGA TCCTCTCCGG GCAGATGTCC TCCGAGGAGG GAGCCCGGAT GATCCAGGAC GCACGCGAGC TGGAATGA
|
Protein sequence | MMMFERGLAS MHAAKLTALA AAAGMTLLFG PETASAETVV KWLHLETVPA YLKQWEDIAA KYETEHPGVD VQLQFLENEA FKAKLPTLLQ SDDAPHFFYS WGGGVLKQQA ETGALKDLTE AMRADGGAWE KSYNPAAVKG FTFEDRIYAV PFKMGTISFF YNKELFQKGG VKAEDIKSWD DFLTAVKTLK EAGITPIAGG GGDKWPLHFY WSYLVMRNGG QQVFEDAKNN EGEGFLHPAI LKAGEQLAEL GKLEPFQGGY LGANWPQTLG LFGDGKAAMI LSFETTEATQ RANSGDGKGL APENIGRFPF PAVEGGAGAA TDTLGGLNGW AVTKNAPPEA LDFLRYLTNA ENERLMASTG MIVPVAVGAE EGITNPLVRA SADQLAASTW HQNYFDQDLG PSVGRVVNDA SVEILSGQMS SEEGARMIQD ARELE
|
| |