Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1728 |
Symbol | |
ID | 5322586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 1808944 |
End bp | 1810179 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640790666 |
Product | extracellular solute-binding protein |
Protein accession | YP_001327398 |
Protein GI | 150396931 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00458236 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.504509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAC GTTTTCTAGC CGCCGCTTTA GGCGCAACCG CTGCCTTGCC CTTCGGTGCG GCCAACGCGA CCGATCTCGA GGTCACGCAT TGGTGGACTT CCGGCGGTGA GGCGGCCGCG GTTGCCGAGC TCGCGAAAGC TTTCGATGCA ACCGGCAACA AGTGGGTCGA CGGCGCGATC GCCGGCTCCG GCGGAACGGC ACGCCCGATC ATGATCAGCC GCATTACCGG GGGCGATCCG ATGGGCGCAA CCCAGTTCAA CCACGGCCGG CAGGCGGAAG AGCTGGTGCA GGCGGGCCTG ATGCGCGACC TTACCGACAT CGCCACGCAG GAAAACTGGA AGGAGATCGT GAAGCCGTCG AGCCTGCTTG ACTCCTGCAC GATCGAAGGG AAGATCTACT GCGCGCCTGT CAATATCCAT TCCTGGCAGT GGCTGTGGCT CTCTAACGCC GCGTTCAAGA AGGCGGGCGT TGAAGTCCCG AAAAACTGGG ACGAGTTCGT GGCCGCGGCT CCAGCGCTGG AGAAGGCCGG AATCGTTCCG CTCGCCGTCG GCGGACAGCC GTGGCAGGCA AACGGTGCCT TCGACGTGCT GATGGTTGCG ATCGCAGGCA AGGACACCTT TGAAAAGGTC TTTGCCGAGA AGGACGCCGA AGTGGCAGCC GGACCGGAAA TTGCCAAGGT CTTCAAGGCA GCCGACGATG CCCGTCGCAT GTCGAAGGGC ACCAACGTTC AGGACTGGAA CCAGGCGACG AATATGGTCA TCACCGGCAA GGCCGGTGGG CAGATCATGG GCGACTGGGC CCAGGGCGAG TTCCAGCTCG CGGGACAGAA AGCCGGCGTC GACTACACCT GCCTGCCGGG TCTCGGCGTG AACGAGGTGA TTTCGACGGG TGGCGATGCG TTCTACTTCC CGCTTATCGA AGATGAGGAA AAGTCGAAGG CGCAGGGAGT GCTGGCATCG ACCTTGCTTA AGCCGGAAAC GCAGGTGGCC TTCAACCTGA AGAAGGGCTC GCTGCCGGTG CGCGGCGATG TCGATCTTGC GGCCGCCAAC GACTGCATGA AGAAGGGTCT CGATATCCTG GCCAAGGGCA ATGTGATCCA GGGCACGGAT CAGCTTCTGT CGGCCGATAG CCAGAAGCAG AAAGAGGACC TCTTCTCGGA GTTCTTCGCG AATCACTCAA TGACGCCGGA AGACGCGCAG AAGCGTTTCG CCGACATCAT CGCGTCCGCG GATTGA
|
Protein sequence | MKLRFLAAAL GATAALPFGA ANATDLEVTH WWTSGGEAAA VAELAKAFDA TGNKWVDGAI AGSGGTARPI MISRITGGDP MGATQFNHGR QAEELVQAGL MRDLTDIATQ ENWKEIVKPS SLLDSCTIEG KIYCAPVNIH SWQWLWLSNA AFKKAGVEVP KNWDEFVAAA PALEKAGIVP LAVGGQPWQA NGAFDVLMVA IAGKDTFEKV FAEKDAEVAA GPEIAKVFKA ADDARRMSKG TNVQDWNQAT NMVITGKAGG QIMGDWAQGE FQLAGQKAGV DYTCLPGLGV NEVISTGGDA FYFPLIEDEE KSKAQGVLAS TLLKPETQVA FNLKKGSLPV RGDVDLAAAN DCMKKGLDIL AKGNVIQGTD QLLSADSQKQ KEDLFSEFFA NHSMTPEDAQ KRFADIIASA D
|
| |