Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5080 |
Symbol | |
ID | 5319382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 27372 |
End bp | 28880 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640776860 |
Product | extracellular solute-binding protein |
Protein accession | YP_001313792 |
Protein GI | 150377197 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0526207 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAACA AATCTGTTCG CGTGATTGCG GGCGCCATGG TGGTCGCCGG ATGGGCCGGC TATGCCGCGG CGCAGAATGC CTCGGAAATC AAGATCGTCC TGCCGGAACA GCCGGCCAAT CTCGAGCCCT GCGGTACCAT CATCACCAAT GTCGGCCAGA TACTGAGCCG CAACGTGGTC GAGCCGCTGA CGATCATCGA TCCGAAAAGC GGCCAGCCAA CATCCGGTCT TGCGACCGAG TGGAAGCAAA CGGATCCGAA CACGTGGCAG CTCAAGCTGC GCGAAGGCGT CAAATTCCAG GATGGTGCCG CCTTCAATGC AGAGGCGGTC AAATTCTCGA TCGAGCGCAT GACCGGCGGC AAGCTGACCT GCAGCAACAT TGCCAAATTC GGCAATGCCA AGCTCACCGT CACGCCGATC GACGACCTCA CGGTCGAGAT CAAATCGGAT ACGCCGCAGC CTATTTTGCC GACGCTGCTC AGCGTGGTCA TGATCGTCTC GCCGAACACC CCGGCGGACA AGGCCGTGAA CGATCCGGTC GGAACCGGTC CTTTCAAGCT CTCGAGCTTT ACGCCACAGA CTGTCGTGTT GGAAGCCTTT GACGGCTACT GGGGCGAGAA GCCGGCCATT GCCAGGGCGA GCTATGTCTG GCGCCCGGAA TCCTCGATCC GTGCCGCCAT GGTGGAGACC GGCGAGGCCG ATCTGACGCC GTCCATCGCC ATCCAGGATG CCACCAACCC GGAAACGGAC TTCGCCTATC TGAACTCGGA GACGACAGCG ATCCGCATCG ATGCCGGGTT CGCTCCGCTC GACGACGTGC GGATTCGCAA GGCGCTGAAC CTTGCGATCG ACTGGAATGG TCTTGCGCAG CTTTTCGGCG AGGACGTGCA GCGTGCTTCG CAGATGGTTG TCACCGGCAT CAACGGTCAT GACGACAAGC TGGCGCCCTG GGCCTTCGAT GCCGAAAAGG CCCGTGCGCT GATCGCCGAG GCCAAGGCTG CGGGCGTACC GGTCGATACC GAAATCGAAC TGATCGGCCG CAACGGAATT TATCCCAACG GTACGGAAGC CATGGAAGCC ATGATGGCCA TGTGGCAGGA TGTCGGTCTG AATGTGAAGC TGACGATGCT CGACGTGAAC GATTGGCTCC GCTACCTGCA GAAGCCTTTC CCGGAAAGCC GCGGGCCGAA CCTTTTGCAG ATGATGCATG ACAACAACAA GGGCGACGCC GCCTTCACCG TTCCGATCTT CTATACGTCG GGCGGAAGCT ACTCGACCTT CAACGATGCG GCGTTCGACA AGGAGATCGC CGATGCCATG GCTGCCACCG GCGAGGACCG TACGGCCAAG TTCAAGGCGA TCTTCGCGAA GGTGCATGAG GAGCTTGCGG TCGATATCCC GATGTTCCAC ATGATCGGCT ACACCCGGGT GGGCAGCCGT CTGGAGTGGA AGCCGGACAT CACGACCAAC AGCGAGATCC CGCTGGCCAA TATCGGCCTC AAGGATTAA
|
Protein sequence | MGNKSVRVIA GAMVVAGWAG YAAAQNASEI KIVLPEQPAN LEPCGTIITN VGQILSRNVV EPLTIIDPKS GQPTSGLATE WKQTDPNTWQ LKLREGVKFQ DGAAFNAEAV KFSIERMTGG KLTCSNIAKF GNAKLTVTPI DDLTVEIKSD TPQPILPTLL SVVMIVSPNT PADKAVNDPV GTGPFKLSSF TPQTVVLEAF DGYWGEKPAI ARASYVWRPE SSIRAAMVET GEADLTPSIA IQDATNPETD FAYLNSETTA IRIDAGFAPL DDVRIRKALN LAIDWNGLAQ LFGEDVQRAS QMVVTGINGH DDKLAPWAFD AEKARALIAE AKAAGVPVDT EIELIGRNGI YPNGTEAMEA MMAMWQDVGL NVKLTMLDVN DWLRYLQKPF PESRGPNLLQ MMHDNNKGDA AFTVPIFYTS GGSYSTFNDA AFDKEIADAM AATGEDRTAK FKAIFAKVHE ELAVDIPMFH MIGYTRVGSR LEWKPDITTN SEIPLANIGL KD
|
| |