Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3985 |
Symbol | |
ID | 5317911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 436358 |
End bp | 437857 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640775793 |
Product | extracellular solute-binding protein |
Protein accession | YP_001312726 |
Protein GI | 150376130 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAT CGCTGCTTTT GGGAGTCCTC ATGCTGGGCT CCGCCCTATC CCCCGCTTTC GGGGAATCGG GACCGATCAA GATCGTCCTA CCGGAAGAGG CAGATCTGCT TGAGCCTTGC ATGGCCACGC GTTCCAATAT CGGCCGCATC ATCATGCAGA ACGTCAGCGA GACGCTGACC GAGCTTGACG TTCGCGGCGG CAAGGGCCTG ATGCCGCGTC TGGCGGAAAG CTGGGAGCAG AAGGAGGATG GCAGCTGGCG CTTCAACTTG CGCAAGGGCG TCAAGTTCTC CGACGGCACC GACTTCAACG CCAAAGACGT CAAGTACAGC TTCGACCGCG TGATGAGCGA CAAGAACGCC TGCGAGTCGC GCCGCTACTT CGGCGGCATG AACATCAAGA TCGACGTCGT CGATGACGCC ACCATCGATT TCACCGTCGA TCCCGTTCAG CCCATCCTGC CGCTTCTCCT GTCGTTGCTG ACGATCGTAC CGGAAGAGAC TCCGATGGAA TTCGTTCGCG AGCCTGTCGG GACCGGCCCC TACGAGCTGA CTGACTGGAC GCCCGGCCAG CAGATCGTGC TCACTGCCCG CGACGACTAC TGGGGCGAGA AGCCGGAGGT CACCGAGGCC ACCTATCTGT TCCGCTCCGA TCCTGCGGTC CGCGCCGCAA TGGTCCAGAC GGGTGAAGCA GACCTTTCGC CCTCCATATC GCAGACCGAG GCCACGAATC CTGCCACGGA TTTCTCCTAT CTCGACAGTG AGACCGTTTA CCTGCGCATC GATCACAATA TCGAGCCGCT GAACGATGTC CGCGTGCGCC GCGCTCTGAA CCTTGCTATC GACCGCGAAG CTTTCCTCGG GACGCTGGTG CCCGACAGCG CCGTTCTTGC CACCGCTATC GTTCCGCCGC CGACGCTCGG CTGGAACCCG GACGTCAAGG TCTTCCCCTA TGATCCCGAT CAGGCAAAGA AGCTCATCGA GGAAGCCAAG GCCGACGGCG TCAAGGTGGA CACGCCGATC ACCATCATCG CGCGCACGGC GAACTTCCCG AACGTGACCG AAATCATGGA GGCTATCCAG GCGCAGTTGC AGGAGGTCGG GCTTCAGGTC GAGCTCAAGT TCGTTGAGGT CGCCGAGCAT GAGGTCTATT ATTCAAAGCC TTTCAAGGAC GGGCGCGGCC CGCAGCTCGT TGCGGCGATG CACGACAACT CGAAGGGCGA TCCCTCCTTC TCGATGTTCT TCAAATACGA CTCCGAAGGC ACGCAGTCGG GCTTTGCCGA TCCGAAGGTC GACGACCTGA TCGCCCGAGC AAATGCAGCC GTCGGTGACG AACGCGCAAA GCTCTGGTCC GAGCTCATCG CCTATGTGCA CGACGAGGTG GTGGCAGATG TGCTTCTGTT CCACATGGTC GGCTTCTCGC GGGTATCGGA GCGGCTGGAC TTCAAGCCGA CAATGGCGAC GAATTCCATG CTGCAGCTCT CCGAGATCAA GATCAAATGA
|
Protein sequence | MKRSLLLGVL MLGSALSPAF GESGPIKIVL PEEADLLEPC MATRSNIGRI IMQNVSETLT ELDVRGGKGL MPRLAESWEQ KEDGSWRFNL RKGVKFSDGT DFNAKDVKYS FDRVMSDKNA CESRRYFGGM NIKIDVVDDA TIDFTVDPVQ PILPLLLSLL TIVPEETPME FVREPVGTGP YELTDWTPGQ QIVLTARDDY WGEKPEVTEA TYLFRSDPAV RAAMVQTGEA DLSPSISQTE ATNPATDFSY LDSETVYLRI DHNIEPLNDV RVRRALNLAI DREAFLGTLV PDSAVLATAI VPPPTLGWNP DVKVFPYDPD QAKKLIEEAK ADGVKVDTPI TIIARTANFP NVTEIMEAIQ AQLQEVGLQV ELKFVEVAEH EVYYSKPFKD GRGPQLVAAM HDNSKGDPSF SMFFKYDSEG TQSGFADPKV DDLIARANAA VGDERAKLWS ELIAYVHDEV VADVLLFHMV GFSRVSERLD FKPTMATNSM LQLSEIKIK
|
| |