Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5555 |
Symbol | |
ID | 5319857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 520226 |
End bp | 522154 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640777304 |
Product | extracellular solute-binding protein |
Protein accession | YP_001314236 |
Protein GI | 150377641 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.140103 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0740019 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGAAT TCAGCCGTCG CGCGTTTCTC TTTTCGAGCA CCGCCGCCGT TCTACTGCCT GTAATGCCAA TGATTGCACT GGCCAACTCG GCCAAAGAGC CTGCCATGTT GGAGGCGCTG GCCAAGACGG GCAGCCTGCC AGCCGTTGCG GACCGCTTGC CGCTCAACCC GATGGTCGTC ACCCCGCTGG ATCGGGTGGG GACCCATGGA GGCGACTGGA ACAGTGCAAT CGTTGGCGGG GGATCCCTGT CGATGTTGTT CCGCTATCAG GCTTACGAGC CTCTGTTAAG GTATGCGCCG GATTGGTCGG GCGTGGTGCC AAACGTTGCC GAACATTACG AAAGCAATGC CGACGCGACT GAGTTCACGT TCAGGCTGCG CAAGGGCATG AAATGGTCGG ATGGCGAGCC CTTCACCACG GAAGACATCC TGTTCTGGTA CGAGGATATC TTCAACTACG AGGGGCTCAA CGATGTCGGG CAGAACCATC TGCGCGCCGG AGGCAAGAAG GCGCGCTTCG AAGCTGTCGA CGACGTCACG TTCAAGGTGA TCTTCGCAGC ACCCAATGGA CTTTTTCCCC TCCGGCTCGC ATGGGCGAAC GACGATCAGA CGACGCGGGC ACCGAAGCAT TACCTCAAGC AGTTCCACAT CAAGTACAAT CCAAATGCCG AAGAGGAAGC CAAGACCAAG GGCGCCTCGG GATGGATCCA GCTTTTCCAG CGGGAAGCTG GTCTCGTCGT AGACAACGAA TTCTTCCAGA ACTCGCAGCG CCCGGTCATT CATGCCTGGA AGGTGGCCAT CGCGCCCGGT CAGAGTACCG ACCGTGCCGT TGCCGAGCGA AATCCCTACT ACTGGAAAGT CGATACCGAG GGCAATCAAC TGCCTTATCT GGATCGGATC GTCTACCAGA TGGTTTCCGA TCCGCAGGTC CTACTTCTGA AGGCGATGCA GGGCGAGATC GATTTGATGG ATCAGTATAT TGCCACGCCT GCCAATCGTG CCGTGCTGTA CGATTCCCAA GAGCAGGGCA GATTTGGATT CTACACGCTG ACCTCAACCG AAACAAACGA GATGGTTTTC CAGCTCAATC TCAACCACCC CAATGAGGTG AAGCGCAAGC TCTACAACAA CAAGGACTTC AGGGCAGCGC TCTCAATGGC GCTCGATCGC CAGGCCATCA TCGATACCGT GTTCATCGGA CAGGGAACGA TCTCGCAGCC TGCCGTGCGA GCGGACGATC CGCTCTACAA CGAACGCCTC GCAACGCAGT ACACGCAATA CGACCCCAAT CGCGCCAACG CTCTCCTGGA CAAGATCCTG CCGAGCAAGG ATAGTGAAGG TTTCCGTCTC GATGAAGGCG GAAAACGGGT ATCGATCATT TTTGAAATCG ATCAGGCGCG CGCCACCTTC CTCGACATCT TCCAACTTGC TCTGCCGATG TTCCGGGCTG TCGGCGTTGA TGTCCAGATG AGAAGCATGG ACCGCTCGCT TTGGGAAGTG CGCGTGCGGC AGGGTATCGA GTATGACGCG ACAGCCCATC GCTTTGGCGG CAATGGCGGC ATCGCGGCAA TCCTTGACCC GCGTTATTTC ATTCCCAATA CGACAGAAGC GCTGTACGCG AAAGGTTGGC AACTCTGGTA TCGAGATTCG CAATCCCAGG GTGCGGTGGA GCCACCGCAG CCCGTCAGAA ACGCCTTGGC TCTCTACGAT CGGGTGCTCG CTTCGGCCGA TCCCGATGTG CAAAAGAAGC TCATGGCCGA GATTCTGGAG ATCGCTGCCG ACCAGTTCTA TGTGTTCGGA ATCTGCCTGC CCGCCGACAG CTATGGGGTG GTTAAAAACG ACATGCAGAA CGTCCCCGAG GCGATGCCGA ACTCCTGGGG ATATCCGACG CCCGGACCTG TCAATCCCGA GACTTTCTTC AAGGTCTGA
|
Protein sequence | MQEFSRRAFL FSSTAAVLLP VMPMIALANS AKEPAMLEAL AKTGSLPAVA DRLPLNPMVV TPLDRVGTHG GDWNSAIVGG GSLSMLFRYQ AYEPLLRYAP DWSGVVPNVA EHYESNADAT EFTFRLRKGM KWSDGEPFTT EDILFWYEDI FNYEGLNDVG QNHLRAGGKK ARFEAVDDVT FKVIFAAPNG LFPLRLAWAN DDQTTRAPKH YLKQFHIKYN PNAEEEAKTK GASGWIQLFQ REAGLVVDNE FFQNSQRPVI HAWKVAIAPG QSTDRAVAER NPYYWKVDTE GNQLPYLDRI VYQMVSDPQV LLLKAMQGEI DLMDQYIATP ANRAVLYDSQ EQGRFGFYTL TSTETNEMVF QLNLNHPNEV KRKLYNNKDF RAALSMALDR QAIIDTVFIG QGTISQPAVR ADDPLYNERL ATQYTQYDPN RANALLDKIL PSKDSEGFRL DEGGKRVSII FEIDQARATF LDIFQLALPM FRAVGVDVQM RSMDRSLWEV RVRQGIEYDA TAHRFGGNGG IAAILDPRYF IPNTTEALYA KGWQLWYRDS QSQGAVEPPQ PVRNALALYD RVLASADPDV QKKLMAEILE IAADQFYVFG ICLPADSYGV VKNDMQNVPE AMPNSWGYPT PGPVNPETFF KV
|
| |