Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5416 |
Symbol | |
ID | 5319718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 380168 |
End bp | 381772 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640777182 |
Product | extracellular solute-binding protein |
Protein accession | YP_001314114 |
Protein GI | 150377519 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.604179 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAATAC GAAGCAGGTT TTGTTCACAT TGGCTTGCCG CATTGCTGCT CTCGCTCACC TTCTGGAGCG GTCTTCCAGG GTCTGGTCAT GCGCAGGAGG AGCTTCGGAT CGCAACATCG TACAAGCTGA TGACACTCGA TCCGCATTAC GCAAATCTCA ACGAAAACAC CTCGCTGCTC TCCCATATCT ACGAGCGTCT GGTCTACCAG GATGACCGTC TCGATCTGAA ACCGGGTCTG GCCGTCTCCT GGCGCGCGGT GTCGGACAGG CAGTGGGAGT TCAAGCTTCG CGAGAATGTC CGCTTCCATG ACGGCTCGCC CTTGACGGCG GACGACGTCG TCTACACGAT CGAGCGCATA CGGGATTTCC TCAAATCCCC GAGCGGCGGC TTCCGGTCCT ATGTGACCGG CATCGAATCG GTCTCGGCAC CCGATCCTCT CACCGTCGTC ATCGACACCA AGGGCAATAT CCCCAACCTG CCGCTGTCGT TCTCGTCGAT CTTTGTGATG AACCGGCCGG CACAGGGGTT CCAGACCACC GAAGAGCTCA ATGCCGGCAG GGCCGGCCAC CCTCCGGTCG GCACAGGGCC GTATACATTC GAAAGCTGGA GTTCCGGCGA GGTGCTGAAG CTCGCCAGGA ACGATGATTA CTGGGGCGGC AGGCCTGGCT GGCCGCAGGT CACGTTTCGG GTCATCGAAA GCCCGGCTGC CCGCGTGGCG GCACTCAGCA CCGGCGAAGT CGACCTGGCG GATGCTATTC CCGCACGCGA CGTTGCCTCT TTGAAGCAGC GCGGCGCCAG GATAGCCAGC GTCGGCGCGG CGCGGATCAA CTTCCTGCAG TTCGACGTGG AGCGAGACAG GCTTCCCGGC GTGACCGATA AGTCCGGCGA GCCGATCGCC AATCCGTTCA AGGACGCCTT GGTCCGTCGT GCGCTCGCCA TGGCCACCGA TCGCGGAATT CTGGTCGACA AGATCCTCTC GGGCTATGGC ACGGCCGCAG CCCAGCTCTT TCCCGGCGGC TTGCCGGGTA CCTCGGAAAC CTTGCAGCCG GAGGCTCCGA AGTATGACGA AGCCAAGGCG CTTCTCGCAA AGGCTGGTTT CCCTGACGGT TTCAACCTCA TTCTCGCCGG ACCTGCCGGG CGTTATCCCG GCGACGGCGA GAGCCTTCAG GCGATTGCGC AAAGCTGGGC CCGCATCGGA GTAAAGGTGC AGCCGGCGGC GGCGCCGTTT TCGGTTTTCA ATACAAAGCG TGCCGCCGGC GACTATGCCG TCTGGTACGG CGGCGCTTCC GGCGAAGCGG TGGACATCAT CCTCCACGCT CTGCTGGCCT CACCGGACCC TGAAAGCGGG AACGGCGCCT TGAACTTCGG GCATTATCGC AACCAGGCTT TCGACGCGAT GCTCGCAAGG GCGGAAAGCA TCCAGGAGGG CCCTGAGCGC AACAAGGCGC TCGCCGAAGC GACCGAGTTC GTGATGGCCG ATCAGCCGAT CATACCGCTT TACCACTTCC ATCACATCGT CGGCTACGGC CCGCGCGTTG CCTCCTATGC GATGCATCCC CGCGGCTGGA CCACGGCGAT GCAGACGCTT GCCGCGACGG AGTAA
|
Protein sequence | MQIRSRFCSH WLAALLLSLT FWSGLPGSGH AQEELRIATS YKLMTLDPHY ANLNENTSLL SHIYERLVYQ DDRLDLKPGL AVSWRAVSDR QWEFKLRENV RFHDGSPLTA DDVVYTIERI RDFLKSPSGG FRSYVTGIES VSAPDPLTVV IDTKGNIPNL PLSFSSIFVM NRPAQGFQTT EELNAGRAGH PPVGTGPYTF ESWSSGEVLK LARNDDYWGG RPGWPQVTFR VIESPAARVA ALSTGEVDLA DAIPARDVAS LKQRGARIAS VGAARINFLQ FDVERDRLPG VTDKSGEPIA NPFKDALVRR ALAMATDRGI LVDKILSGYG TAAAQLFPGG LPGTSETLQP EAPKYDEAKA LLAKAGFPDG FNLILAGPAG RYPGDGESLQ AIAQSWARIG VKVQPAAAPF SVFNTKRAAG DYAVWYGGAS GEAVDIILHA LLASPDPESG NGALNFGHYR NQAFDAMLAR AESIQEGPER NKALAEATEF VMADQPIIPL YHFHHIVGYG PRVASYAMHP RGWTTAMQTL AATE
|
| |