Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5037 |
Symbol | |
ID | 5319086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1556637 |
End bp | 1558241 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776818 |
Product | extracellular solute-binding protein |
Protein accession | YP_001313750 |
Protein GI | 150377154 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.4089 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.963466 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAATTC AAAGCAGGTT TTGTTCACAT CGGCTTGCCG CATTGCTGAT CTCGCTCACC TTCTGGAGCG GTCTTCCAGG GCCTGGCCAT GCGCAGGAGG AGCTCCGGAT CGCGACATCG TACAAGCTGA TGACACTCGA TCCGCATTAC GCAAATCTCA ACGAAAACAC CTCGCTGCTC TCCCATATCT ACGAACGTCT GGTCTACCAG GATGACCGTC TCGATCTGCA ACCGGGTCTG GCCGTCTCCT GGCGCGCGGT GTCGGACAGG CAGTGGGAGT TCAAGCTTCG CGAGAATGTC CGCTTCCATG ACGGCTCGCC CTTGACGGCG GACGACGTCG TCTACACGAT CGAGCGCATA CGCGATTTCC TCAAATCCCC GAGCGGCGGC TTCCGGTCCT ATCTGACCGG CATCGAATCG GTGTCGGCAC CCGATCCCCT CACCGTCGTC ATCAACACCA AGGGCAATAT CCCCAACCTG CCGCTGTCGT TCTCGTCGAT CTTTGTGATG AACCGGCCGG CACAGGGGTT CCAGACTACC GAAGAGCTCA ATGCCGGCAG GGCCGGCCAC CCTCCGGTCG GCACAGGGCC GTATACATTC GAAAGCTGGA GTTCCGGCGA GGTGCTGAAG CTCGCCAGGA ACGATGATTA CTGGGGCGGC AGGCCTGCAT GGCCGCAGGT AACGTTTCGG GTCATCGAAA GCCCGGCTGC CCGCGTGGCG GCACTCAGCA CCGGCGAAGT CGACCTGGCG GATGCTATTC CCGCACGCGA CGTTGCCTCT TTGAAGCAGC GCGGCGCCAG GATAGCCAGC GTCGGCGCGG CGCGGATCAA CTTCCTGCAG TTCGACGTGG AGCGAGACAG GCTTCCCGGC GTGACCGATA AGTCCGGCGA GCCGATCGCC AATCCGTTCA AGGACGCCTT GGTCCGTCGT GCGCTCGCCA TGGCCACCGA TCGCGGAATT CTGGTCGACA AGATCCTCTC GGGCTATGGC ACGGCCGCAG CCCAGCTCTT TCCCGGCGGC TTGCCGGGTA CCTCGGAAAC CTTGCAGCCG GAGGCTCCGA AGTATGACGA AGCCAAGGCG CTTCTCGCAA AGGCTGGTTT CCCTGACGGT TTCAACCTCA TTCTCGCCGG ACCTGCCGGG CGTTATCCCG GCGACGGCGA GAGCCTTCAG GCGATTGCGC AAAGCTGGGC CCGCATCGGA GTAAAGGTGC AGCCGGCGGC GGCGCCGTTT TCGGTTTTCA ATACAAAGCG TGCCGCCGGC GACTATGCCG TCTGGTACGG CGGCGCTTCC GGCGAAGCGG TGGACATCAT CCTCCACGCT CTGCTGGCCT CACCGGACCC TGAAAGCGGG AACGGCGCCT TGAACTTCGG GCATTATCGC AACCAGGCTT TCGACGCGAT GCTCGCAAGG GCGGAAAGCA TCCAGGAGGG CCCTGAGCGC AACAAGGCGC TCGCCGAAGC GACCGAGTTC GTGATGGCCG ATCAGCCGAT CATACCGCTT TACCACTTCC ATCACATCGT CGGCTACGGC CCGCGCGTTG CCTCCTATGC GATGCATCCC CGCGGCTGGA CCACGGCGAT GCAGACGCTT GCCGCGACGG AGTAA
|
Protein sequence | MQIQSRFCSH RLAALLISLT FWSGLPGPGH AQEELRIATS YKLMTLDPHY ANLNENTSLL SHIYERLVYQ DDRLDLQPGL AVSWRAVSDR QWEFKLRENV RFHDGSPLTA DDVVYTIERI RDFLKSPSGG FRSYLTGIES VSAPDPLTVV INTKGNIPNL PLSFSSIFVM NRPAQGFQTT EELNAGRAGH PPVGTGPYTF ESWSSGEVLK LARNDDYWGG RPAWPQVTFR VIESPAARVA ALSTGEVDLA DAIPARDVAS LKQRGARIAS VGAARINFLQ FDVERDRLPG VTDKSGEPIA NPFKDALVRR ALAMATDRGI LVDKILSGYG TAAAQLFPGG LPGTSETLQP EAPKYDEAKA LLAKAGFPDG FNLILAGPAG RYPGDGESLQ AIAQSWARIG VKVQPAAAPF SVFNTKRAAG DYAVWYGGAS GEAVDIILHA LLASPDPESG NGALNFGHYR NQAFDAMLAR AESIQEGPER NKALAEATEF VMADQPIIPL YHFHHIVGYG PRVASYAMHP RGWTTAMQTL AATE
|
| |