Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1397 |
Symbol | |
ID | 5322248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 1472542 |
End bp | 1474362 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640790339 |
Product | extracellular solute-binding protein |
Protein accession | YP_001327078 |
Protein GI | 150396611 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.982775 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.886089 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGTTGC GCGCAGTTCT TTCCGGCCTG GTGATGCTCG TGGCAGCCAG CATATTTGCC AACGGTGCTT TGGCCGCGCC CGTGCATGCG ATCGCCATGT ATGGCGAACC AGCCTTGCCA GCCGACTTCA AGCACTTCCC TTACGTCAAC CCGGAGGTGA AGAAAGGCGG GAGGATTGCC TATGGCGTCG TCGGCACGTT CGACAACCTC AACCCCTTCA TCCTGAAGAG CATGCGCACG ACTGCGCGCG GCATGTGGGA TCCCGGTTTC GGCAATCTGG TCCACGAGTC CCTGATGCAG CGCTCGCAGG ACGAGCCCTT CACCATGTAT GGCCTGCTCG CCGAGACCGT CGAATGGGAC GATGACCGTA CTTTCATCCA GTTCAACCTC AATCCCAAGG CGCGCTGGGC CGACGGTCAG CCGGTAACCG CAGAAGACGT GATCTTCACC TTCGAATTGA TGCGCGACAA AGGTCGCGCA CCCTTCAGCA ACCGCCTCTC CAAGGTGGCG AAGATGGAAA AGGTCGGGGA GCGAAGCGTG CGCTTCACTT TCACCGAGGA TGCCGACCGA GAGGTTCCGC TTTTGCTTGG GCTTTCACCC GTGCTGCCGA AACATGCGGT CGACGTCGAG GCGTTCGATC GAACGAGCCT CAAACCGCCG CTCGGCTCCG GCCCTTACCG CGTCGCGGAA GTCAGGCCCG GAGAACGAAT CGTCTATCGC CGCAATCCCG ATTATTGGGC CAAGGATCTG CCGTCCAAGG TCGGTCTCGA CAATTTCGAC GAGATCTCGG TTGAGTACTT TCTTCAGGAG AACACGCTTT TCGAAGCCTT CAAGAAGGGC GTCGTAGACA TCTATCCGGA AGGAAGCGCC ACCAAATGGG CTCGCGCCTA TGATTTCCCC GCGGTCCGCA GCGGCGACGT GATCAAGGAA ACATTCAAGC CGAAGACGCC CTCCGGAATG CTCGGCTTTG TCTTCAACAC GCGCAGGCCT GTATTCGACA ACATCAAGCT CAGACAGGGC CTGGCTCTCG TCTTCGACTT CGAGTGGGTC AACAAGAACC TCTTCGACGG TGCCTATACC CGCACGCAAA GCTACTGGCA GAATTCGTCG CTTTCCTTCC TGGGCGTAGC AGCCGACAAT CGTGAACTTG ACCTCATGGG AGATGTCAGG GAGCGCATCA ATCCGGCGAT CCTCGACGGC AGCTATAGGC TCCCCGTTAC GGACGGCTCG GGTCGCGATC GCAACGTGCT TCGGGAGGCA GTGACGCTGC TGCGTGAAGC GGGCTACTCG ATCAAGGACG GCAAGATGGT CGATGGCAAC GGCACGCCGC TTGCTTTCGA GATCATGAGC CAGAATGCCG GACAGGAGAA GATCGCCCTC GCCTACCAGC GCTTTCTGGC CCCGCTCGGC ATTGTTGCAA GGGTTCGGAC GGTCGATGAT TCGCAATATC AGTTGCGCAG CCAGTCCTTC GATTACGACG TCATCATAAA GTCGTTTCCG TCGTCGCTCT CCCCGGGTCT CGAACAGATC AACCGCTGGA GCTCGCGAAC ACGGGACAGG CAAGGTAGCG ACAATTTCGC CGGGGTCGCC GACAAGGACG TCGACAAACT GATCAACAAC ATACTCCAGG CACGAAACCC CGATGATTTC ACCGCCGCCG TTCGCGCGCA CGACCGGCTG CTGGTCAACA ATTCCTATGT GGTTCCGCTT TACCACCTCG ATGCGCAATG GATCGCCCGG TGGAAGCGTA TCGGCCGGCC AACGGCTGAG CCGCTCTATG GCTATCAGTT GCCGAGCTGG TGGGACGAGC GGGTCCAGTA A
|
Protein sequence | MKLRAVLSGL VMLVAASIFA NGALAAPVHA IAMYGEPALP ADFKHFPYVN PEVKKGGRIA YGVVGTFDNL NPFILKSMRT TARGMWDPGF GNLVHESLMQ RSQDEPFTMY GLLAETVEWD DDRTFIQFNL NPKARWADGQ PVTAEDVIFT FELMRDKGRA PFSNRLSKVA KMEKVGERSV RFTFTEDADR EVPLLLGLSP VLPKHAVDVE AFDRTSLKPP LGSGPYRVAE VRPGERIVYR RNPDYWAKDL PSKVGLDNFD EISVEYFLQE NTLFEAFKKG VVDIYPEGSA TKWARAYDFP AVRSGDVIKE TFKPKTPSGM LGFVFNTRRP VFDNIKLRQG LALVFDFEWV NKNLFDGAYT RTQSYWQNSS LSFLGVAADN RELDLMGDVR ERINPAILDG SYRLPVTDGS GRDRNVLREA VTLLREAGYS IKDGKMVDGN GTPLAFEIMS QNAGQEKIAL AYQRFLAPLG IVARVRTVDD SQYQLRSQSF DYDVIIKSFP SSLSPGLEQI NRWSSRTRDR QGSDNFAGVA DKDVDKLINN ILQARNPDDF TAAVRAHDRL LVNNSYVVPL YHLDAQWIAR WKRIGRPTAE PLYGYQLPSW WDERVQ
|
| |