Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2183 |
Symbol | |
ID | 5323043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 2257047 |
End bp | 2258636 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640791121 |
Product | extracellular solute-binding protein |
Protein accession | YP_001327851 |
Protein GI | 150397384 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.226598 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAT TCACGAAGCG TCCCGATGGC CTCATCGTCC CGGCCGGCAT CAACCGGCGC GGCTTTCTCG CGGGCACAGC AGCGCTTGGC ATCGCCGCAG GGCTCGGCAG TTCACTCGTA GGCGGTGAAG CGCGGGCGCA GGAACCGAAG CGAGGCGGCC ATCTGAAGCT CGGCCTCGAC GGCGGCGCGA CAACCGATTC ACTCGACCCG GCAAGCTACA GCGGTTCCGT ATCCTTCGTA ATCGGCCATC TCTGGGGCGA CACACTGGTC GAATCCCATC CCGAGACCGG CGCTCCGCTG CCGTCAATCG CTTCCTCCTG GGAGTCGTCG GCCGACGCTT CGGTCTGGAC GTTCAAGATT CGAAAGGACA TTGCGTTCCA CGATGGCAGG AAACTCGCCG TTGCAGACGT GATCGCAACC TTGAAGCGGC ACTCCGACGA AGGCTCCAAA TCGGGCGCGC TGGGACTTAT GCGCTCCGTG AAGGCAATTG AGGAAAAGGC CGGCGATCTC GTCCTGACGC TGACGGAAGG CAATGCCGAC CTGCCGCTTC TGCTGACCGA CTATCACCTC ATCATTCAGC CGGACGGCGG CGTCGATAAT CCGGCTTCGC CGGTCGGCAC CGGACCCTAC AAGCTCGCAA GCTACGAAGC CGGTATACGT GCGACCTTTG AAAAGAACAC CGCCGACTGG CGCTCGGACC GCGGCTATGT CGACAGCGTC GAAATCATCG TCATGAATGA CACCACGGCG CGCATTGCGG CGCTTTCCTC CGGGCAGGTG CACTTCATCA ACGGCGTCGA ACCGAAAACC GTGCCGCTGC TCAAGCGCGC GCCCCGCGTG GAAATTCTGC AGACTTCGGG CAAGGGCTTC TACAGCTTCC TCATGCATTG CGACACGAGC CCCTTCGACA ATAACGACCT GAGGCTCGCG CTGAAATATG CGATCGACCG GCAGGCCATT CTCGACCGTG TGCTCGGCGG CTTTGGCACG CTCGGCAACG ATTATCCGGT CAATGAAAAC TACGCGCTCG CACCTGAAGG CATCGAGCAG CGTGCCTACG ATCCCGACAA GGCCGCCTTC CACTACAAGA AGTCCGGCCA TGACCGGCCG ATCCTGCTGC GCACATCCGA CGCCGCCTTC CCGGGCGCGG TCGACGCCTC CGTCCTCTTC CAGGAAAGCG CCCGCAAGGC CGGCATCGAA ATCGAGGTAC GTCGCGAACC GGAAGACGGC TACTGGACCA ATGTCTGGAA TGTGCAGCCT TTCTGCGCAT CCTACTGGGG CGGCCGCCCC ACACAGGACT CCCGCTATTC CACCTCCTAT CTTTCGAGTG CGGAATGGAA CGACACGCGG TTCAAGCGAG AGGATTTCGA CAAGCTGCTC CTGCAGGCGC GCTCCGAACT GGACGAAGCC AAGCGCAAGG CGCTCTACCA TACCATGGCT CTCATGGTGC GCGATGAAGG CGGCCTCATT CTGCCGGTCT TCAACGACTA TGTGAACGCC GCCTCCGCTT CGCTGAAAGG TTTCGTGCAC GACATCGGCA ACGACCTTTC GAACGGCTAT GTCGGAAGCC GCGTCTGGTT CGACAGCTAA
|
Protein sequence | MTEFTKRPDG LIVPAGINRR GFLAGTAALG IAAGLGSSLV GGEARAQEPK RGGHLKLGLD GGATTDSLDP ASYSGSVSFV IGHLWGDTLV ESHPETGAPL PSIASSWESS ADASVWTFKI RKDIAFHDGR KLAVADVIAT LKRHSDEGSK SGALGLMRSV KAIEEKAGDL VLTLTEGNAD LPLLLTDYHL IIQPDGGVDN PASPVGTGPY KLASYEAGIR ATFEKNTADW RSDRGYVDSV EIIVMNDTTA RIAALSSGQV HFINGVEPKT VPLLKRAPRV EILQTSGKGF YSFLMHCDTS PFDNNDLRLA LKYAIDRQAI LDRVLGGFGT LGNDYPVNEN YALAPEGIEQ RAYDPDKAAF HYKKSGHDRP ILLRTSDAAF PGAVDASVLF QESARKAGIE IEVRREPEDG YWTNVWNVQP FCASYWGGRP TQDSRYSTSY LSSAEWNDTR FKREDFDKLL LQARSELDEA KRKALYHTMA LMVRDEGGLI LPVFNDYVNA ASASLKGFVH DIGNDLSNGY VGSRVWFDS
|
| |