Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_5796 |
Symbol | |
ID | 5320098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | - |
Start bp | 769200 |
End bp | 770780 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640777500 |
Product | extracellular solute-binding protein |
Protein accession | YP_001314432 |
Protein GI | 150377837 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00815808 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.00391201 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGCGTT TAAACAGGTT TCTCATTTCG GCGCTGACGG TAGCGGCGAT TACCGCGCCG GCGCTGTCCA CCTCCGCAAG TGCCGCCACG CTTCGCTGGG GCAGCCGCGC AGACATCTAT TCGCTCGATC CGGATTCCGT TCCCTCGACA TCCAACCTTG CGTTCCTGAA CCACATCTAT GAAGGTCTGA TCCGGTATGG ACCGAACTTC GAGATCGAGC CGGCGCTCGC CACCGAGTGG AAGCTGATCG ACGACAAGCA CTGGCGTTTC ACGCTGCGCA AAGGCGTGAA GTTCCACAAC GGCGCAGACT TCACCGCAGA CGACGTTGTC GCCTCCATGA ACCGCGTGTC GGACCCGGCC TCGCCTCTGC GCGGCAACAT CCCGCTCTAT GTCGGCGTAA AGAAGGTGGA CGATTTCACG GTCGACATCG AGGTTTCGGC GCCGACTGCG CTGTTCCTGA ACGACATGAC CAATATCTTC ATGTTCAACG CGAAATGGCT GACGGATAAC AAGGCAGAAA AACCGACCGA TATCGCGTCC AATACCGAGA ACTACGCGAC GCACAACACG AACGGTACGG GCCCGTTCAA GCTTGAGAGC CGCGTTCCGG ACAGCAAGAC CGTTCTCATC GTCAACGACC TGTGGTGGGA TCAGAAGAAG CACAATCTGG ACCGGATCGA GTATGTTCCG ATCGCATCGG CGGCGACGCG TGTCGCAGCG CTTCTTTCCA ACGAAATCGA TCTGGTCGAT TCCGCACCCA TTCAGGACCT TCCTCGCCTG GAATCCTCGC CTGGTATCAA AGTAAGCAAG CGCACGGAGC TGCGCACCGT GTTCATCGGC TTCAACGGCA AGGCGAAGCT TGAAGATGGG CGTGCGAACC CGTTCCTCGA CGTTCGCGTT CGTCAGGCCG TTGACGCAAG CATTGATCGC GATCTCATCA ACAAGAAGAT CATGCGCGGT CTGGCGCGGC CCTCCGGCTC GCTCATTGCA CCGGAAATCG CCGGCTATGC GAAATCGCTT GATACCTATC AGCCCGTCGA CACCGAGCTT GCCCAGAAGC TGCTCGCGGA AGCCGGCCAG GAGGGTCTTG CCTTCACCTA TCTTTGCATG AACGACGAGA GCATCAACGA GGAAGACTTC TGCTCGGGCA TCGCGAACAT GCTGAAGCGT GCCGGCTTTC AGCCCACAAT CGACATGGGA CCGCGCGCCG TGCAGCAGCC TAAGCGCACC AATGGCAAGG CCGACATTTT CAACCTGAGC TGGGCAAACG AACCGACGCT CGATGCCTAT TCGCTGCTCT CTCAAGTCCT CTCCACGCGC AGCGGTTCGA CGGGCGTTTC CAACTATGGC GGCTGGTCCT ACCCGGAGCT CGACGAGCTG GTGAAGAAGG CGGCACAGGA ACCTGACACC GCAAAGCGGC TCGCACTCGA AGAACAGGCC CTGAAAATCG CCAAGGACAA GGCGATCCTG ATCCCGCTCC ACCAGCAGCC GATTGCCTGG GGCATGCTGG ACACGGTCAA GAGCGTGGAT TTCCGCGCCG ACAACAAGCC GCGCCACTGG CACACGCAAA TGGCCGAATA A
|
Protein sequence | MSRLNRFLIS ALTVAAITAP ALSTSASAAT LRWGSRADIY SLDPDSVPST SNLAFLNHIY EGLIRYGPNF EIEPALATEW KLIDDKHWRF TLRKGVKFHN GADFTADDVV ASMNRVSDPA SPLRGNIPLY VGVKKVDDFT VDIEVSAPTA LFLNDMTNIF MFNAKWLTDN KAEKPTDIAS NTENYATHNT NGTGPFKLES RVPDSKTVLI VNDLWWDQKK HNLDRIEYVP IASAATRVAA LLSNEIDLVD SAPIQDLPRL ESSPGIKVSK RTELRTVFIG FNGKAKLEDG RANPFLDVRV RQAVDASIDR DLINKKIMRG LARPSGSLIA PEIAGYAKSL DTYQPVDTEL AQKLLAEAGQ EGLAFTYLCM NDESINEEDF CSGIANMLKR AGFQPTIDMG PRAVQQPKRT NGKADIFNLS WANEPTLDAY SLLSQVLSTR SGSTGVSNYG GWSYPELDEL VKKAAQEPDT AKRLALEEQA LKIAKDKAIL IPLHQQPIAW GMLDTVKSVD FRADNKPRHW HTQMAE
|
| |