Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2064 |
Symbol | |
ID | 5322923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 2114764 |
End bp | 2116113 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640791001 |
Product | extracellular solute-binding protein |
Protein accession | YP_001327732 |
Protein GI | 150397265 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0548204 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCAAGC CAATAATTTC TTCGCCCAGA CCGCACTTGA GCAGACGGCA GGTCTTGCAG GGCGTGGCGG CCGCGGCCGG TGCCGGATTG GCGGGATTTC CCGGACAGCT CGGCGCAGCA AGTCAGGTCA AGGAACTGAT CGTTCTAACC GGAACCACCC CATGGCTCCC GGCTTACCAG AAGGCCGCAG CAGCCTACGA GGCGGAGAAG GGCATCAAGA TCACGTTCCG CGCATTTCCC TATGGCGGAA TGCGTACCCA GATGACCAAC GCAATTCAGA GCAAGAACGC GGCCTTCGAC GTCTTCCAGC TCGACGAACC CTGGACCGGC CAGTTCTACG ACAACGGATG GGTCAAACCG CTCGAGGAGA TTATCGAGGG CTACAAGCTC GACCCGAACA TTCTGACCTA TGACAGCCTT CCCCTCTGGG ACAAACAGCA GAGGCGCGGC AAAGCCGGCG GGAAGATCAT GGGCCTGCCG ATAAACGGCA ACGTCGATCT ATTCGTCTAC CGAAAGGACA TCTATGAGAA GCTGGGTCTG ACGGTTCCGA AGACCTGGGA TGAGGCGATC GAAAACGGCA AGAAGGCCGT CGAAGCAGGC GAAGTCCGAT ATGGTTATGT CACCCGTGGC CAACCCACGG CCGGCGGGCA ATCGGTCAGC TTCGAGTTCA TGCACGTGCT TTACGGCTTC GGCGGCGATT GGTTCAAGGC CGACGGCGCT ACACTCGTCC CGACAATCAA CAATGACGCT GCAAAGACCG CGGCAGCGAC TTTCCGCCGG CTTCTTGAGC TTGGGCCCTC ACGGTCTCAA ACGGTCGGTC AGGCGGACTG CATCGCCCTG ATGCAGAGCG GCCAGGCTCT GCAGGGTCAC TTCGTTGCTG CCGGCATGCC CCAGCTCGAA GACGAGACCC GTTCATCGGT CGTCGGCAAA TGCGGCTACA CCATCGTTCT GGCAGGATCA CTCGACCATC CCGTTCCGGC AAGTGGTGTC TGGTCGCTTT GCGTCCCGGC GGATCAGGCG CCCGAACGGC AGCTCGCGGC AGCCGAGTTC ATTATGTGGA TGCTGGACAA GAAGCAGCAG GAAGCCTTTG CAGGCGCTGG CGGGATGCCG ACCCGCAAAG ACGTCGACGT GTCCGGAGCA GGCGCATTGC GACCGATCAT GGAAGCCGCC AGGGATTCGG CGGCCCTCAC GCAGGGCGCC ATCCGCTATG TCTTCGCAGC CCAGATGCTT GAAGCCGTCG AACCGGTCAT CGGTCAGATC GGCTCCGGTG ACCTGGCCGT CGACGAAGGA CTGGACGAAC TGCAAGCGAA ACTCGCGGAG ATCGCCAAGG CGAGCGGCTT CGCGAAATAG
|
Protein sequence | MIKPIISSPR PHLSRRQVLQ GVAAAAGAGL AGFPGQLGAA SQVKELIVLT GTTPWLPAYQ KAAAAYEAEK GIKITFRAFP YGGMRTQMTN AIQSKNAAFD VFQLDEPWTG QFYDNGWVKP LEEIIEGYKL DPNILTYDSL PLWDKQQRRG KAGGKIMGLP INGNVDLFVY RKDIYEKLGL TVPKTWDEAI ENGKKAVEAG EVRYGYVTRG QPTAGGQSVS FEFMHVLYGF GGDWFKADGA TLVPTINNDA AKTAAATFRR LLELGPSRSQ TVGQADCIAL MQSGQALQGH FVAAGMPQLE DETRSSVVGK CGYTIVLAGS LDHPVPASGV WSLCVPADQA PERQLAAAEF IMWMLDKKQQ EAFAGAGGMP TRKDVDVSGA GALRPIMEAA RDSAALTQGA IRYVFAAQML EAVEPVIGQI GSGDLAVDEG LDELQAKLAE IAKASGFAK
|
| |