Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_2164 |
Symbol | |
ID | 5323024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2233979 |
End bp | 2235424 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640791102 |
Product | extracellular solute-binding protein |
Protein accession | YP_001327832 |
Protein GI | 150397365 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGAGA AGGAGAAAGA CCTCATTGGC GCATTCCTGC GCGGAGAGGT GGACCGTCGC GGCCTGTTGA AGGGCCTTGG CGCGGCAGGC CTGACGGCGG GCACCGCCGG CACCCTGTTC AACATGATGT CGACCCAGGC CCTTGCTGCC GACTTCGACT GGAAGGCACA TTCCGGCAAG TCGCTGAAAC TGTTGCTGAA CAAGCATCCT TACGCGGATG CGATGATTGC CAATCTGCAG GCGTTCAAGG ACCTTACCGG TATCGAAGTC ACCTATGACG TCTTCCCGGA GGACGTCTAT TTCGACAAGG TGACGGCGGC GCTTTCTTCC GGCTCGTCGG AATACGACGC CTTCATGACC GGCGCTTACA TGACCTGGAC CTACGGGCCG GCCGGCTGGA TCACCGACCT CAATGAATGG ATCAAAGATC CGTCGAAGAC CAATCCGCAA TATGGCTGGG ACGACTTCCT GCCGGGCGTC AAGGCATCCT GCGCCTGGAA CGGTCAGCCG GGCGGCGCGC TCGGTTCGGA AGATGCCAAG CAGTGGTGCA TTCCGTGGGG CTACGAGCAG AACAATCTCT CCTATAACCA GGAAATGTTC GAAAAGGCCG GCGCCAGCGT TCCGAAGAAC CTCGATGAAC TCGTCGCTAC GGCGGCAAAG CTCAACAAGG ATGTCGGCGG CGGTGTCTAC GGCATCGGCG TGCGTGGTTC CCGTTCCTGG GCAACCATTC ATCCGGGTTT CCTCTCCGGC TACGCCAATT TCGGCCAGAA GGATCTGAAC GTCTCGGAGG ACGGCAAGCT TTCGGCCGCG ATGAACACGG CGGAGTCCAA GTCCTTCCAC GCCAAATGGG TGCAGATGAT CCAGGAAAGC GGCCCCAAGG ACTGGTCGAC CTATACCTGG TATCAGGTCG GCACCGACCT CGGCGCCGGC GCTTCCGCCA TGATCTACGA CGCCGACATC CTCGGCTATT TCATGAATGG CGGCGACAAC AAGATGGCCG GCAAGCTCGC TTACGCGCCC TTTGCCGCCA ACCCTGAGGC GAAGGCTCCT ACGCCGAACA TCTGGATCTG GTCGCTGGCC ATGTCCAATT TCGCGAAGGA TAAGGATGCG ACCTGGTATT TCCTGCAATG GGCATCGGGT CTCGAGCACG CGATCTTCGG CGCAACCAAG ATGGACTTCG TCAACCCGGT CCGGGCATCC GTCTGGAAGG ACGAGATCTT CCGGGAGCGG CTGAACAAGA GCTATCCCGG TTATGTGGAG ATGCACGACG TTTCGGCGCC GGGCGCGAAG ATCCACTTCA CCGCCCAGCC TCTCTTCTTC GATCTCACCA CCGAATGGGC GGCGACGCTG CAGAAGATGG TGGCGAAGGA AGTGCCGGTC GACGAAGGTC TCGACAGGCT TGCCGAGAGC ATCAACCGGC AACTTGCGGA AGCCGGGCTC GGCTGA
|
Protein sequence | MYEKEKDLIG AFLRGEVDRR GLLKGLGAAG LTAGTAGTLF NMMSTQALAA DFDWKAHSGK SLKLLLNKHP YADAMIANLQ AFKDLTGIEV TYDVFPEDVY FDKVTAALSS GSSEYDAFMT GAYMTWTYGP AGWITDLNEW IKDPSKTNPQ YGWDDFLPGV KASCAWNGQP GGALGSEDAK QWCIPWGYEQ NNLSYNQEMF EKAGASVPKN LDELVATAAK LNKDVGGGVY GIGVRGSRSW ATIHPGFLSG YANFGQKDLN VSEDGKLSAA MNTAESKSFH AKWVQMIQES GPKDWSTYTW YQVGTDLGAG ASAMIYDADI LGYFMNGGDN KMAGKLAYAP FAANPEAKAP TPNIWIWSLA MSNFAKDKDA TWYFLQWASG LEHAIFGATK MDFVNPVRAS VWKDEIFRER LNKSYPGYVE MHDVSAPGAK IHFTAQPLFF DLTTEWAATL QKMVAKEVPV DEGLDRLAES INRQLAEAGL G
|
| |