Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3591 |
Symbol | |
ID | 5318971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 19283 |
End bp | 20548 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640775406 |
Product | extracellular solute-binding protein |
Protein accession | YP_001312339 |
Protein GI | 150375743 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.317951 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCA TCCGGAAATA TGCCGTCGCA ACGACGGTCG CAGCGATGCT GGCCTGCACG GCCCTGCCGG TCACCGCCAA GGCGGAAGTG CTCAAATTCG TCTCGTGGCA GAAGGACGAG AAGGGCATCG GCGACTGGTG GGGAACGGTG GTCAAGGAGT GGGAGGCAAA GCACCCCGGT AATACGATCG AATGGACCAA GGTGGAACGC AGCGCCTATG CCGACACCAT GACGACGCTC TTTGCAGGCG GCACGCCGCC GGACATCGTC CACCTGGCCT CCTTCGAATT CCAGACCTTT GCGAACAATG GCTGGCTCGA GGACCTTGGT CCCTGGGTCG AGAAGTCGGG GCTCAATCTC GATGGATGGA GCGGGCAGGA CATCTGCAAT TTCCAGGACA CGACCGTGTG CATCATGATG CTCTATTACG GCACGATCTT CGGCTATAAC GAGGAGATGC TGAAGCAGGC GGGCGTCGCG GTCCCCACCA ATTACGAGGA GTTCCTCGCA GCGGCCCGCG CGACCACCAA GGACCTGAAT GGCGACGGCA TCGTCGACCA ATTCGGAACC GGCCACGAGA CCAAAGGTGG CGGCGGGCAG TATATCGCCG AGATGGCGAG CTATCTTTTC GATGCCGGCG CACGCTTCAC CAATGCAGAA GGCAAGGTGA CGATCGACAC CCCCGAAATG GTCGAGGGCC TGACCCGCTG GAAAACCGTG GTCAAGGAAA GCCTGACCCC GCGCGACCTC TCGGCGGGCG AGGTCCGGAA ACTCTTCGCC GATGGAAAGA TCGCTTTAAA GGTCGACGGT CCCTGGATCT ATTCCATCAT GCAGCAGGGA GCGGCAAAGG ATAAGCTGAA GCTTGCCAGC GTTCCCTTCG ACCCGCCGCT GGGAGGGTCA TCCAACATTC TCGCGATGCC GAGCGAGATT TCCGATGAGA AGAAGCAGCT TGTCTGGGAT TTCATCGCAA TTGCGACCTC CGACAAATTC CAGACCAGCT TCGCGACGCT TGCCGCCTCG ACTCCGCCGA GCCCGCGCGC CGATCTCACC GAAGCCAAGG CGCAGATTCC ACATTTCGAT CTGATGGCGA AGTCGCAGAA GGCTGCGGCA GAGCACAAGA TCGACCGCAT TCCGACCGGA CTCGAGATCC AGTTCAACGA GTTCTCGAAA ATGATTCAGG AGGAGGCGCA GAGAATGATC ATAGAAGATC TCGATCCTGC AGCCGTCGCC AAGACGATGC ACGAGAAGGC CGAGGCGCTT CAGTAG
|
Protein sequence | MKIIRKYAVA TTVAAMLACT ALPVTAKAEV LKFVSWQKDE KGIGDWWGTV VKEWEAKHPG NTIEWTKVER SAYADTMTTL FAGGTPPDIV HLASFEFQTF ANNGWLEDLG PWVEKSGLNL DGWSGQDICN FQDTTVCIMM LYYGTIFGYN EEMLKQAGVA VPTNYEEFLA AARATTKDLN GDGIVDQFGT GHETKGGGGQ YIAEMASYLF DAGARFTNAE GKVTIDTPEM VEGLTRWKTV VKESLTPRDL SAGEVRKLFA DGKIALKVDG PWIYSIMQQG AAKDKLKLAS VPFDPPLGGS SNILAMPSEI SDEKKQLVWD FIAIATSDKF QTSFATLAAS TPPSPRADLT EAKAQIPHFD LMAKSQKAAA EHKIDRIPTG LEIQFNEFSK MIQEEAQRMI IEDLDPAAVA KTMHEKAEAL Q
|
| |