Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4328 |
Symbol | |
ID | 5319177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 825769 |
End bp | 827073 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640776133 |
Product | extracellular solute-binding protein |
Protein accession | YP_001313066 |
Protein GI | 150376470 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0170975 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATAA AGAGACGTGA TTTTCTCGCA ACCTCCGCCG CACTCGCCGG CGTTGCGGGT CTGGGCATCC GCCCGTCCTT CGCGCAGGCC GAGCCGAGCT ACAAGCCGGA GGAGGGCGCG AGCCTCAGGC TTCTCAGATG GACGCCTTTC GTCAAGGGCG ACGAGGACGC CTGGATCGCC AATACCAAGA AATTCACTGA GACGACGGGC GTCGAGGTCC GCATCGACAA GGAAAGCTGG GAGGATATCA GGCCGAAGGC GGCGGTCGCC GCAAATGTCG GCTCCGGTCC GGACATGGTG ATGTGCTGGT TCGACGACGC GCATCAATAT CCCGACAAGC TCGTCGACGT GACGGAGCTC GCTGATTATC TCGGCAATAA ATATGGCGGA TGGTACGACG GCCTGAAGGG CTATGCCAGC CGCGAGGGGC AGTTCATCGC CATGCCGTTG GCCGCGATCG GTAACGCCGT CTGCTACCGC GAAAGCCACA TGAAGGCGGC CGGGTTCAGC GAGTTCCCGA AAGACACGGC CGGCTTCCTC GAGCTCTGCA AGGCTCTCAA GGCAAAGGGA ACTCCCGCCG GTTTCCCGCA TGGAAAGGCC GTCGGCGACG GCAACAACTA CGCTCATTGG CTGTTGTGGA GTCATGGCGG CAAGATGGTC GACGAGAGCG GCAAGGTCAT GATCAACAGC CCGGAGACGC TCGCGGCGGT CAACTATGCC AAATCGCTGT ATGAGACGTT CATTCCGGGC ACGGAGAGCT GGCTCGACAT CAACAACAAT CGTGCCTTTC TCGCCGGGCA GGTATCGCTT ACGGCAAACG GCGTCTCGCT CTACTATGCG GCCAAGAAGG ACCCCGCGCT TGCCGAACTT GCGGCTGACA TCCGCACGAC CAACTTCCCG ATCGGTCCGG TCGGCCAGAG CGTCGAACTG CATCAGACGA GTTCGATCCT GCTCTTCAAG CACAGCAAGT ATCCGGAAGC TGCCAAGGCA TATCTGAAGT TCATGATGGA AGCCGACCAG ATGAACGCCT GGATCGAGGG GTCAAGCGCC TATTGCTGCC AGCCGTTGAA AGCGTTCGCC GATAACCCGG TCTGGACCTC GGACCCGATT CACGCGCCCT ATGCGCGGGC CTCGGAGACG CTGCGGCCGA ACGGCTATGC CGGTCCACTC GGCTATGCCT CGGCGGGCGT CATGGCTGAC TACGTACTGG TAGACATGTT CGCAAGTGCT GTTACCGGCC AGGCAACGCC GGAGGATGCG ATCGTCGAGG CCGAACGGCG GGCGAACCGC TACTATCGGG TCTGA
|
Protein sequence | MPIKRRDFLA TSAALAGVAG LGIRPSFAQA EPSYKPEEGA SLRLLRWTPF VKGDEDAWIA NTKKFTETTG VEVRIDKESW EDIRPKAAVA ANVGSGPDMV MCWFDDAHQY PDKLVDVTEL ADYLGNKYGG WYDGLKGYAS REGQFIAMPL AAIGNAVCYR ESHMKAAGFS EFPKDTAGFL ELCKALKAKG TPAGFPHGKA VGDGNNYAHW LLWSHGGKMV DESGKVMINS PETLAAVNYA KSLYETFIPG TESWLDINNN RAFLAGQVSL TANGVSLYYA AKKDPALAEL AADIRTTNFP IGPVGQSVEL HQTSSILLFK HSKYPEAAKA YLKFMMEADQ MNAWIEGSSA YCCQPLKAFA DNPVWTSDPI HAPYARASET LRPNGYAGPL GYASAGVMAD YVLVDMFASA VTGQATPEDA IVEAERRANR YYRV
|
| |