Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4544 |
Symbol | |
ID | 5319045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 1031566 |
End bp | 1032591 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640776345 |
Product | sulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | YP_001313277 |
Protein GI | 150376681 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.284119 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCTA ATAGACTTGC CGGAATAGCG AAATTAGCGC TCGTCGTCGG AAGCCTGCAG CTTGGCTCGG TTGGCCTCGT GCATGCGGAC ACGACGATCC TGAACGTGTC CTACGACCCG ACTCGGGAAC TTTATAAAGA GTTCAATGCA GCCTTTGCCG AGAAGTGGCA GGCTGATACC GGCGAAACCG TGACGATCCA AACATCGCAT GGCGGCTCCG GCAAGCAGGC GCGTTCGGTT ATCGACGGGC TGGAGGCGGA CGTGGTGACC CTGGCACTAG AAGCCGACAT CGATGCGATC GCCAGAGAGA GCGGAAAAAT TCCCGCCGAT TGGAAGACCC GTTTGGAAAA CAACAGTTCT CCCTACACCT CGACGATCGT CTTCCTCGTC CGCAAGGGAA ACCCGAAGGG GATCAAGGAT TGGGGCGATC TTGTCCGGGA GGACGTGCAG GTGATCACGC CCAATCCGAA GACCTCGGGC GGCGCTCGCT GGAACTTCCT CGCTGCCTGG GCCTGGGCCC GCGCTGCCAA TAACGGCGAC GACACAAAGG CGCAGGAATA CGTGACGCAA CTCTTCAAGC ACGTTCCGGT TCTCGACACC GGCGCGCGCG GTGCGACGAC CACCTTCGTT CAGCGCGGGT TGGGAGACGT GTTGCTCGCC TGGGAAAACG AAGCCTATCT GTCGCTGGAA GAGCTCGGCC CGGACAATTT TGACATAGTA ACCCCGTCTA TTTCGATCAA GGCGGAACCA CCCGTGGCGC TCGTCGATGG CAATGTCGAT CGCAAGGGCA CGCGTAAGGT GGCGGAAGCC TATCTCGACT ATCTCTACAG CGATGCCGGC CAGAAGATCG CGGCCAAGCA CTATTACCGG CCGTTCAAGC CGGAAGCGGC AGACGCTGAG GACACGGCCC GCTTCAAGGA ATTGAAGCTA GTCACGATCA ACGACTTCGG CGGCTGGAAG GAAGCTCAAC CGAAATTCTT CGGCGATGGC GGAATTTTCG ACCAGATTTA CAGACCGGGG CAATGA
|
Protein sequence | MSSNRLAGIA KLALVVGSLQ LGSVGLVHAD TTILNVSYDP TRELYKEFNA AFAEKWQADT GETVTIQTSH GGSGKQARSV IDGLEADVVT LALEADIDAI ARESGKIPAD WKTRLENNSS PYTSTIVFLV RKGNPKGIKD WGDLVREDVQ VITPNPKTSG GARWNFLAAW AWARAANNGD DTKAQEYVTQ LFKHVPVLDT GARGATTTFV QRGLGDVLLA WENEAYLSLE ELGPDNFDIV TPSISIKAEP PVALVDGNVD RKGTRKVAEA YLDYLYSDAG QKIAAKHYYR PFKPEAADAE DTARFKELKL VTINDFGGWK EAQPKFFGDG GIFDQIYRPG Q
|
| |