Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_3147 |
Symbol | |
ID | 5324026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 3298164 |
End bp | 3299198 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640792095 |
Product | aliphatic sulfonate ABC transporter periplasmic ligand-binding protein |
Protein accession | YP_001328806 |
Protein GI | 150398339 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACACC TAATTGCCGC CATCCTTGCC GCGACGCTTT CGACAGGCAC CGCCTGGGCC GAGGATCCCT TGCCCATCCG CATCGGCGCC GCTTCGGCTG TCGATCATGC TCCGGTCTTC ATCGGGGTCG AGAAGGGAAT ATTCGCCGCA CATGGCCTGG ATGCGGACGT GGTTATGTAT CAGTCCGGCG TTGATATGGT GAACGGGCTC ATGAACGGGG CGCAGGAAGT CAATGTCATG GGCTCGGTCC CGTTCCTGTC CGGGATTTCA CGCGGGTTTC CCCTTGTCCT GATCGGGCAT TTGCACGGTG ACCCGAACCG TACCGACTAT TCCGACAACC AATCCGTTAT CGCCTCGGCT GAATCGGGCG TCAAAAAAGC CGATATTGCG GCGCTTGCCG GGAAACGGAT CGGCTTGCCG CGCGGTACGG GTGCGGAGGG TTATCTGTTC GGGTTACTGA AATCGGCAGG GCTCAGCGAG AAGGATGTGC AACTGGTCAA TGTGCAGCCG GCAGAGCTGG TGACTGCGCT GACTCAGGGG GACGTTGATG CCATTTCGAT CTGGCAGCCC TGGGCCGCGA CCGCGCTGAC GAAAATCGAA GGAACGGTTG AGGTCGTCGC GGGGGGCTGC GCGGGTTGCT ATGATCCCGG CACGATCCTG ACAACCCGCA TGGTTGCCAC GGAAAAGCCT GAGGAGCTGA AGCGCTTCAT GGCTGCCTTC GCCGAAGCGC AGCAATGGGT GCGTCAGAAC CCCGATAAGG CGGCTGAGAT AAACACCCGC TGGATTTCGG GCGTCGATGC CGAAACCATG GCTCTAGCAC TGAAGAATAT CCCGCTGGAT TCGCGCATTT CGGCGCATAC CGCCGCAATG TATCAGGAAA AGACATTGCC GTTTCTGGTT GGTCTCGGCC GGGTCGAAAA AGTATTCGAT CCATCGGATT CCATCGACAC AAGCTTTCTG AAGGCCGCGC AGGAAAGCAA TCCCAAGGCA TTTTCGGATC TGGAACCTAT CCCGGAAGAC ATTCAGGTCA AGTGA
|
Protein sequence | MKHLIAAILA ATLSTGTAWA EDPLPIRIGA ASAVDHAPVF IGVEKGIFAA HGLDADVVMY QSGVDMVNGL MNGAQEVNVM GSVPFLSGIS RGFPLVLIGH LHGDPNRTDY SDNQSVIASA ESGVKKADIA ALAGKRIGLP RGTGAEGYLF GLLKSAGLSE KDVQLVNVQP AELVTALTQG DVDAISIWQP WAATALTKIE GTVEVVAGGC AGCYDPGTIL TTRMVATEKP EELKRFMAAF AEAQQWVRQN PDKAAEINTR WISGVDAETM ALALKNIPLD SRISAHTAAM YQEKTLPFLV GLGRVEKVFD PSDSIDTSFL KAAQESNPKA FSDLEPIPED IQVK
|
| |