Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_0650 |
Symbol | |
ID | 5321486 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 702769 |
End bp | 703767 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640789586 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001326341 |
Protein GI | 150395874 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.324621 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC TTCTTGCAAC GACATGCCTG GCCGCTGGTC TTCTTGGACT GGGCAGTACG GCATCGGCGG CGGAATGCGG CGATGTGACC ATTGCCAACA TGAACTGGCA GAGCGCTGAA GTCCTGGCGA GTGTGGACAA GTTCATTCTG ACCGAGGGTT ACGGCTGCAA TGCCGATCTC GTCGTGGGCG ATACGGTGCC GACCATCACC TCGATGATCG AAAAGGGCGA GCCGGACATT GCGCCGGAAG GCTGGGTCGA TCTGCTGCCC GACGTCGTGA ACCGTGGTCT CGAGGAAGGC AAGCTTGTAG GCGCCGCAGT GGCGCTTTCA GACGGCGCCG TCCAGGGTTG GTGGGTGCCG AAATATATCG TCGACGCCAA TCCGGACATC AAGACGATCG ACGACGTCCT GAAGCACAAG GACCTCTTCC CGGATCCCGA AGATCCAAGC AAGGGCGCGA TTTTCAACGG CCCGCAGGGC TGGGGCGGCA CGGTCGTTAC GACGCAGCTC TATAAGGCTT ACGGCGCCGA GCAGGCGGGC TTCACGTTAG TCGATACCGG CTCGGCAGCC GGCCTCGACG GATCGATTGC CAAGGCGTAT GAGCGCAAGC AGGGCTGGGT CGGCTACTAC TGGGCTCCGA CGGCGCTGCT AGGCAAGTAC GAGATGGTCA AGCTCGGCCA TGGCGTTCCG AACGACATGG CGGAATGGAA GCGTTGCAAT ACGGTTGCGG ACTGCCCGGA CCCGAAGAAG AACGATTGGC CGAAGGACAA GGTCCAGACG CTGGTGACCA AGGAATTTGC CGATCGTGCT GGTCCGGCCA TGGAGTACCT CAATACGCGC GCCTGGACGA ACGACACGGT GAACAAGCTG ATGGCCTGGA TGACCGACAA TCAGGCGAGC GGCGAGGAAG GTGCGAAGCA CTTCCTCGAG GAGAACCCGG ACCTCTGGAC CAAGTGGGTC TCTCCCGAAG TCGCCGAGAA GATCAAGTCG GCCCTTTAG
|
Protein sequence | MKKLLATTCL AAGLLGLGST ASAAECGDVT IANMNWQSAE VLASVDKFIL TEGYGCNADL VVGDTVPTIT SMIEKGEPDI APEGWVDLLP DVVNRGLEEG KLVGAAVALS DGAVQGWWVP KYIVDANPDI KTIDDVLKHK DLFPDPEDPS KGAIFNGPQG WGGTVVTTQL YKAYGAEQAG FTLVDTGSAA GLDGSIAKAY ERKQGWVGYY WAPTALLGKY EMVKLGHGVP NDMAEWKRCN TVADCPDPKK NDWPKDKVQT LVTKEFADRA GPAMEYLNTR AWTNDTVNKL MAWMTDNQAS GEEGAKHFLE ENPDLWTKWV SPEVAEKIKS AL
|
| |