Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1959 |
Symbol | |
ID | 5322818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | - |
Start bp | 2010852 |
End bp | 2011796 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640790897 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001327628 |
Protein GI | 150397161 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.874851 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.400976 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGAGAG GCGGAGAGTT CTTTTGCGCG GCGGCGCTGG CGGTAGCGAT GACTGCGCCG GCAGCCGCGG CGGATCTGGT AATCGCGATG CCGCCGTGGC CGTCAGGTCA GGCGGCGGCG AACATCCTCA AATTCGGCAT CGCCAAGAAA TTCAGCCTCG ATGCGGAGGT GCGGGAACTC GGTACGCTCA ACGCTTTCGT CGGCCTAGAG AAGGGCGAAA TCGACATCCA GCCGGAGGTT TGGCGGCCAA ATTTCGACGA GCTCGTCCGC AAGTTCGTGA CCGAAAAGGG CGCCGTGACG CTGAGTACGC GTGCGGTACC TGCATGGCAG GGGATTTGCG CCACGCCGGA GGCGGCCGCG ACGATCAAGA CCGTTGCGGA TCTCGGCGAC CCGGCCAAGA CGAAATTCCT GGACACCGAC GGTGACGGAC GCGGAGAAAT GTGGATCGGC GCCGCCGAAT GGCTTTCGAC CGGAATCGAA CGTGCGCGGG CGGCCGGCTA TGGCTATGCG GCAAACCTGA CGCTTGTCGA GGCCAAGGAA GATGTTGCAA TGGCGGCGGT GGATGCGGCA ATCGCGACGG CGCGGCCGAT GGTCTTCTAC TGCTACGCTC CGCATCATGT TTTCAAGCTG CACCAGATCT CCCGGCTTGA GGAGCCGCCC CATGATCCTT CCAAATGGAA AATCGCGCCG CCGAACGATC CGCTATGGGT CAGCAAGTCG AGCGCGTCCA CGGCCTGGGA CGCGGGCCAG TTCCAGATCG GCTATGCGAC GGCTTTTGCA AAGAAACATC CCGAAGTCGC GCAGTTCCTT CAGAATGTGG ACTTCTCCCC GGATGAAGTG ACGGCGATGA GTTATGCGCT CCAAGTCGAG CGGCAGACGC CGGTGGACTA CGCCAGGAAG TGGGTTGAAA GCCACGCGGA ACGGATCGAC GGATGGGCGA AATGA
|
Protein sequence | MRRGGEFFCA AALAVAMTAP AAAADLVIAM PPWPSGQAAA NILKFGIAKK FSLDAEVREL GTLNAFVGLE KGEIDIQPEV WRPNFDELVR KFVTEKGAVT LSTRAVPAWQ GICATPEAAA TIKTVADLGD PAKTKFLDTD GDGRGEMWIG AAEWLSTGIE RARAAGYGYA ANLTLVEAKE DVAMAAVDAA IATARPMVFY CYAPHHVFKL HQISRLEEPP HDPSKWKIAP PNDPLWVSKS SASTAWDAGQ FQIGYATAFA KKHPEVAQFL QNVDFSPDEV TAMSYALQVE RQTPVDYARK WVESHAERID GWAK
|
| |