Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_1767 |
Symbol | |
ID | 5322625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009636 |
Strand | + |
Start bp | 1851013 |
End bp | 1852011 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640790705 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001327437 |
Protein GI | 150396970 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.473646 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0264882 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAC TCCTTGCTCC CCTCCTACTC GCTGCGGGCA TATGCCTTAT GGCGGGTTCC TCAGGAGCTG ACGAATGCGG AAACGTCACT ATTGCCGAGA TGGACTGGGC CTCGGCTGGC GTTGCGGCTC GCGTGGACAG GCTCATCCTT GAAAACGGCT ATGGATGCAG CGTCAAGCTG GTCCCCGGCG ACACTATGCC GACATTCTCT TCGATGAACG ATAAGGCAGA GCCCGACCTG GTCCCGGAAC TCTGGATCAA CTCCGTGCGG ACGCCTTTCG ACGCCGCAGT CAAGGAGGGT CGCCTCGTCG AAGGCGCCAG GGTCTTGAGC GAAGGCGGGG TAGAAGGCTG GTGGATTCCA AAGTTCATCG CCGACGCACA CCCGAAGATC CGAACGGTGC AGGACGCTTT GAAGCACCCG GAACTCTTTC CCTCACCCGA CGACCCCGCA AAGGGGGCAG TCCATAATTG CCCATCGAGC TGGAACTGCC GGTTCTCCAC CGCCAACCTC TACAAGGCAC TCGAAGGGGA TAAGGCCGGC TTCCAGCTTG TCGACCCAGG CTCGGCCGCC GGCCTCGACG GTTCTATCGC CGAAGCATTC GAGAAGAAGA CCGGCTGGCT CGGCTATTAT TGGGCCCCCA CCGCAGTGCT CGGTAAATAC GAGATGACGT TGTTGTCCTT CGGCGTCGAT CACGACAAGG CGGAGTGGGA CTCCTGCACC GCCGTTCCGG ACTGCCCCGC CCCGAAGGTA AACTCCTATC CGGTCTCCGA TGTCTACACT GTGGTGACCG GGTCCTTTGC CGACAGAGCA GGTAGCGCCA TGAATTATGT CGGGGCGCGG CAATGGGAGA ACGCGACGAT CAACAAGGTT CTGGCCTGGA TGGATCAGAA CCAGGCCACG AACGAAGACG CGGCCCGCTA CTTCCTCGGA AACTTTCAAG AGATATGGAC GCAGTGGGTG AGTCCGGAAG TCGCCGAAAA AGTGAAGGCG GCGCTCTGA
|
Protein sequence | MKKLLAPLLL AAGICLMAGS SGADECGNVT IAEMDWASAG VAARVDRLIL ENGYGCSVKL VPGDTMPTFS SMNDKAEPDL VPELWINSVR TPFDAAVKEG RLVEGARVLS EGGVEGWWIP KFIADAHPKI RTVQDALKHP ELFPSPDDPA KGAVHNCPSS WNCRFSTANL YKALEGDKAG FQLVDPGSAA GLDGSIAEAF EKKTGWLGYY WAPTAVLGKY EMTLLSFGVD HDKAEWDSCT AVPDCPAPKV NSYPVSDVYT VVTGSFADRA GSAMNYVGAR QWENATINKV LAWMDQNQAT NEDAARYFLG NFQEIWTQWV SPEVAEKVKA AL
|
| |