Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2139 |
Symbol | |
ID | 7093360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 2313324 |
End bp | 2314244 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643465464 |
Product | Substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_002362440 |
Protein GI | 217978293 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.668278 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCAG CGCTGCTTTC GCGACGCAGC CTTGTCGCCT CGGCGCTGGC GTTTTCCGCC GCCGGGACTT TTGCCGCGGC GCCAGTGGTC GTCAGCTCGA AACTCGATCT TGAAGGCGCG CTGCTGGGGG AGATGATGCT CATCGCCTTG CGCCGCGCGG GCGTGCCGGC TGTCGCGAAT CTACAGATCG GACCGACCGC GATTTTGCGT CAGGCGCTGA TCTCCGGCGC GGTCGATCTT TGCGTCGAAT ATACCGGCAA CGCCGCTTTC TTTTTCCATC GGGAAGACGA TCCCGTCTGG CGCGACGCCA GGGCCGGCTT TTTGCGCGCG GCGATGCTGG ACCTTAAGGC CAATAGTCTC GTCTGGCTCG ATCCCGCGCC CGCCGACAAC AGCTGGGTCA TCGCCGTCCC CAACGCGCTC GCGCGAGAGC AATCGTTGTC GACGCTCGAA GATTTCGCGA CGTGGGTCAA TTCCGGCGCG CGCGCAAAGC TCGCGGCCTC CGCCGAATTT GTCGAGAGCG AGGCCGGACT TCCGGCCTTT GAAGCGGCCT ATCATTTTCA TCTCAGCGCC GATCAATTGC TGGTTCTCTC CGGCGGCGAG ACCACGGCGA CCATGAAGGC CGCGTCCGCC GGCATATCCG GCGTCAACGC TGCGATGGCC TATGCGACGG ACGGGGCACT CGACGCGCTC GATCTTCGCG CGCTCGAAGA CCCGCGCCGC GCCGAGCCGG TCTACGCGCC GGCGCCCGTC ATTCGCGCCG GGACGCTCGC CGCCTATCCG CAGATCAGAG GCGCGCTGGC GCCCGTTTTT GCGGGCCTCG ATCTCAAAAC ATTGCGCGCG CTGAATGAAA GAATCGCGGT GGAAGGCGAG GACGCGGCCC TTGTCGCGCG CGATTATATG ATGGAGAAAA GCCTGCTGTG A
|
Protein sequence | MNAALLSRRS LVASALAFSA AGTFAAAPVV VSSKLDLEGA LLGEMMLIAL RRAGVPAVAN LQIGPTAILR QALISGAVDL CVEYTGNAAF FFHREDDPVW RDARAGFLRA AMLDLKANSL VWLDPAPADN SWVIAVPNAL AREQSLSTLE DFATWVNSGA RAKLAASAEF VESEAGLPAF EAAYHFHLSA DQLLVLSGGE TTATMKAASA GISGVNAAMA YATDGALDAL DLRALEDPRR AEPVYAPAPV IRAGTLAAYP QIRGALAPVF AGLDLKTLRA LNERIAVEGE DAALVARDYM MEKSLL
|
| |