Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5236 |
Symbol | |
ID | 8007410 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 647999 |
End bp | 648847 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644822144 |
Product | ectoine/hydroxyectoine ABC transporter solute-binding protein |
Protein accession | YP_002973404 |
Protein GI | 241113569 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR02995] ectoine/hydroxyectoine ABC transporter solute-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.897169 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.367584 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAC GATATCTTTT GAGCGCCGCC AGCCTGTCAG TGCTTCTGAT CACGGCGGCT TCGCCTGCCT CCGCCGCCGA TGACAAGCTC GAGCAGTTGA AGGAGCAAGG CTTTGCGCGT ATCGCCATCG CCAACGAGCC GCCGTTCACC GCCGTCGGCG CCGACGGCAA GGTTTCCGGC GCGGCCCCCG ATGTGGCGCG CGCAATATTC GAAAAGCTGG GCGTCAAGGA AGTGGTCGCC TCGATCTCGG AATATGGCGC AATGATCCCC GGCCTGCAGG CCGGCCGCCA CGACGCGATC ACCGCAGGCC TCTTCATGAA GCCCGAGCGC TGCAACGCCG TCGCCTATTC CGAACCGATC CTTTGCGACG CCGAAGCTTT CGCGCTCAAG AAGGGCAACC CGCTGAAGCT GACGAGCTAC AAGGACATCG CCGACAATCC GGACGCCAAG ATCGGCGCGC CGGGCGGCGG TACCGAGGAG AAGCTGGCGC TTGAGGCCGG CGTGCCGCGC GATCGCGTCA TCGTCGTTCC GGATGGCCAG AGCGGCATCA AGATGCTGCA GGACGGCCGC ATCGACGTCT ACTCGCTGCC GGTTCTGTCG ATCCACGATC TGATGGCCAA GGCGAACGAT CCGAACCTCG AGACCGTCGC ACCCGTCGTC AATGCGCCGG TCTATTGCGA TGGCGCGGCC TTCCGCAAGC AGGACGTTGC GCTCCGCGAC GCCTTCGATG TCGAGCTGAA GAAGCTGAAG GAATCCGGCG AATTCGCCAA GATCATCGAG CCCTACGGTT TCTCGGCCAA GGCGGCGATG TCGACGAGCC GCGAAAAGCT TTGCGCCGCG GCGAAGTAA
|
Protein sequence | MKTRYLLSAA SLSVLLITAA SPASAADDKL EQLKEQGFAR IAIANEPPFT AVGADGKVSG AAPDVARAIF EKLGVKEVVA SISEYGAMIP GLQAGRHDAI TAGLFMKPER CNAVAYSEPI LCDAEAFALK KGNPLKLTSY KDIADNPDAK IGAPGGGTEE KLALEAGVPR DRVIVVPDGQ SGIKMLQDGR IDVYSLPVLS IHDLMAKAND PNLETVAPVV NAPVYCDGAA FRKQDVALRD AFDVELKKLK ESGEFAKIIE PYGFSAKAAM STSREKLCAA AK
|
| |