Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5515 |
Symbol | |
ID | 6978609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1164605 |
End bp | 1165453 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643394614 |
Product | ectoine/hydroxyectoine ABC transporter solute-binding protein |
Protein accession | YP_002279432 |
Protein GI | 209547514 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR02995] ectoine/hydroxyectoine ABC transporter solute-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.859546 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00100925 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGCAG GATATGTTTT GAGCGCCGTC AGTCTGTCGG TGCTTCTGGC CGCGGCCGCC GGCCCGGCTT CCGCTGCCGA TGACAAGCTC GAGCAGCTGA AGGAGCAGGG TTTTGCCCGC ATCGCCATCG CCAACGAGCC GCCGTTCACG GCCGTCGGCG CCGATGGCAA GGTGTCGGGT GCTGCGCCCG ATGTCGCGCG CGTGATCTTC GAAAAGCTCG GCGTCAAGGA AGTGGTCGCT TCGATCTCGG AATATGGCGC GATGATCCCG GGCCTGCAGG CCGGCCGCCA TGACGCGATC ACAGCCGGCC TGTTCATGAA GCCCGAGCGT TGCAATGCCG TCGCCTATTC CGAACCGATC CTCTGCGACG CCGAAGCCTT CGCCCTCAAG AAGGGCAACC CGCTGAAGCT GACCAGCTAC AAGGATATCG CCGACAATCC CGACGCCAAG ATCGGCGCGC CGGGCGGCGG CACCGAAGAG AAGCTGGCGC TCGAGGCCGG CGTGCCGCGC GACCGCGTCA TCGTCGTTCC GGACGGCCAG AGCGGCATCA AGATGCTGCA GGACGGCCGC ATCGACGTCT ATTCGCTGCC GGTCCTGTCG ATCCACGACC TGATGGCCAA GGCGAATGAT CCCAATCTCG AAACCGTCGC TCCCGTCGTC AACGCGCCGG TCTATTGCGA TGGCGCCGCC TTCCGCAAGC AGGATGTCGC GCTGCGCGAC GCCTTCGACG TCGAGCTGAA GAAGCTGAAG GAATCGGGCG AATTCGCCAA GATCATCGAG CCCTACGGCT TCTCGGCCAA GGCGGCGATG TCGACGAGCC GCGACAAGCT CTGCGCCGCA GCCAAGTAA
|
Protein sequence | MKAGYVLSAV SLSVLLAAAA GPASAADDKL EQLKEQGFAR IAIANEPPFT AVGADGKVSG AAPDVARVIF EKLGVKEVVA SISEYGAMIP GLQAGRHDAI TAGLFMKPER CNAVAYSEPI LCDAEAFALK KGNPLKLTSY KDIADNPDAK IGAPGGGTEE KLALEAGVPR DRVIVVPDGQ SGIKMLQDGR IDVYSLPVLS IHDLMAKAND PNLETVAPVV NAPVYCDGAA FRKQDVALRD AFDVELKKLK ESGEFAKIIE PYGFSAKAAM STSRDKLCAA AK
|
| |