Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4531 |
Symbol | |
ID | 6977625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 171621 |
End bp | 172394 |
Gene Length | 774 bp |
Protein Length | 257 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643393709 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002278527 |
Protein GI | 209546609 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | [TIGR01096] lysine-arginine-ornithine-binding periplasmic protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.417135 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.112125 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTGC TCCCCATGCT TTTTGCCGGC GCGGCACTTG CGCTTTCCGC GGTCACCGCG CAGGCCGAAG TCCGCTTCGG CGTCATGAAT GAATCCTATC CGCCCTTTTT CGCCAAGGAC GCCTCGGGCG AGTGGCACGG ATGGGAAATC GATCTCATGA ACGCCGTCTG CACGGAGATG AAGGAGAAAT GCTCGATCGT CGACATTTCC TGGGATGGTC TCATTCCGGC TCTCCAGAGC AAGAAGTTCG ACGTGATCTG GTCGTCGATG TCGAACACGG CAGAACGCTC GAAGGTGATC GACTTCACCG ACAAATACTA CAACACGCCG AGCACCCTGA TCGGTCCCAA GGACCAGAAG CCGGGCGCCA CTGCCGAAGA CGTGAAGGGC AAGACCATCG GCATCCAGGT GTCGACGATC CAGTCTGAAT ATTACAAGAA GTATTTTGCC AATGCTGCCG AGGAGAAGAC CTACCAGACG CTCGACGAGG CTTTCCAGGA TCTGGCCTCC GGCCGTATCG ACTACGTCTT CGGCGATTCG CTGGCGCTCG ACGCGTTCCT GAAAAGCGAC GGCGGCAAGG ATTGCTGCGC CAAGATGGGC GATGTCGCCG ACGACAAGGA AATTCTCGGC GCCGGCGTTT CGGGCGGTCT GCGCAAGGAA GACACGGAGC TGAAAGCCAA GCTGAATGCG GCGATCGCTG CGGTTCGCGC CAACGGCCAG TACGAGACCA TCACCAAGAA ATACTTCGAC TTCGACATCT ACGGCGCGAA GTAA
|
Protein sequence | MKLLPMLFAG AALALSAVTA QAEVRFGVMN ESYPPFFAKD ASGEWHGWEI DLMNAVCTEM KEKCSIVDIS WDGLIPALQS KKFDVIWSSM SNTAERSKVI DFTDKYYNTP STLIGPKDQK PGATAEDVKG KTIGIQVSTI QSEYYKKYFA NAAEEKTYQT LDEAFQDLAS GRIDYVFGDS LALDAFLKSD GGKDCCAKMG DVADDKEILG AGVSGGLRKE DTELKAKLNA AIAAVRANGQ YETITKKYFD FDIYGAK
|
| |