Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5536 |
Symbol | |
ID | 6978630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1188227 |
End bp | 1189066 |
Gene Length | 840 bp |
Protein Length | 279 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643394635 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002279453 |
Protein GI | 209547535 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.901177 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.273691 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTATTT CCTTTCGCAC CGGCGTGATG TCGCTGGCCG CCGCCGCGCT GCTCTCCACG CCTGCATTGG CCGACGGCAG CAAGCTCGAT GAAGTGCTGG CCCGTGGCCA TCTCGTCCTC GGCACGGGCA GCACCAACGC GCCCTGGCAC TTCAAGAGCG CCGACGACAA GCTGCAGGGT TTCGACGTCG ACATGGGCCA TATCATCGCC AAGGCGCTGT TCGGCGATCC TGAGAAGATC GAATATGTAA ACCAGTCTTC CGACGCCCGC ATCCCGAACA TCACCACCGA TAAGGTCGAT ATCACCTGCC AGTTCATGAC CGTTACCGGC GAACGCGCCC AGCAGGTCGC CTTCACCATT CCCTATTATC GCGAGGGTGT CGGCCTGATG CTCAAGGCGG ACGGAAAATA TGGCGATTAC GCCGCTCTCA AGGCCGCCGG TTCCTCCGCC ACCATCTCGG TTCTGCAGAA CGTCTATGCC GAGACGATGG TGCATGCGGC ATTGCCGGAC GCGACCGTCG ATCAGTATGA TTCCGTCGAC CTGATCTATC AGGCTCTGGA GTCGGGACGC GCCGATGCCG TCGCCACGGA CCAGTCGTCG CTCGCCTGGT ACATGACGCA AAATCCGGGC CGATACAAAG ACGCCGGCTA CGGCTGGAAC CCGCAGACCT ACGCCTGCGC CGTCAAGCGC GGCGATCAGG ACTGGCTGAA CTTCGTCAAC ACCGCCCTGC ACGAAGCCAT GACCGGCGTC GAGTTCGACT TTTACGCCAA GTCCTTCAAG ACCTGGTTCG GCAAGGACCT GACGCCGCCG CAGATCGGCT TCCCCGTCGA GTTCAAATAA
|
Protein sequence | MTISFRTGVM SLAAAALLST PALADGSKLD EVLARGHLVL GTGSTNAPWH FKSADDKLQG FDVDMGHIIA KALFGDPEKI EYVNQSSDAR IPNITTDKVD ITCQFMTVTG ERAQQVAFTI PYYREGVGLM LKADGKYGDY AALKAAGSSA TISVLQNVYA ETMVHAALPD ATVDQYDSVD LIYQALESGR ADAVATDQSS LAWYMTQNPG RYKDAGYGWN PQTYACAVKR GDQDWLNFVN TALHEAMTGV EFDFYAKSFK TWFGKDLTPP QIGFPVEFK
|
| |