Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5128 |
Symbol | |
ID | 6978222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 766784 |
End bp | 767716 |
Gene Length | 933 bp |
Protein Length | 310 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643394259 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002279077 |
Protein GI | 209547159 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.212164 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAACG TGCAGACAAT CCTGAAAGCC GGCGTCGCGA GCATGCTTAT TTCCCTCATG TCTATGTCCG CCCATGCGGC AGAGGGGACT GCCGAGAAAA TGGAGGCGGG CGCCGACGTC AGCAAACTCC CGGGTGTGGC AATAACCAAG AAGCTGGCAG ACGGCCTCCC CGCTTCCATC AAGGCATCCG GCACATTGAA AGTGGCGACG GACCTGACGC CGCCGATCAG CTTCCATGAT GACGATGGCA AGCTCGTTGG CATCGACGCC GATATCGCAG CCGCCCTTGG GGTCATCCTT GGCCTCGATG TCCAGATGAC GGATGTGGGA GCAGGTGCCG CCATCGTTCC TGCAATCCTG TCCAAACGGT TCGATATGAC GATCTCCGGC ATCAATGATG ACGTCGAGCT TGAGAAGCAG GTCGATGTTA TCGACTACAT GTACGATGCC ACCACGATCA TGACGATCAA GGACAATCCG CTCGGCATCA AGAGCATGGC GGAACTCTGC GGCAAGAAGG TCGCCGTGCC TGTCGGCACC TTCCAGGGCA AGATGGTCGA GGCCGCTTCG GCAAAGTGCC AGACGCCGAT CAACATCATG GCGATCCCGA AGATGCCTGA CGTGCTGCAG GCGGTGCGCA CCGGCCGCGC CGACGCTACT GTCAACGGCT ACGCGACCAG CGTCTACACA ACCGAACACC AGACCGGCAA CGGCAAGGGC CTGCAGGCGC TTCCCGATAT CCGTCTCGCA GTCGGTTATC TCGGCATGCT GACGGCAAAG GACAATCCGG AGCTCCGCGA CACCGTCGTC GCTGCGTTGC AACAGATGGT CGACAGCGGT GCCTATGAGA CGATCATGAA GAAGTGGAGC CTCGGCCCAC TGGCGGTCAA GACCGTCAAG GTCAACGACG CTGCCAGCAT GCCGGCGGAG TGA
|
Protein sequence | MRNVQTILKA GVASMLISLM SMSAHAAEGT AEKMEAGADV SKLPGVAITK KLADGLPASI KASGTLKVAT DLTPPISFHD DDGKLVGIDA DIAAALGVIL GLDVQMTDVG AGAAIVPAIL SKRFDMTISG INDDVELEKQ VDVIDYMYDA TTIMTIKDNP LGIKSMAELC GKKVAVPVGT FQGKMVEAAS AKCQTPINIM AIPKMPDVLQ AVRTGRADAT VNGYATSVYT TEHQTGNGKG LQALPDIRLA VGYLGMLTAK DNPELRDTVV AALQQMVDSG AYETIMKKWS LGPLAVKTVK VNDAASMPAE
|
| |