Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4947 |
Symbol | |
ID | 6978041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 589138 |
End bp | 590175 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643394099 |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_002278917 |
Protein GI | 209546999 |
COG category | [E] Amino acid transport and metabolism [T] Signal transduction mechanisms |
COG ID | [COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0229057 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCCTAA TATCCTCTAT CGCGTATATA TCCGCCGTGA TCGCTCTCGC CGGGGCGGCT GCCACATCGG CGCGCGCTTC CGTGCTCGAC AGCGTAAAAC AGCGCGGCGT GCTCAATTGC GGCACTGACA ACACGACTCC GGGGTTCGGT TATCTCAACC CAAAAACCTC CAAGATGGAG GGTGTGGATG TGGATCTTTG CCGCGCGACT GCAGCCGCCG TTCTTGGTGA TCCGGACAAG GTTAACTTCG TGGTCGTCAC GGACAAAAGC CGGTTCAATG CCGTTCAGAC AGGGCAGGTA GACATTGTCT ATGCCCATAC GTCTGTGTTC GCGTCGCGCG CTGCAGCGCT CGCAGTTGAC TTTTTACCGT CGTATTTTTT TGATGGTGGC GGCGTGATGG TGACGGCCGC ATCTGGCGTG AAGTCGATCA ACGATCTGTC GGGGGCTACG ATCTGCACCA CTCAGGGGTC TGGCAGCGAG GCCACACTTG CCCAAGAGGT TAAGGCTAGG AATCTAACGA ACACAAAGAT TCTGACGTTC GATACCAGCG AGAAACTGTT CTCGGCGCTG ACCAGCGGGC GGTGCAACGG TATGTACACC GACAAGTCGG CCCTTGCCGC CTGGCGCGGT AACTCGCAGA AATCTGCCGA CTACGTGATC CTACCAGAGA CGCTGGCAGT GGCTCCATTC GCTGGTATCA TCGTTCAAAA CGATCCAGAA TGGCGAAAGC TGATGACGTG GACGCTCTAC GCCTTGTTTC AGGCCGAGGA ATGGGGCATC ACCAGTGCTA ACCTGAGCGA GATGCAGAAA TCTGCCGACC CCGCAATTCA GAAGTTCTTG GGTGTAAACG GCGGCTTCGG CGCGGACTTC CATGTGTCGG ACAGCTTCAT CGCCGACATG ATCAAGGGCG TCGGCAATTA CGGCGAGATC TATGACCGGT CTCTGGGGCC GAAGACGCCG CTCTATCTGG AGCGCGACAA GACGTCGAAC GCACTCTCGA AGAATGGCGG TCTGCTGTAC TCGATCCTGT GGCTCTGA
|
Protein sequence | MRLISSIAYI SAVIALAGAA ATSARASVLD SVKQRGVLNC GTDNTTPGFG YLNPKTSKME GVDVDLCRAT AAAVLGDPDK VNFVVVTDKS RFNAVQTGQV DIVYAHTSVF ASRAAALAVD FLPSYFFDGG GVMVTAASGV KSINDLSGAT ICTTQGSGSE ATLAQEVKAR NLTNTKILTF DTSEKLFSAL TSGRCNGMYT DKSALAAWRG NSQKSADYVI LPETLAVAPF AGIIVQNDPE WRKLMTWTLY ALFQAEEWGI TSANLSEMQK SADPAIQKFL GVNGGFGADF HVSDSFIADM IKGVGNYGEI YDRSLGPKTP LYLERDKTSN ALSKNGGLLY SILWL
|
| |