Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4006 |
Symbol | |
ID | 6982776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 4174052 |
End bp | 4175338 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643398735 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002283494 |
Protein GI | 209551577 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGTCA ACCGCCGTTC ATTTCTGATG GGATCAGCCG GAGCAGCCGC CGGCCTCGCC TTTGGCGCGG GAAGCGCCAT TCCGGCTTTT GCCGAGGACA CCTCGCTGCG CGCCATGTGG TGGGGTTCGA ACGACCGCGC CAAGCGCACG CTCGATGTCG CCAAGCTCTA TCAATCGAAG ACACCCGGCG TCACGATCGT CGGTGAATCG CTATCCGGCG ACGGCTACTG GACCAAGCTC GCAACGCAGA TGGCCGGGCG CTCGATCGCC GACATCTTCC AGCTCGAGCC GGGAACGATC TCCGATTATT CCAAGCGCGG CGCCTGCCTG CCGCTCGACG AATTCGTCCC CTCGACGCTG AAGGTCCAGT CCTTCGGCGC CGACATGCTG AAACTGACCA CCATCGACGG CAAACTCTAT GGTGTCGGCC TCGGCCTCAA CTCCTTCTCG ATGTTTTTCG ACACGGTCGA ATTCGAAAAG GCCGGCATCC CGCTGCCGAC ACCCGACCTT ACCTGGGATG AGTATGCCAA GCTCGCTGTC GAACTCACAA AGTCTTCCGG CAAGAGCGGA GGCCCCTATG CGGCCCGCTA CGCCTATGTG TTCGACGCCT GGCTGCGCCA GCGCGGCAAG AGCCTTTTTG CAAAGGAAAC CGTCGGGCTC GGCTTCACGG CCGACGATGC CACGGAATGG TTCGACTATT GGGAGAAGCT GCGCAAGGCG GGCGGCACCG TTGCCGCCGA TGTGCAGACG CTCGATCAGA ACACCATCGA CACCAATAGC CTCGGTGTCG GTAAATCGGT GATGGGCATG GCCTATTCCA ACCAGATGGT CGGCTACCAG CTGATCATCA AGAACAAGCT CGGCGTCACC ATGCTGCCGC GGGAAAAGAA GGGCGGACCC TCCGGCCACT ACTACCGTCC TGCGCTGATC TGGAGTCTCG GCGCCACGAC GAAGAACGGT GAAGCGGCCG CGAAATTCAT CGACTTCTTC GTCAACGATA TCGAGGCCGG CAAGATCCTC GGCGTCGAGC GCGGCGTGCC GATGTCGCCG ACCGTGCGTG AAGCCATTCT GCCGCAACTC AACCCGACGG AGCAGGAAAC GGTCAAATAC GTCAATCTTC TGAAAGATCA GGTCGGCGAA TATCCGCCGC CGGTGCCGAT GGGCGCAACC CAGTTCGACC AGCGCGTGCT GCGCCCGATC TGCGACGAAC TCGCTTTCGA ACGGATTTCG CCGGCCGATG CGGCGACCCG GCTCGTCGAA GAGGGGAAGG CGACGCTCAA AGGATGA
|
Protein sequence | MQVNRRSFLM GSAGAAAGLA FGAGSAIPAF AEDTSLRAMW WGSNDRAKRT LDVAKLYQSK TPGVTIVGES LSGDGYWTKL ATQMAGRSIA DIFQLEPGTI SDYSKRGACL PLDEFVPSTL KVQSFGADML KLTTIDGKLY GVGLGLNSFS MFFDTVEFEK AGIPLPTPDL TWDEYAKLAV ELTKSSGKSG GPYAARYAYV FDAWLRQRGK SLFAKETVGL GFTADDATEW FDYWEKLRKA GGTVAADVQT LDQNTIDTNS LGVGKSVMGM AYSNQMVGYQ LIIKNKLGVT MLPREKKGGP SGHYYRPALI WSLGATTKNG EAAAKFIDFF VNDIEAGKIL GVERGVPMSP TVREAILPQL NPTEQETVKY VNLLKDQVGE YPPPVPMGAT QFDQRVLRPI CDELAFERIS PADAATRLVE EGKATLKG
|
| |