Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5698 |
Symbol | |
ID | 8016661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012853 |
Strand | - |
Start bp | 281885 |
End bp | 283123 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644827851 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002979051 |
Protein GI | 241518423 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.145305 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.141272 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGATCA CCAAACGCGA ATTCCTTGTT GCGACAGCAG CGCTAGCGCT TGCCAGTGGT GTCCGCTCCG CAAGTGCTGC CACAGCGATC AATTATTGGC ATCACTTCGC CAGCCAGTCG GAAATGGCCG GCCTGGTGAA AATCATCGAG CTGTTTGGGA AATCCCATCC AGGCATCACC GTCACGCAAG AGAGCATTCC GAATTCGGAA TATATGGCCA AGGTTTCATC GGCTGTCGTG GCCGGCGGGC GGCCCGACAC CGGAATGGTC ATTGCGGAAC GCTTTGCCGA TCTGACGGCG ATGGGCGCGC TGACCGATAT CACCGAGCGG GTAAAAGGAT GGAAGGGAAA AGCCAATCTG CCGGACAACC GGTGGGCTGG CATGTCTCAG GACGGCGCGA TATACGCGGT TCCGGCCTAT GCTTTCGTCG ACTGGATGTA CTACCGGAAA GACTATTTCG AAGAAGCGGG CCTTTCGGGC CCACCAAGGA CTTTTGATGA GTTTGTCACC GCTTGCCGGA AGCTCACCGA TCCGGCAAAG GGACGCTACG CTTTCGGAAT GCGCGGGGGC GCGGGGGCGT TCAAATACGT CATCGACGTC ATGGAGGCCT TTGGCTCGCC AATTGTAAAG GACGGTCAGG CGGCCATCGA TAAGGCTGCC GCGGTGGAGG CGATCACCTT CTACTCAAGT CTGTTCTTGA AAGAGAAAGT CGTTCCTCCA AGTGTGCCGA ACGACAGCTA TCGGCAGATC ATGGAAGGTT TCCGAACTGG CCAGACAGCC ATGGTCTGGC ACCATACCGG ATCGCTGATC GAAATCTCGG CCGCCTTGAA GCCGGGAGAG CAGTTCGCCA CCGCTCCAAT GCCCGCGGGA CCGAAGGCAC ATATCGCGCG TGTTGCCTAC GCCGGCAACG GCATCATGAA GGACGACAAT ATCGACGCTG CCTGGGACTG GATCAGCTTC TGGGGAGAAA AAGACGCGGC GATAGCCTTG TTGGAAGCGA CGGGTTATTT CCCGGCATCA ACGGCAGCGC TTGAGGATGA GCGCATCAAG ACCAATCCAA TCTACCAAGC CGCTTCGCAG ACGCTCGACT TCGGTCGTCT GCCGAACAGT TTCGTCGGCG CTGCGGGCTG GTCCGAAAAT GTCGTCAATC CCACGTTCCA ATCCGTTCTG ACGGGTCAAC TCACCCCTGA GCAGGCCGTC GACCGAATGA TCGAGGGTCT GGAGACCGCG CTCCGGTAG
|
Protein sequence | MQITKREFLV ATAALALASG VRSASAATAI NYWHHFASQS EMAGLVKIIE LFGKSHPGIT VTQESIPNSE YMAKVSSAVV AGGRPDTGMV IAERFADLTA MGALTDITER VKGWKGKANL PDNRWAGMSQ DGAIYAVPAY AFVDWMYYRK DYFEEAGLSG PPRTFDEFVT ACRKLTDPAK GRYAFGMRGG AGAFKYVIDV MEAFGSPIVK DGQAAIDKAA AVEAITFYSS LFLKEKVVPP SVPNDSYRQI MEGFRTGQTA MVWHHTGSLI EISAALKPGE QFATAPMPAG PKAHIARVAY AGNGIMKDDN IDAAWDWISF WGEKDAAIAL LEATGYFPAS TAALEDERIK TNPIYQAASQ TLDFGRLPNS FVGAAGWSEN VVNPTFQSVL TGQLTPEQAV DRMIEGLETA LR
|
| |