Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4344 |
Symbol | |
ID | 8015120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4467578 |
End bp | 4468864 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826920 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002978123 |
Protein GI | 241207027 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0302279 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00000351134 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACATTCC GTGTAAGCAG ACGTAATTTC GTTGCGGGAG GAGCGACGCT TCTTTCGCTC TCGGCGCTGG GAACCAGCGC TTTGGCACAG GAAACGCGCT TGCGTCTCCT GTGGTGGGGC TCGCAGCCGC GTGCGGATCG CACCAACAAG GTGTCGCAGC TCTATCAGTC GAAGAAGCCA GGCACCTCGG TGACCGGCGA ATTCCTCGGC TGGGGCGACT ACTGGCCGCG CCTTGCGACC CAGGTCGCCG GCCGCAACGC GCCCGACGTC ATCCAGATGG ATTATCGCTA TATCGTCCAG TATGCGCGGC GCGGCGCGCT CGCCCCGCTC GAATCCTATA TGCCGGCCAA ACTCAACCTC GACGATTTCG ACAAGGCGCA GATCGAAGGC GGCAGCGTCG ACGGCCATCT CTACGGCGTC AGCCTCGGGG CGAATTCGGC CGCCACGGTC CTGAACACCA CCGCCTTCAA GGAGGCCGGC GTCGATCTGC CGACCCAGGC GACCACCTGG GAAGAGTTCG CCCGCATGGG TGCGGAGATC ACCAAGGCAG GCAAACGCAA GGGCATGTTC GGCTTGGCCG ACGGCAGCGG CGGCGAACCG CTGTTCGAAA ACTGGCTGCG TCAGCGCGGC AAGGCGCTTT ATACCGCCGA CGGCAAGATC GCCTTCGACG TGGACGATGC CTCCGAATGG TACGACATGT GGGCCAAGTT CCGTGCGGCC GGCGCCTGCG TTCCTGCCGA TGTCCAGGCT CTCGACAAGA ACGATATCGA CACCAACACG GTTTCGCTCG GCAAGTCGGC CGCCGGTTTT GCCCATTCCA ACCAGTTCGT CGCCTATCAG GCAATGAACA AGGACAAGCT GGCGCTGACC AACTACATGC GCATTAAGCC GGAATCGAAG GGCGGCCACT ATCGCAAGCC TTCGATGTTC TTCTCGGTCT CCGCCCAGTC GAAAGCCGTG GACCTGGCCG TGGACTACGT CAATTTCTTC GTCAAGAACC CCGAGGCAGC GCTGCTTCTG GATGTCGAAC GCGGCATTCC GGAATCGAGC GCCATGCGTG AGGTCGTCGC GGCGAAGCTT GATGAGAACG GCAAGGTTGC GCTGGCCTAT GTCAGCGGCC TGGGCGATCT CGCCGGCAAA TTGCCGCCGC CGCCGCCTGC CGGCGCCGGT GAAGGTGAGT TGATGTTACG CAACATCGCC GAACAGGTCG GCTTCGGACA GCTGTCTCCC TCCGACGGCG GCAAACAGCT TGTCGCTGAA ATCACGCAGA TTCTCGCACG AGGCTGA
|
Protein sequence | MTFRVSRRNF VAGGATLLSL SALGTSALAQ ETRLRLLWWG SQPRADRTNK VSQLYQSKKP GTSVTGEFLG WGDYWPRLAT QVAGRNAPDV IQMDYRYIVQ YARRGALAPL ESYMPAKLNL DDFDKAQIEG GSVDGHLYGV SLGANSAATV LNTTAFKEAG VDLPTQATTW EEFARMGAEI TKAGKRKGMF GLADGSGGEP LFENWLRQRG KALYTADGKI AFDVDDASEW YDMWAKFRAA GACVPADVQA LDKNDIDTNT VSLGKSAAGF AHSNQFVAYQ AMNKDKLALT NYMRIKPESK GGHYRKPSMF FSVSAQSKAV DLAVDYVNFF VKNPEAALLL DVERGIPESS AMREVVAAKL DENGKVALAY VSGLGDLAGK LPPPPPAGAG EGELMLRNIA EQVGFGQLSP SDGGKQLVAE ITQILARG
|
| |