Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4015 |
Symbol | |
ID | 6982785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 4185170 |
End bp | 4186456 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643398744 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002283503 |
Protein GI | 209551586 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.946152 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCTAC GTGTAAGCAG ACGTAATTTC ATAGCGGGAG GCGCAACGCT TCTTTCGCTC TCGGCGCTCG GTACCAGCGC TTTTGCACAG GAAAAGCGCT TGCGCCTCCT GTGGTGGGGC TCGCAGCCGC GCGCCGACCG CACCAACAAG GTTTCGCAAC TCTATCAGAC GAAGAATGCC GGTACGTCGA TCACCGGCGA ATTCCTCGGC TGGGGCGATT ACTGGCCGCG CCTTGCGACC CAGGTCGCCG GCCGCAATGC GCCCGACGTC ATCCAGATGG ATTATCGCTA TATCGTTCAG TATGCACGGC GTGGCGCACT CGCACCGCTT GAATCCTATA TGCCGGCCAA GCTCAACCTT GACGATTTCG ACAAGGCGCA GATCGAAGGC GGCAGCGTCG ACGGCCATCT CTATGGCGTC AGCCTGGGTG CAAACTCGGC CGCGACGGTG CTGAACACCA CCGCCTTCAA GGAAGCCGGG GTCGATCTGC CGACACAGGC GACCACCTGG GAAGAATTCG GCCGTATGGG TGCGGAGATC ACCAAGGCAG GCAAACGCAA GGGCATGTTC GGCCTCGCCG ACGGCAGTGG CGGTGAACCG CTGTTCGAAA ACTGGCTGCG CCAGCGCGGC AAGGGCCTCT ATACCGCCGA CGGCAAGATC GCCTTCGACG TCGACGATGC ATCGGAATGG TATGACATGT GGGCGAAGTT CCGTGAGGCC GGCGCTTGCG TTCCCGCCGA TATCCAGGCT CTCGACAAGA ACGATATCGA AACCAACACG GTGTCGCTCG GCAAGGCAGC CGCCGGTTTT GCACATTCAA ACCAGTTCGT CGCCTATCAG GCCATGAACA AGGACAAGCT GGCGCTCACC AATTACATGC GCATCAAGGC GGATTCGAAG GGCGGCCACT ACCGCAAGCC TTCGATGTTC TTCTCGGTCT CGGCCCAGTC GAAAGCGATC GACTTGGCAG TGGATTACAT CAACTTCTTC GTCAAGAACC CCGAAGCAGT GCTGCTCTTG GATGTCGAAC GCGGCATTCC GGAATCGGCT GCCATGCGCG AGGTTGTTGC GGCGAAACTC GATGAGAACG GCAAGGTCGC GCTGGCCTAT GTCAGCGGCC TTGGCGACCT CGCTGGCAAA TTGCCGCCGC CGCCGCCGGC CGGCGCCGGT GAAGGCGAGC TGATGCTGCG CAACATCGCC GAACAGGTCG GCTTCGGCCA GCTGTCTCCT TCCGATGGCG GCAAACAGCT TGTCACCGAA ATCACGCAGA TTCTCGCACG AGGCTGA
|
Protein sequence | MSLRVSRRNF IAGGATLLSL SALGTSAFAQ EKRLRLLWWG SQPRADRTNK VSQLYQTKNA GTSITGEFLG WGDYWPRLAT QVAGRNAPDV IQMDYRYIVQ YARRGALAPL ESYMPAKLNL DDFDKAQIEG GSVDGHLYGV SLGANSAATV LNTTAFKEAG VDLPTQATTW EEFGRMGAEI TKAGKRKGMF GLADGSGGEP LFENWLRQRG KGLYTADGKI AFDVDDASEW YDMWAKFREA GACVPADIQA LDKNDIETNT VSLGKAAAGF AHSNQFVAYQ AMNKDKLALT NYMRIKADSK GGHYRKPSMF FSVSAQSKAI DLAVDYINFF VKNPEAVLLL DVERGIPESA AMREVVAAKL DENGKVALAY VSGLGDLAGK LPPPPPAGAG EGELMLRNIA EQVGFGQLSP SDGGKQLVTE ITQILARG
|
| |