Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4132 |
Symbol | |
ID | 6982904 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 4312042 |
End bp | 4313874 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643398862 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002283620 |
Protein GI | 209551703 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCTT TGTGGTCGAA GATCGGTTTG TTTGTATCGC TTGCGGGTGT CTTGGCGCCC ATGACCGGAA CGGGTCAGGA CCAGCCGTTT CAGATCGGCA GTTCCGTCAT CAGTGAGATG AAGTACAAGC AGGGCTTTGC GCATTTCGAC TACGTCAATC CCAATGCCCC AAAAGGCGGA GACCTGCGCC TTTCTGCAAG CGGCGCTTTC GACACCTTCA ATCCTATCCT TGCCAAAGGC CAGATAGCGG CAGGGCTCTC GCTCGTCTAC GACACGTTGA TGAAGCCGAC CGATGACGAG CTCCTTGTCT CCTATGGTCT GCTTGCCGAG GGGCTGTCCT ATCCGGATGA CGTCTCAAGC GCGACCTTTC GCCTACGCAA GGAAGCGAAA TGGGCGGATG GCCAGCCGAT AACGCCCGAC GACGTCATCT TCAGTCTGGA TAAGACGAAG GAATTAAACG CTGCCACCGC GAACTATTAC CGGCACGTGG TGAAGGCCGA AAAGACGGGC GATCGCGACG TCACTTTCAC CTTCGACGAA AAGAACAATC GCGAGCTCCC GAATATTCTC GGCCAGTTGG TGATCGTGCC GAAACATTGG TGGGAGGGGC AGGGGCCGGA CGGCAAGCCG CGCGACATCT CAAAGACGAC GCTCGAGCCT GTGATGGGAT CGGGGCCTTA TAAGATCGCA TCCTTTTCGT CCGGCGCGAC GATCCGTTAT GAACTGCGCG ACGATTATTG GGGCAAGGAT CTCAATGTGA ATGTCGGCCA GAACAATTTC CGCAACGTCA TTTACACCTA TTTCGGCGAT CGCGATGTCG AGTTCGAAGC CTTTCGCGCC GGCAATAGTG ACTTCTGGCA GGAGACAACG GCGTCCCGCT GGGCGACGGG TTATGATTTT CCCGCAGTGA AGGAAGGACG CGTCAAGAAA GAAGAGGTTG CAAATCCGCT GCGCTCCACC GGCATTCTGC AAGCGCTCGT GCCCAATATG CGGCGTGACC TTTTCAAGGA TGAACGGGTC CGTGAGGCGC TGAATTACGG CCTCGATTTC GAGGAGCTGA ACCGGACCGT TGCCTTCAAC AGCTACAAGC GCATCGACAG CTACTTCTGG AACACCGAAC TCGCCTCCTC CGGCCTGCCG CAGGGGCGTG AACTGGAAAT ACTGCAGGCC ATGAAGGACA AGGTGCCGCC TGAGGTCTTC ACGACGCCCT ACACCAATCC GGTCGGGGGC GATCCGCAGA AAAGCCGCGA CAACCTCCGC AAGGCGATTG CATTGCTCAA AGAATCCGGA TGGGAGATCA AGAACAATCG CATGGTCAAT GGCAAGACCG GCCAGCCGAT GAGTTTCGAG ATCCTGTTGT CGAGCCCGGT ATTGGAGCGC TGGGCGGTGC CCTATGCCAA CAATCTCAAG AAAATCGGCA TCGATGCGCG GGTGCGCACA GTCGACGCCT CGCAAGCCGT CAACCGCGAA CGCAGCTTCG ATTACGACAT GATCTGGAAT GTCTGGGCGG AGACGATGAA CCCGGGCAAC GAGCAAGCAG ATTATTGGGG GTCTGGTTCG GTCGACCAGC AGGGCTCCCA CAATTATGCG GGCATCGCCA ATCCGGCGGT CGATGAACTC ATCCACATGA TCATCTTCGC ACCCAATCGT GCGGAACAGG TCGCAGCGAT CAAGGCAATG GATCGCGTAT TGCTGGCAAA CCACTACGTC ATCCCGCTGT TCTATCGCGA TAGCTATAAC CTCGCCTATT GGAACACGAT TACGCACCCG ACGGACTTCC CGACCTATGG ACTGGGTTTC CCGGAAGCCT GGTGGTCCGC CTCGGCAAAA TGA
|
Protein sequence | MTALWSKIGL FVSLAGVLAP MTGTGQDQPF QIGSSVISEM KYKQGFAHFD YVNPNAPKGG DLRLSASGAF DTFNPILAKG QIAAGLSLVY DTLMKPTDDE LLVSYGLLAE GLSYPDDVSS ATFRLRKEAK WADGQPITPD DVIFSLDKTK ELNAATANYY RHVVKAEKTG DRDVTFTFDE KNNRELPNIL GQLVIVPKHW WEGQGPDGKP RDISKTTLEP VMGSGPYKIA SFSSGATIRY ELRDDYWGKD LNVNVGQNNF RNVIYTYFGD RDVEFEAFRA GNSDFWQETT ASRWATGYDF PAVKEGRVKK EEVANPLRST GILQALVPNM RRDLFKDERV REALNYGLDF EELNRTVAFN SYKRIDSYFW NTELASSGLP QGRELEILQA MKDKVPPEVF TTPYTNPVGG DPQKSRDNLR KAIALLKESG WEIKNNRMVN GKTGQPMSFE ILLSSPVLER WAVPYANNLK KIGIDARVRT VDASQAVNRE RSFDYDMIWN VWAETMNPGN EQADYWGSGS VDQQGSHNYA GIANPAVDEL IHMIIFAPNR AEQVAAIKAM DRVLLANHYV IPLFYRDSYN LAYWNTITHP TDFPTYGLGF PEAWWSASAK
|
| |