Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4863 |
Symbol | |
ID | 8007251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 243972 |
End bp | 245450 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644821793 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002973053 |
Protein GI | 241113218 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.426159 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGGA TTTCACTCGG CGGTGCCGCC GCGCGCGCAA CTGTATCCAC TGCTTTGATG CTATCGTTGA TGTCCGCGTC TGCACTGGGC GCGCCGGTCG ACCTGAGCAA GTGGTCGCCG GAATATGTGC GCTCCATTGC CGGCACACAG GACTTTGACA CGGCGGGCGA TTGCGCCAAG GTCACCCCCC TCGACTACAA GGGGCGACTG ACTTTCTGGT ATCAGGGCGT GTTCGAGGGT GACCCCGACC TCCTGCGCCA GTATTACAAG GAGTTCTTCG AGACCTTCCG CAAGACCTAT CCAAACATCC AGCTTGAGGA ACAAGCCCTC ACCTATAACG ACCTGCTGGA CAAGTTCCGC ACCGCGCTCC TTGGCAATGC AGCGCCAATG GCGGTGCGTC TGCAAATCCT GGGCGGCACC GAGTTCGCCT CGAAGGGCTA TCTGGAACCC CTCAAACCAG AGGACGTAGG GTATTCGACC GACGACTTCT GGCCCGGTGC AATGAAGGCC GTAACCTGGG AGGGGGTGAC CTACGGCATC CCGACCAACA ACGAGACGAT GGCGTTCATC TGGAACGCCG ACGTCTTCAA GCGTGCAGGC CTCGATCCGG AAAAGGCTCC GGCAACCTGG GACGACGTCG TCAAATATTC CAAGCAGATC CACGACAAGC TCGGCATTGC CGGTTACGGC CTCGTGGCGC GCAAGAACGC CGGCAATACG CCGTATCGCT TCATGCCGCA GCTGTGGGCC TATGGCGGCG GCGTTTTCGA CGAAGCCACC GCCAATCCGA CCTACAAGCA GGTCCAGCTC GACAGCCCGC AGAGCAAAGC GGCATTGCAA GCCTCCTACG ATATGTATGT CCGCGACAAA TCGGTTCCGG TTTCGGCGCT CACCAACCAG CAGGCGGATA ACCAGCCCCT CTTCCTCGCT GGCCAGCTCG GCATGATGGT CTCGCACCCC TCCGACTACA ACGTCATGCT CGACCTGCAG AAAAAGACGA CGGGCGGCGA CAAGGACAAA GCGCAGACCG TCATCGACAA TATGCGCTAC GGCCTGATTC CGACTGGCCC CGACGGCAAG CGTGCCGTCG TGTTTGGCGG CTCGAACATT CACATCCTGA AGCCCGAATA TGTCGAGGGC GGCAAGGTCG ACGAGCCGGC TGCAAAGGCT ATCAGCTGCA TGTGGGCAAG CCCCGAATGG TCGCTGAAAA TGGCCTATGC CGGCTCGAAC CCGGGAAACC TTAACGGCTT CAAGACCAAA TGGATGAAGG AACGCCTGGA CAGTATAAAG TTCCTTGATG TCACGACTTC GATGCTGCCA TACGGCATCC CGTTTCCGGC GCTGCCCCAG TCCCCCGAGA TCATGAACAT CATCGTCCCG GACATGCTGC AGAATGCCCT CACCGGAGCC ATGACTGTCG ACCAAGCAGC GGACGACGCA GCCAAGAAGG TCAAAGACCT AACGGATGGC GGACTCTAG
|
Protein sequence | MTRISLGGAA ARATVSTALM LSLMSASALG APVDLSKWSP EYVRSIAGTQ DFDTAGDCAK VTPLDYKGRL TFWYQGVFEG DPDLLRQYYK EFFETFRKTY PNIQLEEQAL TYNDLLDKFR TALLGNAAPM AVRLQILGGT EFASKGYLEP LKPEDVGYST DDFWPGAMKA VTWEGVTYGI PTNNETMAFI WNADVFKRAG LDPEKAPATW DDVVKYSKQI HDKLGIAGYG LVARKNAGNT PYRFMPQLWA YGGGVFDEAT ANPTYKQVQL DSPQSKAALQ ASYDMYVRDK SVPVSALTNQ QADNQPLFLA GQLGMMVSHP SDYNVMLDLQ KKTTGGDKDK AQTVIDNMRY GLIPTGPDGK RAVVFGGSNI HILKPEYVEG GKVDEPAAKA ISCMWASPEW SLKMAYAGSN PGNLNGFKTK WMKERLDSIK FLDVTTSMLP YGIPFPALPQ SPEIMNIIVP DMLQNALTGA MTVDQAADDA AKKVKDLTDG GL
|
| |