Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3669 |
Symbol | |
ID | 6982431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3797646 |
End bp | 3799124 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643398391 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002283158 |
Protein GI | 209551241 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGGA TTTCACTCGG TGGTGCCATC CTGCGCGGCA CTGTTTCCAC TGCTTTGATG GTATCGTTGA TGTCCGCGTC GGCGCTGGGA GCGCCGGTCG ACCTGAGCAA GTGGTCGCCG GAATATGTGC GCTCCATTGC CGGCACGCAG GATTTCGACA CGGCGGCCGA TTGCGGCAAG GTCACCCCGC TCGACTACAA GGGGCGACTC ACTTTCTGGT ATCAGGGTGT GTTCGAGGGC GACCCCGATC TCCTGCGCCA GTATTACAAG GAGTTCTTCG AGACCTTCCG CAAAACCTAT CCGAACATCC AGCTTGAGGA GCAGGCCCTC ACCTATAACG ACCTGCTGGA TAAGTTCCGG ACCGCGCTCC TTGGCAATGC AGCGCCCATG GCGGTCCGCC TGCAGATCCT GGGTGGCACG GAGTTCGCCT CAAAGGGCTA TTTGCAGCCG CTCAAACCCG AGGATGTAGG CTATTCGACC GAGGATTTCT GGCCCGGCGC AATGAAGGCT GTAACCTGGG ATGGGGTAAC TTACGGCATC CCGACCAATA ACGAGACGAT GGCGTTCATC TGGAACGCCG ACATCTTCAA GCGTGCAGGC GTCGATCCGG ATAAGGCTCC GGCAACATGG GACGACGTCG TCAAGGATTC CAAGCAGATC CACGACAAGC TCGGCATTGC CGGTTACGGC CTCGTGGCTC GCAAGAATGC CGGCAATACG CCGTACCGCT TCATGCCGCA GCTGTGGGCC TATGGCGGCG GCGTCTTCGA CGAAGCGACC GCCAACCCGA CCTATAAGGA GGTCGAGCTC AACAGTCCGC AGAGCAAGGC GGCATTGCAA GCCTCCTACG ATATGTATGT TCGCGACAAG TCGGTTCCGG TTTCGGCGCT CACCAACCAG CAGGCCGACA ACCAGCCCCT CTTCCTCGCT GGCCAGCTCG GCATGATGAT CTCGCACCCG TCCGACTATA ACGTCATGCT CGACTTGCAG AAAAAGGCGA CGGATACCGA CAAGGACAAG GCGCAGACCG TCATCGACAA TATGCGCTAC GGCCTCATTC CGACTGGGCC CGATGGCAAG CGTGCCGTCG TGTTCGGCGG CTCCAACATT CACATCCTGA AGCCAGAATA TGTCGAGGGC GGCAAGGTAG ACGAGCCGGC TGCAAAGGCG ATCATCTGCA TGTGGACGAG CCCGGAATGG TCGCTGAAGA TGGCCTATGC CGGCTCGAAC CCGGGAAACC TCAACGGCTT CAAGACAAAA TGGATGAAGG AACGTCTTGA TAAAATCAAG TTCCTCGATG TCACGACCTC GATGCTGCCA TACGGCATTC CGTTCCCGGC GCTGCCACAG TCTCCCGAGA TCATGAACAT CATCGTCCCG GACATGCTGC AGAATGCCCT GACCGGGGCC ATGACCGTCG ACCAAGCCGC CGACGACGCA GCCAAGAAGG TAAAAGACCT GATGGACGGC GGACTCTAG
|
Protein sequence | MTRISLGGAI LRGTVSTALM VSLMSASALG APVDLSKWSP EYVRSIAGTQ DFDTAADCGK VTPLDYKGRL TFWYQGVFEG DPDLLRQYYK EFFETFRKTY PNIQLEEQAL TYNDLLDKFR TALLGNAAPM AVRLQILGGT EFASKGYLQP LKPEDVGYST EDFWPGAMKA VTWDGVTYGI PTNNETMAFI WNADIFKRAG VDPDKAPATW DDVVKDSKQI HDKLGIAGYG LVARKNAGNT PYRFMPQLWA YGGGVFDEAT ANPTYKEVEL NSPQSKAALQ ASYDMYVRDK SVPVSALTNQ QADNQPLFLA GQLGMMISHP SDYNVMLDLQ KKATDTDKDK AQTVIDNMRY GLIPTGPDGK RAVVFGGSNI HILKPEYVEG GKVDEPAAKA IICMWTSPEW SLKMAYAGSN PGNLNGFKTK WMKERLDKIK FLDVTTSMLP YGIPFPALPQ SPEIMNIIVP DMLQNALTGA MTVDQAADDA AKKVKDLMDG GL
|
| |