Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3430 |
Symbol | |
ID | 8014303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 3452102 |
End bp | 3453349 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644825988 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002977215 |
Protein GI | 241206119 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGCCG CATTAGCAGG TATCGCAGCA TCCGCGTTGA CATTGTGCAT ATCGACGTCT GCCGTCTTCG CGACGGATCT CCGCATGACG GTCTGGACCG GGAGCGAGGC GCATCTGAAG ATGCTGAATG GCATTGCGGA GAGCTTCAAA GCCACACACC CCGATGTGAA CGTGAAGTTC GAGACCGTGC CGGTCAGCGA CTACACGCAG AAACTGACCT TCCAGATCGC CGGCGGCAAT GCTCCCGACA TAGCCTGGAT GATGGAGGAT GCCGCTCCGG CTTTCGAAAA CGCCAATCTT CTGATGGATC TCGGCCCGAC GCTCAAGGCG GCGGAAGGCT ATGATTTCGA CGATTTTTCG AAGCCGGCCA TGGGCCTCTG GCAGAAGGAT GAAACGGTCT ACGGCATTCC GTTCTCCACC TCGCCTTTCA TGATCTACTA CAACAAGGAC ATGTTCGACA AAGCCGGGCT CGAAGATCCG CTGACGCTCG CCACCAAGGG CGAATGGAAC ATGGACAAGT TCCAGGAAGT CTCCAAGAAG CTCGCGGAAA CCAATCCCGG CAAATGGGGC TTCGAGTTCA AGGATGGGGA AGGCTATGCC TCTCGCATGA CCCATGCCCT TCTGCCGCCA ATCCGCGCCT ATGGTGGCGA TATCTGGTCG AACAAGGAAT GCGGCTTCGA CAAGCCCGAA GCGGTCAAGG CGGTCAAGCA GCTGCATGAC ATGGTCTTCA AGGACAAGTC CATCGTTCCG CCGGGCGAAC AGGGCGATTA CTTTTCCGGC AATTCGGCGA TGACGGTCAA CCAGATTTCC CGCGCTTCGA AGATGGCGGA AGCCGGCTTC AAGTGGGGCA TCGCACCGTT GCCCACCGGC CCAGGTGGTG AGTCACCCGT TATCGGCCAG GCCGGTCTTG TTGTGTTCGC CCAAGGCAAG AATACGGAAA TCGCCGCGGA ATTCGTGGCG CATATGACCA ACAAGGAAAA CGTCGCCACC ATGGCGCAGT TCTTCCCGCC CGCCCGCAAG AGCGTTCTGC AGGCCGATGC ATTCATCAAC GGCAACAAGC TCGTGCCGCC CGAGATGATG AAGAATGTGG CTGCCGCCAT AGAAAAGGGC CGGGTGGTTT CGGCTAACGA AAAAGCGCCA CAGATCCTTG CCGCCATGGC GCCTCGCGTC GATGCCTTGT GGAAGCCGGA TGCCGATGTC GATGCCGCCA TCAAGGGCAT CTGCGCGGCA ATCCAACCGC TGCTTTGA
|
Protein sequence | MKAALAGIAA SALTLCISTS AVFATDLRMT VWTGSEAHLK MLNGIAESFK ATHPDVNVKF ETVPVSDYTQ KLTFQIAGGN APDIAWMMED AAPAFENANL LMDLGPTLKA AEGYDFDDFS KPAMGLWQKD ETVYGIPFST SPFMIYYNKD MFDKAGLEDP LTLATKGEWN MDKFQEVSKK LAETNPGKWG FEFKDGEGYA SRMTHALLPP IRAYGGDIWS NKECGFDKPE AVKAVKQLHD MVFKDKSIVP PGEQGDYFSG NSAMTVNQIS RASKMAEAGF KWGIAPLPTG PGGESPVIGQ AGLVVFAQGK NTEIAAEFVA HMTNKENVAT MAQFFPPARK SVLQADAFIN GNKLVPPEMM KNVAAAIEKG RVVSANEKAP QILAAMAPRV DALWKPDADV DAAIKGICAA IQPLL
|
| |