Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2351 |
Symbol | |
ID | 6981090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2411512 |
End bp | 2412819 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643397064 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002281852 |
Protein GI | 209549935 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0456809 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0969201 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATTT CCATTCTGGC GGCTGCGGCC GCGCTGATGG CGGGCGCATG TTCCGCACAG GCGGCCAATA TCGAATTCTG GTACGGCAAT ACCGGTGTCG TCGAGACGGC GATCCAAGCC CAATGCAGTG CATTCAATGC CGCGCAGACC GAGCATCACA TTACCTGCGT TGGCCAGGGC AGCTACGAGG TGTCGATGCA GAAGGCGATC GCAGCCTTTC GCGCCAAGAA CCATCCCGTC CTCATCCAGT TCTTCGACGC CGGCACGCTC GATCTGATGC TGTCGGACGC TGTTGTGCCA GTCCAGGAGG TGCTGCCCGA CGTCAAGTGG GAAAGCTATA TTGCCGGGGC GCGCGCCTAT TACGAAACCT CCGGCGGCAA GCTTTTCGCC CAGCCTTACA ATGCCTCGAC GCTGCTCTTC TACACCAACA AGACCGAGCT TCAGAAAGCC GGCGTCACCG AGACGCCGAC GACCTGGGAA GAGATCATCG AAGCTGCCCG CAAGCTGAAG GCATCGGGTC ATGCCTGCCC CTTCGTCACC GACGGCGATA CATGGCGCGT GCTCGAACAG TTCTCGGCCC GTCACGGCCT GCCGATCGCC TCCAGGCACA ATGGCTATGA CGGCCTCGAC GCCGGATATG TCTTCAACAC CACGTTCGCC GCCAAGCATT TGCAGAACCT GGTCGACTGG CGCAAGGAAG GCCTCGTCAG GCTCGCAAGC GATACCAAGG CCGGTAATTT CACCGCCGCC TTCAATGCCG GCGAATGCGC GATGATGGAG AATTCCTCGG GTTCTTATAC CGCTTCCGCC AAGGCGTTCG AAGGCAAGTA CGAGCTCACT GTCGGCATGG CGCCGATGTA CAAGGGGTAT GAGCGCCACA ACACGCTCGT TGGCGGCGCC TCGATCTACA TCATGAAAGG CCACGACAAG GCCGAAATCG AGGGCGCCAA GGCCTTCCTC GATTTCGTGC GCCGACCCGA ACAGCAGATG GCCTTCACGT CAGCCACCGG CTACGTGCCG GTCACCAGCG ACGTGATGGA CGCGATCGCC AAGAGCGGCG AGGCGAATAC GCCGAAATAC GCCACGGCCG CCGTCGGCAT CGGTTCGATG AATGAGCCGC GCACGCCGGA TACCCGCGGC ATCCGCCTCG GCTTCTACGT GCAGTTTCGC CAGGTCTTCA TGGAAGAAAC GCAGAAGGCT TTTGCCGGCG GGCAGACGAT GCAGGCAGCG CTCGATAACG CCAAGAAGCG CGGCGACGAG CTGCTGCGCC GCTTCGAGCA GACCTACAAG GGCGTAAAGC TGCACTGA
|
Protein sequence | MKISILAAAA ALMAGACSAQ AANIEFWYGN TGVVETAIQA QCSAFNAAQT EHHITCVGQG SYEVSMQKAI AAFRAKNHPV LIQFFDAGTL DLMLSDAVVP VQEVLPDVKW ESYIAGARAY YETSGGKLFA QPYNASTLLF YTNKTELQKA GVTETPTTWE EIIEAARKLK ASGHACPFVT DGDTWRVLEQ FSARHGLPIA SRHNGYDGLD AGYVFNTTFA AKHLQNLVDW RKEGLVRLAS DTKAGNFTAA FNAGECAMME NSSGSYTASA KAFEGKYELT VGMAPMYKGY ERHNTLVGGA SIYIMKGHDK AEIEGAKAFL DFVRRPEQQM AFTSATGYVP VTSDVMDAIA KSGEANTPKY ATAAVGIGSM NEPRTPDTRG IRLGFYVQFR QVFMEETQKA FAGGQTMQAA LDNAKKRGDE LLRRFEQTYK GVKLH
|
| |