Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_3450 |
Symbol | |
ID | 6982204 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 3565387 |
End bp | 3566697 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643398168 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002282943 |
Protein GI | 209551026 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTGA GAACTTTTCT GCTGGGCGCC TGCTCAGCAC TGGCGTTTGC CGGCATGGCT TCGGCTGAGA CGCTGACAAT CGCAACCGTC AACAACGGCG ACATGATCCG GATGCAAAAG CTGACGGATG ATTTCAAGGC GAAGAATCCC GGCATCGACC TTGAATGGGT CACCCTCGAA GAGAACGTGC TGCGCCAGAA GGTCACGACC GACATCGCGA CCAAGGGCGG CCAGTACGAC GTTTTGACGA TCGGCACTTA CGAAGTTCCG ATCTGGGCAA AGCAGGGTTG GCTGCTGCCG CTCGACAATC TCGGCGCCAA TTATGACGTC GACGACCTGC TGCCGGCGAT CCGCAGTGGC CTGACCGTGG ACGGCAAGCT CTATGCTGCG CCGTTCTACG GCGAAAGCTC GATGGTCATG TATCGCAAGG ACCTGTTTGA CGCCGCCGGC CTGAAGATGC CCGACGCGCC GACCTGGGAT TTCGTTGCCG ACGCTGCCCG CAAGATCACT AACAAGGACA AGGAAATCTA CGGCATCTGC CTGCGTGGGA AGGCCGGCTG GGGCGAGAAC ATGGCCTTCC TGACGGCCAT GTCCAACTCC TTCGGCGCTC GCTGGTTCGA TGAAAAGTGG AAGCCGCAGT TCGATCAGCC GGAGTGGAAG GACACGCTCG ACTTCTACGT CAAGCTGATG AAGGATGCCG GCCCTCCGGG CGCTTCCTCC AACGGCTTCA ACGAGAACCT GGCGCTGTTC CAGACCGGCA AGTGCGGCAT GTGGATCGAC GCAACGGTTG CCGCTTCCTT CGTCGCCGAT CCGAAGCAGT CGCAGGTCGC CGACAAGGTC GGCTTTGCGC TCGCCCCGGA CAAGGGCCTC GGCAAGCGCG GCAACTGGCT CTGGGCCTGG AGCCTCGCCG TCCCGGCAGG TACGCAGAAG GCGGAAGCTG CCGAGAAGTT CGTCGCCTGG GCGACGAGCA AGGAATACAG CAATCTCGTC GCTGAGAAGG AAGGCTGGCT GAACGCACCT CCGGGCACCC GCAAGTCGCT CTATGCGAAT GCGGACTACC AGAAGGCGGC GTCGTTTGCC AAGATGACGC TCGACTCGAT CGAGGCGGCC GATCCGACCA AGCCGACCGT CAAGCCGGTT CCTTATGTCG GCGTCCAGTT CGTGGCGATC CCGGAATTCC AGGGTATCGG TACGGCGGTG GGTCAGCAGT TCTCGGCAGC CCTTGCCGGC CAGATTTCGG TCGACCAGGC GTTGAAGAGC GCACAGCAGC TGGCGACGCG CGAAATGACC AAAGCCGGCT ACATTAAGTA A
|
Protein sequence | MTLRTFLLGA CSALAFAGMA SAETLTIATV NNGDMIRMQK LTDDFKAKNP GIDLEWVTLE ENVLRQKVTT DIATKGGQYD VLTIGTYEVP IWAKQGWLLP LDNLGANYDV DDLLPAIRSG LTVDGKLYAA PFYGESSMVM YRKDLFDAAG LKMPDAPTWD FVADAARKIT NKDKEIYGIC LRGKAGWGEN MAFLTAMSNS FGARWFDEKW KPQFDQPEWK DTLDFYVKLM KDAGPPGASS NGFNENLALF QTGKCGMWID ATVAASFVAD PKQSQVADKV GFALAPDKGL GKRGNWLWAW SLAVPAGTQK AEAAEKFVAW ATSKEYSNLV AEKEGWLNAP PGTRKSLYAN ADYQKAASFA KMTLDSIEAA DPTKPTVKPV PYVGVQFVAI PEFQGIGTAV GQQFSAALAG QISVDQALKS AQQLATREMT KAGYIK
|
| |