Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5687 |
Symbol | |
ID | 6977078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | - |
Start bp | 85758 |
End bp | 86963 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643393144 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002277962 |
Protein GI | 209546072 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.627079 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCTCG CCTTGCTCGG TTCGACGGCA ATGACCACGG TCACTGCCCA TGCCGCCGAC AAGGAAATCA GCTGGATCTA TTGCGGCGAC ACGATCGACC CGGTCCACAC CAAATACATC AAGCAGTGGG AAGAAAAGAA CACTGGCTGG AAGATCACCC CTGAGGTCGT CGGCTGGGCG CAGTGCCAGG ACAAGGCGAC GACCCTTGCC GCCGCCGGCA CGCCGGTTGC CATGGCCTAT GTCGGCTCGC GCACGCTGAA GGAATTCGCG CAGAACGATC TTATCGTTCC GGTGCCGATG ACCGATGACG AAAAGAAGAC CTACTACCCC CACATCGTCG ACACGGTAAC CTTCGAGGGC AACCAGTGGG GCGTTCCGAT CGCCTTCTCC ACCAAGGCGC TTTACTGGAA CAAGGATCTC TTCAAGCAGG CGGGCCTCGA TCCCGAGAAG CCGCCGAAGA CCTGGGCCGA AGAAATCGAG ATGGCAAAGA CCATCAAGGA AAAGACCGGC ATTCCGGGCT TCGGTCTCTC CGCCAAGACC TTCGACAACA CCATGCACCA GTTCATGCAT TGGGTTTACA CCAACAACGG CAGCGTGATC GGTGCCGACG GCAAGGTCAC GCTCGACAGC CCGCAGATTC TCGCCGCGCT GAAAGCCTAT AAGGACATTG TCCCCTACTC CGAAGAAGGC CCGACGGCCT ATGAGCAGAA CGAAGTCCGC GCCATCTTCC TCGACGGCAA GGTGGCGATG ATCCAGGCCG GCTCGGGTGC GGCCGACCGT CTGAAGAAGA CGCAGATCAG CTGGGGCATC ACGACGCTGC CGCTCGGCCC CGACGCCAAG GGTCCCGGCA CGCTGCTGAT CACCGACAGC CTGGCGATCT TCAAGGGTTC GGGCGTCGAG GACAAGGCGA CGGAGTTCGC CAAGTTCATC ACCTCGCCGG ATGTGCAGTC GGAATATGAG CTGCAGGGCG GCGCCGGCCT CACGCCGCTG CGCCCCTCTG CAAAGGTCGA CGAGTTCGTC GCCAAGGATC CCTATTGGAA GCCGTTGATC GACGGCATCA GCTATGGTGG TCCGGAGCCG CTCTTCACCG ATTATAAGGG CTTCCAGAAC TCGATGATCG AAATGATCCA ATCGGTGGTG ACTGGAAAGG CCGAGCCGGA AGCCGCACTC AAGAAGGCTG CCGGCGAAGT CGAAGCCTTC AAGTAA
|
Protein sequence | MALALLGSTA MTTVTAHAAD KEISWIYCGD TIDPVHTKYI KQWEEKNTGW KITPEVVGWA QCQDKATTLA AAGTPVAMAY VGSRTLKEFA QNDLIVPVPM TDDEKKTYYP HIVDTVTFEG NQWGVPIAFS TKALYWNKDL FKQAGLDPEK PPKTWAEEIE MAKTIKEKTG IPGFGLSAKT FDNTMHQFMH WVYTNNGSVI GADGKVTLDS PQILAALKAY KDIVPYSEEG PTAYEQNEVR AIFLDGKVAM IQAGSGAADR LKKTQISWGI TTLPLGPDAK GPGTLLITDS LAIFKGSGVE DKATEFAKFI TSPDVQSEYE LQGGAGLTPL RPSAKVDEFV AKDPYWKPLI DGISYGGPEP LFTDYKGFQN SMIEMIQSVV TGKAEPEAAL KKAAGEVEAF K
|
| |