Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4871 |
Symbol | |
ID | 8007258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 251888 |
End bp | 253192 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644821800 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002973060 |
Protein GI | 241113225 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.618682 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.377624 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATTT TGATGGGTAA TCGTTTACTT TCCGCACTCC TAGCGTCCGC TACCCTGGTC GCGACCGCCG GTTTCGCCGA GGCGAAGACC GTCATCCACG TCATGCACCA GGGTGATCCC GGCTGGGTAA AGGCCTATGG CGACGTTGCC ACCCGCTTCG AGGCCGTCAA TCCCGATGTC GACATCGAGA TGATCTACGC GCCGCACGAT GCCTATAACG AAAAGTTCAG CGCCGCCGTC ATGGCCAAGC AGCTGCCCGA TATCATGGAA CTCGATGCTC CGTTCCTCGC CAATTATGTC TGGTCCGGCT ATCTACAGGC CGTCAAGCCG CTGATTGACA AGGACCTGCT CAATGACATG ACGGATTCGA ATATCGCCCA GGGCACCTAT CCGATCGACA AGGACCTCTA TGCGATCGGG CTCACCGACT CCTCGGTCGT GCTCTACGGC AATAAGAAAT ATCTTGAAGC CATTGGCGCC CGCATTCCGA AATCGGTCGA TGATGCCTGG ACCCGCGAGG AATTCGAAGG CTATCTCGAA AAGCTTTCCA AGCTCGAAGG CGTGAAGTGG CCGATCGACA CCTTCCGAGG TTACGGCATC AAGACCGAAT GGATCACTTA TGCCTACGGC CCCCTGCTCG AAAGCGCCGG CTGCGACCTG ATCGACCGCA AGACTTGGAA GGCCAGCGGC ACGCTCGATG GCGAGGCCTG CGTCAAGGCG CTGACGATGA TGCAGGACTG GGTGAAGAAA GGCTGGGTCG TACCGACTTC GTCAGGCACC AACCAGTTCT ACGCCGAGGG ACAGCCGGCC GCGCTCGCCA TGGGCGGTCA CTGGTTCTAC GCGGAGGCGT CGGCAGCGAT GAAGGACAAT ATCGTCGTCA TGCCGCTGCC GAAGATCGGC GACAAGGGCG TGAGCCCGAA CGGAACCTGG ATCTGGGGCA TTTCCGCGAC CTCGAAAAAC CCTGAGATCG CCGGCAAGTT CCTGAGCTTC CTTTTGAAGG ACAAGGAGTA CCGGGAGTAC GCCAAGACCC AGTCCGCCTA TCCGGGTTTG AAAAGCTTCG CCGCCGAATC CCCGCTCTAT GCGGAAGGCG GTCCGCTCGC CGTCGCCTTT GAGCAGGCTT CCAAGACCGC GGTCGCCCGT CCGCCGCACC CGGCCTATCC GACGATCACC TCGGCCTTCA TGCAGGCCGT CGACAAGATC TTCAACGGTG GTGACGCGCA GGAAGCGCTG ACGGCCGCTG CCGAGAAGAT CGACGAGGAT ATCGAGGACA ACGCCGGTTA CCCGCCCTTC GACGAGCAGA AGTAA
|
Protein sequence | MRILMGNRLL SALLASATLV ATAGFAEAKT VIHVMHQGDP GWVKAYGDVA TRFEAVNPDV DIEMIYAPHD AYNEKFSAAV MAKQLPDIME LDAPFLANYV WSGYLQAVKP LIDKDLLNDM TDSNIAQGTY PIDKDLYAIG LTDSSVVLYG NKKYLEAIGA RIPKSVDDAW TREEFEGYLE KLSKLEGVKW PIDTFRGYGI KTEWITYAYG PLLESAGCDL IDRKTWKASG TLDGEACVKA LTMMQDWVKK GWVVPTSSGT NQFYAEGQPA ALAMGGHWFY AEASAAMKDN IVVMPLPKIG DKGVSPNGTW IWGISATSKN PEIAGKFLSF LLKDKEYREY AKTQSAYPGL KSFAAESPLY AEGGPLAVAF EQASKTAVAR PPHPAYPTIT SAFMQAVDKI FNGGDAQEAL TAAAEKIDED IEDNAGYPPF DEQK
|
| |