Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4568 |
Symbol | |
ID | 6977662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 203806 |
End bp | 205038 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643393745 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002278563 |
Protein GI | 209546645 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.267408 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACAA TGATGAAGCT TTTTCTTGCC GGTGTGGCAT TCGCCGGTTT CTTCGCTTCG GCGCATGCTG AGGACAAGCC GACAATCGAG ATCATGTCGT CCTGGACGTC GGGCGGCGAA GCGGCAGCCC TCAATGTCAT CAGGACCGAG TTCGAAAAGC GCGGCGGCGT CTGGAAGGAT TCCTCGATCG CCGGCTTCGG CGCCGCTGAT GCCGCCTTCC AGAACCGTAT CGTCGCCGGT GACGCGCCGG GCGCCAAACA GGGCGTCATC GGTCTTGCGG CTGCGGATTT CGTCGGCCAG GGACTGTTCA ATCCGATCGA CGATGTCGCT GCTGCCGGCA AATGGGCCGA CGTGCTGCCG AAATCGATCC ATGATCTCAT CTCTTATGAC GGCAAGGTCT ATCTCGCGCC GACCGGCGCC CATGGCGAGA GCTGGGTCTT CTATTCAAAG GAAGCCTTCG CCAAGGCCGG CATCGCCGAG GAGCCCAAAA CCTGGGACGA GTTCTTCGCC GACTTCGACA AGCTGAAGGC TGCCGGTATC GTTCCCGTTG CATGGGGTGG CCAGCCCTGG CAGCAGACCA AGGTCTTCAA CATGATCCTG CTCTCGCAGG TCGGGATCGA CGGCTTTCTG AAGATCTATG TCGACAAGGA CAAGAGCCAG GCCTCTGTCG AGGGCGTGAA GAAGACCCTC GAAATTCTCG GCAAGCTGCG CGGCTATATC GATGCGGGGG CTGCCGGCCG CAACTGGAAC GACGCAACGG CGATGCTGAT CACCGCCAAG GCCGGCGTGC AGTTCATGGG CGACTGGGCA AAGGGCGAAT TCACCGTTGC CGGCAAGGAA CCGGGCAAGG ATTATGGCTG CATGATCGTG CCGGAGTCCA AGGGCATGGT CTACATCGCC GATTCCCTCT GGTTCCCGAA GACCGGCAAT GCCGCGACCG ACAAGGCGCA GAAACTTCTC GCCGAAGTCG TCATGGATCC CGCGGTCCAG GTCGAATTCG CTTTGAAGAA GGGCTCGGTT CCGATGCGCA CCGATGTCGA CAAGTCGAAG CTTGATGTCT GCGCCCAGAA GGGTGTCGAG TTGATGGCTT CCGGTGCGAT CGTCCCGGAT CAGGCAATCG TGCTGACACC CCAGCAGGTC GGCGCGCTCG ACGATTTCGT CGACGAATAC TGGAGCGGTG GCTCGAACGA TACGGCATCT GCGGCTGAGA ATTTCTTCGC CGTCTTCGAG TAA
|
Protein sequence | MKTMMKLFLA GVAFAGFFAS AHAEDKPTIE IMSSWTSGGE AAALNVIRTE FEKRGGVWKD SSIAGFGAAD AAFQNRIVAG DAPGAKQGVI GLAAADFVGQ GLFNPIDDVA AAGKWADVLP KSIHDLISYD GKVYLAPTGA HGESWVFYSK EAFAKAGIAE EPKTWDEFFA DFDKLKAAGI VPVAWGGQPW QQTKVFNMIL LSQVGIDGFL KIYVDKDKSQ ASVEGVKKTL EILGKLRGYI DAGAAGRNWN DATAMLITAK AGVQFMGDWA KGEFTVAGKE PGKDYGCMIV PESKGMVYIA DSLWFPKTGN AATDKAQKLL AEVVMDPAVQ VEFALKKGSV PMRTDVDKSK LDVCAQKGVE LMASGAIVPD QAIVLTPQQV GALDDFVDEY WSGGSNDTAS AAENFFAVFE
|
| |