Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1699 |
Symbol | |
ID | 6980436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1731520 |
End bp | 1732779 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643396423 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002281213 |
Protein GI | 209549296 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGAA TACAATCGAT CGGTGCGGCT TTTGCTGCGG TTCTCTTGAG TTCCGTTGCC GCCCATGCCG GCGACGTGCG CATCATGTGG TATTCCGATG GCGGCGAAGG CGCTGTCATC AAGGATCTGC TGGCGCGCTT CTCGAAGGCC AATCCCGATG TCAACGTCAT TCTCGACGAG GTCTCCTATG ACGTCGTCAA GGAACAGCTG CCGGTGCAGC TCGAAGCCGG GAAGGGGCCG GATATCGCCC GCGTCACCAA TCTGAAGGCG CTGGCCCAGC ACTGGCTCGA TCTTCGCCCG CTCCTTGCCG ATGCGAAATA TTGGGACGAC AATTTCGGCG CCCAGGCCGA CTGGATGCGC CCCGACGGCT CGAACGCCAT CACCGGCTTC ATGACGCAGC TGACGCTGAC CGGCGGCTTC GCCAACAAGA CGCTGTTCGA TCAGGCCGGC GTCGAAATTC CCGGCCCGAA AGCCACCTGG GATGATTGGG CGGCGGCCGC CAAGAAGGTC GCCGACAGTC AGAAGGTCTT CGCCATGGCG ATTGACCGCT CCGGTCATCG CGTCTCCGGC CCGAACATCT CCTACGGCGC CAATTATATC GCCGCCGACG GCAAGCCGGC GCCGATCGAT CAGGGCGCCA AGGACTTCCT CAGCCGCTTC GTCAAATGGA ACGAGGAGGG CATCGTCAAC AAGGATGTCT GGGTCAGCGC CGCCGGCACC ACCTATCGCG CTGCCGCCGA AGACTTCATC AATGGCGGCC TTGCCTATTA TTATTCCGGC AGCTGGCAGG TCCCGGGCTT TGCCCAGAAG ATCGGCGATA ATTTCGATTG GGTCATGACC GGAAGCCCCT GCGGCACGGC CAGCTGCACC GGCATACAGG GCGGCGCCGC TCTTGTCGCC GTCAAATACA CCAAGAACCC CAAGGACGTC GCCAAGGTGA TGGATTACCT GGCAGGTGCC GACGTGCAGA AGGAATTCGC CGAGCGCAGC CTGTTCATTC CAGCGCATAA GGGTGTCGCC GCCGGCCAGG TGGACTTCAA GACCGACAAT CCGCATGTGC AGGCGGCGCT GAAGGCCTTC GTCGAAGCGG CCGGCCAGAC GGCGGCACCG GCCATGAAGC TGCCGGGCTG GAAGTGGTCG GATGCCTATT ACAGCGCCAT CGTCGCCCGC ATCAGCCAGG TGATCGCCGG CGAGATGAAG CTCGACGACG CCTATGCCCG CATCGACGAG GACATCAAGG CCAAGGTCGG CGCCAACTAA
|
Protein sequence | MTRIQSIGAA FAAVLLSSVA AHAGDVRIMW YSDGGEGAVI KDLLARFSKA NPDVNVILDE VSYDVVKEQL PVQLEAGKGP DIARVTNLKA LAQHWLDLRP LLADAKYWDD NFGAQADWMR PDGSNAITGF MTQLTLTGGF ANKTLFDQAG VEIPGPKATW DDWAAAAKKV ADSQKVFAMA IDRSGHRVSG PNISYGANYI AADGKPAPID QGAKDFLSRF VKWNEEGIVN KDVWVSAAGT TYRAAAEDFI NGGLAYYYSG SWQVPGFAQK IGDNFDWVMT GSPCGTASCT GIQGGAALVA VKYTKNPKDV AKVMDYLAGA DVQKEFAERS LFIPAHKGVA AGQVDFKTDN PHVQAALKAF VEAAGQTAAP AMKLPGWKWS DAYYSAIVAR ISQVIAGEMK LDDAYARIDE DIKAKVGAN
|
| |