Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1885 |
Symbol | |
ID | 8015613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1871848 |
End bp | 1873107 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644824474 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002975706 |
Protein GI | 241204610 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.35457 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGAA TGAAATCGAT CGGCGCGGCT TTTGCTGCAA TTCTCTTGAG TTCCGTTGCC GCCCATGCCG GCGACGTGCG CATCATGTGG TATTCCGACG GCGGCGAAGG CGAGGTGATC AAGGACCTGC TCTCGCGCTT CTCCAAGGCC AATCCTGACG TCAACGTCAT CCTCGACGAA GTCTCCTACG ACGTCGTCAA GGAACAGCTG CCGGTTCAGC TCGAAGCCGG GCAGGGGCCG GATATCGCCC GCGTCACCAA TCTGAAGGCG CCGGCCCAGC ACTGGCTCGA TCTTCGGCCC TACCTCACCG ACGCGAAATA CTGGGAGGAC AATTTCGGCG CCCAGGCCGA CTGGATGCGT CCCGACGGCT CGAACGCCAT CACCGGCTTC ATGACGCAGC TGACCTTGAC CGGCGGCTTC GTCAACAAGA CGCTGTTCGA GCAGGCCGGG GTCGAAATTC CCGGCCCGAA AGCCACCTGG GACGATTGGG CGGCGGCGGC CAAGAAGGTT GCCGACAGCC AGAAGGTCTT CGCCATGGCG ATCGACCGCT CCGGCCACCG CGTCTCCGGC CCGAACATCT CTTACGGCGC CAACTACATC GCCGCCGACG GCAAGCCGGC GCCGATCGAT CAGGGCGCCA AGGACTTCCT CAGCCGCTTC GTCAAGTGGA ACGAGGACGG CACCATCAAC AAGGATGTCT GGGTGAGTGC TGCCGGCACC ACTTACCGCT CCGCCGCCGA GGACTTCATC AATGGCGGCC TTGCCTATCT TTATTCGGGC AGTTGGCAGG TTTCGGGCTT CGCCCAGAAG ATTGGCGACA ATTTCGACTG GGTGATGGCG GGCAGTCCCT GCGGTTCTGT CGCATGCTCC GGCATGCAGG GCGGCGCCGG TCTGGTGGCC GTCAAATACA CCAAGAACCC GAAGGACGTC GCCAAGGTGA TGGATTACCT GGCAGGTGCC GACGTGCAGA AGGAGTTTGC CGAACGCAGC CTGTTCATTC CGGCGCATAA GGGCGTCGCC GCCGGCCAGA TGGACTTCAA GACCGACAAT CCGCATGTGC AGGCGGCGCT GAAGGCCTTC GTCGAAGCGG CCGGCCAGAC GGCGGCACCC GCCATGAAGC TGCCGGGCTG GAAGTGGTCG GATGCATATT ACAGCGCCAT CGTCGCCCGC ATCAGCCAGG TGATCGCCGG CGAAATGAAG CTCGACGACG CCTATGCCCG CATCGACGAG GACATCAAGG CCAAGGTCGC CGGCAACTGA
|
Protein sequence | MTRMKSIGAA FAAILLSSVA AHAGDVRIMW YSDGGEGEVI KDLLSRFSKA NPDVNVILDE VSYDVVKEQL PVQLEAGQGP DIARVTNLKA PAQHWLDLRP YLTDAKYWED NFGAQADWMR PDGSNAITGF MTQLTLTGGF VNKTLFEQAG VEIPGPKATW DDWAAAAKKV ADSQKVFAMA IDRSGHRVSG PNISYGANYI AADGKPAPID QGAKDFLSRF VKWNEDGTIN KDVWVSAAGT TYRSAAEDFI NGGLAYLYSG SWQVSGFAQK IGDNFDWVMA GSPCGSVACS GMQGGAGLVA VKYTKNPKDV AKVMDYLAGA DVQKEFAERS LFIPAHKGVA AGQMDFKTDN PHVQAALKAF VEAAGQTAAP AMKLPGWKWS DAYYSAIVAR ISQVIAGEMK LDDAYARIDE DIKAKVAGN
|
| |