Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6156 |
Symbol | |
ID | 8016169 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | + |
Start bp | 202415 |
End bp | 203647 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644827462 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002978662 |
Protein GI | 241258778 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.889741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.000305646 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCTATCC GCAAATATGC AATTCTGGGC GCTCTCGCAC TTGCAGGCGT CTCGCTGTTC GGTCTTTCGG CCAAGGCCGA AGACGTCACA CTGACGCTCT GGTCGCTGGA TAAGGACACC CAGCCGGCGC CGAACCTCGT CAAGGAGTTC AACGCCCAGA ACAACGGCAT CAAGATCGAA TATCGGCTGA TCCAGTTCGA CGACGTCGTC ACCGAGGCGA TGCGTGCCTA TGCGACCGGC CAGGCGCCCG ACATCATCGC CGTCGACAAT CCGGAGCATG CGCTGTTTTC GTCGCGCGGC GCCTTCCTGG ATCTTACCGA CATGATCGCC AAGTCGACCG TCATCAAGCC GGAGAATTAT TTCCCCGGCC CGCTGAAATC GGTCGAGTGG GACGGCAAGT ATTTTGGCGT GCCGAAGGCG ACCAATACGA TCGCGCTTTA CTATAACAAG GACATGTTCA AGGCCAAGGG CCTCGACCCG AACAAGCCAC CGCAGACTTG GGACGAGCTC GTCGAGGACG CGCGTAAGCT GACCGACCCC GCCAAGAACG TCTATGGTCT CGCCTTCTCG GCCAAGGCCA ACGAGGAGGG CACCTTCCAG TTCCTTCCCT GGGCTCAGAT GGGCGGCGGC AGCTATGAGA ACATCAATGC CGAAGGCGCG GTGAAGGCGC TCGGGATCTG GAAGACGATC ATGGACGAGA AGCTCGCTTC TCCCGACACC TTGACGCGCG GCCAGTGGGA TTCGACCGGC ACCTTCAATT CCGGCAATGC GGCAATGGCG ATCTCGGGCC CGTGGGAGCT CGACCGCATG ACGCAGGAAG CGAAGTTCGA CTGGGGCGTC ACGCTGCTCC CGGTTCCCAA GGAAGGGGCT GAACGCTCCT CGGCCATGGG CGACTTCAAC TGGGCGATCT TCGCCACCAG CAAACATCCG GCCGAAGCCT TCAAGGCGCT CGAATATTTC GCCTCGCAGG ACGACAAGAT GTTCAAGAAC TTCGGCCAGC TTCCGGCCCG TTCCGACATC TCGATCCCCG AGACCGGCCA GCCGCTGAAG GATGCAGCCC TCAAGGTCTT CCTCGAACAG CTGAAATACG CCAAGCCGCG CGGCCCGCAT CCGCAATGGC CGAAGATCTC CAAGGCGATC CAGGACGCTA TCCAGGCAGC ACTCACCGGC CAGATGAGCC CGAAAGACGC GCTCGACCAG GCAGCCGACA AGATCAAGGC AGTACTAGGC TGA
|
Protein sequence | MAIRKYAILG ALALAGVSLF GLSAKAEDVT LTLWSLDKDT QPAPNLVKEF NAQNNGIKIE YRLIQFDDVV TEAMRAYATG QAPDIIAVDN PEHALFSSRG AFLDLTDMIA KSTVIKPENY FPGPLKSVEW DGKYFGVPKA TNTIALYYNK DMFKAKGLDP NKPPQTWDEL VEDARKLTDP AKNVYGLAFS AKANEEGTFQ FLPWAQMGGG SYENINAEGA VKALGIWKTI MDEKLASPDT LTRGQWDSTG TFNSGNAAMA ISGPWELDRM TQEAKFDWGV TLLPVPKEGA ERSSAMGDFN WAIFATSKHP AEAFKALEYF ASQDDKMFKN FGQLPARSDI SIPETGQPLK DAALKVFLEQ LKYAKPRGPH PQWPKISKAI QDAIQAALTG QMSPKDALDQ AADKIKAVLG
|
| |