Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6572 |
Symbol | |
ID | 6983641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | + |
Start bp | 246854 |
End bp | 248086 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643399567 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002284323 |
Protein GI | 209552408 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00103217 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.480926 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATCC GCAAATATGC AATTCTCGGC GCTCTGGCGC TTGCAGGCGT TTCACTCTTC GGTCTTTCGG CCAAGGCCGA GGACGTCACG CTGACGCTCT GGTCGCTGGA TAAAGACACC CAGCCGGCGC CCAATCTCGT CAAGGAATTC AACGCCCAGA ACAACGGCAT CAAGATCGAA TACCGGCTGA TCCAGTTCGA CGACGTCGTC ACCGAGGCCA TGCGCGCCTA TGCCACCGGC CAGGCGCCCG ATATCATCGC CGTCGACAAT CCGGAGCATG CGATGTTCTC ATCGCGCGGC GCCTTCCTCG ACCTCACCGA CATGATCGCC AAGTCGACCG TCATCAAGCC GGACAACTAT TTCCCCGGGC CGCTGAAATC GGTCGAATGG GAGGGCAAAT ATTTCGGCGT GCCGAAGGCG ACGAACACGA TCGCGCTCTA CTACAACAAG GACATGTTCA AGGCCAAGGG CCTCGATCCG AACAAGCCGC CGCAGACCTG GGACGAACTG GTCGAGGATG CGCGCAAGCT GACCGACCCC GCCAAGAACG TCTACGGCCT GGCTTTTTCG GCCAAGGCCA ACGAGGAGGG GACCTTCCAG TTCCTGCCCT GGGCGCAGAT GGGCGGCGGC AGCTATGAGC ACATCAATGC CGACGGCGCG GTGAAGGCGC TCGGGATCTG GAAGACGATC ATGGACGAGA AGCTCGCTTC TCCCGATACG CTGACACGCG GCCAGTGGGA TTCCACAGGC ACGTTCAACT CCGGCAATGC GGCCATGGCG ATCTCCGGTC CCTGGGAACT CGACCGCATG ACGCAGGAAG CGAAATTCGA CTGGGGCGTC ACCCTGCTTC CGGTTCCGAA GGAAGGGGCG GAACGATCCT CGGCCATGGG CGACTTCAAC TGGGCGATCT TCGCCAGCAG CAAACATCCG GCCGAAGCCT TCAAGGCGCT CGAATATTTC GCCTCGCAGG ACGACAAGAT GTTCAAGAAT TTCGGCCAGC TTCCGGCCCG TTCCGACATC TCCATCCCCG AGAGCGGCCA GCCGCTGAAG GACGCCGCCC TCAAGGTCTT CCTCGAACAG CTGAAATACG CCAAGCCGCG CGGGCCACAC CCGCAATGGC CGAAGATCTC CAAGGCGATC CAGGACGCCA TCCAGGCGGC GCTGACCGGC CAGATGAGCC CGAAGGACGC GCTCGACCAG GCCGCGGATA AGATCAAGGC TGTTCTAGGC TAG
|
Protein sequence | MAIRKYAILG ALALAGVSLF GLSAKAEDVT LTLWSLDKDT QPAPNLVKEF NAQNNGIKIE YRLIQFDDVV TEAMRAYATG QAPDIIAVDN PEHAMFSSRG AFLDLTDMIA KSTVIKPDNY FPGPLKSVEW EGKYFGVPKA TNTIALYYNK DMFKAKGLDP NKPPQTWDEL VEDARKLTDP AKNVYGLAFS AKANEEGTFQ FLPWAQMGGG SYEHINADGA VKALGIWKTI MDEKLASPDT LTRGQWDSTG TFNSGNAAMA ISGPWELDRM TQEAKFDWGV TLLPVPKEGA ERSSAMGDFN WAIFASSKHP AEAFKALEYF ASQDDKMFKN FGQLPARSDI SIPESGQPLK DAALKVFLEQ LKYAKPRGPH PQWPKISKAI QDAIQAALTG QMSPKDALDQ AADKIKAVLG
|
| |