Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4695 |
Symbol | |
ID | 6977789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 336422 |
End bp | 337720 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643393868 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002278686 |
Protein GI | 209546768 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.52565 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGCG CCACCGTCAG CGCATTTGCG CTGAGCACAA TGTTATTTTC AGCATCCGGT TCGTATGCCC AGGAACTTGC AACCAAGGAC AGGATCGGCC TTGCCGATGC GCCAAAATCC CTCGTCGTCC GTCTGACCAA CGACAGCCCG AACAATGCGG ATCCGGCGAT CGCCGAGGGC TATCAAAAGC TCTTCGTCGA CTTCATCAAA AAGCATCCCG ACTGGAAATT GCAGATGCAA TTCATGTCGT CTGATATCGG CACCGAACAG GCCAAGATGC TGGAGCAGGC CAAGGCCGGC AATGCGCCCG ATTGCGCCGC CGTCGACTCC TTCGTGCTCT CGCAGTTCAT GGTCAATCAT GTGCTGGCGG ACTTCACGCC CTATTTCTCG AAGGAAGAAG TAGACGACCT CTTCCCCTTC ATCCGCAACG GCATCACCGA CAAGGACAAG ACGGTGCGCG CCTGGTGGTG GGATACCGAC CTTCGTGTGC TCTACCGCAA CAAGTCTGTG GTCGCAGATG CGCCGCAGAC CTGGGATGAT CTGAAAAAGG CCGCGCTTGC CTCCACCAAG GAGGGCATGG AAGGCGTGCT CTTCAACGGC GGGCGCTGGG AAGGCACGAC CTTCGACTGG CTGGCGAACT ATTGGGCGCT CGGCGGCAAG CTCGTCGACG ACTCGGGCAA ACCGGTCTTC GGCGAAGGCG AGAACAAGGA GAAATTCCTG AAGGCGCTGA ACTATTTCAA GGATCTCGTC GATTCCGGCG CGGCCCCCAA GCGCGTCAGC ACGATCGCCA ATTACGACGA CATGAATGCC GCGGCAGCTG CCGCGACCAC AGCCCTCTTC ATCGGCGGCA ATTGGCAATA TGCCCAGCTG AAGTCCACGC TTGACGAAGA CGAGTTCAAG AACTGGACCT TCTCGCCGAT CCCCGGCCCG ACGGCCGATC AACGTTCGAC AGGCACCGGC GGCTGGACGA TCGCCTCGTT CAGCAAGGAC AAGGACAAGG TCGAGATGTG CGCCAACCTC GCACGCGAGG TCTATATGGG GCCGGCCAAC GCGCTGCAGC AGCAGCTGCC GACCCGCAAA TCGCTGTTCG ACAAGTACGA GGTCTTCTCG ACGGAAGCCA ATAAGACCTT CGCCAAGGCT CTGGTCGACG GACAGGCGCG CCCCGGCGTG CCGATCTATC CAGAGATCTC GAACCAGATC CAGATCATGA TGGGTGACGT GCTCTCTGGG ACTAAGAAAC CGGAAGAAGC GCTGGATGCC GCCTTCAATG CGGCGATGGA GGCCTACAAG CGTCTGTGA
|
Protein sequence | MKSATVSAFA LSTMLFSASG SYAQELATKD RIGLADAPKS LVVRLTNDSP NNADPAIAEG YQKLFVDFIK KHPDWKLQMQ FMSSDIGTEQ AKMLEQAKAG NAPDCAAVDS FVLSQFMVNH VLADFTPYFS KEEVDDLFPF IRNGITDKDK TVRAWWWDTD LRVLYRNKSV VADAPQTWDD LKKAALASTK EGMEGVLFNG GRWEGTTFDW LANYWALGGK LVDDSGKPVF GEGENKEKFL KALNYFKDLV DSGAAPKRVS TIANYDDMNA AAAAATTALF IGGNWQYAQL KSTLDEDEFK NWTFSPIPGP TADQRSTGTG GWTIASFSKD KDKVEMCANL AREVYMGPAN ALQQQLPTRK SLFDKYEVFS TEANKTFAKA LVDGQARPGV PIYPEISNQI QIMMGDVLSG TKKPEEALDA AFNAAMEAYK RL
|
| |