Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_4783 |
Symbol | |
ID | 6977877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 417670 |
End bp | 418674 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643393947 |
Product | ABC sugar transporter, periplasmic ligand binding protein |
Protein accession | YP_002278765 |
Protein GI | 209546847 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.121475 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATCA CAGGTCTCGG ACCCCATGGA GAGCGTGCGG CAGCACCCGA ACGGGTCCTC CTCTTGCCAG AAGACATGGC GCGCGCACGA GCTGGAGGGC TCCGCGTCGC GGTCGTCCTG CACACGACCG CCAGCGACTG GGCAAAGCAA CTGCTAGCCG GGCTGGTCGG CACGCTTGGT GCATGTGGGG TTGTGGTCAT CGACGTCGCC GACTGCGCCT TTGAACCAGA CACGCAAATC CAAGCACTTG ACCGGCTGGT CCGCGAAGCG CCCGATGCAA TAATCTCCCT GCCCGTCGCT AACGCCAAAG TCGCGGAAGC CCACAAGCGT GTCGGCGCCG CCGGCATAAA ACTCGTGCTG CTAGACAACG CGCCAACTGG CCTCTTGCCG ACTAAGGACT ACGTCTCACT TATTTCAGCC GATAATTTCG GCCTTGGAAA AATGGCTGCT GAGAGCCTGT CTCCCCATCT CCAAGTCGGG GCACAGGTAG GCGTTCTCGG TTATGCCGCG GACTTCTTTG CTACAAATGA GCGGGAAATC GCCTTCGTGA AATGGATGGG AATTAATCGC CGTGACCTGA AGATTACGAT TCGCCGGTTC TCGGCACTGT CGGACGCCGC ATCCACCGCA GAAGGGCTCC TGAGGCATAT CCCCGAGCTC GATGGCCTCT TTGTGGTTTG GGACACGCCT GCCATTGCAG CGGCAAGCGC GATCGTGGAA GCTGGGATGA CGATTCCGAT CGCGACCATC GATCTTGGCC GAGATGCGGC CATCGAACTT GCTGCCGGCG GGCCCATAGT AGCGATAGCG GCCCAGCAAC CCTTCAGACA GGGCCAGACT GCGGCGTCGA CAACGGTGAC TTCACTACTT GGTCGGCTTC CACCAGCGTG GGTGGCGCTA CCGGGTTTGG CGGTTACGCC AGACAATGTG GTGGAATCCT TCCAGACGGT CTGGCAGACA TCCGCGCCGC GGGAGCTGCT GCGTCGCAAG AAGCTGGTGC GATGA
|
Protein sequence | MTITGLGPHG ERAAAPERVL LLPEDMARAR AGGLRVAVVL HTTASDWAKQ LLAGLVGTLG ACGVVVIDVA DCAFEPDTQI QALDRLVREA PDAIISLPVA NAKVAEAHKR VGAAGIKLVL LDNAPTGLLP TKDYVSLISA DNFGLGKMAA ESLSPHLQVG AQVGVLGYAA DFFATNEREI AFVKWMGINR RDLKITIRRF SALSDAASTA EGLLRHIPEL DGLFVVWDTP AIAAASAIVE AGMTIPIATI DLGRDAAIEL AAGGPIVAIA AQQPFRQGQT AASTTVTSLL GRLPPAWVAL PGLAVTPDNV VESFQTVWQT SAPRELLRRK KLVR
|
| |