Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4026 |
Symbol | |
ID | 8014832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 4102882 |
End bp | 4104132 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826595 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002977806 |
Protein GI | 241206710 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.557727 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGTTTT CGCCCACCGG AGACCGTTTT GCCGCGTTCC GGCACTCGTC CTATACGCGG TTCTTCTTCG CGCGCTTCCT GCTTTCCTTC TCGCAGCAGA TCGTCAGCGT CGCCGTCGGC TGGCAGATGT ACGACCAGAC GGGCAGCGCG ATCTATCTCG GTTTGATTGG TCTCGTGCAG TTCCTGCCGT CGCTGCTGCT CATCCTCGTC ACCGGTTCGG TAGCCGATCG GTACAATCGC CGGGCGATCG CCGCCCTCTG CTCGCTGGTG AGCGCGCTCT GTACGCTGGC ACTGCTGGTT ATGACTTTAA TGGGAAGCTT TACGCCGCTG CCTGTCTTCG CGGTGCTTTT GATCTTCGGC ATCGAGCGCG CCTTCATGTC GCCGGCGGTA CAGTCGCTGG CGCCCAATCT GGTGCCGGAG GAGGCACTCT CCAATGCGAT CGCCTGGAAT TCGTCGTCCT GGCAGCTCGC GGCAATCACC GGACCGGTGC TCGGTGGCCT GCTCTATGGT GTCAGCGCGC CGACTGCCTA TACGGTGGCG GTGATCTTTT CGGTGCTCGG TGCGGCCCTT CTCTACATGA TCCCGAAACC GGTGCAGAAG ACGACCGGCG AGACCAAGAG CTGGGCGATG ATCCTCGGCG GCTTCAGTTT CATCCGTGCC GAAAAGGTGG TGCTCGGGGC GATCTCGCTC GATCTGTTCG CCGTGCTGCT CGGCGGGGCC ACGGCGCTGA TGCCGATTTT TGCGCGCGAT ATCCTCACCC TCGGTCCCTG GGGCCTCGGA CTGCTGCGCG CCGCACCCGG ACTTGGCGCC ATCGTCATGG CGATCTTCCT GGCCGCCTAT CCGCTCAGAC ATCGCGCCGG CATCTACATG TTTATCGGCG TCGCCCTGTT CGGCGTCGGA ACGATCATCT TCGGCATCTC GACCAACACC GAGGTCTCGA TCGCGGCGCT AGCGCTAATG GGGGCGGCTG ACATGGTATC GGTCTATGTG CGCGAGAGCC TGATTGCGCT CTGGACGCCG GATCAGCTGC GCGGCCGCGT CAATGCGGTC AACATGGTCT TCGTCGGCGC TTCGAACGAG CTTGGGGAAT TCAGGGCGGG CACGATGGCG GCGCTCTTCG GCGCTGTGCC GGCGGTCGTC ATCGGCGGAA TCGGGACGCT TGTCGTGGCG GCGATCTGGG CGTCGAGTTT CCCCAAACTG CGCGGGATCG ATACGCTCGA CGCGCCCAGC GCATCGTCGA AATCGATTTA A
|
Protein sequence | MSFSPTGDRF AAFRHSSYTR FFFARFLLSF SQQIVSVAVG WQMYDQTGSA IYLGLIGLVQ FLPSLLLILV TGSVADRYNR RAIAALCSLV SALCTLALLV MTLMGSFTPL PVFAVLLIFG IERAFMSPAV QSLAPNLVPE EALSNAIAWN SSSWQLAAIT GPVLGGLLYG VSAPTAYTVA VIFSVLGAAL LYMIPKPVQK TTGETKSWAM ILGGFSFIRA EKVVLGAISL DLFAVLLGGA TALMPIFARD ILTLGPWGLG LLRAAPGLGA IVMAIFLAAY PLRHRAGIYM FIGVALFGVG TIIFGISTNT EVSIAALALM GAADMVSVYV RESLIALWTP DQLRGRVNAV NMVFVGASNE LGEFRAGTMA ALFGAVPAVV IGGIGTLVVA AIWASSFPKL RGIDTLDAPS ASSKSI
|
| |