Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1064 |
Symbol | |
ID | 8012193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 1037742 |
End bp | 1038968 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644823647 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002974898 |
Protein GI | 241203802 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.391824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAAT CCTCCCGCTT CCGCTCGGCG CAGACGGTCA CCGTCCTCGC CGTTACCCAG CTGATCGGCT GGGGCACGAC GTTCGACATG CTCGGGGTCA TGGGCCGTGT CGTCGCGCCG GATCTCGGCC TGGCGAACGA AGTGGCGTTT GCCGGCCTGA CGATCATGAT GGTGGTCAGC GCCATCGTCG GTCCGGCGAC CGGCCGATGG CTCGGCCGCT ATGGTGCTGC CCGTGTGCTT TCGGCCGCCT CGCTGACCTT TGCGCTCGGG CTGCTTCTGC TTGCCGCCGC AAACGGCATC GTGCTCTATG CCAGCGCCTG GGTCATCATC GGCATCGGCG GCGCATTTGG CCTCTCGGCG CCGGCCTATA CCGCCGTCGT CGAGCGCGAA GGAGCAAACG GCAAACGCGT CATCGCCATC CTGATGCTGT TCACCGGGCT TTCGAGCGCC ATCTTCTGGC CGATCCTCAG CCTGCTCAAC GAGGCGGTCG GCTGGCGCCT CACCTTCCTG GTCTGTGCGG CGCTGCAATT CTTCGTCTGT CTGCCGCTGC ATCTCTTAGG CCTGCCGAAG CCGATCGCAA CACATGTCGA AGGCGGCACA GCCGAAATCG CTCCGGTGCC GCTGTCGAAA GCCAAGCAGC GAAAAGCCTT CCTGCTGATC GCCGCGGCGA CGACGATCTC GACCTTCGTC ACCTTCGGAA TCTCGCCATC ACTGCTCGAA ATCTTCCGCC AGTCCGGCGC CTCGCCGGCC TTTGCGCTGC AGCTCGGCTC GGCACGCGGC GTCCTCGGCA TCTCTGCACG TTTCCTCGAC ATGCTGCTCG GCCGGCACGG CAACCCCATG CTCAGCGCGG TCATGGGCAT CAGCCTGATG ATGATCAGTT TCCTGATGAT GCTGGTTGCC AGCCCGTCGA CGCCGCTGCT TGTCACCTTC GTCCTGTTTT ACGGGTTCGG CACCGGGGTC ATGACCGTCG CCCGCGCGCT GCTACCGCTG GCGCTGTTCT CACCGCGCGA ATTCGGACTG CAATCGGCCC GGCTGTCGCT GCCGCAGAAC CTCGCCAACG CCATCGCCCC CGTCATCTTC ACCGCCATCC TCGATCGCGC CGGCACCGGC CCGGCGCTCG CCGCCTGCGC CGTTCTCGCG GCCTTGTCGC TGGCCTTCGT GCTGATGCTG ATGGCGCTGG TGCGCGGTGC CCGCGCATCA GAGTCAGCCA TTCTTAATGT CTCTTGA
|
Protein sequence | MPKSSRFRSA QTVTVLAVTQ LIGWGTTFDM LGVMGRVVAP DLGLANEVAF AGLTIMMVVS AIVGPATGRW LGRYGAARVL SAASLTFALG LLLLAAANGI VLYASAWVII GIGGAFGLSA PAYTAVVERE GANGKRVIAI LMLFTGLSSA IFWPILSLLN EAVGWRLTFL VCAALQFFVC LPLHLLGLPK PIATHVEGGT AEIAPVPLSK AKQRKAFLLI AAATTISTFV TFGISPSLLE IFRQSGASPA FALQLGSARG VLGISARFLD MLLGRHGNPM LSAVMGISLM MISFLMMLVA SPSTPLLVTF VLFYGFGTGV MTVARALLPL ALFSPREFGL QSARLSLPQN LANAIAPVIF TAILDRAGTG PALAACAVLA ALSLAFVLML MALVRGARAS ESAILNVS
|
| |