Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4854 |
Symbol | |
ID | 8007242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | - |
Start bp | 232142 |
End bp | 233329 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644821784 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002973044 |
Protein GI | 241113209 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.011677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.12131 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTCG AGTTGACATT GAATGAGACC TCACTGACGG ACGACACGGC GACCTCGTGG TCTGCTGTCA TCTGTCTCAC GCTTCTTACT TTCCTTCTGG TGGGGTTGGA GTTTCTCCCT GTGAGCCTTC TGACCCCGAT CGCCAGGGAC CTCTCGGTGT CAGAGGGGCA GGCCGGATTG GCGATCACCG TGTCGGGGGT CTTCGCGGTT GTCACCAGCC TTTTCGGCAA CGCCTTCCTG GCGAAGATCG ACCGGAAGTC TGTCTTCTTG CTTTATACCG CGGTTCTTGT TGTATCGAGC TTGGCGGTTG CCCTCGCACC CAACTTCCTT GTCTTTCTCG TCGGGCGGTC CCTGGTCGGC GTGTCGATCG GCGGCTTCTG GTCGCTGTCG ACGGCTATTC TGGCACGCCT GACGTCAGAC CGTGACCTGC CCAAGGCCAT TGCGCTTCTT CAAGGTGGCA CCGCATTCGC CCTCGTCCTT GCCGCGCCGC TCGGCAGTTT TCTTGGCGGG TTGATCGGAT GGCGCGGAAC CTTCTTCATC ACGGTACCGA TTGGATTTGC CGCGCTCGTC TGGCAACTGG TCGTCCTGCC GAGGATGCCG GCGACATCAA CCGTCTCGGT GGCCAGAATA TTCGGGCTGC TGCGCAATCG CACATTCGCG ATTGGAATGG CGGCGACCGC TCTCGCCTTC ATCGGCCAGA ACGCGCTGTC TATTTATCTT CGTCCGTTCC TCGAAGGCGT CACAGGACTG GAATTGGATG TTCTGTCCAT GGTGCTTCTC GGCCTCGGCG TCGGCGGACT GGCTGGAACC TCCGTCATTG GCTTCGCCGC CCGGCGCCAC CTCCTCTCCG TTCTCGTAGG CCTGCCGGCT GCTCTTTCGG TCCTTGCCCT GCTGCTGATC GCTCTCGGGC CGTTCGCGGC GGTTACCGCA TCCCTGCTTG TCATGTGGGG ATTTTTCTCG ACGCCGATTC CGGTCGCCTG GAACACCTGG ATGGCCGCTA TCGTCCCCGG TGAGTTGGAA GCGGCGGGTG GGCTGCAGGT GGCGCTGATC CAACTTGCCA TTGCCGGCGG CGCTTTCGCT GGCGGCATGC TGTTCGACAC CGTGGGATGG TGGAGCACCT TCCTTCTGGC TGCCTGCCTT CTTGCCGGTT CGGCAGTCCT TGCCGCGCTC GCCGGTCGCC GTTCCTGA
|
Protein sequence | MSVELTLNET SLTDDTATSW SAVICLTLLT FLLVGLEFLP VSLLTPIARD LSVSEGQAGL AITVSGVFAV VTSLFGNAFL AKIDRKSVFL LYTAVLVVSS LAVALAPNFL VFLVGRSLVG VSIGGFWSLS TAILARLTSD RDLPKAIALL QGGTAFALVL AAPLGSFLGG LIGWRGTFFI TVPIGFAALV WQLVVLPRMP ATSTVSVARI FGLLRNRTFA IGMAATALAF IGQNALSIYL RPFLEGVTGL ELDVLSMVLL GLGVGGLAGT SVIGFAARRH LLSVLVGLPA ALSVLALLLI ALGPFAAVTA SLLVMWGFFS TPIPVAWNTW MAAIVPGELE AAGGLQVALI QLAIAGGAFA GGMLFDTVGW WSTFLLAACL LAGSAVLAAL AGRRS
|
| |