Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2685 |
Symbol | |
ID | 6981429 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2725778 |
End bp | 2726998 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643397398 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002282182 |
Protein GI | 209550265 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.188987 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATAT CCCAAGGCCC GCGGAGCGTT CTTCGCCACC CCGGCTATCT GAACTTCGCT GCCTCCCGCG TCTTTTCCTC GCTTGCCTTC CAGTCCATCG GCATCGCCAT GGGCTGGATG ATCTATGATC AGACGCACAG CGCCTTTGCG CTCGGCCTCG TCGGCCTCTG CCAGTTCCTG CCGATGGCGG TGCTGACCTT TGTCGTCGGC CATGTCGCCG ACCGGTTCGA CCGGCGGCGT ATCGGCCTCA TCTGCCAGCT GATCGAGGCG GTGACGGCGC TGGTGCTGGC GGTTGCCACC TGGCAGCAAT GGCTGACGCC TTCAGGCATC CTTGTCGCCG TCACGGTGCT TGGCGCCGTT GTCGCCTTCG AGCGGCCGAC CATGGCGGCT CTGCTGCCGA ACATCGTGCC GGCTTCGATG CTGCAGAAGG CGGTCGCCAC CTCCACCTCG CTGATGCAGA CGGCGCTGAT CATCGGCCCC TCGCTCGGCG GCTTGCTCTA CGGCCTCCAC CCGGTGGCGC CCTTTGCCGT GTCGGCGCTG CTCTTTGCCG TTGCAAGCTT CAACGTCATC TCGATCCGCA TGCAGTGGGC GCCTTCCAAA CGCGAGCCGG TGACGCTCGC CTCGGTCTTT GCCGGCGTCT CCTTCATCCG CAGCCGGCCG GTGATGCTCG GCACGATCTC GCTCGATCTC TTCGCGGTGC TGCTCGGCGG CGCCACGGCA CTGCTGCCGA TGTTTGCCCG CGATATCCTG CATGCCGGTC CCTGGGAGCT CGGCCTTTTG CGCGCCGCAC CGGCGATCGG CGCGCTTGCC ATGTCGATCG TGCTCGCCCG CCGGCCGCTC GAGAGCAATG TCGGGCGCAA GATGCTTGCT GCCGTCGCCG TGTTCGGCCT CGCCACCATC GTCTTTTCGC TGTCCACCAA CATCACGCTT TCGGTCGCTG CCCTGCTTGT TGTCGGCGCG TCGGATACGG TCAGCGTCGT CGTGCGCAGT TCGCTGGTGC AGCTTCTGAC GCCGGATGAG ATGCGCGGCC GCGTCAGCGC GGTCAACTCG CTGTTCATCG GCACCTCCAA CCAGCTCGGC GAATTCGAAT CCGGCATGAT GGCGGCAGCC CTCGGGCCGG TCGCCACCGG CATCGTCGGC GGGCTCGGCA CGATCATCGT CGTGCTCCTG TGGATGAGGC TCTTCCCCGA TCTTACCAAG GTCAAGACGC TGCAGGGCTG A
|
Protein sequence | MDISQGPRSV LRHPGYLNFA ASRVFSSLAF QSIGIAMGWM IYDQTHSAFA LGLVGLCQFL PMAVLTFVVG HVADRFDRRR IGLICQLIEA VTALVLAVAT WQQWLTPSGI LVAVTVLGAV VAFERPTMAA LLPNIVPASM LQKAVATSTS LMQTALIIGP SLGGLLYGLH PVAPFAVSAL LFAVASFNVI SIRMQWAPSK REPVTLASVF AGVSFIRSRP VMLGTISLDL FAVLLGGATA LLPMFARDIL HAGPWELGLL RAAPAIGALA MSIVLARRPL ESNVGRKMLA AVAVFGLATI VFSLSTNITL SVAALLVVGA SDTVSVVVRS SLVQLLTPDE MRGRVSAVNS LFIGTSNQLG EFESGMMAAA LGPVATGIVG GLGTIIVVLL WMRLFPDLTK VKTLQG
|
| |