Gene Rleg_2943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2943 
Symbol 
ID8013869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2934758 
End bp2935978 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content65% 
IMG OID644825513 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002976741 
Protein GI241205645 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.747804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATAT CCCAAGGACC GGGGAGCGTT CTTCGCCACC CCGGCTATCT GAACTTCGCT 
GCCTCCCGCG TTTTTTCCTC GCTCTCCTTC CAGTCCATTG GCATCGCCAT GGGGTGGATG
ATCTACGATC AGACACACAG CGCCTTTGCG CTCGGCCTAG TCGGCTTGTG CCAATTCCTG
CCGATGGCGG TGCTGACCTT CGTCGTCGGC CATGTCGCCG ACCGGTTCGA CCGGCGGCGC
ATCGGGCTCG TCTGCCAGTT GATCGAAGCG GTGACGGCGC TGGTGCTGGC GGTTGCCACC
TGGCAGCAAT GGCTGACGCC GGCCGGCATC CTTGCCGCCG TCACCGTGCT CGGCGCCGTC
GTCGCCTTTG AACGGCCGAC CATGGCGGCG CTGTTGCCGA ACATCGTGCC GGCCTCGATG
TTGCAGAAGG CGGTCGCAAC CTCGACCTCG TTGATGCAGA CGGCGATGAT CATCGGCCCC
TCGCTCGGAG GCCTGCTTTA TGGCCTCCAC CCCGTCGCGC CTTTTGCTAT ATCGGCGCTG
CTCTTTGCCG TCGCAAGCTT CAACGTCATC TCGATCCGCA TGCAATGGAG CCCTGCCAAG
CGTGAGCCGG TGACGCTCGC CTCGGTCTTT GCCGGCGTCT CCTTCATCCG CAGCCGGCCG
GTGATGCTCG GCACGATCTC GCTCGATCTC TTCGCGGTGC TGCTCGGCGG CGCGACTGCG
CTGCTGCCGA TGTTCGCCAG CGATATCCTG CATGCCGGTC CCTGGGGCCT CGGTTTTCTG
CGCGCGGCCC CGGCGGTCGG CGCGCTTGCC ATGTCGATCA TGCTCGCTCG CCGGCCACTG
AGCAGCAATG TCGGTCGCAA GATGCTCGCC GCCGTCGCCG TGTTCGGCGT CGCCACCATC
GTCTTCTCGC TGTCGACCAA TATCGCGCTT TCCGTCGTCG CGCTGCTCGT CATCGGCGCG
TCCGATACGG TGAGCGTCGT CGTGCGCAGC TCGCTGGTGC AGCTTTTGAC GCCGGACGAG
ATGCGTGGTC GTGTCAGTGC CGTCAACTCG CTGTTCATCG GCACCTCCAA CCAGCTCGGC
GAATTCGAAT CCGGCATGAT GGCGGCGGCG CTTGGGCCGG TCGCCACCGG CATCGTCGGC
GGCTTCGGCA CGATCGTCGT CGTGCTCCTG TGGATGCGGC TCTTCCCCGA TCTTACCAAG
GTCAAGACGC TGCAGGGTTA G
 
Protein sequence
MDISQGPGSV LRHPGYLNFA ASRVFSSLSF QSIGIAMGWM IYDQTHSAFA LGLVGLCQFL 
PMAVLTFVVG HVADRFDRRR IGLVCQLIEA VTALVLAVAT WQQWLTPAGI LAAVTVLGAV
VAFERPTMAA LLPNIVPASM LQKAVATSTS LMQTAMIIGP SLGGLLYGLH PVAPFAISAL
LFAVASFNVI SIRMQWSPAK REPVTLASVF AGVSFIRSRP VMLGTISLDL FAVLLGGATA
LLPMFASDIL HAGPWGLGFL RAAPAVGALA MSIMLARRPL SSNVGRKMLA AVAVFGVATI
VFSLSTNIAL SVVALLVIGA SDTVSVVVRS SLVQLLTPDE MRGRVSAVNS LFIGTSNQLG
EFESGMMAAA LGPVATGIVG GFGTIVVVLL WMRLFPDLTK VKTLQG