Gene Rleg2_0835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0835 
Symbol 
ID6979553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp854119 
End bp855729 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content63% 
IMG OID643395546 
Productdrug resistance transporter, EmrB/QacA subfamily 
Protein accessionYP_002280355 
Protein GI209548438 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.861502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACTA TTCAGACCGC CGCAAACAGC AATTCGCTCA CACGCCCCGC CGCGGCGGCG 
CCCGCCGCCA TCAACCCGGT CCGCATGTGG ATGGCCGTCG TCGGCTCCAC CCTCGGCGCC
TTCATGGCGG TCTTGAACAT CCAGATCGTC AATGCCTCGC TGGCCGATAT CCAGGGCGCC
ATCGGCGCCG GCACGGATGA CGGCGGCTGG ATCTCGACCT CCTATCTGAT CGCCGAGATC
GTCGTCATTC CCTTGAGCGG CTGGCTGACG CGGGTCTTCT CGCTGCGCAA CTACCTGCTC
GTCAACGCCA TCCTCTTTCT GATCTTCTCC GTCGCCTGCG CCTTTGCGGC CAATCTGCAG
CAGATGATCG TCCTGCGCGC CATTCAGGGT TTTTCCGGCG GCGTGCTGAT CCCGATGGCC
TTCACCATCA TCATCACGCT GCTGCCCAAG GCGAGGCAGC CGATCGGGCT GGCGCTCTTT
GCCCTTTCGG CCACTTTCGC ACCGGCGATC GGCCCGACCA TCGGCGGTTA TCTCACCGAG
AACTGGGGCT GGGAATATAT CTTCTATGTC AACCTGGTGC CCGGCGCGCT GATGATCGGC
CTGCTCTTTG CCTCGCTCGA CCGGGCGCCG ATGAACCTGA AGCTGCTCGC CAAGGGCGAC
TGGCCCGGCA TCGTCACCAT GGCGATCGGG TTGGCGGGCC TGCAGACGGT GCTGGAAGAG
GGCAACAAGG AAGACTGGTT CGGCTCCGAT TTCATCCTGC GCCTGACTGT TATCGCTGCG
GTTTCGCTGA CCCTGTTCAT TGTCATCGAG CTGAAGACGG CCCATCCGCT CCTGAACCTG
CGTCTGCTGG TGCGCCGCAA TTTCGGCTTC GGCATCGTTG CCAACTTCCT GCTCGGCATC
GCGCTTTACG GCTCGGTCTT CGTGCTGCCG ATCTATCTCA CCCGTATCCA GGGTTATAAT
TCCGAGCAGA TCGGCATGGT GCTCGCGTGG ACCGGCATCC CGCAGCTGCT GCTGATCCCA
CTGGTGCCGC GGCTGATGAA GCGTTTCGAC CTCCGCCTGC TGATCGTCGT CGGCTTTGCC
CTTTTTGCCG CCTCGAATTT CATGAACGTG CATATGACCG GCGATTATGC CAGCGATCAG
CTGTTCTGGC CGAACATCGT GCGCGCCATC GGCCAGGCGC TGGTCTTCAC GCCCCTTTCG
GCGATCGCCA CATCCGGCAT CGAGCCGGAG AATGCCGGTT CGGCGTCGTC GCTGTTCAAC
ATGATGCGCA ATCTCGGCGG CGCGGTCGGC ATCGCCTCGC TGCAGACCTT CCTGTCGAAG
CGCGAGCAGT TCCACTCGAA TATCCTGACC AACTCGGTCT CGGTCTTCGA GGAAGCCACG
CGCGATCGCA TTGCCAGGTT GACGGCCTAT TTCATGAGCC ACGGCGTCAG CGATCAGGCG
CTCGCCGGCC ACAAGGCCGT CGTCGCCATT GCGCTCAAAA TACGCAAGCA GGCCAATATC
ATGGCCTTCA GCGACACCTT CTTCCTGCTC GGCGTCGCCC TGGTCGTCGC CCTGCTGGCA
AGCCTGCTTC TCAGCAAACC CGGTCAACTC TCCGGCGGCG GCGCTCACTA G
 
Protein sequence
MATIQTAANS NSLTRPAAAA PAAINPVRMW MAVVGSTLGA FMAVLNIQIV NASLADIQGA 
IGAGTDDGGW ISTSYLIAEI VVIPLSGWLT RVFSLRNYLL VNAILFLIFS VACAFAANLQ
QMIVLRAIQG FSGGVLIPMA FTIIITLLPK ARQPIGLALF ALSATFAPAI GPTIGGYLTE
NWGWEYIFYV NLVPGALMIG LLFASLDRAP MNLKLLAKGD WPGIVTMAIG LAGLQTVLEE
GNKEDWFGSD FILRLTVIAA VSLTLFIVIE LKTAHPLLNL RLLVRRNFGF GIVANFLLGI
ALYGSVFVLP IYLTRIQGYN SEQIGMVLAW TGIPQLLLIP LVPRLMKRFD LRLLIVVGFA
LFAASNFMNV HMTGDYASDQ LFWPNIVRAI GQALVFTPLS AIATSGIEPE NAGSASSLFN
MMRNLGGAVG IASLQTFLSK REQFHSNILT NSVSVFEEAT RDRIARLTAY FMSHGVSDQA
LAGHKAVVAI ALKIRKQANI MAFSDTFFLL GVALVVALLA SLLLSKPGQL SGGGAH