Gene Rleg2_0834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0834 
Symbol 
ID6979552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp852853 
End bp854043 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content66% 
IMG OID643395545 
Productsecretion protein HlyD family protein 
Protein accessionYP_002280354 
Protein GI209548437 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.541495 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGAGT TGCCCCGCAA GGATGTTTTC GACACGGCCC GGCAGGCCGA TGGCGCGCCT 
GCGACCGAAG CGGCTGTCGT CGAAGCCCCA GCCGCGGCCG TGCCGAAGAA AACCGGCCGC
AAGATTGTCA AGCGCGCGGT CATCGCCGCA GCCCTGCTTG CCGGCGTCGG TTTATCAGGC
GATTTCGGTT ACCGCTATTG GACGGTCGGC CGCTTCATCG AATCCACCGA CGATGCCTAT
GTGAAGGCCG ATTACACCAC CGTCGCCCCG AAGGTTGCCG GCTATATCAG GCAGGTGCTG
GTCAACGACA ACGACCCGGT CAAGTCAGGC CAGGTTCTCG CTCGCATCGA CGACCGCGAC
TTCCAGGCCG CATTGTCGCA GGCGAGGGCC GCCGTGAAGG CGGCCGATGC CGCGATCGCC
AATATCGACG CCCAGATCGC CTTGCAGCAG TCGGTGATCG GCCAGGCCAA GGCCACGATC
GATGCCTCGC AGGCCTCGCT CGATTTTGCC GTTTCGGATG CTGCCCGCTC GGCCCGGCTG
ATCACCAGCG GCGCCGGCAC GCAATCGCGC GCCGAACAGA GCCAGTCGGC CCGCGACCAG
GCCGCCGCCG CCGTCGAGCG CGACCGGGCA GCCCTCGTCG CGGCTGAGAA CAAGGTGCCG
GTCCTTGAAA CGCAGCGCCA GCAGGCAATT GCCGAGCGCG ATCGGGCGGC AGCCGCCGCC
CAGCAGGCCG AACTCAACCT GTCCTATACT GATATCGTCG CCGCCGTCGA CGGCACGGTC
GGCGCCCGTT CGATCCGCGT CGGCCAGTAT GTCACCTCGG GCACGCAGCT GATGGCCGTC
GTGCCGCTGC ATGCCGTCTA TGTCGTCGCC AATTTCAAGG AGACGCAGCT GACCCATGTC
CGCCCCGGCC AGCCGGTCGA GATCAAGGTG GACAGCTTTC CCGACATGGC GATCAAAGGC
CATGTCGACA GCGTTTCACC GGCGAGCGGC CTCGAATTCT CGCTGCTGCC ACCTGACAAC
GCCACCGGCA ATTTCACCAA GATCGTCCAG CGCATCCCGG TCAAGATCGT CATCGACGAC
GAGGCGCTGA GCGGCCTGTT GCGCTCGGGC ATGTCGGTCG AGCCGGAGAT CGATACCAAG
GCTGCAGAGA CCTCTGTGGC CGAGGAAGAA TTATCCCGGC ACGCCGGATA G
 
Protein sequence
MVELPRKDVF DTARQADGAP ATEAAVVEAP AAAVPKKTGR KIVKRAVIAA ALLAGVGLSG 
DFGYRYWTVG RFIESTDDAY VKADYTTVAP KVAGYIRQVL VNDNDPVKSG QVLARIDDRD
FQAALSQARA AVKAADAAIA NIDAQIALQQ SVIGQAKATI DASQASLDFA VSDAARSARL
ITSGAGTQSR AEQSQSARDQ AAAAVERDRA ALVAAENKVP VLETQRQQAI AERDRAAAAA
QQAELNLSYT DIVAAVDGTV GARSIRVGQY VTSGTQLMAV VPLHAVYVVA NFKETQLTHV
RPGQPVEIKV DSFPDMAIKG HVDSVSPASG LEFSLLPPDN ATGNFTKIVQ RIPVKIVIDD
EALSGLLRSG MSVEPEIDTK AAETSVAEEE LSRHAG