Gene Rleg2_4040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4040 
Symbol 
ID6982811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp4213905 
End bp4215062 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content61% 
IMG OID643398770 
Productsecretion protein HlyD family protein 
Protein accessionYP_002283528 
Protein GI209551611 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTGC TCCTGATACT CACCTACGCC GCGATCTGTT GGGCAATCTT CAAAATATTC 
AGGATCCCGG TAAACCAGTG GACGCTCGCA ACCGCCGTCC TTGGGGGGAT CTTCCTTCTG
TCTACCCTCC TGCTCCTGAT GAGCTACAAC CATCCCTATT CGAGCGACGG CCGCATCTAT
TTCACCTCCG CGCCGGTTAT TCCCGTCGTC GGAGGCCAGG TGGTCGAGGT GCCGGTGACA
CCGAATGCGC CTCTCAAAAA GGGCGATATC CTCTTTCGCA TCGATCCGCG GCCCTATCAG
TTTACCGTCG ATCAAAAGAA AGCGGCGCTC GCCGAGGCTG AGCAGTCCGT CCTGCAGTTG
AAAGCCGCCA TGGATGCCGC CGAATCAGGG GTCACGGGCG CCGAGGCCAC GAGGGACAGG
TCACAGCAGG CCTTCGAAAA GTTCCAGCAG ACGAACGAGA ATGCGAAGTC GAGCGGCAAG
GGTGCGGCCT TTTCCGAACT CGAGGTCGAA AACCGACGCG GCATCTACCT GACATCGGAG
GCTGCGGTCG CCACGGCCCG CGCGCAAGCG GTGCAGGCAA AGCTTGCCTA TGAGTCCGAG
ATCGACGGAA CCAACCCGAC AGTCGCAAGG CTGCAAGCGG AATTGCACAA TGCCGAGTAC
GAACTCGACC AGACGGTTGT GCGGGCGCCG ACCGATGGCT ACGTCACGCA GGTCTTCCTG
CGCCCTGGAA TGATGGCCAA CCCGCTACCC CTGCGGCCGG TCATGGTGTT CATCGACAGT
CAGGACCGCA TGCTGGCGGC AGCCTTCATC CAGAACTCGC TGCAGCGCGT CCGTGTCGGC
GATGAGGCGG AGGTCTCTTT CAAAGCCGTG CCCGGCAAGA TTTTCAAGGC GCGGGTTCAG
GAGGTCATCG ATGTGATGGC CCAGGGCCAA CTGCAGCCGA GCGGTGCGCT GATCGATCCG
CAATCGCCCG AGCGCGTCTC GCCGGGACAG ACGCTGGCTC GGATCGAGTT GCTCGAAAGT
ACCGACGAAT ATCAACTGCC CGGCGGCGTC GTCGCCGAGG TCGCGGTCTA CACCCATCAT
TGGCACCATG TCGCTGTCCT TCGCAAGGTG CTGCTGCGGA TGAGCAGCTG GATGAACTTC
GTGTTCCTCG AACACTAA
 
Protein sequence
MDLLLILTYA AICWAIFKIF RIPVNQWTLA TAVLGGIFLL STLLLLMSYN HPYSSDGRIY 
FTSAPVIPVV GGQVVEVPVT PNAPLKKGDI LFRIDPRPYQ FTVDQKKAAL AEAEQSVLQL
KAAMDAAESG VTGAEATRDR SQQAFEKFQQ TNENAKSSGK GAAFSELEVE NRRGIYLTSE
AAVATARAQA VQAKLAYESE IDGTNPTVAR LQAELHNAEY ELDQTVVRAP TDGYVTQVFL
RPGMMANPLP LRPVMVFIDS QDRMLAAAFI QNSLQRVRVG DEAEVSFKAV PGKIFKARVQ
EVIDVMAQGQ LQPSGALIDP QSPERVSPGQ TLARIELLES TDEYQLPGGV VAEVAVYTHH
WHHVAVLRKV LLRMSSWMNF VFLEH