Gene Rleg_4324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4324 
Symbol 
ID8015104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4441647 
End bp4442996 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content61% 
IMG OID644826900 
Producttype I secretion membrane fusion protein, HlyD family 
Protein accessionYP_002978103 
Protein GI241207007 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01843] type I secretion membrane fusion protein, HlyD family 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.111635 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACGGA AAAATCAGGA GACGGCCGGT CCGGTGCAGC TCGAATGGTA TAGCGACGTG 
CCGCGCTCAA TCCGGATGCA CAGCATCGTC GGCCTGACGG TTCTCTTCAC CTCGTTCGGA
GGCTTCGGCT TCTGGGCGGC GACGGCGCCG CTCGCCTCCG CCGTTATCGC GCAGGGAAGC
TTCGTCGCGA CCGGCAACAA CAAGATCGTC CAGCATCTGG AAGGCGGCAT CATCAAGGAG
ATGCGCGTCA GCGAAGGCGA TACGGTCAAG GAAGGCGATA TCCTGCTGAC CCTCGATCCT
ACAGCATCCC GCTCAAACGA GCGGATGCTG CAGCTTCGCC GTCTGCGCCT GGAAGCCATT
GTCGCAAGGC TCCGTGCCGA AGCGCAGGGC CTGCGTGAAC TCCAGCTTCC CAATATCGTG
ACGAAGGAGG CCAGCGATAC CGATATCAAC GCGATCATCC AGAGCCAGAA CGTCGTCTTC
CACAGCAAGC AGATCAAGCT CGAAGAGCAG CTCAATCTGA TCGGGAAGAA CATCGCGTCG
CTGGAATTCC GGTTCGCCGG CTATAGGGGC CAGAGAGATT CCTTCGAAAG GCAGCTGTCG
CTGCTGACCG AGGAACGCGA CTCCAAGGCT CGGCTGGTGA AGGTCGGCTA TATGCGCAGG
ACGGATCTGC TGGCAATCGA GCGGGCGATC GCCGATGCGA TGGGCGACAT TGCTCGACTG
AACGGCGAGC TCAACGAGAG CGAGGCCGAG ATCGCCAAAT TCCGCCAGGA AGCCGTCATC
GCCGTCAACT CCAACAAGCA GGCGGCACTC GATGCACTCG AGACCGCCGA AACCGATCTG
GACAGTGTGC GCGAACAGAT GCGCGAGGCC GCCGGCGTGC TCGAACGGAC CACTATCCGT
TCGCCGGTGT CGGGCACGGT GGTGCGCTCC TATTTTCACA CAGCCGGCGG TGTCATCACG
ACAGGAAAGC CGATCATGGA AATCCTGCCG TCGCACGTGC CGCTGATCCT GGAAGCGCAG
GTATTGCGCA CCTCGATCGA CCAGTTGCAT GAGGGAGAAA CGGCCTCCAT CCGCCTCACG
GCGCTCAACC GGCGCACGAC CCCTGTTCTG CAGGGCAAGG TCTTCTACGT CTCGGCCGAT
TCGATCGAGG AAAATTCGGG AGCTTCGGTC AAGGACGTCT ATATCGTCCG CGTCGGCATA
CCCGATTCCG AGATTGCGCG GGTGCACAAT TTCCATCCCG TTCCCGGCAT GCCGGCCGAA
GTGCTGATTC AGACATCGGA ACGCACCTTC TTCGAATATC TGAGCAAACC GATCACCGAC
AGCATGTCCC GGGCATTCAA GGAGCGCTGA
 
Protein sequence
MGRKNQETAG PVQLEWYSDV PRSIRMHSIV GLTVLFTSFG GFGFWAATAP LASAVIAQGS 
FVATGNNKIV QHLEGGIIKE MRVSEGDTVK EGDILLTLDP TASRSNERML QLRRLRLEAI
VARLRAEAQG LRELQLPNIV TKEASDTDIN AIIQSQNVVF HSKQIKLEEQ LNLIGKNIAS
LEFRFAGYRG QRDSFERQLS LLTEERDSKA RLVKVGYMRR TDLLAIERAI ADAMGDIARL
NGELNESEAE IAKFRQEAVI AVNSNKQAAL DALETAETDL DSVREQMREA AGVLERTTIR
SPVSGTVVRS YFHTAGGVIT TGKPIMEILP SHVPLILEAQ VLRTSIDQLH EGETASIRLT
ALNRRTTPVL QGKVFYVSAD SIEENSGASV KDVYIVRVGI PDSEIARVHN FHPVPGMPAE
VLIQTSERTF FEYLSKPITD SMSRAFKER