Gene Smed_0062 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0062 
Symbol 
ID5320889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp67949 
End bp71134 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content64% 
IMG OID640788993 
Producthydrophobe/amphiphile efflux-1 (HAE1) family protein 
Protein accessionYP_001325757 
Protein GI150395290 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.410108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.744352 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTCT CACGCTTCTT CGTCGACCGC CCGGTCTTCG CGGGCGTTCT CTCCGTGATC 
ATCTTCGTCG CGGGCCTGAT CGGCATGACT GGTCTGCCTA TTTCCGAATA TCCGGAAGTC
GTGCCGCCGC AGATCGTGGT CAGGGCGCAA TATCCGGGCG CCAATCCGGC CGTCATCGCC
GAAACCGTCG CCACACCGCT GGAAGAGCAG ATCAACGGCG TCGAGGGCAT GCTCTACATG
CAGAGCCAGG CGACCGCTGA CGGCCTGATG ACCCTGACCG TCACGTTCGA GCTCGGAACC
GATCCGGACC AGGCCCAGCA GCTGGTGCAG AACCGCGTCT CGCAGGCTGA GCCGCGCCTG
CCCGAAGAGG TCCGTCGGCT CGGCGTCACC ACAGTCAAGA GCTCGCCCGA CCTGACGCTC
GTCGTGCACC TGATTTCGCC GAACGGACAG TATGACATCA ATTATCTGCG CAACTACGGC
GTCCTCAACG TCAAGGACCG GCTGGCGCGG GTGGACGGCG TGGGGCAAGT GCAGATCTTC
GGCGGCGGCG ACTATTCGAT GCGCGTCTGG ATCGACCCCG AAAAGGCGGC CGAGCGCGGC
CTGGCCGCGA GCGACATTGC CAATGCTGTC CGGGGGCAGA ACGTTCAGGC CGCCGCCGGC
GTCATTGGCG CTTCGCCGTC TGTTCCGGGC CTCGATCTTC AGCTCTCCGT CAACGCCCAG
GGGCGCCTCA AGACACCCGA AGATTTCGCC GACATCGTCG TCAAATCGGG CGAGGGCGGT
GAGATCACGC GGCTCGGCGA TGTCGCCCGC GTCGAGATGG GGGCGGCGGA CTATTCGCTG
CGCTCGCTTC TCGACAACAA GGCCGCCGTC GGCATGGGCG TCTTCCAGGC TCCCGGATCG
AACGCCATCG AGATATCCGA AAACGTCCAC AAGGTTATGG CCGAGCTGAA GCAGACCATG
CCGGAAGGCG TCGACTACGA GATCGTCTAC GACACGACGC AGTTCGTCCG TGCCTCGATC
GAGTCGGTCA TCCATACTCT GCTCGAAGCA ATCGCGCTCG TAGTCATTGT CGTCATCGTC
TTCCTGCAGA CCTGGCGCGC CTCGATCATC CCGCTTGTCG CGGTACCCGT ATCGATCGTC
GGTACCTTCG CGGTGATGTA TGTTTTCGGC TTTTCGATCA ACGCGCTCAG CCTCTTCGGC
CTGGTGCTGG CGATCGGCAT CGTCGTCGAC GACGCCATCG TCGTCGTGGA GAATGTCGAG
CGTAACATCT CCCAGGGTCT CTCGCCGGTA CAGGCAACCT ACCGCGCCAT GCAGGAGGTT
TCCGGGCCCA TCATCGCGAT CGCGCTCGTG CTCGTCGCGG TATTCGTGCC GCTCGCCTTC
ATCACCGGGC TCACCGGCCA GTTCTATCGC CAGTTCGCGC TGACCATCGC GATTTCGACG
GTGATTTCCG CACTCAACTC GCTGACGCTC TCGCCAGCGC TCGCTGCCCT TCTTCTGAAG
GACCACCACG CGCCTAAGGA CTGGCTGACG CGAGTCATGG ACGGATTGTT CGGGTGGTTC
TTCCGAGGCT TCAACCGCTT CTTCGGAGCG AGTTCCGAGG CCTATGGCCG CGGCGTTGGT
GGTATTCTCA CCCGCAAGTC GCTGGTCATG GGCGTCTATG TCGTCCTCCT CGGCGTCACC
TTCGTGCTCT TCCGCGCCGT GCCCGGCGGC TTCGTGCCGG CCCAGGACAA GCAGTACCTC
ATCGGCTTTG CGCAGTTGCC GGACGCAGCG ACGCTCGACC GGTCGGAAGA CGTCATCCGG
CGCATGAGCG AGATCGCGCT GAAGCACCCG GGCGTCGAGC ACGCCATCGC ATTCCCGGGA
CTGTCGATCA ACGGCTTCAC CAACTCGTCC AATTCGGGCA TCGTCTTCGT CTCGCTGAAG
CCGTTCGAGG AGCGCAAGAC GGCCGAGCTT TCGGGCGGCG CCATCGCCAT GCAGTTGAAC
CAGGAATTCG GTGCGATACA GGACGCCTTC ATTGCCATGT TCCCGCCGCC GCCCGTGCAG
GGTCTCGGTA CGACCGGCGG ATTCAAGCTG CAGATCGAAG ACCGCAACGG CCTCGGCTAC
CGGGCGCTCG ACGACGCGGC CAAGGCCGTC CTCGCCAAGG CAATGGCGGC CCCGGAACTG
GCCGGACTCT ATTCGAGCTT CCAGATCAAC GTGCCGCAAC TCTATGCCGA TCTTGACCGC
ACCAAGGCGC GGCAGCTCGG CGTGGCGGTG ACGGACGTTT TCGAAACGCT GCAGATCTAT
CTGGGCTCGC TCTATGTCAA CGACTTCAAT GCGTTCGGCC GGACCTACAG CGTGCGCATC
CAGGCCGATG CGAGCTATCG CAGCCATGCC GACGACATCG GCAAGCTGAA GGTGCGTTCG
CAGACGGGCG AGATGATCCC CCTATCGGCT CTGCTCAACA TCGAGCAGAC GGTCGGCGCC
GAACGGGCGA TCCGCTACAA CGGCTTCCTG GCCGCGGATA TCAACGGCGG GCCGGCACCC
GGCTTCTCCT CCGGCCAGGC ACAGGCAGCG ATAGAGCGGA TTGCGGCCGA GACGCTGCCG
CCCGGCATTA CATACGAGTG GACGGACCTG ACCTATCAGC AGATCCTCGC CGGCAATTCG
GGCATCCTCG TCTTCCCGCT GGCGCTGCTG CTCGTCTATC TCGTGCTTGC CGCCCAATAC
GAGAGCCTGC TGCTTCCCAT CGCGATCATC CTGATTGTGC CGATGGGCAT CATGGCGGCG
CTGACGGGCG TCTGGCTGAC AGGCGGCGAC AACAACGTCT TCACCCAGAT CGGTCTTATC
GTTCTGGTGG GTCTATCGGC CAAGAATGCG ATCCTGATCG TCGAATTCGC ACGCGAATTG
GAGTTGTCCG GCAGCAATGC CGTCAGCGCC GCGATCGAGG CAAGCCGGCT CCGCCTGCGG
CCCATCCTGA TGACCTCGAT GGCCTTCATC ATGGGCGTCG TTCCACTCGT CACCTCCACC
GGCGCGGGTG CCGAGATGCG CTCGGCCATG GGCGTGGCGG TGTTTGCCGG CATGATCGGC
GTCACGGCCT TCGGGATCTT CATGACGCCG GTGTTCTATG TGCTCATCCG CAAAGTGTCC
GGCGAACGGC CGCTGAAGCA TACCGGGGCG CATCTCGAAG CCCCGCATCT CGCTCCGGGC
GAGTGA
 
Protein sequence
MNFSRFFVDR PVFAGVLSVI IFVAGLIGMT GLPISEYPEV VPPQIVVRAQ YPGANPAVIA 
ETVATPLEEQ INGVEGMLYM QSQATADGLM TLTVTFELGT DPDQAQQLVQ NRVSQAEPRL
PEEVRRLGVT TVKSSPDLTL VVHLISPNGQ YDINYLRNYG VLNVKDRLAR VDGVGQVQIF
GGGDYSMRVW IDPEKAAERG LAASDIANAV RGQNVQAAAG VIGASPSVPG LDLQLSVNAQ
GRLKTPEDFA DIVVKSGEGG EITRLGDVAR VEMGAADYSL RSLLDNKAAV GMGVFQAPGS
NAIEISENVH KVMAELKQTM PEGVDYEIVY DTTQFVRASI ESVIHTLLEA IALVVIVVIV
FLQTWRASII PLVAVPVSIV GTFAVMYVFG FSINALSLFG LVLAIGIVVD DAIVVVENVE
RNISQGLSPV QATYRAMQEV SGPIIAIALV LVAVFVPLAF ITGLTGQFYR QFALTIAIST
VISALNSLTL SPALAALLLK DHHAPKDWLT RVMDGLFGWF FRGFNRFFGA SSEAYGRGVG
GILTRKSLVM GVYVVLLGVT FVLFRAVPGG FVPAQDKQYL IGFAQLPDAA TLDRSEDVIR
RMSEIALKHP GVEHAIAFPG LSINGFTNSS NSGIVFVSLK PFEERKTAEL SGGAIAMQLN
QEFGAIQDAF IAMFPPPPVQ GLGTTGGFKL QIEDRNGLGY RALDDAAKAV LAKAMAAPEL
AGLYSSFQIN VPQLYADLDR TKARQLGVAV TDVFETLQIY LGSLYVNDFN AFGRTYSVRI
QADASYRSHA DDIGKLKVRS QTGEMIPLSA LLNIEQTVGA ERAIRYNGFL AADINGGPAP
GFSSGQAQAA IERIAAETLP PGITYEWTDL TYQQILAGNS GILVFPLALL LVYLVLAAQY
ESLLLPIAII LIVPMGIMAA LTGVWLTGGD NNVFTQIGLI VLVGLSAKNA ILIVEFAREL
ELSGSNAVSA AIEASRLRLR PILMTSMAFI MGVVPLVTST GAGAEMRSAM GVAVFAGMIG
VTAFGIFMTP VFYVLIRKVS GERPLKHTGA HLEAPHLAPG E