Gene Smed_3398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3398 
Symbol 
ID5324282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3603392 
End bp3606538 
Gene Length3147 bp 
Protein Length1048 aa 
Translation table11 
GC content63% 
IMG OID640792349 
Producthydrophobe/amphiphile efflux-1 (HAE1) family protein 
Protein accessionYP_001329054 
Protein GI150398587 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.392181 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGTT TTTTCATAGA CAGGCCTATC TTTGCCTGGG TGGTGGCGAT CTTCATCATG 
ATCGCCGGCA TCATCGCGAT CCCGCTATTG CCGGTGTCGC AGTATCCGGA TGTCGCGCCC
CCGCAGATAT CGATCAACAC GAACTATCCC GGCGCCTCGT CGCAGGACAC CTATCAGAGC
GTGACCCGTC TGATCGAGGA CGAGCTGAAC GGCGTCGAGG GGCTGCTTTA CTTCGAGTCT
TCGACCAGTG CCTCCGGCTC GGTCTCGATC GACGCCACCT TCCAGCCCGG CACCGATCCG
AGCCAGGCTT CGGTGGACAT CCAGAACCGG GTGCAGCGCG TCGAGCCGCG CCTGCCTGAC
CCCGTCAGGC AGCAAGGCGT ACAGGTGGAC GAGGCCGGCG CCGGCTTCCT GCTGATCATC
TCGCTGACTT CGACCGACGG CTCGATGGAC GCCATCGGGC TTGGCGATTA CCTCAGCCGC
AATGTGCTCA GCGAGATCCA GCGCGTGCCC GGCGTCGGAC GCGCCCAGCT CTTTGCGACC
GAGCGCTCGA TGCGCGTCTG GCTCGATCCC GACAAGATGC TCGGCCTCAA TCTGACCGCG
GCTGACGTCA CCGCGGCGAT CCAGGCTCAG AACGCCCAGA TCGCCTCCGG CTCGATCGGC
GCCCAGCCGA ATCCGATCAC GCAGCAAGTC ACCGCACCGG TCGTCATCAA GGGCCAGCTT
TCGAGCCCCG AGGAATTCGG TGCGATCGTA CTCCGGGCGA ATGCCGACGG CTCCGCCGTG
CGGCTGCGCG ACGTTGCGCG TCTCGAGATC GGCGGCGAAA GCTACACCTT CTCGACCCGC
CTCAACGGCA GCCCCTCGGC GGCCATCGCC GTGCAGCTCT CGCCGAGCGG CAATGCGATG
TCGACCTCGG CGGCGATCAA GACACGCATG GACGAGCTCG CGGCGTTTTT CCCCCAAGGG
CTCGAATACT CGATCCCCTA TGACACCTCG CCCTTCGTCG CGGTGTCGAT TGAGAAGGTG
CTGCACACCC TTGTCGAGGC CGTCGGCCTC GTATTCCTGG TGATGTTCCT CTTCCTGCAG
AACGTCCGCT ATACGCTCAT CCCCACCATC GTGGTGCCGG TCGCCCTGCT CGGCACCTGC
GCGGTCATGC TGGCGATGGG CTTCTCCATC AACGTGCTCA CGATGTTCGG CATGGTGCTG
GCGATCGGTA TCCTCGTCGA CGATGCGATC ATCGTGGTCG AGAACGTCGA ACGCATCATG
TCGGAGGAAG GGCTGACGCC GAAGGATGCA ACCCGGAAGG CGATGAAGCA GATCACCGGA
GCGGTGATCG GCATCACGCT GGTTCTGGCT TCGGTGTTCA TTCCGATGGC CTTCTTCCCC
GGCGCCGTGG GCGTGATCTA TCGCCAGTTC AGCCTGACGA TGGTCGTCTC GATCCTGTTT
TCGGCGCTCC TCGCATTGTC GCTGACACCG GCACTCTGCG CGAGCTTCCT CAAGCAGGTA
CCGAAGGGGC ACCATCATGC CAAGCGCGGC TTCTTCGGTT GGTTCAACCG CGGTTTCGAC
CGGACGTCGC ACGGCTACAC GCGTTTCGTC GGAAGCGTCA TCAGGCGCAC GGGCCGCTTC
ATGATCATCT ATCTGATCCT GCTTGCAGCA ATGGCCTGGG CATTTATCCG GCTGCCCTCG
TCCTTCCTGC CGGACGAGGA CCAGGGCTTT GTCATCGTCA TGATGCAATT GCCGTCGGAA
GCGACGGCAA ACCGCACCAC GGAGGTGATC GAGCAGACCG AGAAGATCTT CGGCCAGGAA
CCGGCGGTCG ACACGATCGT GGCGATCAAC GGTTTCTCCT TCTTCGGCAG CGGGCAGAAT
GCGGGTCTTG CATTCGTGAC GCTGAAGGAC TGGAGCGAAC GCGACGCCGA CAATGCAGCG
CAGTCGATCG CCGGCCGCGC GACGATGGCC ATGAGCCAGA TCAAGGATGC GATCAGCTTC
GCGCTGTCGC CTCCGGCGAT CCAGGGCCTC GGAACGACAG GCGGGTTTTC CTTCCGCCTT
CAGGACCGTG CGGGTCTCGG CGAAGCGGCG CTGGCAGAAG CGCGCGACCA GCTTCTCGGC
CTCGCCTCCC AGAGCAAGAT CCTGACCGGC GTCCGTTTCG AGGGCATGCC CGATGCGGCG
CAGGTCAGCG TCGATATCGA CCGCGAGAAG GCCAATACCT TCGGCGTGAC CTTTGCCGAT
ATCAACTCGA CCATCTCGAC CAATCTCGGC TCGTCCTACG TCAACGACTT CCCGAATGCC
GGCCGCATGC AACGCGTGAC GGTGCAGGCG GACGAAACGA AGCGCATGCA GCCGGCGGAC
CTCCTGAACC TGAACGTCCG CAATTCCAAC GGCGGCATGG TTCCGCTCTC GGCCTTCGCG
AATGTGGAAT GGGTCAAGGC GCCGACGCAG ACGGTCGGCT ACAACGGCTA TCCGGCAGTG
CGCATCAGCG GCGAGGCCGC TCCGGGCTAT TCCTCCGGCG ATGCCATTGC CGAGATGGAG
CGTCTGGTAG CCCAGCTTCC GGCTGGTTTC GGCTATGAAT GGACGGGGCA GTCGCTACAG
GAAATCCAGT CGGGGACGCA GGCGCCGCTC CTTATCGCGC TCTCCTGCCT GCTCGTCTTC
CTGTGCCTGG CCGCGCTCTA TGAGAGCTGG TCCATTCCAG TCTCGGTCAT CATGGTCGTG
CCGCTCGGCG TCATCGGCGC CGTGCTCGCC GTGACGATGC GCGACATGCC GAACGACGTC
TACTTCAAGG TCGGCCTGAT CGCGATCATC GGGCTTTCGG CGAAGAACGC GATCCTGATC
ATCGAATTCG CCAAGGAACT CCGCGACCAG GGTAAATCGC TTATCGATGC GACACTGGAA
GCTGCACATC TGCGCTTCCG GCCGATCCTG ATGACCTCGC TCGCCTTCAC CCTCGGCGTT
CTGCCGCTGG CGATCGCGAC AGGCGCAAGT TCCGGCAGCC AGCGCGCCAT TGGCACCGGC
GTCATGGGCG GCATGATCTC GGCGACGGTA CTGGCGATAT TCTTCGTCCC CGTCTTCTTC
GTGTTCGTCA TGAAGATCTT CGAACGCGGA AGAGCGGCGC CCGAGCCGGC CAAGCAGGCA
TCCGAAACCG TCAGCCCCGC CGAGTGA
 
Protein sequence
MPSFFIDRPI FAWVVAIFIM IAGIIAIPLL PVSQYPDVAP PQISINTNYP GASSQDTYQS 
VTRLIEDELN GVEGLLYFES STSASGSVSI DATFQPGTDP SQASVDIQNR VQRVEPRLPD
PVRQQGVQVD EAGAGFLLII SLTSTDGSMD AIGLGDYLSR NVLSEIQRVP GVGRAQLFAT
ERSMRVWLDP DKMLGLNLTA ADVTAAIQAQ NAQIASGSIG AQPNPITQQV TAPVVIKGQL
SSPEEFGAIV LRANADGSAV RLRDVARLEI GGESYTFSTR LNGSPSAAIA VQLSPSGNAM
STSAAIKTRM DELAAFFPQG LEYSIPYDTS PFVAVSIEKV LHTLVEAVGL VFLVMFLFLQ
NVRYTLIPTI VVPVALLGTC AVMLAMGFSI NVLTMFGMVL AIGILVDDAI IVVENVERIM
SEEGLTPKDA TRKAMKQITG AVIGITLVLA SVFIPMAFFP GAVGVIYRQF SLTMVVSILF
SALLALSLTP ALCASFLKQV PKGHHHAKRG FFGWFNRGFD RTSHGYTRFV GSVIRRTGRF
MIIYLILLAA MAWAFIRLPS SFLPDEDQGF VIVMMQLPSE ATANRTTEVI EQTEKIFGQE
PAVDTIVAIN GFSFFGSGQN AGLAFVTLKD WSERDADNAA QSIAGRATMA MSQIKDAISF
ALSPPAIQGL GTTGGFSFRL QDRAGLGEAA LAEARDQLLG LASQSKILTG VRFEGMPDAA
QVSVDIDREK ANTFGVTFAD INSTISTNLG SSYVNDFPNA GRMQRVTVQA DETKRMQPAD
LLNLNVRNSN GGMVPLSAFA NVEWVKAPTQ TVGYNGYPAV RISGEAAPGY SSGDAIAEME
RLVAQLPAGF GYEWTGQSLQ EIQSGTQAPL LIALSCLLVF LCLAALYESW SIPVSVIMVV
PLGVIGAVLA VTMRDMPNDV YFKVGLIAII GLSAKNAILI IEFAKELRDQ GKSLIDATLE
AAHLRFRPIL MTSLAFTLGV LPLAIATGAS SGSQRAIGTG VMGGMISATV LAIFFVPVFF
VFVMKIFERG RAAPEPAKQA SETVSPAE