Gene Nmul_A0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0043 
Symbol 
ID3786386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp45802 
End bp48930 
Gene Length3129 bp 
Protein Length1042 aa 
Translation table11 
GC content57% 
IMG OID637810112 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_410744 
Protein GI82701178 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTTT CCAGGTTCTT CATCGACCGT CCCATATTTG CCATCGTTCT TTCGATCGTT 
ATTTTCGCGG CGGGACTCAT CGCCATCCCG CTCCTGCCTG CGGGCGAGTA TCCGGAAGTG
GTGCCTCCAA GCGTGGTTGT GCGTGCAATG TATCACGGCG CCAATCCCAA AGAGATCGCG
GATTCAATCG CCGTTCCCCT GGAAGAAGCG ATCAACAGCG TCGAAGGCAT CATGTATATG
AAGTCCGTTG CGGGCTCCGA TGGAAGCCTG CAAGTGACGA TCACCTTCAC GCCCGGCATC
GATCCCGATA CCGCGGCCGT CCGCGTACAG AATCGCGTCG CCCAAGCCCT GTCCCGTCTG
CCCAACGACG TACGCCAGTA TGGCGTAACC ACGCAGAAGC AATCCCCGAC TCCGCTCATG
TACGTGAATC TTTCTTCACC CGACGGTCGC TACGACGCGG TCTATCTGCG CAACTATATG
ACGCTGAATA TCAAGGACCA GCTTTCGCGC CTGAAGGGGG TTGGCAACGT CGGATTGTAC
GGGGAAGGCG ATTATGCGAT GCGCCTGTGG CTCGACCCGA ACAAGCTCGC CGCACGTGGC
TTGACAGCCA GCGATGTGGT GGATGCCGTT CGCGAGCAGA ACGTTCAGGT ATCCGCAGGT
CAACTGGGCG CCCAGCCATC ACCCAGGAGC GCGGATTTCC TCGTTCCGAT CAATGTTCGT
GGACGGCTTC GCAGCGAGCA GGAATTCGGC GACATTGTCC TGAAAAGTGC CGATGATGGC
CAATTGGTCC GCCTGGCGGA TGTTGGGCGC GTCGAGCTCG GCTCCAGCGA TTACACGCTC
CATGCCATGA TCGATAACAG GGGCAGCACT AGCGCCGGAA TTTTTCTGAC TCCCGGCGCG
AACTCCCTGG AGGTGGCCAG GACGGTCTAT GCCAAGTTTG AAGAACTTGC AGCAAGCTTT
CCTCCGGGTA TCGAGTATCG GGTGGTGTGG GACCCCAATG TTTTTGTCCG GGATTCCATC
CGGGCCGTGG GTCAAACACT GATCGAAGCG GTCTTCCTGG TCGTGCTGGT GGTCATTCTC
TTTCTGCAGA CTTGGCGCGC ATCGATCATC CCCCTGATTG CGGTGCCGGT CTCGATTATC
GGCACGTTTG CCTGGCTCTA CCTTCTCGGA TATTCCATCA ATACGCTGAC ATTGTTCGGA
CTGGTCCTGG CAGTGGGGAT TGTCGTCGAC GATGCGATCG TCGTCGTGGA GAACGTTGAG
CGCAATATCG AGGAAGGTCT GAGTCCTCTC GCGGCAGCAC ACCAGGCCAT GAAGGAAGTC
TCCGGTCCGA TCATCGCCAT TGCGCTGGTG CTATGCGCCG TGTTCGTGCC CATGGCCTTC
CTGAGCGGTG TCACGGGGGA GTTCTACAAG CAGTTTGCCG TCACCATCGC CATATCCACC
GTCATTTCCG CGATCAATTC GCTCACACTG TCCCCGGCAC TGGCGGCAAA ACTGCTCAGA
GGACATGATG CTCCGAAGGA TGCGCCTACC CGAGTGCTCG AGCGCAGCCT GGGATGGTTG
TTCCATCCGT TCAATCGGCT GTTTCTTCGC AGCTCCGGCA AATACCAGAG CACGGTAGCC
CGAACACTGG GCCAGCGCGG CAGGGTGTTT ACCGTCTATG CAGGACTATT GCTCGCAACA
GTCATGCTAT TCCATATCGT GCCGACCGGC TTTATTCCAA ATCAGGACAA GCTCTACCTC
TTTGCAGGCG CCAAATTGCC GGAGGGCGCT TCGCTTGCCC GTACTGACGC CGTCACACGT
CAATTAAACG ACCTTGCCAG CAGTATCGAA GGCGTGGAAA TGACCGAAAG CTATGTTGGC
ATGAATGCGT TGCAGTCAGT CAACACGCCC AATCAGGCGG CCTCCTACGT TATCCTCAAG
CCTTTTGATC AACGCCGGCG TTCCGCAGAA TCCATCACCG CTGAACTGAA TGCCAGGCTT
GCCGACATCA AGGACGGGCA AGCCTATGCA TTACTGCCTC CGCCTATTCA AGGACTGGGC
AATGGGTCGG GCTATTCGCT CTATTTGGTC GATCGCGGCG GTCTGGGATA TGGGGCGTTG
CAAAGCGCAA CCACAAAATT CCAGAACGCG ATTGCGCAGA CGCCAGGCAT GACTTTTCCT
GTCAGCTCAT ACCAGGCCAA CGTCCCGCAA CTTGAGGTCA AGGTGGATCG GGTCAAAGCC
AAGGCCCAGG GAGTAAATTT GACGGACCTC TTCAATACGT TGCAGGTCTA CCTTGGATCC
TTCTACATCA ACGATTTCAA TCTATACGGT CGCGTCTATC GGGTGATGGC ACAGGCGGAT
GCAGCTTTCC GCCAGAATGC TGAAGACATC AACAATCTGT ATACACGCAA TAGTCGAGGA
GAAATGGTTC CGCTAGGGTC GATGGTGACG ATTCAACATA CTTTTGGTCC GGATCCCGTC
ATCCGCTACA ACGGTTATCC CGCTGCCGAC CTTATCGGGG ATGCGGATCC CCGCGTACTT
TCCTCGGGAC AAGTCATCGC CAAGCTATCG GAAATTGCTA CGGAGGTACT GCCCCCCGGC
ATTACCCTGG AATGGACCGA CTTGAGCTAT CAGCAGGTCA CCCAGGGCAA CGCGGCAGTC
ATCGTCTTCC CATTATCGAT CATGCTGGCC TTTCTGGTGC TGGCCGCCCT CTATGAGAGC
TGGACATTAC CCTTGGCAGT CATCCTGATC GTGCCGGTAT GCATGTGCGC CGCGCTTTTC
GGGATCTGGC TCACAGGCGG GGACAACAAT ATATTCGTAC AGGTGGGACT GGTCGTACTG
ATGGGTCTGG CATGCAAGAA TGCCATCCTG ATCGTTGAGT TTGCACGAGA ACTGGAGATT
CAGGGCAAGG ATACCATCGA AGCCGCACTG GAAGCCTGCC GCCTTCGACT TCGGCCGATT
GTGATGACTT CGATCGCCTT CATCGCTGGA TCTGTTCCCC TAGTGCTCAG TCAGGGTGCA
GGTAGCGAGG TGCGCTACGC AACGGGTATA ACTGTCTTTG CCGGGATGCT CGGTGTGACT
CTATTCGGAT TGTTCCTTAC CCCGGTTTTC TACGTGGCTT TGCGCAAGTT GGCCCGCCCC
AAGCTGTAG
 
Protein sequence
MDFSRFFIDR PIFAIVLSIV IFAAGLIAIP LLPAGEYPEV VPPSVVVRAM YHGANPKEIA 
DSIAVPLEEA INSVEGIMYM KSVAGSDGSL QVTITFTPGI DPDTAAVRVQ NRVAQALSRL
PNDVRQYGVT TQKQSPTPLM YVNLSSPDGR YDAVYLRNYM TLNIKDQLSR LKGVGNVGLY
GEGDYAMRLW LDPNKLAARG LTASDVVDAV REQNVQVSAG QLGAQPSPRS ADFLVPINVR
GRLRSEQEFG DIVLKSADDG QLVRLADVGR VELGSSDYTL HAMIDNRGST SAGIFLTPGA
NSLEVARTVY AKFEELAASF PPGIEYRVVW DPNVFVRDSI RAVGQTLIEA VFLVVLVVIL
FLQTWRASII PLIAVPVSII GTFAWLYLLG YSINTLTLFG LVLAVGIVVD DAIVVVENVE
RNIEEGLSPL AAAHQAMKEV SGPIIAIALV LCAVFVPMAF LSGVTGEFYK QFAVTIAIST
VISAINSLTL SPALAAKLLR GHDAPKDAPT RVLERSLGWL FHPFNRLFLR SSGKYQSTVA
RTLGQRGRVF TVYAGLLLAT VMLFHIVPTG FIPNQDKLYL FAGAKLPEGA SLARTDAVTR
QLNDLASSIE GVEMTESYVG MNALQSVNTP NQAASYVILK PFDQRRRSAE SITAELNARL
ADIKDGQAYA LLPPPIQGLG NGSGYSLYLV DRGGLGYGAL QSATTKFQNA IAQTPGMTFP
VSSYQANVPQ LEVKVDRVKA KAQGVNLTDL FNTLQVYLGS FYINDFNLYG RVYRVMAQAD
AAFRQNAEDI NNLYTRNSRG EMVPLGSMVT IQHTFGPDPV IRYNGYPAAD LIGDADPRVL
SSGQVIAKLS EIATEVLPPG ITLEWTDLSY QQVTQGNAAV IVFPLSIMLA FLVLAALYES
WTLPLAVILI VPVCMCAALF GIWLTGGDNN IFVQVGLVVL MGLACKNAIL IVEFARELEI
QGKDTIEAAL EACRLRLRPI VMTSIAFIAG SVPLVLSQGA GSEVRYATGI TVFAGMLGVT
LFGLFLTPVF YVALRKLARP KL