Gene Nmul_A0688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0688 
Symbol 
ID3784065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp789801 
End bp792974 
Gene Length3174 bp 
Protein Length1057 aa 
Translation table11 
GC content56% 
IMG OID637810770 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_411387 
Protein GI82701821 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.67862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCCC GCTTTTTTAT CGATCGTCCA ATTTTCGCAT CCGTCCTGTC CATCATTATT 
GTGGTGGTGG GACTCGTTGC GCTGAGAAAC CTCCCAATTG CGCAGTTTCC GGAAATCACG
CCGCCCATGG TGCAGATCGA TGCCGATTAC CCCGGGGCGA GCGCGGAAGT TGTTGCGGAA
TCTGTCGCGC GCCCCATCGA GGTCCAGCTT CCAGGTATCG ATAATCTTCT TTATTACGAA
TCCACTAGCT CGAACGACGG GCACATGACC ATGAAGCTCA CGTTCGAGAT CGGAACGGAC
GTGGATATCG CGCAGGTCCA GACGCAGAAC AGGCAGCGCC TAGCCGAACC GCAACTTCCG
GACGAAGTCG TGCGCCAGGG TATAACCGTG AAAAAAACGT CGCCTGATCT TCTGGCGGTC
ATTGCCCTGA GTTCTTCCGA CCCGCGGCAC GATACCATTT ACCTCTCGAA TTATGCCTTG
CTGCGGGTTC TCGATAACGT CAAGCGTCTT CCGGGCGTAG GAGATGCCAT TATCTTCGGC
AGCCAGAATT ACTCCATGCG GCTGATCCTG GATCCTGTCC GCATGGCGCA ACTGGACCTC
ACTCCCACCG ATATCGCGGC GGTGGTCCGC GAGCAGAACC GGGATTTTCC CGCCGGCAGG
ATAGGACGGG AGCCTTCGCC GAAAGGAACG GAGCTTACCA TCCCGGTCAT TACCCAGGGC
CGCATGAGCG AAGTGAAGGA ATTCGAGGAT ATGATCGTAA GGGCGTATCC CGACGGTTCC
ATGGTGCGAT TGCGGGATGT AGCGAGAGTG GAGCTGGGTG CGCAATCGTA TGATCTGGAA
GGGAGATGGA ACGGAAAGCC CAACACCTTT CTTCTGACCT TTCTGGCCCC CGGCGCCAAC
GCGCTCGATA CGGTTCACCG GGTACGCCAG GAAATGGACA AGCTCGCGCG CAGTTTCCCC
GCCGGCGTCT CCTACGACAT ACCCTATGAC ACTACCATAT TCATCGAAGT TTCCATCAAG
GAAGTCCTGA AGACGCTGGT CGAAGCAACG CTTCTTGTCA TACTGGTCGT TTTTGTTTTT
CTGCAAAGCT GGAGGGCAAC GATCATTCCA GCGGTCGCGG TTCCTATTTC ACTGATCGGA
ACCCTGGCCG GAATGGCAGC GCTTGGATTT TCAATCAATA CCCTTACCCT GTTCGGCATG
GTGCTTGCCA TCGGGATCGT GGTGGATGAT GCGATCGTGG TGGTGGAAAA TGTCGAGCGG
CATATGCGGG AGGGGTTGCC GCCCAGGGAG GCGGCCAGGG TGGCCATGGA TGAAGTGGCG
GGCCCCGTCA TTGCCATCGT CCTGGTGCTG GGCGCGGTGT TTGTTCCAGT CGCTTTTCTG
GGCGGAATCA CCGGTGAATT GTACAAACAG TTTGCGATTA CCATTGCCCT GTCCGTCGCG
ATATCGGGTT TTGTGGCGCT TACCCTGAGC CCCGCGCTCT GTGCGCTTAT TCTCAAGCCG
GGGCATGGGG AGCCCGCAAA GTACTGGAAG CTGTTCAACC GCTCGTTTGA CTGGATGCAG
ACACGCTATA CAAATGGCGT CGGAATGGTA TTGAAACGAT CCATGATCGC TCTCTGTATT
TTTGCCGTGA TGATATTCGT CCTCCTTGGC TTGTTCAGAA CGATCCCGGG CAGCTTTCTC
CCGGAAGAAG ACCAGGGCTA TTTCATTACC GTTGTCCAGT TGCCGGACGG AGCCTCCAAG
GAACGCACGA TTGATGTATT GAGCAAAGTA GAGCAATACT TCCTGTCGAT TCCCGCGGTA
CACTCAACGG ATGCGCTGGC CGGCCAGAAC TTCGTGTTCG GCACGCGGGG AGCGAATCAG
GCGACGATGT TCGTTCCGCT GCAGTCTTGG GACACGCGCA AGAGTGCCGG GGAGCATGTC
ACCGGCCTTA TCGCATCCGC CTTCCAGGAG TTTGCAAAGA TACCGGAAGC ACTGATTCTT
GCCTTCAATG CCCCATCCAT CAGAGGCCTG GGTTCCACCG GGGGTTTTTC CCTGCAGGTA
CAGGATCCGA GCGGGGGCGA TTTCAAAGAG TTTGCCGAGA TTACGCAGAA ATTCGTCGCC
AAGGCTGTGG AACATCCTGC TATCGCTGCT GCCAGCACCA ATTTTCGCGT CAGTGCCCCC
CGGCTTTATG CCCGCGTTGA CCGGGAACGC GCCAAAGCGC TGGGCGTGCC GATTTCCGAA
GTCTTCGACA GCATGCAGGC TTATTTCGGC AACCTGTATA TCAACGACTT CGTGAAATAT
GGTCGTATCT ATCGCGTCCA GACGGAAGCG CAGCCTCAGT ACCGATCAAG GCCGGAGGAT
ATCGAGAAAA TTTACGTGCG TGCGCGGAAC GACAAGGGCC ACGTCATGAT TCCACTGAAT
TCGGTGATCA CCACCGAATT CACCAGCGGA CCTGATCCCG TCACCCACTT CAACGGATTC
AATTCCGCAC TCGTGCTGGG CGGCGCCGCT TCCGGCTACA GCTCGGGGCA GGCCCTCGAT
GCGCTGGAAC AGATCGCAGA TGAGATATTG GCGCCAAAAG GTTATACGAT CGACTGGAGT
GGAATATCCT TTCAGGAGCG CCAGGCAGGA GGGAAATCGG TTCTGGTATT CGCCTTCGCC
CTGCTCATGG TCTTTCTGGT GCTGGCCGCC CTTTACGAAA GCTGGTCAGT TCCGCTCGCG
GTGATTCTTG CAATTCCGTT CGGAATTTTA GGCGCATTAC TGGCTATCTG GGTTCGCGAA
TTGACCAACG ACATCTATTT CCAGATCGGG CTGGTGACAT TGATCGGATT ATCCGCGAAG
AACGCCATCC TGATCGTGGA ATTCGCCAAT CAGCGTTATG CGAACGGGGA GCCTTTGCTC
GACGCGGCGA TGGAGGCCGC CCGGCTACGT TTCCGTCCCA TTATCATGAC CTCCATGGCT
TTTATCCTGG GGGTGTTTCC GCTGGTGATC GCTTCCGGCG CAGGGGCCGC CAGCCGGAAT
TCCATCGGGA CGGGTGTTTT CGGGGGGATG CTGGCTGCGA CCTTTCTTGC CATCTTTTTC
GTACCTCTCT TTTTCGTAGT AATAAGAAAA ATGACGCATC GGCGCGGACA GCCAGAGGTT
CGAGCATCCC ATGCGGCGCC GGATAATTCT CCTTCCACCG TCGAAGACGA ATAA
 
Protein sequence
MSSRFFIDRP IFASVLSIII VVVGLVALRN LPIAQFPEIT PPMVQIDADY PGASAEVVAE 
SVARPIEVQL PGIDNLLYYE STSSNDGHMT MKLTFEIGTD VDIAQVQTQN RQRLAEPQLP
DEVVRQGITV KKTSPDLLAV IALSSSDPRH DTIYLSNYAL LRVLDNVKRL PGVGDAIIFG
SQNYSMRLIL DPVRMAQLDL TPTDIAAVVR EQNRDFPAGR IGREPSPKGT ELTIPVITQG
RMSEVKEFED MIVRAYPDGS MVRLRDVARV ELGAQSYDLE GRWNGKPNTF LLTFLAPGAN
ALDTVHRVRQ EMDKLARSFP AGVSYDIPYD TTIFIEVSIK EVLKTLVEAT LLVILVVFVF
LQSWRATIIP AVAVPISLIG TLAGMAALGF SINTLTLFGM VLAIGIVVDD AIVVVENVER
HMREGLPPRE AARVAMDEVA GPVIAIVLVL GAVFVPVAFL GGITGELYKQ FAITIALSVA
ISGFVALTLS PALCALILKP GHGEPAKYWK LFNRSFDWMQ TRYTNGVGMV LKRSMIALCI
FAVMIFVLLG LFRTIPGSFL PEEDQGYFIT VVQLPDGASK ERTIDVLSKV EQYFLSIPAV
HSTDALAGQN FVFGTRGANQ ATMFVPLQSW DTRKSAGEHV TGLIASAFQE FAKIPEALIL
AFNAPSIRGL GSTGGFSLQV QDPSGGDFKE FAEITQKFVA KAVEHPAIAA ASTNFRVSAP
RLYARVDRER AKALGVPISE VFDSMQAYFG NLYINDFVKY GRIYRVQTEA QPQYRSRPED
IEKIYVRARN DKGHVMIPLN SVITTEFTSG PDPVTHFNGF NSALVLGGAA SGYSSGQALD
ALEQIADEIL APKGYTIDWS GISFQERQAG GKSVLVFAFA LLMVFLVLAA LYESWSVPLA
VILAIPFGIL GALLAIWVRE LTNDIYFQIG LVTLIGLSAK NAILIVEFAN QRYANGEPLL
DAAMEAARLR FRPIIMTSMA FILGVFPLVI ASGAGAASRN SIGTGVFGGM LAATFLAIFF
VPLFFVVIRK MTHRRGQPEV RASHAAPDNS PSTVEDE