Gene Nmul_A0691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0691 
Symbol 
ID3786153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp795752 
End bp798868 
Gene Length3117 bp 
Protein Length1038 aa 
Translation table11 
GC content55% 
IMG OID637810773 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_411390 
Protein GI82701824 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.553697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCAGAT TTTTTATTCT GCGACCGATA TTTGCGTCGG TTATCTCCAT TATCATCGTC 
CTTGCCGGAC TCGCCGCCGC ATTGCAGCTT CCCATTGCGC AATATCCGGA GATCGCGCCG
CCTACGGTAC TGATAACCGC AACCTATCCC GGCGCCAGCG CTGAAACCTT GACCAAAACA
GTGGCCGCGC CCATCGAAGA GCAATTGAGC GGTGTCCAGG ACCTTTTGTA TTTCAGCTCG
AGCGCCGACT CCAGCGGAAC GCTGACCATT ACTGCAACGT TTGAGGTCGG CACGGACGTT
GACCAGGCGA CTTTCAATAT CAGTAACCGC GTAAACGTCG CACTACCGCG ACTGCCGGAT
GAAGTGCGCC GCACCGGCCT GAAAATTGAG AAGCGCTCCA ACGACATCCT GATAGTTTTC
ATGCTGATCT CGACCGAGAA AGGCAAATAT GACCCGCTCT ATCTCAGCAA TTATGCCACG
CTCAACATAA TGGATGAGCT GAGACGTACG GATGGAGTCG GTGATGCCAC TGTCTTTGGC
GGCCAGGATT ATTCCATGCG CATCTGGCTG CATCCCGATC GCATGGCCCA ACTGGGCGTG
GCCACTACGG ACATCACTGC AGCTATCCAG GCGCAGAACG CGCAATATGC TGCCGGCAAG
ATCGGTCAGG AACCGGCTCC GCCTGATCAG CAACTGATTT ATACCGTAAA TGCAAGAGGT
CGGCTGGTAA CACCTGAACA GTTCGGCGAC ATCATTTTGC GGGCGGATGG TCCGCGCGGA
GTCCTCTACC TCAAGGATGT CGCCCGGATA GAACTGGGGG CGCAAAGCTA CGATGTTCGT
ACGGCCTTGA ATGGTCAGCC CGGCGCCGGC ATTCCCGTCT TCCTGCAACC CGGTGCGAAT
GCGCTCGACA CGAAAAACGC GCTGGTCGCT AAAATGGAAG AGCTGAAGAA ACACTTTCCG
CAGGGAATGG ACTACGTGGT GCCGTACGAT ACCAGTCTGT TCGTCAAAGC TTCGATGTGG
GAGGTGCTCA AGACGCTTGG CGAAGCGATG GTGCTCGTTC TGCTGGTTGT TTATCTTTTC
CTGCAAAGCT GGCGCGCCAC GCTGATTCCC ATTATTGCCG TGCCCATTTC CCTCGTGGGC
ACTTTCGCCG GATTGTGGGC ATTCGGCTTC TCCATTAACA CGCTTACGCT TTTCGCCATG
GTGCTTTCCA TCGGCATTGT AGTGGATGAT GCAATTGTCG TACTGGAAAA TATCGAGCGG
TTGATGGACG AAGAGAAACT GTCACCGCTG AAAGCTGCCA TCCGTTCCAT GGAGCAGGTG
GCAAGTGCGG TGGTGGCAAT TGTCCTGGTA CTGTGCGCCG TATTTGTGCC GGTTGCATTC
ATGGGGGGAA TCGCAGGCGA ACTGTATCGC CAGTTCGCGG TCACCGTCGC CGTAGCGGTA
ACCATTTCCG GTCTGGTGGC ACTGACCTTG ACCCCTGCCT TGTGCGCTAT TCTGCTCAAG
CATACCCACG GTGAATCCCG GTTCTTTCTT GCGTTCAACA GCGGTTTTCA GCGGGTGACG
AATTTTTATA TCCGCATGGT CAATATCACG CTCCGGCATA AGGTTATCGG TGCCTTCGTA
TTCCTAAGCA TCATTGCCTT GTCGGCCTAT CTGTTCAAAA CAGTACCTGA GAGTTTCGTG
CCGCCAGAAG ATCAGGGCTA TGTCGTCACG GCCACCATCC TGCCGGATGG CGCCACACTG
GCCAGAACGA CAAAAACCGC GGAATCGGTG CGGGCGGGCA TCGCGGATGA TCCCGCCGTG
GCCCATCAAT TCGTTGTAAA CGGCTTCGAT CTGATCGGTG GCGGAAATAA AACCAGTTCT
GCTACCATGT TCGTCGCATT CAAGGACTGG TCGGAGCGGA AGGCAACAGC GGAGGATATC
ATCCAGAAGC TGATGGGTAT CGGAATGCAA CAACCGGATG GAATCGCAAT CTCCTTCAAC
CCGCCCGCCA TCCGCGGGCT GGGAACCGCA GGCGGCTTCG AAGTCTATGT GCAAAGCCGC
GCAGGCGCTA ATCCGGTTCA ATTATCCATA GTGGTGAACA ACCTTATCGC TGCACTGAAC
CAGGAACCGC GGCTAGCCGG CATCAATACT TTTTTCCGTC CCACCGTGCC GCAATTCTTC
GTTGAAGTGG ATGAGGAAAA AGCGATTTCT CAGCAAGTGC GAATCGCCGA CATTTACGCC
ACGCTGCAAA GTACGATGGG TTCGCTGTAT ATCAACGATT TCAATTACTC CGGTCGTACT
TACCGGGTGC AAATGCAGGC GGAACCGCAA TACCGGATGC ATCCCGAGGA TCTCGGCAGA
GTGTACGTAC GCTCCCAATC CGGCGCGATG GTGCCCATGT CCGCCCTCAG CAAACTCAGC
ACCATTGTGG GCGCGGAGCA GCTCGAGCGC TATAACGGCC TCCTTGCTGC AAAGATTCTG
GGAAGCGGTG CGCCTGGCGT GAGTTCAGGC GATGCTATCC GGCTGGTAGA GGAGATTGCG
GCAAAAAATT TGCCCGATGG CTACCAGATT GCGTGGACAG GACAGGCTTA TCAGGAAAAG
CGGACTGGTT CAGCCGCAAT TTTTGCCTTC AGCTTCGCAA TCATAATGGT ATTTCTCATT
CTTGCTGCCC AGTTCGAGAC CTGGGCCCTG CCGCTCGCCG TTATCATGGC GGTGCCTTTT
GCTCTTGCCG GTGCCCTGCT TGCCGTTTTG GCCCGCGGCA TGCCCAACGA TATCTATTTT
CAGATCGGCC TGATCACGCT CATCGGCCTG GCTGCAAAAA ATGCCATTCT GATCGTCGAG
TTTGCTACGC AGAAAATGGC TGAAGGTCTG CCTGTGGCAG AGGCAGCGAT CGAGGCGGCG
CGTTTGCGTT TCCGACCCAT CGTCATGACA TCCATGGCTT TTGTGCTGGG TATCGTGCCC
CTATTGATCG CAACGGGAGC TGGCGCTGCC GCGCGTCGCT CCATGGGAAC CGGCGTGTTT
GGGGGAATGC TGCTTGCCAC TTTTGTTGCC ACAATATTTA TTCCCTTGTT TTTTACCTGG
CTTTCACGCA AGAATAAAGG TAAGCCAGTC GAAAACCTGT CGCAGGAAAC ATCATGA
 
Protein sequence
MTRFFILRPI FASVISIIIV LAGLAAALQL PIAQYPEIAP PTVLITATYP GASAETLTKT 
VAAPIEEQLS GVQDLLYFSS SADSSGTLTI TATFEVGTDV DQATFNISNR VNVALPRLPD
EVRRTGLKIE KRSNDILIVF MLISTEKGKY DPLYLSNYAT LNIMDELRRT DGVGDATVFG
GQDYSMRIWL HPDRMAQLGV ATTDITAAIQ AQNAQYAAGK IGQEPAPPDQ QLIYTVNARG
RLVTPEQFGD IILRADGPRG VLYLKDVARI ELGAQSYDVR TALNGQPGAG IPVFLQPGAN
ALDTKNALVA KMEELKKHFP QGMDYVVPYD TSLFVKASMW EVLKTLGEAM VLVLLVVYLF
LQSWRATLIP IIAVPISLVG TFAGLWAFGF SINTLTLFAM VLSIGIVVDD AIVVLENIER
LMDEEKLSPL KAAIRSMEQV ASAVVAIVLV LCAVFVPVAF MGGIAGELYR QFAVTVAVAV
TISGLVALTL TPALCAILLK HTHGESRFFL AFNSGFQRVT NFYIRMVNIT LRHKVIGAFV
FLSIIALSAY LFKTVPESFV PPEDQGYVVT ATILPDGATL ARTTKTAESV RAGIADDPAV
AHQFVVNGFD LIGGGNKTSS ATMFVAFKDW SERKATAEDI IQKLMGIGMQ QPDGIAISFN
PPAIRGLGTA GGFEVYVQSR AGANPVQLSI VVNNLIAALN QEPRLAGINT FFRPTVPQFF
VEVDEEKAIS QQVRIADIYA TLQSTMGSLY INDFNYSGRT YRVQMQAEPQ YRMHPEDLGR
VYVRSQSGAM VPMSALSKLS TIVGAEQLER YNGLLAAKIL GSGAPGVSSG DAIRLVEEIA
AKNLPDGYQI AWTGQAYQEK RTGSAAIFAF SFAIIMVFLI LAAQFETWAL PLAVIMAVPF
ALAGALLAVL ARGMPNDIYF QIGLITLIGL AAKNAILIVE FATQKMAEGL PVAEAAIEAA
RLRFRPIVMT SMAFVLGIVP LLIATGAGAA ARRSMGTGVF GGMLLATFVA TIFIPLFFTW
LSRKNKGKPV ENLSQETS