Gene Nmul_A1727 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1727 
Symbol 
ID3786204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1972086 
End bp1975265 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content49% 
IMG OID637811814 
Producthydrophobe/amphiphile efflux-1 HAE1 
Protein accessionYP_412417 
Protein GI82702851 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0256707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGGTA TTTTTCTCAA AAGACCTGTG CTGGCGATCG TGTTATCGCT GTTGTTCATA 
TTCATGGGCT TGTTGGCAAT GAAATCATTG CCCGTTTCCC AGTTTCCAGA TATTGCACCG
CCCCGGGTCA CGGTTTCAGT ATCGTTTCCC GGTGCCAGCG CCCAAGTACT GGTGGAATCG
TCCCTGATCC TCTTAGAACG CGCTATCAAT GGCGTGCCTG GTATGAGATA CATCACTTCG
GATGCAACCA GCGCCGGAGA AGCGACCATT CAAGTATATT TCAATTTAGG CGTCGATCCG
AATATCGCGA TGGTTAACGT GAAGACACGC GTGGATCAGA TGATGAGCAG GCTACCCATA
CTGGTGCAGT TGGAAGGCGT GATTGTAAAC TTCGTCCAGC CCAGCATGCT CATGTACGTC
AACCTCTACA GCAAAGACAA AGACGCCGAT CAGAAGCTTT TGTTTAATTA TGCCTATGTC
AATGTTATTC CGGAGATTCA ACGCATTACT GGAATTGCCC AAGCAACCAT ATTGGGTTCC
CGCCAATATG CGATGCGGAT TTGGATGAAT CCGGAACGGA TGAGAGCCTA TGCCGTCTCG
TCAGAAGAGG TGATGAAGGC ACTGGCGACT CAAAGCATTA TCGGACGACC TGGTCGGTTG
GGGCAAAGCA CAGGCATGCT CGCCCAGTCC AAAGAGTACG TGCTGGTTTA TAAAGGACGC
TTTAACAAGC CGGAAGAATA TGGGGACATT ATTCTCCGGG CTGCTCCAAC AGGCGAGATT
CTGCGCATTA AAGACGTTGC CAAGGTCACT CTGGACAGCG AGTTTTATAA TATTTTCTCC
GATAAGGACG GATTCCCTTC CGCGTCCATC GTGCTGAAGC AAAACTACGG CAGTAATGCC
CAAGTAGTCA TAGAGAACGT GAAGAAAAAA CTGGAAGAAT TGAAAAAATC TTTCCCGGAG
GGCATGGACT ACGAAATCAA TTACGATGTG TCGCGTTTCG TTGAGGCGTC TATCGACCAA
GTGCTGCACA CATTGGTGGA AGCCTTCATT CTGGTCTCCT TGGTGGTTTT CATTTTCCTT
GGAGACTGGC GTTCTACCCT GATTCCCGTC CTCGCGGTAC CGGTTTCGCT GGTCGGAGCA
TTTGCCGTCA TGTCTGCGTT CGGGCTTTCC ATCAACCTCA TCACCTTGTT TGCCTTAGTG
TTGGCTATTG GTATTGTGGT CGATGATGCC ATTGTGGTAG TGGAGGCAGT GCATGCAAAA
ATGGCAGCCG AGCATTGCAC CCCTTATGAA GCCTCCCGAA AAGTGTTGCA GGAAATCGGT
GGCGCCATTA TAGCGATTAC GCTGGTCATG GTATCAGTGT TCGTCCCTCT TGCGTTTATG
CCAGGACCGG TCGGGGTATT TTATCGACAG TTCGCTATCA CTATGGCGTC GTCGATCACT
ATTTCAGCAA TTGTGGCTTT ATCGCTCACC CCGGTACTGT GTGCCATGAT TCTTAAAAAC
ACTCACGGCC AGCCGAAACG GAGAAACCCG ATAAACCTTT TCATCGATGC CTTTAACTCT
GTTTTCGAAA AGGTCACCGG CCGCTACATC AAATTCCTGA ATGTCACGGT CACCCGGCGC
ATTGTCACGC TTTTGGTATT GGTCGGGTTT AGCTTCGGCA TTTTCGCGGT CGATAAAGTC
TTGCCTTCCG GTTTTATTCC CGGCGAAGAT CAGGGCATGA TTTATGCGAT TATCCAAACA
CCGCCCGGGT CAACCATCGA AGTGACCAAC AAAGTGGCAC GTGAACTGGA AATAATGGCC
GCAGAAATCG AAGGCGTACA ATCAATTTCT TCCTTAGCCG GCTACGAAGT ACTTACCGAA
GGCCGTGGAT CAAATGCAGG GACTGTTGTG GTCAACCTGA AAGATTGGTC GGAACGTAAA
CATTCGGTAA ATCAAGTTAT CCACGAGTTG GAGGAAAAAG CCCATAACCT GGGTGCGACC
ATCGAGTTTT TTCAACCGCC GGCAGTGCCG GGATACGGTG CGGCATCGGG TTTGGCGTTT
CGCTTGGTGG ACAAGACGTT AGACACGGAT TACTTCGAAT TCGATGAACT TAATAAAAAA
TTCATGAATG CGATGCGTGA ACGTAAGGAA TTGACCGGCT TGTTCACTTT TTATGCGGCC
AATTATCCGC AATACGAACT GGTCGTCGAC AATAGGCTGG CCATGCAAAA AGGGGTTACC
ATCGAAGACG CGATGAATCA CCTAGACATC ATGATCGGCA GTACTTATGA ACAGGGCTTT
ATTCGCTTCA ATAACTTTTT TAAGGTCTAC ACCCAGGCCG CGCCGGAATT CAGGCGAGCC
CCTGAAGATG TGTTGAAATA TTTTGTGAAA AATGATAAAG GCGAAATGGT GCCGTACTCT
GCCTTCATGA CCATGGTGAA AAAGCAGGGG CCCAACGAGC TGACGCGTTA TAACCTCTAT
AACACGGCGG CGATTCGGGC CGAGCCCGCA CCGGGTTATA CCACTGGCGC AGCTATCAAG
GCAATCGAAG AGGTTGCGGA AGCCACTTTG CCGCGAGGTT TTGAAGTTGC CTGGGAAGGC
TTGACATTCG ACGAAGCCCT TCGGGGCAAT GAAGCACTGC TGATCTTCGC CGTCGTTATC
CTGTTTGTCT ATCTGGTGCT GGCCGCACAA TACGAAAGCT TTGCTTTACC CTTGGTCGTG
TTATGTTCGC TACCTCCCGG TATTTTCGGT GCCTTTTTAT TGTTAAAAGG AACAGGGCTT
GCCAATGATG TCTATTCCCA ACTGGGAATG GTGATGTTGA TTGGCTTGCT CGGCAAAAAC
GCGGTACTGA TTGTCGAGTT TGCAAGCCAG AGGCAAGTCG AAGGAGGAAT GACGGTCAAA
GAAGCGGCGA TTGCGGGCGC AAAAGAGCGT TTCCGGCCGA TTTTGATGAC TTCATTTGCC
TTTATAGCCG GGCTGATACC CTTGGCGATG GCTACGGGAG CTGGAGCGGT TGGCAACAGG
ACGATTGGTA CCTCAGCACT GGGTGGGATG CTTACCGGTA CCGTGTTTGG CGTTATCGTT
ATTCCAGGAC TTTATTATAT TTTTGCGAAA ATGGCAGAGG GTAAAAAATT AATTCAAGGT
GAAGACCTTG ATCCTCTAGC CGAAACTTAT CACTACGGCC CGGATGACAA TGAAAAATAA
 
Protein sequence
MFGIFLKRPV LAIVLSLLFI FMGLLAMKSL PVSQFPDIAP PRVTVSVSFP GASAQVLVES 
SLILLERAIN GVPGMRYITS DATSAGEATI QVYFNLGVDP NIAMVNVKTR VDQMMSRLPI
LVQLEGVIVN FVQPSMLMYV NLYSKDKDAD QKLLFNYAYV NVIPEIQRIT GIAQATILGS
RQYAMRIWMN PERMRAYAVS SEEVMKALAT QSIIGRPGRL GQSTGMLAQS KEYVLVYKGR
FNKPEEYGDI ILRAAPTGEI LRIKDVAKVT LDSEFYNIFS DKDGFPSASI VLKQNYGSNA
QVVIENVKKK LEELKKSFPE GMDYEINYDV SRFVEASIDQ VLHTLVEAFI LVSLVVFIFL
GDWRSTLIPV LAVPVSLVGA FAVMSAFGLS INLITLFALV LAIGIVVDDA IVVVEAVHAK
MAAEHCTPYE ASRKVLQEIG GAIIAITLVM VSVFVPLAFM PGPVGVFYRQ FAITMASSIT
ISAIVALSLT PVLCAMILKN THGQPKRRNP INLFIDAFNS VFEKVTGRYI KFLNVTVTRR
IVTLLVLVGF SFGIFAVDKV LPSGFIPGED QGMIYAIIQT PPGSTIEVTN KVARELEIMA
AEIEGVQSIS SLAGYEVLTE GRGSNAGTVV VNLKDWSERK HSVNQVIHEL EEKAHNLGAT
IEFFQPPAVP GYGAASGLAF RLVDKTLDTD YFEFDELNKK FMNAMRERKE LTGLFTFYAA
NYPQYELVVD NRLAMQKGVT IEDAMNHLDI MIGSTYEQGF IRFNNFFKVY TQAAPEFRRA
PEDVLKYFVK NDKGEMVPYS AFMTMVKKQG PNELTRYNLY NTAAIRAEPA PGYTTGAAIK
AIEEVAEATL PRGFEVAWEG LTFDEALRGN EALLIFAVVI LFVYLVLAAQ YESFALPLVV
LCSLPPGIFG AFLLLKGTGL ANDVYSQLGM VMLIGLLGKN AVLIVEFASQ RQVEGGMTVK
EAAIAGAKER FRPILMTSFA FIAGLIPLAM ATGAGAVGNR TIGTSALGGM LTGTVFGVIV
IPGLYYIFAK MAEGKKLIQG EDLDPLAETY HYGPDDNEK