Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0043 |
Symbol | |
ID | 3786386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 45802 |
End bp | 48930 |
Gene Length | 3129 bp |
Protein Length | 1042 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810112 |
Product | hydrophobe/amphiphile efflux-1 HAE1 |
Protein accession | YP_410744 |
Protein GI | 82701178 |
COG category | [V] Defense mechanisms |
COG ID | [COG0841] Cation/multidrug efflux pump |
TIGRFAM ID | [TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTTTT CCAGGTTCTT CATCGACCGT CCCATATTTG CCATCGTTCT TTCGATCGTT ATTTTCGCGG CGGGACTCAT CGCCATCCCG CTCCTGCCTG CGGGCGAGTA TCCGGAAGTG GTGCCTCCAA GCGTGGTTGT GCGTGCAATG TATCACGGCG CCAATCCCAA AGAGATCGCG GATTCAATCG CCGTTCCCCT GGAAGAAGCG ATCAACAGCG TCGAAGGCAT CATGTATATG AAGTCCGTTG CGGGCTCCGA TGGAAGCCTG CAAGTGACGA TCACCTTCAC GCCCGGCATC GATCCCGATA CCGCGGCCGT CCGCGTACAG AATCGCGTCG CCCAAGCCCT GTCCCGTCTG CCCAACGACG TACGCCAGTA TGGCGTAACC ACGCAGAAGC AATCCCCGAC TCCGCTCATG TACGTGAATC TTTCTTCACC CGACGGTCGC TACGACGCGG TCTATCTGCG CAACTATATG ACGCTGAATA TCAAGGACCA GCTTTCGCGC CTGAAGGGGG TTGGCAACGT CGGATTGTAC GGGGAAGGCG ATTATGCGAT GCGCCTGTGG CTCGACCCGA ACAAGCTCGC CGCACGTGGC TTGACAGCCA GCGATGTGGT GGATGCCGTT CGCGAGCAGA ACGTTCAGGT ATCCGCAGGT CAACTGGGCG CCCAGCCATC ACCCAGGAGC GCGGATTTCC TCGTTCCGAT CAATGTTCGT GGACGGCTTC GCAGCGAGCA GGAATTCGGC GACATTGTCC TGAAAAGTGC CGATGATGGC CAATTGGTCC GCCTGGCGGA TGTTGGGCGC GTCGAGCTCG GCTCCAGCGA TTACACGCTC CATGCCATGA TCGATAACAG GGGCAGCACT AGCGCCGGAA TTTTTCTGAC TCCCGGCGCG AACTCCCTGG AGGTGGCCAG GACGGTCTAT GCCAAGTTTG AAGAACTTGC AGCAAGCTTT CCTCCGGGTA TCGAGTATCG GGTGGTGTGG GACCCCAATG TTTTTGTCCG GGATTCCATC CGGGCCGTGG GTCAAACACT GATCGAAGCG GTCTTCCTGG TCGTGCTGGT GGTCATTCTC TTTCTGCAGA CTTGGCGCGC ATCGATCATC CCCCTGATTG CGGTGCCGGT CTCGATTATC GGCACGTTTG CCTGGCTCTA CCTTCTCGGA TATTCCATCA ATACGCTGAC ATTGTTCGGA CTGGTCCTGG CAGTGGGGAT TGTCGTCGAC GATGCGATCG TCGTCGTGGA GAACGTTGAG CGCAATATCG AGGAAGGTCT GAGTCCTCTC GCGGCAGCAC ACCAGGCCAT GAAGGAAGTC TCCGGTCCGA TCATCGCCAT TGCGCTGGTG CTATGCGCCG TGTTCGTGCC CATGGCCTTC CTGAGCGGTG TCACGGGGGA GTTCTACAAG CAGTTTGCCG TCACCATCGC CATATCCACC GTCATTTCCG CGATCAATTC GCTCACACTG TCCCCGGCAC TGGCGGCAAA ACTGCTCAGA GGACATGATG CTCCGAAGGA TGCGCCTACC CGAGTGCTCG AGCGCAGCCT GGGATGGTTG TTCCATCCGT TCAATCGGCT GTTTCTTCGC AGCTCCGGCA AATACCAGAG CACGGTAGCC CGAACACTGG GCCAGCGCGG CAGGGTGTTT ACCGTCTATG CAGGACTATT GCTCGCAACA GTCATGCTAT TCCATATCGT GCCGACCGGC TTTATTCCAA ATCAGGACAA GCTCTACCTC TTTGCAGGCG CCAAATTGCC GGAGGGCGCT TCGCTTGCCC GTACTGACGC CGTCACACGT CAATTAAACG ACCTTGCCAG CAGTATCGAA GGCGTGGAAA TGACCGAAAG CTATGTTGGC ATGAATGCGT TGCAGTCAGT CAACACGCCC AATCAGGCGG CCTCCTACGT TATCCTCAAG CCTTTTGATC AACGCCGGCG TTCCGCAGAA TCCATCACCG CTGAACTGAA TGCCAGGCTT GCCGACATCA AGGACGGGCA AGCCTATGCA TTACTGCCTC CGCCTATTCA AGGACTGGGC AATGGGTCGG GCTATTCGCT CTATTTGGTC GATCGCGGCG GTCTGGGATA TGGGGCGTTG CAAAGCGCAA CCACAAAATT CCAGAACGCG ATTGCGCAGA CGCCAGGCAT GACTTTTCCT GTCAGCTCAT ACCAGGCCAA CGTCCCGCAA CTTGAGGTCA AGGTGGATCG GGTCAAAGCC AAGGCCCAGG GAGTAAATTT GACGGACCTC TTCAATACGT TGCAGGTCTA CCTTGGATCC TTCTACATCA ACGATTTCAA TCTATACGGT CGCGTCTATC GGGTGATGGC ACAGGCGGAT GCAGCTTTCC GCCAGAATGC TGAAGACATC AACAATCTGT ATACACGCAA TAGTCGAGGA GAAATGGTTC CGCTAGGGTC GATGGTGACG ATTCAACATA CTTTTGGTCC GGATCCCGTC ATCCGCTACA ACGGTTATCC CGCTGCCGAC CTTATCGGGG ATGCGGATCC CCGCGTACTT TCCTCGGGAC AAGTCATCGC CAAGCTATCG GAAATTGCTA CGGAGGTACT GCCCCCCGGC ATTACCCTGG AATGGACCGA CTTGAGCTAT CAGCAGGTCA CCCAGGGCAA CGCGGCAGTC ATCGTCTTCC CATTATCGAT CATGCTGGCC TTTCTGGTGC TGGCCGCCCT CTATGAGAGC TGGACATTAC CCTTGGCAGT CATCCTGATC GTGCCGGTAT GCATGTGCGC CGCGCTTTTC GGGATCTGGC TCACAGGCGG GGACAACAAT ATATTCGTAC AGGTGGGACT GGTCGTACTG ATGGGTCTGG CATGCAAGAA TGCCATCCTG ATCGTTGAGT TTGCACGAGA ACTGGAGATT CAGGGCAAGG ATACCATCGA AGCCGCACTG GAAGCCTGCC GCCTTCGACT TCGGCCGATT GTGATGACTT CGATCGCCTT CATCGCTGGA TCTGTTCCCC TAGTGCTCAG TCAGGGTGCA GGTAGCGAGG TGCGCTACGC AACGGGTATA ACTGTCTTTG CCGGGATGCT CGGTGTGACT CTATTCGGAT TGTTCCTTAC CCCGGTTTTC TACGTGGCTT TGCGCAAGTT GGCCCGCCCC AAGCTGTAG
|
Protein sequence | MDFSRFFIDR PIFAIVLSIV IFAAGLIAIP LLPAGEYPEV VPPSVVVRAM YHGANPKEIA DSIAVPLEEA INSVEGIMYM KSVAGSDGSL QVTITFTPGI DPDTAAVRVQ NRVAQALSRL PNDVRQYGVT TQKQSPTPLM YVNLSSPDGR YDAVYLRNYM TLNIKDQLSR LKGVGNVGLY GEGDYAMRLW LDPNKLAARG LTASDVVDAV REQNVQVSAG QLGAQPSPRS ADFLVPINVR GRLRSEQEFG DIVLKSADDG QLVRLADVGR VELGSSDYTL HAMIDNRGST SAGIFLTPGA NSLEVARTVY AKFEELAASF PPGIEYRVVW DPNVFVRDSI RAVGQTLIEA VFLVVLVVIL FLQTWRASII PLIAVPVSII GTFAWLYLLG YSINTLTLFG LVLAVGIVVD DAIVVVENVE RNIEEGLSPL AAAHQAMKEV SGPIIAIALV LCAVFVPMAF LSGVTGEFYK QFAVTIAIST VISAINSLTL SPALAAKLLR GHDAPKDAPT RVLERSLGWL FHPFNRLFLR SSGKYQSTVA RTLGQRGRVF TVYAGLLLAT VMLFHIVPTG FIPNQDKLYL FAGAKLPEGA SLARTDAVTR QLNDLASSIE GVEMTESYVG MNALQSVNTP NQAASYVILK PFDQRRRSAE SITAELNARL ADIKDGQAYA LLPPPIQGLG NGSGYSLYLV DRGGLGYGAL QSATTKFQNA IAQTPGMTFP VSSYQANVPQ LEVKVDRVKA KAQGVNLTDL FNTLQVYLGS FYINDFNLYG RVYRVMAQAD AAFRQNAEDI NNLYTRNSRG EMVPLGSMVT IQHTFGPDPV IRYNGYPAAD LIGDADPRVL SSGQVIAKLS EIATEVLPPG ITLEWTDLSY QQVTQGNAAV IVFPLSIMLA FLVLAALYES WTLPLAVILI VPVCMCAALF GIWLTGGDNN IFVQVGLVVL MGLACKNAIL IVEFARELEI QGKDTIEAAL EACRLRLRPI VMTSIAFIAG SVPLVLSQGA GSEVRYATGI TVFAGMLGVT LFGLFLTPVF YVALRKLARP KL
|
| |