Gene Nmul_A1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1970 
Symbol 
ID3784994 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2265630 
End bp2268719 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content57% 
IMG OID637812059 
Productacriflavin resistance protein 
Protein accessionYP_412657 
Protein GI82703091 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCTCT CGGAACTGTG CATTCGCCGT CCCATCATGA CGGTGCTGTT GTCAGCCTCG 
GCTATTCTCG CCGGCATTAT GGCTTACGGT AATTTGCCTA TTGCCGCCCT TCCCAGTTAT
GACTCGCCCA CTATTTCGGT GGCAGCGGTA TTGCCGGGCG CCAGTCCCGA GACAATGGCT
TCCTCTGTGG CAACCCCACT GGAGCGGCAA TTCTCGACGA TCTCTGGTCT TTCGGTAATC
AGTTCCACCA GCACGCTTGG CAATACTGCG ATTACCCTGG AATTTGACCA GAAACGGGAT
ATCGACAGCG CGGCAATTGA CGTACAGGCG GCCTTGCTGC GAGCCGAGCG CGCTCTCCCG
ATCGAGATGA CGTCACCCCC TTCGTATCGC AAGATCAATC CTGCCGACTC GCCCATCCTC
TTTCTTACGC TGACATCGCC TTCGATGTCG TTGTCGGAAC TGAACAGCCT TGCCGAAGAC
CTGATTGCTC CCACTTTATC AACGCTCTCG GGCGTTGCGC AGGTCCAGAT CAACGGCCAG
AAAAAATACG CCGTACGTAT CCATGTCAAT GCGGAAGCGC TTGCTGTCCG TAATTTGACG
CTGGACGATA TCGCCACTGC GCTGGGAACT GCCAACGCCA ATACGCCGAT AGGCACTCTC
GAGGGACCGC GTCAGACGCT TACCATTCAG GCCAACCGCC AGCTCGGAAA CGCGGCAGCC
TTCGCGGATC TGATCGTAGC GACCTCTGAT GCAGGGAACC CCGTGCGCCT CTCGGAAGTA
GCGGAAGTTG AAGACAGCGT TCAGTCGGTC AAGACCGCAA GCTGGGCCAA TGGGGAGCGG
GCAATAACCT TGTCGGTGCA GCGCCAGCCG GGCGCCAATA CAGTGGCGAC CGTGGATGCC
ATAAAAGCCG CCCTGCCGGC GCTGGTCGCC CAAATGCCTT CGTCGGTGCA ACTCAAGCTG
ACCAGCGACC GCTCGGTTTC GATTCGCGAC TCGATTCATG ATGTGCAGGT GACGCTGGCG
GTAACGATCG TACTCGTTCT ACTTGTGATT TTCCTGTTCC TGCGGAAGGC TTCCGCCACG
CTGATTCCGG CGCTGTCGCT GCCGATCTCC CTGCTTGGAA CCGTAGCGAT GATGTATCTG
CTCGATTACA GCCTGGACAA TATTTCCCTG CTGGCAATCA CGCTCGCGGT CGGCCTGGTC
GTGGATGATG CCATCGTAAT GCTGGAAAAT ATCGTTCGTC ATATCGAGGA GGGGATGCCG
CCCTTGCAGG CGGCTCTGGT GGGGTCGAAA GAGGTGAGTT TTACCATCAT GTCGATTTCC
CTCTCGCTGG TGGCCGTCTT CATCCCGATT TTTTTCATGC CCGGAGTAAT CGGCCTGCTT
TTTCATGAAT TTGCAGGGGT AGTGGGCATC GCCATCATGA TGTCGGCACT TGTCTCGCTC
ACGTTGGTGC CCGTGATGAC AAGCCGGTAT ATCGTCCAGC ACGAGTCCGA GGGCAGAAGC
CTGAAGTGGA CCGCCTGGTT CGAGCGGGGA TTCATCCGTA CCCTGGGAGT CTATGAACGC
TTCCTCGATC TCGCGTTGCT TCATCGCCGT ACCGTGCTCG GCATTGCCAT GGGCACCTTC
ATTGCCACGG CAGGCTTGTT CGTGGTCTTG CCCAAAGGTT TTTTCCCTAC GGAGGATATC
GGACAGGCGC TGATCACCGT GGACGCGGTT GAAGACATTT CCTTTCCAGC CATGACAGAA
TTGTTGCAGC AAACGGGGGA AATCATGCGC GCCAATCCGG CTGTTGACAC ACTCATCGTC
AATGCGACCG AAAGCAATAG CGCCCGGTTG TTCATGACCT TGAAGCCGCG GAGCGAACGG
CCCCCTCTCA ACAAGGTAAT GGAAGATTTG CGGCGCGAGG TCAGCGCTAT TCCTGGCGTC
AACGTTTTCA TCAATCCCAT TCAGAATCTC AGGCTGGGAG GACGCACCAG CAAGAGCCGT
TATCAATATG TCATGCGCAG TGTGCGCGCG GAGGAATTGC GCGGTGCGGC AGAGGGTTTG
ATGGCGCGCA TGCGTGCCGA TCCGATATTC CGCGATGTCA CCAGTGATTC CCAGTTGAAG
GGGCTGCAGG CGCAACTCAA TATCAATCGT GACAAAGTCA ATCTGCTGGG CGTGCAGATG
TCCGATATAC GTTCTCTGCT GTACAGCGCG TTTGGTGAGC GCCAGGTGTC GACGATCTAT
ACCTCCAGCG ACAGTTATCC GGTTATCCTG CAGGTAGCAT CGGAGGACCG CGCCGACGAG
AGTGCGTTCG ATAAAATCTA TCTGCGTGGC AAGAACGGCA TGCTGGTGCC ACTCTCCAGC
GTTGCATCGG TTGAACGCCA GGTGGGACCC CTTGCGATCA ACCACTCCGG CCAGCTCGAG
TCGTTGACCA TCTCGTTCAA TCTCGCGCCC GGGGCCGCGC TGGGAGAAGC CTCTGTCCGA
ATAGAGAAGT TCAAGCGCGA ACTGAATGTC CCCGCCAGCA TTCTCACGAG CTATGCGGGG
GATGCGGCAG CATTCCAGTC TTCACAGGCA AGCCAGGTGA TACTCATCAT CGGCGCATTG
CTGGTCATCT ATGTTTTGCT TGGTGTGCTG TATGAAAGCT ATATCCATCC CGTTACGATC
CTTTCAGGAC TGCCATCGGC TGCAGTTGGA GCTTTGGGAA TGTTATGGCT GTTCAATATG
GAACTCTCGA TCATCGCCAT GATCGGCATC CTTATGCTGA TCGGCATAGT TAAAAAGAAC
GCGATCATGA TGATCGATTT CGCGCTCGAT GCCCAACGCA ACGAAGGGAT GACACCTCAG
GAAGCGATCA GGACCGCTTG CCTGCTGCGA TTCCGGCCAA TCATGATGAC TACTCTGGCG
GCCCTGATGG GTGCGCTTCC TATCGCCCTG GGATTGGGAG CAGGCGCGGA ACTGCGCCAG
CCGCTGGGTC TTGCCGTGGT GGGTGGACTA CTGTTTTCTC AGGTGATCAC CTTGTTCATC
ACACCGGTGA TCTATCTATA CCTGGACCGA TACTCCGGCA AAGGACCATT GAAGCTCGAG
ACGGGGGACC TGGCGGGACA GCGGGCGTAG
 
Protein sequence
MNLSELCIRR PIMTVLLSAS AILAGIMAYG NLPIAALPSY DSPTISVAAV LPGASPETMA 
SSVATPLERQ FSTISGLSVI SSTSTLGNTA ITLEFDQKRD IDSAAIDVQA ALLRAERALP
IEMTSPPSYR KINPADSPIL FLTLTSPSMS LSELNSLAED LIAPTLSTLS GVAQVQINGQ
KKYAVRIHVN AEALAVRNLT LDDIATALGT ANANTPIGTL EGPRQTLTIQ ANRQLGNAAA
FADLIVATSD AGNPVRLSEV AEVEDSVQSV KTASWANGER AITLSVQRQP GANTVATVDA
IKAALPALVA QMPSSVQLKL TSDRSVSIRD SIHDVQVTLA VTIVLVLLVI FLFLRKASAT
LIPALSLPIS LLGTVAMMYL LDYSLDNISL LAITLAVGLV VDDAIVMLEN IVRHIEEGMP
PLQAALVGSK EVSFTIMSIS LSLVAVFIPI FFMPGVIGLL FHEFAGVVGI AIMMSALVSL
TLVPVMTSRY IVQHESEGRS LKWTAWFERG FIRTLGVYER FLDLALLHRR TVLGIAMGTF
IATAGLFVVL PKGFFPTEDI GQALITVDAV EDISFPAMTE LLQQTGEIMR ANPAVDTLIV
NATESNSARL FMTLKPRSER PPLNKVMEDL RREVSAIPGV NVFINPIQNL RLGGRTSKSR
YQYVMRSVRA EELRGAAEGL MARMRADPIF RDVTSDSQLK GLQAQLNINR DKVNLLGVQM
SDIRSLLYSA FGERQVSTIY TSSDSYPVIL QVASEDRADE SAFDKIYLRG KNGMLVPLSS
VASVERQVGP LAINHSGQLE SLTISFNLAP GAALGEASVR IEKFKRELNV PASILTSYAG
DAAAFQSSQA SQVILIIGAL LVIYVLLGVL YESYIHPVTI LSGLPSAAVG ALGMLWLFNM
ELSIIAMIGI LMLIGIVKKN AIMMIDFALD AQRNEGMTPQ EAIRTACLLR FRPIMMTTLA
ALMGALPIAL GLGAGAELRQ PLGLAVVGGL LFSQVITLFI TPVIYLYLDR YSGKGPLKLE
TGDLAGQRA