Gene Moth_2369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2369 
Symbol 
ID3832549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2492355 
End bp2495507 
Gene Length3153 bp 
Protein Length1050 aa 
Translation table11 
GC content61% 
IMG OID637830288 
Productacriflavin resistance protein 
Protein accessionYP_431194 
Protein GI83591185 
COG category[V] Defense mechanisms 
COG ID[COG0841] Cation/multidrug efflux pump 
TIGRFAM ID[TIGR00914] heavy metal efflux pump (cobalt-zinc-cadmium)
[TIGR00915] The (Largely Gram-negative Bacterial) Hydrophobe/Amphiphile Efflux-1 (HAE1) Family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000403174 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACTGGT CCCAGGCAGT CATCAAACGC CCGGTAGCTT TAACTATGGT GGTCCTGGTG 
GTCATCCTCA TGGGTGTGGT GTCCCTGTCC CGCCTGAAGG TCGACCTTTT ACCCGACATG
AAGTTGCCCT ACGCGGCCGT GATTACCTCC TATAGCGGGG CCGGCCCCGA GGAAATCGAA
AAGACGGTGA CCAGGCCCCT GGAGGACGCC CTGGGTACCG TCCAGGGGAT TAAGAATATC
CGTTCCATGA GCATGTCCGG GAGCTCCGTT ATCATCCTGG AGTTTAACTG GGGCCAGGAC
ATGGACTTTG CCACCCTGAA CATGCGCGAA AAGATTGACC AGATCGAGAG CAGGCTCCCT
GACGGGGTGG ATAAGCCCAT GGTCATGAAA ATGGACCCCA ATATGTTTCC GGTCATGACC
CTGGCCCTGC ATGGCGACCT GGACCAGCAG CGGCTGAAAG ATATAGCTGA AAACACCGTC
AAGAACCGCC TGGAACGCCT GGACGGGGTG GCGGCGGTCA ACGTCACCGG CGGCCTGGAG
CGGGAGATCC AGGTCCTGGT GGACCCGGCT CGCCTGCAGA CATTCGGTCT CTCCATCAGC
CAGGTGGTCC AGGCCCTGCA GACTGAGAAT ATAACCTCCT CCGGCGGCCA GGTGACCGAC
GCCGGCAAGA AAGTCCTGGT GCGGGTTAAC GGGGAGTTTA ACAACCTGGA CCAGATCCGC
CAGGTGGGCC TGACCACCCC CGGGGGGGCG GTGGTGCGCC TGGGCGACGT GGCGACGGTC
AAGGATACGA CGGCCGAACA GAAGCAGTTT GCTCTTTTCG ACGGTAAACC GGCCATCGGC
CTGTCCATCC AGAAACAAAC CAACGGCAAT ACCGTCCAGA TATCCCATGC CGTCAAAAAA
GCCCTCCAGG AGCTGCAGCA GGAACTGCCG CCCGGGGTGA CCATTGAAGC GGTCAACGAC
CAGTCAAAGT ATATCGAGTC GGCCATTAAC ACCGTCTATC GGGACATGAT CCTGGGCGGC
CTGCTGGCCA TGTTGATTAT TTTTCTCTTT TTGCGCAGCT TTCGCAGCAC CATTATCATC
GGTTTGACCA TCCCCATCTC GGTGATTACC ACCTTTGTCC TGCTCTATTT CAACCACATG
ACCCTGAACA TGATGACCCT GGGGGGCCTC TCCCTGGGGA TAGGCCGGAT GGTCGACGAT
GCCATCGTCG TTTTTGATAA CATCTACCGC CACCGCCAGG GAGGGCAGGA CGCCATGACG
GCGGCGGCCG GCGGCGCCCA GGAAGTGACC ATGGCGGTGG TGGCTTCCAC CCTGACGACG
GTGGGTGTCT TTTTACCTAT TGCCTTTGTC GAAGGCCTGG CGGCCCAGAT CTTCGGACCC
CTGGCCCTGA CGGTGACCTG CTCCCTTCTA GCCTCCCTGG CGGTTTCCCT GTCGGTTACC
CCCGCCCTGG CTTCCCGGAT CCTTAAGGGC AACCTGCCGC CGGAGGCCAC GGCGGCCCGG
GGCTTCCGGC AGCACCTGGT CACCGGGTAC TGGATGACCC GGTTGAGCGA TTCCTACCGC
CGCTTCCTGG CCTGGGCTTT AAACCACCGC AAACTGGTGG TGGCTGCCGT TCTCCTCGTC
TTTGTGGGCA GCCTGGCCCT GGCACCGGCC GTGGGCTTTG AATTTATGCC CCAGACGGAC
GAGGGAAGCA TCAGCATGAC CATCGAATTG CCCCGGGGAA CGGAGCTGGC AACCACAGCG
GCCATGACCG ACCGGGTAGT GCATTTGATC CAGCAGCAGC CGGAAATCCA GAGCATCTAC
CAGGAAATCG GTAGTGGCGG CGGGCAGAGT TCCTTTCTGG GTGGCGAAAC CCCGGAAATG
GCCAGTATCA ATTTGACCCT GGTGCCCTTA AAGCAGCGGC AGCGGAGCGC CGCGGAGGTG
GCCGCGGCCA TCCGCCGGTC GGTGGCCGGC ATTGCCGGGG CGAGGATTAC CGTAACACCA
ACCTCGTCTT TTATGGGCAG TACGGGGCAG GCTCCTGTCC AGGTGGATAT TCACGGTGAC
GATTTGAAGG TCCTGCAGGA TCTGGCGGAA AAGGTCCAGG AAGCAGTGGC CCGGGTGCCG
GGCACGGTGG CCGTGGACAG CAGCATCACC CGGGGGCGGC CCCAGGTAGA AATCCTGGTC
GACCGGGACA GGGCCGCCCT GTACAACCTG GGCGCGGCCC AGATAGCCGC CACCGTTTCT
ACAGCCGTAG GGGGCCAGGT GGCCAGCCGT TACCGGGTCG GCGGCGATGA GTATGACATC
CGCGTCCAGC TGCCGGCGGA CCGGCGCCAG GATTTAAACA GCCTGGCCAA TTTGATGGTA
CCTTCCCCCA GGGGGACCCA GGTGCCCCTG AAAGAAATCG CCACCCTGCA GATGGATACC
ACCCCCAGCA CCATCAACCG TTACAACCAG GACCGGGTGG CCAGCATCAC CGCCAACCTG
GGCGACCGCC CTTTAGGAGC GGTTATGCAG GACATCCGCC GGGAGGTTGC CAGGATCAAC
CTCCCGCCTG GCTACAGCAT CGAGTATACC GGCCAGAACC AAATGATGAT GGAAACCTTC
GGTCAACTGG GACTGGCTTT GATCCTCGCC ATTGCCCTGG TATACATGAT CATGGCGGCC
CAGTTTGAAT CCCTGCTCCA TCCCTTTGTC ATCATGTTCG CCATCCCGGT GGCCATCACC
GGGGTTATCC TGGCCCTCCT GGCCACCGGT CGTACCTTTG ACGTGGTGGT CTTCATGGGA
ATCATCATGC TGGTAGGTAT TGTCCTCTCC AACGCCATTG TCCTGGTGGA CTATATCAAC
ATCCTGCGCC GGCGGGGCAC ACCGCGCCGT GAGGCTATCC TGATCGCCGG CGGTAACCGC
CTGCGGCCGA TTTTAATGAC CGCCCTGGTA ACCATCCTGG CGATGCTGCC CCTGGCCATG
GGTATAGGTG AAGGGGCGGA GATGAACGCC GGCCTGGGCA CCGCCGTCAT CGGCGGCCTC
ACGGTGTCCA CCATCCTGAC CCTGGTCCTG GTGCCGGTTC TCTATACACT CTTTGAAGAC
CTTGGCCAGC GCCTGGGTCG CTTCCTGCGC CTGCCCGGGT ACCGGCAGAA GCTGGACGCT
TCCGGCGCCG GTACGGGAAC CACAGCAGGT TAG
 
Protein sequence
MNWSQAVIKR PVALTMVVLV VILMGVVSLS RLKVDLLPDM KLPYAAVITS YSGAGPEEIE 
KTVTRPLEDA LGTVQGIKNI RSMSMSGSSV IILEFNWGQD MDFATLNMRE KIDQIESRLP
DGVDKPMVMK MDPNMFPVMT LALHGDLDQQ RLKDIAENTV KNRLERLDGV AAVNVTGGLE
REIQVLVDPA RLQTFGLSIS QVVQALQTEN ITSSGGQVTD AGKKVLVRVN GEFNNLDQIR
QVGLTTPGGA VVRLGDVATV KDTTAEQKQF ALFDGKPAIG LSIQKQTNGN TVQISHAVKK
ALQELQQELP PGVTIEAVND QSKYIESAIN TVYRDMILGG LLAMLIIFLF LRSFRSTIII
GLTIPISVIT TFVLLYFNHM TLNMMTLGGL SLGIGRMVDD AIVVFDNIYR HRQGGQDAMT
AAAGGAQEVT MAVVASTLTT VGVFLPIAFV EGLAAQIFGP LALTVTCSLL ASLAVSLSVT
PALASRILKG NLPPEATAAR GFRQHLVTGY WMTRLSDSYR RFLAWALNHR KLVVAAVLLV
FVGSLALAPA VGFEFMPQTD EGSISMTIEL PRGTELATTA AMTDRVVHLI QQQPEIQSIY
QEIGSGGGQS SFLGGETPEM ASINLTLVPL KQRQRSAAEV AAAIRRSVAG IAGARITVTP
TSSFMGSTGQ APVQVDIHGD DLKVLQDLAE KVQEAVARVP GTVAVDSSIT RGRPQVEILV
DRDRAALYNL GAAQIAATVS TAVGGQVASR YRVGGDEYDI RVQLPADRRQ DLNSLANLMV
PSPRGTQVPL KEIATLQMDT TPSTINRYNQ DRVASITANL GDRPLGAVMQ DIRREVARIN
LPPGYSIEYT GQNQMMMETF GQLGLALILA IALVYMIMAA QFESLLHPFV IMFAIPVAIT
GVILALLATG RTFDVVVFMG IIMLVGIVLS NAIVLVDYIN ILRRRGTPRR EAILIAGGNR
LRPILMTALV TILAMLPLAM GIGEGAEMNA GLGTAVIGGL TVSTILTLVL VPVLYTLFED
LGQRLGRFLR LPGYRQKLDA SGAGTGTTAG