Gene Moth_2080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2080 
Symbol 
ID3831830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2170940 
End bp2172304 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content48% 
IMG OID637830007 
ProductMATE efflux family protein 
Protein accessionYP_430917 
Protein GI83590908 
COG category[V] Defense mechanisms 
COG ID[COG0534] Na+-driven multidrug efflux pump 
TIGRFAM ID[TIGR00797] putative efflux protein, MATE family 


Plasmid Coverage information

Num covering plasmid clones67 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.634791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACAGTCG AACGTTCTCA GGAACTCGAG TCTGGTCCTG TCGGGCGCCT ATTGTGGCAG 
TTTTCCTGGC CGGCTATTGT AGGAATGATG TGCAATGCTC TCTATAACAT TGTAGACCGG
GCATTCGTGG GACGCGGGGT CGGTACTCTG GCAATCGCCG CTACCACCGT AGCCTTCCCC
TTGATGATAA TACTACTGGC GTTGTCCCTT TTGATAGGGG TAGGAGCTAC TGCCTTGATT
TCCATCCGGT TGGGGGAACA AAAAAAGGAA GAGGCGGAAG TGGTAGCAGC CAACGCTACC
TCATTACTGG TATTATTACC TCTCTGTTTT TCGATTATCT ATCTGTTATT TCCAGAACCT
ATTTTAAGAC TGTTCGGGGC CAGTTCCGAG GTTTTGCCTT ATGCCCGTGA TTTTATGCAT
ATTATTATGC TGGGTTCGGT ATTTGGAGGC CTTAGCATGG GCATGAACAA TTTTATCCGA
GCGGAAGGCA ATCCCGTGAT GGCCATGTCC ACCCAGGTAC TGGGTGCTCT AATCAACGGG
GTTTTAAATT ATACATTCGT TTTTCAAGTA GGAATGGGGA TCAAAGGCTC GGCTCTGGCG
ACCGTACTAG GTCAATTATT CTCTACGATA TGGGTGTTAA GCTATTATTT AACCGGCCGC
AGCCTGATCA AATTAAAGTT AAGAAACTTT CGGCCACGGC TGCCAATTCT CTTAAGCATT
GTTTCCATAG GCTTTGCCCC GTTTGCAATG GAACTAGCCA CTTGCCTGCA ACAGGTAATC
TTGAATAAAT CCGTCTTGAC ATATGGCGGT GATTTAGGTT TGTCCGCGGT TGGAATACTT
ATGAGCATTA TCACTTTATT GTTCATGCCC ATTCTGGGCA TCAGCCAGGG TGCGCAACCA
CTTATCGGGT TTAATTATGG CGCCCGCCGG TTTGACCGGG TTAAGGCAAC CTTAAAAAAG
GCGATATTTG CCGGTAGCTG CGTTTCCGTA ACAGGTTATC TGGTTATGCG TATCTGGCCA
GTCGAGATCG CAGGAATATT CACCAAAGGC GACATCGCTC TTACCAGAAT GACTGCCGAC
GCGATGCTCG TGTTTTTCTG CATGATCTTT ATGCTCGGTT TTCAAATCGT ATGTTCGCAA
TATTTCCAGG CCGTGGGCAA AGCGGTACAG GCGGCAATAC TCAGCCTGTC GCGGCAGGTT
CTGTTTTTCA TCCCGTTGCT GCTTATCCTT CCTCACTTCT GGGGCATAAA CGGCGTTTGG
CGAACGGCTC CCATTGCCGA TGGCCTTTCG GTCATAATTA CGGCCGTCTT CATTTTAAAT
GAAATGAAAT CTCTAGCTAC AGAAGCTAAA GAAGCATCTC CCTAA
 
Protein sequence
MTVERSQELE SGPVGRLLWQ FSWPAIVGMM CNALYNIVDR AFVGRGVGTL AIAATTVAFP 
LMIILLALSL LIGVGATALI SIRLGEQKKE EAEVVAANAT SLLVLLPLCF SIIYLLFPEP
ILRLFGASSE VLPYARDFMH IIMLGSVFGG LSMGMNNFIR AEGNPVMAMS TQVLGALING
VLNYTFVFQV GMGIKGSALA TVLGQLFSTI WVLSYYLTGR SLIKLKLRNF RPRLPILLSI
VSIGFAPFAM ELATCLQQVI LNKSVLTYGG DLGLSAVGIL MSIITLLFMP ILGISQGAQP
LIGFNYGARR FDRVKATLKK AIFAGSCVSV TGYLVMRIWP VEIAGIFTKG DIALTRMTAD
AMLVFFCMIF MLGFQIVCSQ YFQAVGKAVQ AAILSLSRQV LFFIPLLLIL PHFWGINGVW
RTAPIADGLS VIITAVFILN EMKSLATEAK EASP