Gene Moth_1211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1211 
Symbol 
ID3832978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1249466 
End bp1250890 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content53% 
IMG OID637829144 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_430068 
Protein GI83590059 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000173794 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTGCAAA ATGATGCTTA CCCGGCGCCG AATCCCCGGC GCTGGCTCAT ACTATTTGCT 
GTTATGGCCG CCGGGATTAT GGGGCCCATT GACGGCAGTG TTGTCAACGT TGCCCTACCC
ACTATAGGAC GAGTTTTTAA TGTTGACTTA AATACCGTAG GCTGGGTATC CATGGCCTAT
CTTCTGGTCC TGGGAAGTTT AATCTTGACC TATGGTCGAC TGGGAGATAT GTACGGCTTT
CGCCGGGTGC TTTTAACGGG TATTGTCATA TTTACAATAG CATCCGGCAT TTGCGCCCTG
GCACCCAATA TCTGGGTTCT CATAGTTTTC CGGGCTGTCC AAGCTATCGG GGCGGGCATG
TTTATGGCCA TGGGCCCGGC CATAATTACC TCAGTATTTC CGCCCTACGA GCGCGGTCGT
GCCCTTGGAA CCAACGGGAT GATTATTGCC GTAGGTCTGG CTTTGGGACC GACCCTGGGC
GGTTTTCTGG TTACGGTGGC CGGTTGGGAG GCTATTTTCA CCATTAACAT TCCCATCGGG
ATCATCAGTT ATATTATGTG CCGGCAGGTA GTACCGGAAT CCAGGGATTT AAAACCGCAA
CAATTTGATC TCATTGGCGC GGCTATGGGC TTTATTTCCT TGAGTGCTTT CCTCCTGGCC
GGGAGTTATG GCGAAGAGTG GGGTTGGACC TCGCCGGCTA CCCTGGTGTT AAGCGTGGTA
TTTCTTGTTG GCGGGTGGCT TTTTTTACGC TGGGAAAAAC GCGTGCAGGA GCCAATGCTT
GATTTAACCC TTTTTCACAA TAAAGTCTTC AGCGCCGCCA ATTTTGCCGC CTTGATGAAT
TTTATGTCCC AGTATGCCCT AATTTTTTTG CTCCCCTTCT ATTTACAGCA GATATTAAAC
TATACCGCCG GGCATACCGG TTTGATTCTT ACGGCTTCCC CATTGGTGGT TTTAATGTTG
GCGCCCGTGA GCGGTGCCTT ATCAGATCGC CTGGGGACCC GTTGGCTGGC TTTTACCGGC
CAGGCCATCG TCAGTCTGGC CCTTTTCCTG ATGGTGGGTC TGAAGGTTAC GTCCCGGGCC
TTTGATATCA TCTGGCGTCT TTGCCTCTTC GGACTTGGTA CCGGTATTTT TCAATCTCCC
AATAATAGCG CTGTTATGGG TAGTGTCCCT CGCCATCGTC TGGGTATAGG TTCCGGTGTC
CTGGCCACAG TCCGCAACGT GGGGATGGTC TTGGGTATCG CCGTGAGCAG CGGGGTGTTT
ACGTGGCAGC GTTCAGCTAA GCTGGTAGCT TGGGGGCCCG GAAGTGCAAC TGCGGCCTTT
ATGGCTGGCA TGAAGTCCGC CTTCCTTGTT GGAGCCATAC TTGCGGCTGC CGGCGCCATA
GCTTCCCTGG TCAGGAGCGA TAATTTCCCC CCCGGCGAGC ATTAA
 
Protein sequence
MLQNDAYPAP NPRRWLILFA VMAAGIMGPI DGSVVNVALP TIGRVFNVDL NTVGWVSMAY 
LLVLGSLILT YGRLGDMYGF RRVLLTGIVI FTIASGICAL APNIWVLIVF RAVQAIGAGM
FMAMGPAIIT SVFPPYERGR ALGTNGMIIA VGLALGPTLG GFLVTVAGWE AIFTINIPIG
IISYIMCRQV VPESRDLKPQ QFDLIGAAMG FISLSAFLLA GSYGEEWGWT SPATLVLSVV
FLVGGWLFLR WEKRVQEPML DLTLFHNKVF SAANFAALMN FMSQYALIFL LPFYLQQILN
YTAGHTGLIL TASPLVVLML APVSGALSDR LGTRWLAFTG QAIVSLALFL MVGLKVTSRA
FDIIWRLCLF GLGTGIFQSP NNSAVMGSVP RHRLGIGSGV LATVRNVGMV LGIAVSSGVF
TWQRSAKLVA WGPGSATAAF MAGMKSAFLV GAILAAAGAI ASLVRSDNFP PGEH