Gene Moth_2324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2324 
Symbol 
ID3831076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2445061 
End bp2446665 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content60% 
IMG OID637830248 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_431154 
Protein GI83591145 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.937948 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGTTT TGGCCGATCA TGAAAGGGAA CAATCGGGGG CAGGAGCAGG CCTGCCGGCA 
GCCGACGGCG AGGTACAGGG CTCCTGGATT ATCCCCGTCC TGGTGGCCCT CATCGGGGCT
TTTATGTCCA TCCTGGACTC CAGCATCGTC AATGTGGCCA TCCCGACCAT CATGCATGTT
TTTAATACCG ATACCAGCAC GGTCGAGTGG GTGGTCACCA TTTACATGCT AGCGCTGGGG
GTTATTGTCC CCTTAAGCGG CTGGCTGGGT GACAAGCTGG GCTTCAAAAA GTTATACGTC
ATCGCTCTGG TTATATTTAC CTTCGGGTCA CTCCTGTGCA CCTTGAGCTG GAACGTCGAT
TCCCTCATCG CGGCCCGGGT GGTCCAGGCC CTGGGCGGCG GTTTGATCAT GCCCACTACC
ATGGCCATGA TTTATCGTAT GGTACCGCGG GAAAGAATTG GCAGCGCCAT GGGAGTGCTG
GGGATTGCCC TCTTTGTGGC GCCGGCCATC GGGCCGACCC TGGGCGGCTA CCTGGTGGAG
TATGTTGACT GGCGCTGGAT TTTTACCATT AATCTGCCCA TCGGGGTGCT GGGGGTGCTG
CTTTCCCTGG TCCTCCTGCC AGATTTCCCG GCTGCCGAAG CGGGCAGGCT GGATATCGGG
GGGGCCGTAA CGGCGGCGGT AGGCCTTTTT ACCCTCCTCC TGGCCCTGAG CAAGGGCGCG
GACTGGGGCT GGACCTCAGA AGCCACCGTC TTCCTGTTTT ACACCAGCGC GGTTTCCCTC
GGCCTCTTTA TTTACCTGGA ACTTACCTGT GCCAACCCCC TCCTGGAGCT GAGGGTATTC
CGCTATCCGG CCTTTACCCT GGCCAATCTC ATGGTGGTGG TAACCACCAT TGGCCTTTTT
GGCGGCATTT TCTACGTCCC CCTTTTTCTC CAGACCGTCC GCGGCCTGGG AGCTATGGAA
ACGGGTCTGC TGTCCATGCC CGGCGCCCTG GCCTCGGCGC TGATGATGCC GGTAACCGGC
CGCCTCTACG ACCGCATCGG CCCCCGCCTG ATGGCGGTGA CCGGGCTGGT AGTGCTGGCG
ATAACAACCT ATCTCTTTCA CTTCTTAGAT ATCGTTACCC CCGACAGGGT CATCATTACC
TGGCTGATCC TGCGGAGCGT TAGCATGTCT TTTGCCTCCA TGCCGGCCCA GACGGCGGCC
CTGGCGGGGC TGCCGCCAGA ACTGGTAGGC CGGGCTTCGG CCATGACCAA TATTATCAAC
CGGGTGTCGG GTTCTTTCGG GATAGCCATC TTGACCTCGA TTTTAAATCA CCGTACGGCC
CTGCACGCTA CGCAGCTAGC AAGCCAGATC ACAGCGGACA ACCCGGCCGT TACGGCCTTT
TTCCAGCAGG TGGCTCTCTA CCTGGGGAGC GGGTCGGCAG CGACCCAGGT GAAGAGCCTG
GGTACCACTT ATCTGGCAGG ACTGGTGTCC CAGGCGGCTT TTATACGAGG TATTGACGAC
ATTTTTGTCG TGATGACCGG TTTTGCCCTG GCCGGCGTCC TCCCGGCCTT TTTCCTCCAA
AAAGGGCCTG GCGGCGCCCG GCCGGGCTTT GGCGGCGGCG AGTAA
 
Protein sequence
MKVLADHERE QSGAGAGLPA ADGEVQGSWI IPVLVALIGA FMSILDSSIV NVAIPTIMHV 
FNTDTSTVEW VVTIYMLALG VIVPLSGWLG DKLGFKKLYV IALVIFTFGS LLCTLSWNVD
SLIAARVVQA LGGGLIMPTT MAMIYRMVPR ERIGSAMGVL GIALFVAPAI GPTLGGYLVE
YVDWRWIFTI NLPIGVLGVL LSLVLLPDFP AAEAGRLDIG GAVTAAVGLF TLLLALSKGA
DWGWTSEATV FLFYTSAVSL GLFIYLELTC ANPLLELRVF RYPAFTLANL MVVVTTIGLF
GGIFYVPLFL QTVRGLGAME TGLLSMPGAL ASALMMPVTG RLYDRIGPRL MAVTGLVVLA
ITTYLFHFLD IVTPDRVIIT WLILRSVSMS FASMPAQTAA LAGLPPELVG RASAMTNIIN
RVSGSFGIAI LTSILNHRTA LHATQLASQI TADNPAVTAF FQQVALYLGS GSAATQVKSL
GTTYLAGLVS QAAFIRGIDD IFVVMTGFAL AGVLPAFFLQ KGPGGARPGF GGGE