Gene Moth_0999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0999 
Symbol 
ID3833302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1027223 
End bp1028683 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content56% 
IMG OID637828928 
ProductEmrB/QacA family drug resistance transporter 
Protein accessionYP_429857 
Protein GI83589848 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00068647 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00553022 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTTGCGA AGACAAAAGA AAACTATAAG TGGTATGCCC TTTCCTGTAC TACCCTGGGG 
GCCCTGCTAT CTGTGCTTAA CAGTAATACC CTGCTTATCG CTTTGCCGGT TATCGCCAGG
GCCCTTCATG CTTCCCTGGA AACCATTATC TGGACCTTGA TGATTTATAT GCTGGCAGTC
ACAGTAATGG TGCCGGCAAT CGGCAGGGTG GCAGATATTA TCGGCCGGAA GAAGCTTTAT
GTAAGCGGCT TTGCCCTCTT TACCGTGGCA TCTTTACTGT GCGGACTGGT CCAGTCGGGG
GGGCAGCTGG TAGCGGCCCG CTTCATTCAA TCGGTGGGCG GTTCCTTGAT GCTGGCCAAC
AGCACTGCCA TCGTCACCGA CGCTTTTCCT AAAGGCCAGC TGGGGCGAGC CCTGGGGATC
AACAGTATGG TTATTGGTGC CGGGGCGGTA ATCGGGCCCA TCCTGGGAGG CCTACTAACT
TCCTGGCACT GGCGCTGGAT ATTTTTCTTT AACGTGCCCC TGGGGATTAT CGGTACCCTG
TGGGCGGCTA TCCAGCTCAG GGAAATAATC GAATTGCCCG AAGGCCAGCG CTTCGATTGG
CTGGGGACCT CGCTCTTCAC CATTGGTTTT ACCTTCATCC TCCTGGCCCT GACCTTTGGG
GATATGGTCG GCTGGCATAC GCCCTGGATA GTAGCCAGCC TGGTTGGCGG CAGCCTGCTC
ATGTTGCTCT TCATTTATAT AGAAAACCAC GTGGATCAGC CCATGCTGGA TCTGTCCCTT
TTTCGGCAGC GCTTGCTGGC GGCGGCCTAT GCCAGTAACC TTTTAAACGG CATAGCCCGC
GGGGCGGTGA CCTTTTTGTT GATTTTCTTT TTCCAGGGCA TCTGGGGTAT TGACCCCCTG
TGGGCCGGTA TCTTATTGAC CCCCTTCGCC CTGGCGATGA TGTTCGTAGC TCCGGTGAGC
GGTATTTTAT CCGACCGGTA CGGTTCCCGG GAACTCAGCA GCCTGGGATT GGCGGTTTCG
GCCATAGGTC TCTATGGCCT CACCAGGCTC CAGATTAACA CTCCCATGAC GGTAGTTATC
CTCTGGATGG TCATCATGGG CCTGGGGTCC GGCTTCTTCT TTTCACCAAA TACCAACGCA
ATTATGGGGG CCGTTGCCGC CGAACGCCGC GGCATAGCCG CCGGTACCCG GACCATGATG
AATAATGCCG GCATGGTCAT CAGTATTGCC CTGGGGCTGG CCATGACCGC CTCCAGCATG
ACACCGGAAG CCATGCAGGG GCTTTTCGCC GGCACCCAGG TGGGTTCCCA GGGTATCGCC
GTCCAGGAGT TTATGAACGG CCTGCACCGG GCATTCTGGC TGTCGTTTAT CATTAGCATC
GTTGCCGCCG TCGTAGCCCT CATGCGCGGC CCCCACGAGG TTTATTACCA AGAGACCGGC
TCCGGTTCGA ATAAGGCCTG A
 
Protein sequence
MLAKTKENYK WYALSCTTLG ALLSVLNSNT LLIALPVIAR ALHASLETII WTLMIYMLAV 
TVMVPAIGRV ADIIGRKKLY VSGFALFTVA SLLCGLVQSG GQLVAARFIQ SVGGSLMLAN
STAIVTDAFP KGQLGRALGI NSMVIGAGAV IGPILGGLLT SWHWRWIFFF NVPLGIIGTL
WAAIQLREII ELPEGQRFDW LGTSLFTIGF TFILLALTFG DMVGWHTPWI VASLVGGSLL
MLLFIYIENH VDQPMLDLSL FRQRLLAAAY ASNLLNGIAR GAVTFLLIFF FQGIWGIDPL
WAGILLTPFA LAMMFVAPVS GILSDRYGSR ELSSLGLAVS AIGLYGLTRL QINTPMTVVI
LWMVIMGLGS GFFFSPNTNA IMGAVAAERR GIAAGTRTMM NNAGMVISIA LGLAMTASSM
TPEAMQGLFA GTQVGSQGIA VQEFMNGLHR AFWLSFIISI VAAVVALMRG PHEVYYQETG
SGSNKA