Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0999 |
Symbol | |
ID | 3833302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1027223 |
End bp | 1028683 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637828928 |
Product | EmrB/QacA family drug resistance transporter |
Protein accession | YP_429857 |
Protein GI | 83589848 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00068647 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00553022 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTTGCGA AGACAAAAGA AAACTATAAG TGGTATGCCC TTTCCTGTAC TACCCTGGGG GCCCTGCTAT CTGTGCTTAA CAGTAATACC CTGCTTATCG CTTTGCCGGT TATCGCCAGG GCCCTTCATG CTTCCCTGGA AACCATTATC TGGACCTTGA TGATTTATAT GCTGGCAGTC ACAGTAATGG TGCCGGCAAT CGGCAGGGTG GCAGATATTA TCGGCCGGAA GAAGCTTTAT GTAAGCGGCT TTGCCCTCTT TACCGTGGCA TCTTTACTGT GCGGACTGGT CCAGTCGGGG GGGCAGCTGG TAGCGGCCCG CTTCATTCAA TCGGTGGGCG GTTCCTTGAT GCTGGCCAAC AGCACTGCCA TCGTCACCGA CGCTTTTCCT AAAGGCCAGC TGGGGCGAGC CCTGGGGATC AACAGTATGG TTATTGGTGC CGGGGCGGTA ATCGGGCCCA TCCTGGGAGG CCTACTAACT TCCTGGCACT GGCGCTGGAT ATTTTTCTTT AACGTGCCCC TGGGGATTAT CGGTACCCTG TGGGCGGCTA TCCAGCTCAG GGAAATAATC GAATTGCCCG AAGGCCAGCG CTTCGATTGG CTGGGGACCT CGCTCTTCAC CATTGGTTTT ACCTTCATCC TCCTGGCCCT GACCTTTGGG GATATGGTCG GCTGGCATAC GCCCTGGATA GTAGCCAGCC TGGTTGGCGG CAGCCTGCTC ATGTTGCTCT TCATTTATAT AGAAAACCAC GTGGATCAGC CCATGCTGGA TCTGTCCCTT TTTCGGCAGC GCTTGCTGGC GGCGGCCTAT GCCAGTAACC TTTTAAACGG CATAGCCCGC GGGGCGGTGA CCTTTTTGTT GATTTTCTTT TTCCAGGGCA TCTGGGGTAT TGACCCCCTG TGGGCCGGTA TCTTATTGAC CCCCTTCGCC CTGGCGATGA TGTTCGTAGC TCCGGTGAGC GGTATTTTAT CCGACCGGTA CGGTTCCCGG GAACTCAGCA GCCTGGGATT GGCGGTTTCG GCCATAGGTC TCTATGGCCT CACCAGGCTC CAGATTAACA CTCCCATGAC GGTAGTTATC CTCTGGATGG TCATCATGGG CCTGGGGTCC GGCTTCTTCT TTTCACCAAA TACCAACGCA ATTATGGGGG CCGTTGCCGC CGAACGCCGC GGCATAGCCG CCGGTACCCG GACCATGATG AATAATGCCG GCATGGTCAT CAGTATTGCC CTGGGGCTGG CCATGACCGC CTCCAGCATG ACACCGGAAG CCATGCAGGG GCTTTTCGCC GGCACCCAGG TGGGTTCCCA GGGTATCGCC GTCCAGGAGT TTATGAACGG CCTGCACCGG GCATTCTGGC TGTCGTTTAT CATTAGCATC GTTGCCGCCG TCGTAGCCCT CATGCGCGGC CCCCACGAGG TTTATTACCA AGAGACCGGC TCCGGTTCGA ATAAGGCCTG A
|
Protein sequence | MLAKTKENYK WYALSCTTLG ALLSVLNSNT LLIALPVIAR ALHASLETII WTLMIYMLAV TVMVPAIGRV ADIIGRKKLY VSGFALFTVA SLLCGLVQSG GQLVAARFIQ SVGGSLMLAN STAIVTDAFP KGQLGRALGI NSMVIGAGAV IGPILGGLLT SWHWRWIFFF NVPLGIIGTL WAAIQLREII ELPEGQRFDW LGTSLFTIGF TFILLALTFG DMVGWHTPWI VASLVGGSLL MLLFIYIENH VDQPMLDLSL FRQRLLAAAY ASNLLNGIAR GAVTFLLIFF FQGIWGIDPL WAGILLTPFA LAMMFVAPVS GILSDRYGSR ELSSLGLAVS AIGLYGLTRL QINTPMTVVI LWMVIMGLGS GFFFSPNTNA IMGAVAAERR GIAAGTRTMM NNAGMVISIA LGLAMTASSM TPEAMQGLFA GTQVGSQGIA VQEFMNGLHR AFWLSFIISI VAAVVALMRG PHEVYYQETG SGSNKA
|
| |