Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2208 |
Symbol | |
ID | 3832883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2307566 |
End bp | 2308630 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637830130 |
Product | arsenical-resistance protein |
Protein accession | YP_431040 |
Protein GI | 83591031 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00142179 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.156991 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACCAG TAAATAATTA TCAACCCGTG GCCAGGTTAT CCCTCCTGGA TCGCTTTTTG ACCCTGTGGA TCTTCCTGGC CATGGCCCTG GGCGTGGGCC TGGGTTATAT GGTGCCCGGG GTAGCGGATG CTCTGAATAA AATGTCTGTG GGCACAACTT CCATCCCCAT CGCCATCGGC CTTATCGTCA TGATGTACCC GCCCCTGGCC AAGGTCAAGT ACGAGGAACT GGGCAAGGTA TTCCGCAACG GCAAAGTTAT GGCCTTATCC CTTATCCAGA ACTGGATTAT CGGTCCCATC CTGATGTTTG TCCTGGCCAT TCTCTTTTTG CACAATTATC CGGAATACAT GGCCGGCCTC ATTTTAATCG GCCTGGCGCG CTGCATCGCC ATGGTCATCG TCTGGAACAG CCTGGCCCGG GGCGACGCCG AGTACGCCGC CGCCCTGGTG GCCTTGAACT CCATCTTCCA GGTTATCTTT TATTCAATTT ACGCCTATAT TTTCATCACC CTGCTACCTT CCTGGCTGGG TTTTCAAGGA ATGAAAGTCC ACGTTTCTAT CGGCGAGGTG GCCACCAGCG TGGCCATCTA CCTGGGCATC CCCTTCCTGG CCGGGGTGTT TACCCGTTTT ACCCTCATAC CGGCCAGGGG CAAAGAATGG TATGAGAAAA CCTTTGTCCC CAAAATCAGC CCCCTGGCTT TAATCGCCCT GCTATTTACT ATTGTGGTCA TGTTTTCCCT GAAAGGCCAG TATATCGTTT CGTTACCCAT GGATGTAGTC AGGATCGCTG TACCCCTGAT CTGCTACTTT GTTATTATGT TTCTCATTTC ATTTTTCGTC AGCATGCGCC TGGGGGTCAA CTACGAGCAA ACCACGACCC TCTCCTTTAC GGCCGCCAGC AACAATTTTG AGCTGGCCAT CGCCGTGGCC GTGGCCGTCT TCGGCATTAA TTCCGGCCAG GCCTTCGCAG CGGTCATCGG GCCCTTAATT GAGGTGCCAG TGATGATCGG CCTGGTGAAC GTCGCCCTGG GTTTCCAGCG GCGATATTAC GGCGTAGGGG GATAG
|
Protein sequence | MAPVNNYQPV ARLSLLDRFL TLWIFLAMAL GVGLGYMVPG VADALNKMSV GTTSIPIAIG LIVMMYPPLA KVKYEELGKV FRNGKVMALS LIQNWIIGPI LMFVLAILFL HNYPEYMAGL ILIGLARCIA MVIVWNSLAR GDAEYAAALV ALNSIFQVIF YSIYAYIFIT LLPSWLGFQG MKVHVSIGEV ATSVAIYLGI PFLAGVFTRF TLIPARGKEW YEKTFVPKIS PLALIALLFT IVVMFSLKGQ YIVSLPMDVV RIAVPLICYF VIMFLISFFV SMRLGVNYEQ TTTLSFTAAS NNFELAIAVA VAVFGINSGQ AFAAVIGPLI EVPVMIGLVN VALGFQRRYY GVGG
|
| |