Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1994 |
Symbol | |
ID | 3832327 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 2075299 |
End bp | 2077872 |
Gene Length | 2574 bp |
Protein Length | 857 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637829923 |
Product | heavy metal translocating P-type ATPase |
Protein accession | YP_430833 |
Protein GI | 83590824 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2217] Cation transport ATPase [COG2608] Copper chaperone |
TIGRFAM ID | [TIGR00003] copper ion binding protein [TIGR01494] ATPase, P-type (transporting), HAD superfamily, subfamily IC [TIGR01511] copper-(or silver)-translocating P-type ATPase [TIGR01525] heavy metal translocating P-type ATPase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000675494 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTGTTA ATGTAACAAG CAATCTGGCC CAGGTAAACT TACCTGTTCA GGGTATGAGC TGCGCCGCCT GTGTGGCCAA AGTGGAAAAG GCCTTGAAGA ATATGCCGGG TGTTGAAGAG GCCCGGGTAA ACCTCCTCAC CGGCAGGGCC GCGGTAAAAT ATCACCCCGA CCGCGTAAGC ATCCCCCAGA TAGCCAGGAC CATCCAGGAG ATAGGCTACG AGGTGCCGGA GGAGGAGATG CTCTTAACGG TACGGGGCAT GAGCTGCGCC GCCTGCGTGG CCAAAGTAGA AAAGGTGGTC AAAGGTATAC CCGGAGTAAC CTCGGTGGCG GTCAGCCTCC CGGCCGAATC AGCCCGCATC CGTTATTACC AAGGTACAGT GGACCGGGCG CGCATCAAAA AGGAAATCAA CGCCCTGGGT TATGAAGCTA CGGAGAAGAT CTCCGGACAG GCTGCCCTGG ACCGGGAGAA AGAGGCCCGG GAACGGGAGA TCAGGTACCA GCGCCGCAAT ATGTGGATCG CCTGGCCCCT GGCGACCTTG GTGATGATCG GCATGTTCCG TGATATGTGG ATTTTCCCCT ACTTCGTACC CAAATGGCTG GGGAATGTTT ATGTCCTCTG GGCTTTGACT ACGCCGGTAG CCTTTATTCC CGGCTGGCAG TTCTTCGTCC ACAGCTGGAA CGGCCTCAAG CGCGGGGCTA CCGATATGAA CCTCCTCTAC GCCACCGGTA TTGGCGCCGC CTACATCATC GCCACTATCA ACACCCTGTG GCCGGAGGCC GGTTTCGGCG GGCGCGGGGC CACCTTCTTC GAGTCCGCCG CCCTCTTAAC CGCCTTCATT GTCCTGGGTC GTTACCTGGA GGCTATCACC CGCGGCCGTA CTTCCGAGGC CATCCGCAAG CTCATGAGCC TGCAGGCCAA AACAGCCCGG GTCATCCGGG ACGGCCAGGA GATGGAGATT GCTGCCGATG AGGTCGAGGT TGGGGATATC GTCGTCGTCC GACCGGGCGA GAGCATCCCG GTAGACGGGG AAGTCGTGGA AGGATATTCG GCAGTCGATG AATCCATGAT CACCGGCGAG AGCATCCCGG TGGAAAAACG CCCCGGTGCC CAGGTAGTGG GGGCAACCAT CAATAAAACC GGGTCCTTCA AGTTCCGGGC CACCCGAGTG GGGAGCGAGA CGGCCCTGGC TCAGATAATC AAGATGGTAG AAGAGGCCCA GGCTTCCAAG GCTCCTATCC AGAGGCTGGC CGACTTTGTA GCCGGCCACT TCATCGCTGG CGTCCACGTC CTGGCCCTCA TCGTCTTCTT TTTCTGGTTC TTTATTGGCT ACGACGCCTT CTTCCGGCCT GACAGCCACT TTATCCTGTC GCCCTACAGC CTGGCCCAGG TGGGTGTCTT CGGCTTCGCC CTGTTACTCT CGGTAACCAC CCTGGTCATT TCCTGCCCCT GTGCCTTAGG GCTCGCCACC CCCAGCGCCG TCATGGCCGG TACAGGCAAG GGTGCTGAGA ATGGCATCCT TTTCAAGGGT GCCGATGCGG TAGAGGCGAG CAGCAAGCTC AATGCCATTG TCTTTGACAA AACAGGGACC TTGACCAGAG GCGAGCCCTC GGTCACCGAT GTGATTGTCG CTCCAGGTTT TGAGCAAAAA GAAATCCTGC GGCTGGCCGC TATGGCAGAA AAAACCTCCG AACATCCCCT GGGCGAGGCC ATTGTCCGCA ACGCCGTGGA AAAAGGCTTA GAATTAGAGG AAGTAGAAGA CTTTGAGGCC ATCCCGGGCC ATGGTGTCCG GGCCATCTAC CAGGGGCGAG AAATCCTGCT GGGCAATCGC CGGCTGATGC AGCAGCGTAA CATCGCCATC AGCGACCTGG CCGGGCACAT GGAGAAACTC GAAGAAGAAG GCAAAACCGC CATGTTAATG GCAGTAGACG GCAGGGCCGC CGGGATCATC GCTGTGGCCG ACACTTTAAA AGAGCACGTA AAGGTTGCCA TTGAACGCCT GCACAAGATG GGTATCCAGG TGGCCATGAT CACCGGTGAT AACCGGCGGA CGGCTGCGGC CATCGCCCGC CAGGTAGGTA TAGAGACCGT CCTGGCCGAG GTGCTGCCTC AGGACAAAGC CGAAGAAGTG AAGAAGCTCC AGGAAAAGGG CCTTAAAGTG GCCATGGTGG GGGATGGCAT CAACGACGCC CCGGCCCTGG CTCAGGCTGA CGTGGGCATC GCCATCGGCT CAGGTACCGA TGTGGCCAAG GAGACGGGGG ATATTATCCT CATCAAGGAC GACATCCGTG ACGTGGTGGG AGCCATAGAG ATCGGGCGGG CCACTATGCG CAAGATCAAG GAGAACCTTA TCTGGGCCTT CCTCTATAAT TCCCTGGGTA TTCCCATCGC CGCTGGCATC CTCTACCCCA TCACAGGGTT AATCGTCAGC CCCGAGCTGG CTTCATTCTT CATGGCCATG AGTTCAATCT CCGTGACCCT GAATACCTTG ACCCTGAAGC GCTTCCGGCC TTCCCTCCGC GCGGAGCGGG AGGAAGCTGT ACCCCGGCAC CGGCCGGCAC CCCAGGCAGG CTAA
|
Protein sequence | MTVNVTSNLA QVNLPVQGMS CAACVAKVEK ALKNMPGVEE ARVNLLTGRA AVKYHPDRVS IPQIARTIQE IGYEVPEEEM LLTVRGMSCA ACVAKVEKVV KGIPGVTSVA VSLPAESARI RYYQGTVDRA RIKKEINALG YEATEKISGQ AALDREKEAR EREIRYQRRN MWIAWPLATL VMIGMFRDMW IFPYFVPKWL GNVYVLWALT TPVAFIPGWQ FFVHSWNGLK RGATDMNLLY ATGIGAAYII ATINTLWPEA GFGGRGATFF ESAALLTAFI VLGRYLEAIT RGRTSEAIRK LMSLQAKTAR VIRDGQEMEI AADEVEVGDI VVVRPGESIP VDGEVVEGYS AVDESMITGE SIPVEKRPGA QVVGATINKT GSFKFRATRV GSETALAQII KMVEEAQASK APIQRLADFV AGHFIAGVHV LALIVFFFWF FIGYDAFFRP DSHFILSPYS LAQVGVFGFA LLLSVTTLVI SCPCALGLAT PSAVMAGTGK GAENGILFKG ADAVEASSKL NAIVFDKTGT LTRGEPSVTD VIVAPGFEQK EILRLAAMAE KTSEHPLGEA IVRNAVEKGL ELEEVEDFEA IPGHGVRAIY QGREILLGNR RLMQQRNIAI SDLAGHMEKL EEEGKTAMLM AVDGRAAGII AVADTLKEHV KVAIERLHKM GIQVAMITGD NRRTAAAIAR QVGIETVLAE VLPQDKAEEV KKLQEKGLKV AMVGDGINDA PALAQADVGI AIGSGTDVAK ETGDIILIKD DIRDVVGAIE IGRATMRKIK ENLIWAFLYN SLGIPIAAGI LYPITGLIVS PELASFFMAM SSISVTLNTL TLKRFRPSLR AEREEAVPRH RPAPQAG
|
| |