Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1729 |
Symbol | |
ID | 3833029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1777548 |
End bp | 1780301 |
Gene Length | 2754 bp |
Protein Length | 917 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637829653 |
Product | CRISPR-associated helicase Cas3 |
Protein accession | YP_430573 |
Protein GI | 83590564 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGATT TCTGGGCTAA AAGTAATGGC ATTTCTTTAA GCAAGCACAC CGGGGACGTG CTGGCGGCGA CCGGGGTTCT TCAAAAGAAA CTGGGTAATA AGGTCCCCGG CGACTGGTGG CTGGGTTTAA AGTATGCCGC CCTGCTCCAT GACTCAGGTA AAGTTGATCC CGCTTTTCAG GAGCGCATAA AAAAGCAGGG CCAAAAGGCG GGGGACGTCC CGGCAGACGA ACCAAAATAT AAAGATATCC CTCACAGCAT CCTTTCTCTG TTTTTCGTTA GGCCGGAGAA CTTTACTTTT TCCCAGCCCT ACTTGGCCTA TGCGGTTATA TCGGCCATTG TTTTTCATCA CTGGCGCGAG AGCTTTCCTG ACTACTTCCT GGGCAGCCAG GAATATCTCA TCAGGAAAAA GGCCGGGGAA CTGGAGGAAA GGGCGGCTTT ATGGAACCGG TTGAGCCGGG ATTTAGTCGC CGAGCTTACC ATTCTGGCGG AAAAGTACGA ACTGGATCCG GCCGTGATGG GCCTTAACAC CACCTTGACG GAGTATCTAC GTTACAATAC CCTCGGTTCG GCCGGGCTCC TCATCCCGCC TTATACCCTG GTTTATCTTC CAGAACAGAT CAGGGCCAAA GCCGCCCGGG GAGATGAAAA AGATAGAGCC AGGGTTTTTA TTGCCGGCAA TCTCATGCGC GCCGACCACT TTGCCTCAAT GGTAGAAGAA AGCAACACCG GCTTGCAGAT TGACGCCATC GAAAGCGGCC GGGTTTTCAC TACCGGTGAA ATTGAAAGGA CCTTAAAGGA AAAATTCAAT ACACCGGACT ACTGGCAAAA GCAGTTCTTT CAAAACCATA AGGAGGCAAA AGGTGACAAC CTGGTGCTGG TGGCACCGAC GGGATTCGGC AAGACCGAGT TTGCCTTCCT CTGGGGGGCG GAAAAGAAAA ACTTTATCCT CCTGCCCATG CGGGCGGCCG TCAATAAGAT ATGGGAACGC ACCAGGGATA TGGTTGAGAG CCTGGGGGAA AAAGGCGACG ATCAGGTGGC CCTGTTGCAC AGCGATGCCG CCCTGGAGAC TTTTAGCCGC TACCAGCAGC AGGGCGACCT TGAAAGCGAG AGCAATACCC GCAAAGCCAT GGAGCTGGCC CGCCACCTGG CCCGGACTTA TATTATAGCC ACTGCCGATC AGGTCGCCCC GGCGGCTTTG AGGTATCCTG GCTATGAACG GATCTTTGCT GCCCTTATGA ACGGGGCCCT GGTTATTGAT GAAGTCCAGG CCTATGATCC CCGCGCGGCG GCAATCATCA CCCACCTGGT ACAGCAAAAT GCCTATCTCG GTGGCAACAA TTTGATTATG ACCGCAACCC TGCCGCCTTT CATCAGCCGG GAGCTGGTCA AACGCCTGGG GCTGGCCGAC CATCAGGTTA TACGCCTGAT TGATGAGCCG GAGTTCGAAG GGGTGGCTGC TTCTTGCCGG CATCGGCTGG GATTTAACGT CCATGCAGGC GATTATGCCA GCGCTGTAAC GGCTATTATC CGGGCAGCTC AACAGGGTAA AAAGGTCCTG GTAGTGATGA ATACCGTCAG GGCCGCCTGT GAGATCTACG ACAAAATTAT GGCGGTACTC CAGCAGGAGC AACTGGCTAT TGAAACACTC CTTCTTCATT CGCGCTTTAT CCTTAAACAG AAAGAAGAGC TGGAAAAAGC GGTGGTCGAA AAATATATGC CCAACCGGCC GGACAGGGAT GCCGGTCCCT GTATAGTTGT AGCCACCCAG ATTGTTGAAG CTTCCCTGGA TATTGACGCG GACGTACTTT TTACCGAGCC TTCCCCGGCC GACAGCCTGA TCCAGCGGAT GGGCCGCGTT TTTCGTCGGT TTGCCCGCCT GACGGGTAAC AATGCTCCCC CGGAGGCCAA CGTCATAATT ATAATCAATG GAGGGGATCA ACCCCTCAAT CGCCGCCGGC GTTCAGACCA GGAAGAAAAA ACTGCCGGTG ACGTCCAACT GGCATCCGGC TTAAAAACAG TCTATAACCG GGACCTGACG GCCCTCTCCC TGGTAATCCT GCTCATGGCC TTAAAGGGCG AAGCCGGCCT CAGCCCGGGC AGCAGCCTGG CGGAAGAACT AAAACGGAAA CCATGGGTTG ATTGTTTTAA GAAAAGCAAG GGTAAGGGGA ACTCCGGGAA CACGAACAGG CGCCTGGTGG AAGTTATTAA ATCAATAACA GGACAAACCT TGCTTTTAAC TGAAAAGGAA AAAATGGATT GGGTGGAGCT CACCTATGGT ATCCTTGATG AACTACGGCG GATAGACAAC TATCCCCTCC AACTGGAAGA CTACATTCAG AAGTATGAAG AAACCTTGGC AATCCTAGAT CATGGCTACT GCTCGGATCG CAAAAGGGAT GCGGAAAGGC TTTTTCGAGA TATAAATGCC ATAACCGGTG TTCCCATAGA AAAGGCCGGT GAATTCTATG ACAGCATCCG AACGTGGCTG AGCGGCAAAA ACGCGGGAAG CTTAAATTTC TTGGAACTGG CCCTGGCCAT CCTGCCGCGG TTTACCGTTA ACTGCCCTTT TCTCGCGACG TCCAACGGGG AAAGGGTGCG GGACCTGGAT TTTGAAGCCA TGCTCCCCCC GGAGCTTGAC ATCAAAAGCA TTACTAAAAT CAGATCCAGG CTGGAAAGAT GGCTAAAGGG TATATACATC CTCGATTTGC CCTATGATCA GGTTAAAGGG TTAGAGGTGC AAAATGAACA GTGA
|
Protein sequence | MADFWAKSNG ISLSKHTGDV LAATGVLQKK LGNKVPGDWW LGLKYAALLH DSGKVDPAFQ ERIKKQGQKA GDVPADEPKY KDIPHSILSL FFVRPENFTF SQPYLAYAVI SAIVFHHWRE SFPDYFLGSQ EYLIRKKAGE LEERAALWNR LSRDLVAELT ILAEKYELDP AVMGLNTTLT EYLRYNTLGS AGLLIPPYTL VYLPEQIRAK AARGDEKDRA RVFIAGNLMR ADHFASMVEE SNTGLQIDAI ESGRVFTTGE IERTLKEKFN TPDYWQKQFF QNHKEAKGDN LVLVAPTGFG KTEFAFLWGA EKKNFILLPM RAAVNKIWER TRDMVESLGE KGDDQVALLH SDAALETFSR YQQQGDLESE SNTRKAMELA RHLARTYIIA TADQVAPAAL RYPGYERIFA ALMNGALVID EVQAYDPRAA AIITHLVQQN AYLGGNNLIM TATLPPFISR ELVKRLGLAD HQVIRLIDEP EFEGVAASCR HRLGFNVHAG DYASAVTAII RAAQQGKKVL VVMNTVRAAC EIYDKIMAVL QQEQLAIETL LLHSRFILKQ KEELEKAVVE KYMPNRPDRD AGPCIVVATQ IVEASLDIDA DVLFTEPSPA DSLIQRMGRV FRRFARLTGN NAPPEANVII IINGGDQPLN RRRRSDQEEK TAGDVQLASG LKTVYNRDLT ALSLVILLMA LKGEAGLSPG SSLAEELKRK PWVDCFKKSK GKGNSGNTNR RLVEVIKSIT GQTLLLTEKE KMDWVELTYG ILDELRRIDN YPLQLEDYIQ KYEETLAILD HGYCSDRKRD AERLFRDINA ITGVPIEKAG EFYDSIRTWL SGKNAGSLNF LELALAILPR FTVNCPFLAT SNGERVRDLD FEAMLPPELD IKSITKIRSR LERWLKGIYI LDLPYDQVKG LEVQNEQ
|
| |