Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0498 |
Symbol | |
ID | 3832821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 513340 |
End bp | 515514 |
Gene Length | 2175 bp |
Protein Length | 724 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637828432 |
Product | CRISPR-associated helicase Cas3 family protein protein |
Protein accession | YP_429371 |
Protein GI | 83589362 |
COG category | [R] General function prediction only |
COG ID | [COG1203] Predicted helicases |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR01587] CRISPR-associated helicase Cas3 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAAT TCCTGGCCCA CACTGCAAAT TCCCAGGGTA AAGAACAATC ACTATTAGAT CATCTTGAGG CGGTTGCTAA CCTAACCCAA AAATTTAACG AACCACTGGG AGTACCACAT ATTGGTTACC GTACCGGCCT GTGGCATGAC CTGGGGAAAT TCCATCCGGA GTTCCAGGCT TACCTGCAAG GCGAAGGCGG CCGCCGCGGC CCCAACCATT CCAGCGCCGG TGCCATGTTG GCCTCAAAAT ATTTTGAACC CCTGGCCTTC GTTATCGCCG GCCATCACGG CGGCCTGCCC GCCCGGGCAG AATTAAAGAC TCGCTTAAAA GATAAGGTAC AGCTGAAACG TTATAACACC GCCCTGCAAA ACGCCCGCCG GGTTATCCCC TCTTTAGAAC CGAAGCAACC CCTGGATATG GAATTGCCGC CCTTTTTGCA GGAAGGAGCA GGTGTAGATG TAGCTAGCAG GGTTGAATTA TTCCTGCGCC TTCTATTCAG TTCCCTGGTT GATGCCGATT TCCTTGACAC GGAAAGGCAT TTTGATCCTT CCCTGACAAA ATTGCGACAA AAGGAAGCCT CCATTGCTAA TTTATGGTCT TTATTAGCAG CGAACCAGCA AAGGCTAATG GATAAAAGCG GCGGCCATGT CAACCAGATC CGCCGGGAAA TCTACGAGCA CTGTTGCCGG GCGGCCGAAC TAACGCCAGG TTTCTTTTCT TTGACCGTAC CCACCGGTGG CGGCAAGACC CGTTCCGGGA TGGCCTTCGC CCTGCTCCAC GCCCTGCGTT ATCAGAAGGA GAGAATCATT GTCGCCATAC CCTACACAAG CATTATCGAG CAGACGGCCG ACGTTTACCG CGATATCTTT GGTAACGCCA ATGTCCTTGA GCATCACAGC GCCGTTGCCC CGCCGCTAGA TCCCGAAAAC CCTACCCCTG AGGAACTGTG GGCCCGCCTG GCCAGCGAGA ACTGGGATGC CACCCTCATC GTTACAACCA CGGTACAGCT ATTTGAGAGT TTATTCTCCG ACCGCAGCAG CAGTTGCCGC AAACTCCACA ACATCGCCAA CAGCGTCATC ATCCTGGATG AGGTCCAGAC CCTGCCGGTA CATCTTTTAG AACCCATCCT TGATGTCCTG CAGCAGCTGG TCCACTTTTA TGGAGTAACG GTCATCCTGT GCTCGGCCAC CCAGCCGGCC CTGGAGACCA GCCCCTTTTT CCGGGGACTG CAGGGCGTAA GGGAAATAAT TCCTGATCCG CAAAAATACT TTGCCCTGCT CAAAAGGGTG GGTTACCAGG TACCCGCCGG CAACGAAAAA TGGAGCTGGG AACAGGTAGC CGGAGTAATA CGCCAATCCC GCCAGGCCAT GGCTGTTGTT AACACCAAAC AGGATGCCCT GGCCCTCCTG GAGGCTCTGG ATGACCCTGA AGCTTTACAT CTCTCCACGC TCCTTTGCGG GGCTCACCGG CGTAAGGTCC TGCAGGAGGT ACGCCGGCGC CTGGGAAACG GCCGGCCCTG CCGCCTGGTC GCCACCCAGG TAGTCGAGGC GGGGGTTGAT CTGGATTTCC CCCTCGTCCT GCGGGCTGTG GGTCCCCTGG ACAGGATTGT CCAGGCCGCC GGGCGGTGCA ACCGGGAAGG GAAGCTGCAA GAAGGCCGGG TGATAATTTT CAATCCGGAA GAAGGCTGCC TTCCTCCTGG CAGCTATAAG ACCGGGACCG AAATTGCCGC CACGCTGCTC GCTAGTGTGA CAGGGGTTGA CCTGCATGAC CCCGGCCTGT ACCGGATCTA TTTTCAGCGC CTCTACCAGT CCTGCACCCT GGATGCCAAA GGAATTCAGG CCAGCCGCCG CAGCTTGAAT TATCCCGAAG TGGCGCAAAA ATTCCAGATG ATCGAGCAAA GGGTGGTTCC GGTAATTGTC AACTACCATG AAGGCCCCGA TGACCGGAGG GTGGATGAAC TAATCAGCGC CCTCCGCCAC CGGGGTGTAA GTCGTCTAAT CATGCGGCAG TTGCAGCCCT ACCTGGTAAA TATCAACGCT TACACTGTTA ATTCTCTCCA AAGGGAAGGC GTGGTCCAGG AAATATTTCC CGGCCTGTAT GAATGGCTGG GGGGTTACGA CGAAATTCGT GGCCTGGTGA CGACGGCCCG CGACCCGGAG GAGCTGGTAT TTTAG
|
Protein sequence | MEQFLAHTAN SQGKEQSLLD HLEAVANLTQ KFNEPLGVPH IGYRTGLWHD LGKFHPEFQA YLQGEGGRRG PNHSSAGAML ASKYFEPLAF VIAGHHGGLP ARAELKTRLK DKVQLKRYNT ALQNARRVIP SLEPKQPLDM ELPPFLQEGA GVDVASRVEL FLRLLFSSLV DADFLDTERH FDPSLTKLRQ KEASIANLWS LLAANQQRLM DKSGGHVNQI RREIYEHCCR AAELTPGFFS LTVPTGGGKT RSGMAFALLH ALRYQKERII VAIPYTSIIE QTADVYRDIF GNANVLEHHS AVAPPLDPEN PTPEELWARL ASENWDATLI VTTTVQLFES LFSDRSSSCR KLHNIANSVI ILDEVQTLPV HLLEPILDVL QQLVHFYGVT VILCSATQPA LETSPFFRGL QGVREIIPDP QKYFALLKRV GYQVPAGNEK WSWEQVAGVI RQSRQAMAVV NTKQDALALL EALDDPEALH LSTLLCGAHR RKVLQEVRRR LGNGRPCRLV ATQVVEAGVD LDFPLVLRAV GPLDRIVQAA GRCNREGKLQ EGRVIIFNPE EGCLPPGSYK TGTEIAATLL ASVTGVDLHD PGLYRIYFQR LYQSCTLDAK GIQASRRSLN YPEVAQKFQM IEQRVVPVIV NYHEGPDDRR VDELISALRH RGVSRLIMRQ LQPYLVNINA YTVNSLQREG VVQEIFPGLY EWLGGYDEIR GLVTTARDPE ELVF
|
| |