Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0495 |
Symbol | |
ID | 3832818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 509899 |
End bp | 510792 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637828429 |
Product | CRISPR-associated Csh2 family protein |
Protein accession | YP_429368 |
Protein GI | 83589359 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3649] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR01595] CRISPR-associated protein, CT1132 family [TIGR02589] CRISPR-associated protein, Csd2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 55 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTTT ATACCAATCC CGAAGTACGC CATGATTTTG TCCTCTTATT CGACGTCCGG GACGGCAACC CCAATGGCGA TCCCGATGCC GGCAATCTGC CGCGCCTTGA CCCCGAAACC ATGCAGGGTC TGGTGACCGA CGTCTGCCTT AAGCGCAAAA TCCGCGACTG GGTGGATATG ACCCGCGGCA GCGAGGCTAA CATGAAGATT TATGTCCAGC ATCACGGCAT TTTAAACGCC CAGCACCAGC GAGCCTATGA CGCCATCGGG GAAAAATCCA CCGGCAGCAA ACAAAACCGG GAGATCGTCG ACAAGGCCAG GCAGTGGATG TGCCAGAACT TCTATGATAT CCGCATGTTC GGCGCCGTAA TGACTACCGG CGTCAACTGC GGCCAGGTGC GGGGGCCAAT GCAGCTAACC TTTGCCCGGT CAATCGACCC CATCGTTCCC CTGGACATCT CCATCACCCG CGTCGCCATC ACCAGGGTAG AAGATGCCGC TACAAGCGAA CAGGGTGAGG GAGGCAAGGT CACAGAAATG GGCCGTAAAA CCCTGGTACC CTATGGCCTG TACCTGGGCT ATGGATTTTT CAACCCCCAT TTTGCCGCCG ATACTGGCGT CAGCGCCGCC GACCTGGAGA TCTTCTGGGA GGCCCTGCAG CGGATGTGGG ATGTGGATCG TTCCGCCAGC CGCGGCATGA TGGCCTGCCG GGGACTTTAT ATCTTCAGCC ATGCATCCGC CCTGGGCAAT GCTCCGGCGG ATAATCTCTT TAAACTCATC ACCGTTAAAC GCCGGGATGG AGTAAAAGCA GCGCGCTCTT TTGCCGACTA CCAGGTGACA ATTAATGAAG AGGACTTGCC GCCTGGGGTA ACTCTGACAC GGTTAGTGGG ATAG
|
Protein sequence | MTVYTNPEVR HDFVLLFDVR DGNPNGDPDA GNLPRLDPET MQGLVTDVCL KRKIRDWVDM TRGSEANMKI YVQHHGILNA QHQRAYDAIG EKSTGSKQNR EIVDKARQWM CQNFYDIRMF GAVMTTGVNC GQVRGPMQLT FARSIDPIVP LDISITRVAI TRVEDAATSE QGEGGKVTEM GRKTLVPYGL YLGYGFFNPH FAADTGVSAA DLEIFWEALQ RMWDVDRSAS RGMMACRGLY IFSHASALGN APADNLFKLI TVKRRDGVKA ARSFADYQVT INEEDLPPGV TLTRLVG
|
| |