Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0493 |
Symbol | |
ID | 3832816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 508068 |
End bp | 509099 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637828427 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_429366 |
Protein GI | 83589357 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATATCC TGTTAAACAC CCTGTACCTT ATGACACCGC GCACCCTGGT GCGCTTAGAT CACGAGACGG TAAAAATCGA GGTGGAAGGG GAATTAAAAA TGCAGATTCC CCTGCACCAC CTGGGCAGCA TCGTCTGTTT TGGTGATGTT ACACTTACCA CGCCCTTAAT AATGCGCTGT GCCGAGGACA AACGCTTGAT TGTCTTTCAC GACCAGCATG GCCGTTTTAA AGCCCGCGTG GAAGGCCCGG TATCAGGAAA CGTTTTTCTT CGCCACGCCC AGCATGAAGC CCTATCTGAT AGTAAAAAAA CAACCGCTAT CGCCAGAAAT ATAGTCGCCG GAAAAATCCG CAATACCCGG CAAGTTGTGC TACGCGGTGC CCGGGAAGCC GATAACGAAG ACGATAGGAA AGCCCTCCAG GAAACAGCCA GGGAACTGGC TCACGGACTG GATGCCCTGT CACGTTCCCC CGATCTAGAG CAAATTCGTG GTATCGAAGG TAACGCCGCC CGGCATTACT TCGCAACTCT TGATCGCATG GTCAGAGTTA ACCGAGAAGC CTTTAAAATC ACCCACCGAA ACCGGCGGCC GCCCCTGGAC AGGATGAACG CTTTATTATC CTATATATAT GCTCTCTTGC TGAACGATTG TCTTAGCGCT GCCGAAGGTG TAGGGCTTGA TCCCCAGATT GGATATCTCC ACGTCCTCCG TTCGGGGCGA CCAGCTCTGG CTTTAGATTT GATGGAGGAG TTTCGTGCAA TCCTGGCTGA CCGTTTGGCC CTGACTTTAG TAAACCGTCG TCAGATCGAT GAAAGGGATT TTGTCGAACG CCCTGGTGGC GCCGTGCATA TTAATGACAA CGCCAGGAAA GAGATTGCCA TTGCTTACCA GAAGCGCAAG CAGGAAGAAG TACTTCATCC CGTCCTGAAT AGAAAGGTGC CCCTAGGGCT GGTTCCCCAT ATCCAGGCCC GTTTACTCGC CCGTGTCCTG AGGGGAGATG CTGAGGAATA TTTGCCCTTT ATGTACCGGT AA
|
Protein sequence | MHILLNTLYL MTPRTLVRLD HETVKIEVEG ELKMQIPLHH LGSIVCFGDV TLTTPLIMRC AEDKRLIVFH DQHGRFKARV EGPVSGNVFL RHAQHEALSD SKKTTAIARN IVAGKIRNTR QVVLRGAREA DNEDDRKALQ ETARELAHGL DALSRSPDLE QIRGIEGNAA RHYFATLDRM VRVNREAFKI THRNRRPPLD RMNALLSYIY ALLLNDCLSA AEGVGLDPQI GYLHVLRSGR PALALDLMEE FRAILADRLA LTLVNRRQID ERDFVERPGG AVHINDNARK EIAIAYQKRK QEEVLHPVLN RKVPLGLVPH IQARLLARVL RGDAEEYLPF MYR
|
| |