Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1727 |
Symbol | |
ID | 3833027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1776043 |
End bp | 1777008 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637829651 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_430571 |
Protein GI | 83590562 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.914147 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACCT ACTATATTTT TTCTTCCGGA CGCCTCCGGC GAATGGATAA TACCCTGGCT TTGGAGCTAG AAACAGAACG ACGCGTGGTC CCGGTGGAAG ATATTGATCA TATTTACTGT TTTTCTGAGT TGGATTTGAA CACTAGGCTT TTAGACTTCC TGGCACAAAA ACAGATTTGC CTCCATTTTT TCAACTACTA CGGGCACTAT TCCGGCAGTT TTATTCCCCG GGAATCCCAG CTCTCCGGTT TTCTATTGGT AAGGCAGGTG GAGCATTACC TGGACCAGGC TAAGAGACTG GAACTGGCCC GGACCTTTGT CGAGGGAGCA CTGCACAACA TCCGCCGCAA CCTGGAAAAA AGGGAGTATG ATGATATTTG CAGCAAACTG GATGAGATCA GGGAGGGGAT AGGTAAGACT GCTTCCATTG AAGAACTCAT GAGCCTGGAG GCCCATGCTC GTAAAGCCTA CTATGACACC TGGGAAGAAA TCACGGGCTG GGAGTTTGGC AGCCGTAGCA AACGACCTCC TGCTAATGCT TTAAATGCCC TGATTTCCTT TGGCAACGCG ATGATGTATA CGGTAGTTTT AAAGGAGATC TACCGCACCG CCTTAAATCC GACCATTAGC TACCTGCATG AGCCATCAGA GCGAAGGTAT TCCCTGGCCC TTGATGTTGC GGAGATATTT AAGCCGGTTT TTGTCGACAG ACTAATATTC CGTTTGATAA ACCTCAACAT GCTAAAGGAA ACCCACTTCG ACACCAATGT CAATTTCGTC TACCTTACCG AAGGGGGAAG GAAGGTATTT GTTAAAGAAT TTGAAGAAAC TCTGGAAAAG ACGATTTTGC ACCGCAAGCT AAAAAGGAAT ATTCGCTATA AAAGTCTTGT CCGGTTAGAC TTATATAAGC TTATAAAGCA CCTCCTTGGT GAAGAAAAAT ATTCCCCCAT GAAGGTGTGG TGGTAA
|
Protein sequence | MRTYYIFSSG RLRRMDNTLA LELETERRVV PVEDIDHIYC FSELDLNTRL LDFLAQKQIC LHFFNYYGHY SGSFIPRESQ LSGFLLVRQV EHYLDQAKRL ELARTFVEGA LHNIRRNLEK REYDDICSKL DEIREGIGKT ASIEELMSLE AHARKAYYDT WEEITGWEFG SRSKRPPANA LNALISFGNA MMYTVVLKEI YRTALNPTIS YLHEPSERRY SLALDVAEIF KPVFVDRLIF RLINLNMLKE THFDTNVNFV YLTEGGRKVF VKEFEETLEK TILHRKLKRN IRYKSLVRLD LYKLIKHLLG EEKYSPMKVW W
|
| |