Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1559 |
Symbol | |
ID | 4461851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 1694669 |
End bp | 1695661 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639700580 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_843969 |
Protein GI | 116754851 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.461244 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCCCA AGGCAGGAGA GGGGCTCTTA GAGAACAGCG TTGTTTACAT CACCAAACAG GGCGCCCAGG TTGGCGTGGA TGGTGGAAGG ATAGTTGTAT ACTCTAAAGA TGAGGGCGAG ATTGCGTCCT TTCCGATGGG TCAGGTTGAC ACGATAAACA TATTCGGGAA CATCAACTTC ACAACTCCAT TCGTCGCCAG AGCAAACGAG CACGGAATCG TGCTCAACTA TTTCACACAG AATGGTCATT ATCGGGGGAG CTTTGTCCCA GAGAGAAACA CGATAGCAGA AGTGCGGCGG AGGCAGTATG CTCTCTCGGA GGCCGACCGC CTGAGAATAG CGAGCGTGAT CATCCGGGCC AAGATAAGAA ACTCTAGGAC AATGCTCTAC AGGAGAGGGG CATCGGATGA CAGGCTGGGG GAACTGGAGG ACAGGGTTGC GGATGCTAAC GATCTTGATG AGCTCAGAGG GCTGGAGGGC GAGGCTGCGG AGATATATTT TGGTATACTC AAACGTTGTG TTCCACAGGA CTGGAGCTTC GAGCGGAGGA GCCGGAGGCC GCCGGCTGAC CATATGAATG CACTTCTCTC TCTGACGTAC AGCATGATTA AGAACGAGGT GCTGAGCGCC TTGCGGCAGT ACAATCTGGA CCCCTTTCTG GGTATCCTGC ATGCTGACCG ACACGGAAGG CCGGCGCTAG CGCTCGATCT CCTGGAGGAG TTCAGGCCGA TCTTCTGCGA TGCCTTCACG CTGCGTCTCA TCAACAGGGG TGTCCTGAAG CATGAGGATT TCCAGGTGAA CAATCATCTC AAGGAATACG CTTTCAAAAC ATATCTGGGG AAGTTTGACG AATACATGCA GGAAGAGTTC AGGCACCCGA GATTCGATTA CACTGTTACA AGAAGAAAGG CTGTGCGCAT GCAGGCGATC TTGCTCCGTA AAGCGATAAC CGGAGAGATG AAGGAGTATC ATCCGCTGGA GTTCAAAAAA TGA
|
Protein sequence | MMPKAGEGLL ENSVVYITKQ GAQVGVDGGR IVVYSKDEGE IASFPMGQVD TINIFGNINF TTPFVARANE HGIVLNYFTQ NGHYRGSFVP ERNTIAEVRR RQYALSEADR LRIASVIIRA KIRNSRTMLY RRGASDDRLG ELEDRVADAN DLDELRGLEG EAAEIYFGIL KRCVPQDWSF ERRSRRPPAD HMNALLSLTY SMIKNEVLSA LRQYNLDPFL GILHADRHGR PALALDLLEE FRPIFCDAFT LRLINRGVLK HEDFQVNNHL KEYAFKTYLG KFDEYMQEEF RHPRFDYTVT RRKAVRMQAI LLRKAITGEM KEYHPLEFKK
|
| |