Gene Mthe_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1559 
Symbol 
ID4461851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1694669 
End bp1695661 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content53% 
IMG OID639700580 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_843969 
Protein GI116754851 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.461244 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCCCA AGGCAGGAGA GGGGCTCTTA GAGAACAGCG TTGTTTACAT CACCAAACAG 
GGCGCCCAGG TTGGCGTGGA TGGTGGAAGG ATAGTTGTAT ACTCTAAAGA TGAGGGCGAG
ATTGCGTCCT TTCCGATGGG TCAGGTTGAC ACGATAAACA TATTCGGGAA CATCAACTTC
ACAACTCCAT TCGTCGCCAG AGCAAACGAG CACGGAATCG TGCTCAACTA TTTCACACAG
AATGGTCATT ATCGGGGGAG CTTTGTCCCA GAGAGAAACA CGATAGCAGA AGTGCGGCGG
AGGCAGTATG CTCTCTCGGA GGCCGACCGC CTGAGAATAG CGAGCGTGAT CATCCGGGCC
AAGATAAGAA ACTCTAGGAC AATGCTCTAC AGGAGAGGGG CATCGGATGA CAGGCTGGGG
GAACTGGAGG ACAGGGTTGC GGATGCTAAC GATCTTGATG AGCTCAGAGG GCTGGAGGGC
GAGGCTGCGG AGATATATTT TGGTATACTC AAACGTTGTG TTCCACAGGA CTGGAGCTTC
GAGCGGAGGA GCCGGAGGCC GCCGGCTGAC CATATGAATG CACTTCTCTC TCTGACGTAC
AGCATGATTA AGAACGAGGT GCTGAGCGCC TTGCGGCAGT ACAATCTGGA CCCCTTTCTG
GGTATCCTGC ATGCTGACCG ACACGGAAGG CCGGCGCTAG CGCTCGATCT CCTGGAGGAG
TTCAGGCCGA TCTTCTGCGA TGCCTTCACG CTGCGTCTCA TCAACAGGGG TGTCCTGAAG
CATGAGGATT TCCAGGTGAA CAATCATCTC AAGGAATACG CTTTCAAAAC ATATCTGGGG
AAGTTTGACG AATACATGCA GGAAGAGTTC AGGCACCCGA GATTCGATTA CACTGTTACA
AGAAGAAAGG CTGTGCGCAT GCAGGCGATC TTGCTCCGTA AAGCGATAAC CGGAGAGATG
AAGGAGTATC ATCCGCTGGA GTTCAAAAAA TGA
 
Protein sequence
MMPKAGEGLL ENSVVYITKQ GAQVGVDGGR IVVYSKDEGE IASFPMGQVD TINIFGNINF 
TTPFVARANE HGIVLNYFTQ NGHYRGSFVP ERNTIAEVRR RQYALSEADR LRIASVIIRA
KIRNSRTMLY RRGASDDRLG ELEDRVADAN DLDELRGLEG EAAEIYFGIL KRCVPQDWSF
ERRSRRPPAD HMNALLSLTY SMIKNEVLSA LRQYNLDPFL GILHADRHGR PALALDLLEE
FRPIFCDAFT LRLINRGVLK HEDFQVNNHL KEYAFKTYLG KFDEYMQEEF RHPRFDYTVT
RRKAVRMQAI LLRKAITGEM KEYHPLEFKK