Gene Mthe_0651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0651 
Symbol 
ID4463145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp685957 
End bp687708 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content51% 
IMG OID639699659 
ProductCRISPR-associated TM1812 family protein 
Protein accessionYP_843081 
Protein GI116753963 
COG category 
COG ID 
TIGRFAM ID[TIGR02221] CRISPR-associated protein, TM1812 family
[TIGR02549] CRISPR-associated DxTHG motif protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0409281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTTTA GTTTACTGTG CATTAACTGC CGGCTGAAGA TTTGTATGGA TCGATACATT 
ATGGATAATC GCGTTCTTTT GACTGCCCTG GGGCTCAACC CAAAGGAGAC GACGTACACA
CTGGGGAACA GAACCTGCAA GTCCCTCCTG TCCCCTGTGG CTCTGTACAA TCTGCTTCCG
GAAGATGAGC GGTTCGATAG AGTGCTGGCC CTCTGCACGG AGGAGGTTCT CGAAAAGACG
TTTCCCCTGC TCAGGGAGGA GCTGCCTGTC ATGTGTGAGC CCACGCGTGT CCACAGCGGC
GTCAGCGCTG GCGATCTTCA GGCGACCCTG CATAACATGC TGGAGAAGAT ACCGGAGAAG
GCAGAGCTCA TCCTGGATGT GACGCACGGC TTCAGGCACT ACCCGTTTCT GTTCTTCACC
GCATGCCTGT ATCTGAAGGC TCTGAAGGAT GTTAAGATCA AAAAGATCTG GTACGGAAGG
TTGGATGCAA AGAACCCTGA TGGTGCAACA CCATTCATCG ATCTGAGCAT CCTCCTTGAG
ATGATCGACT GGTTCCGAGT CGTTCAGTCC TTCAGGGATA TGAGCAACCC GAAGGCCCTT
GCGGATAAGT TGCTTGCATA CAAGAGAGAT GTGATAAAGC CGGATACGAA CATACCTCCC
TCCACTAAAA AGAAATACAG CGCGGTCTCC AGCGCTGCAA AGGAGTTCGC AGAGCTGTTC
GGCATAGGCC TTCCAGTGGA GATCGGCATG GCTTCAGCAG ATCTGCAGAG GCTGATAGAA
GAGCTTCGTG TGGAGGGAGA TCTTTCAGCA ATAAAGATCC TCCTGGCAGA GGATCTGCTG
CTGGAGATAG AAAATGCAGC AGAGCGCTTC AGGCTGCCAG AGGGCGTTGA AAAGAAAGAT
CTCATTCTCG ATATCGGGGA GATCCACAGG CAGGAGAAGC TAATAGATGC ATACTTCGAG
GCAGGTTACC ACAACAACGC TATCGGGCTG ATCAGAGAGC TTATCATAAG CAGGCTAATG
CTCGAAGATG AGGATGGCAG GAATAGCTGG CTCGACAAAT CACAGCGAGA GAAGATCGAA
CGATGTATTG GAGCCCTCAT GGATTACAGC AAAAAGAACA GCGCAGATCT TACAAAAGAC
AGGAAAGAGC TTGCTGGTAT ATGGAGCTCC GTTATCGACG CCAGAAATAA GTTGCACCAT
TATGGGATGA CGAAAGATAT TGTGAGTGAT AAACAAATCG GGATCCCGAA GTGGTGGGAC
GATCTGAAGC AGCGTCTCGA CGACAATTCG TTTTGGGATT GCGGTTTTGG AGGTGGCTCT
GGCAGGCTTC TGATCTCGGC ACTTGGCCTT CACCCAGGCT TTCTCTATAC CGCTCTGAGG
AATGTGAATC CAGAGTTATG CCTGGTGATT GCATCACACG AAACTGTGGG CAGATGGGAT
GAGCTCGCCA GAATGGCCAG TTACAGTGGC GATTTCGATG TTAAGGAAAT GGATGCACAT
TCATTTGACG ATGGCAAAAC CATCGTTGTC AACTCAGGGG AGCTTCTCGT CAGATTCGAT
GAGGTTGTGT GCAACCTGAC GGGTGGCACA GCTGCCATGC AGTATGCGGT CATCCAGCTG
GCAAGGAGGG CAGAGCGGTT GGGAAGAAAT GTCAGATGGA TCGCGGTCAT CGATAATCGT
TCAATTGAGG AGCAGAATCT GAACCCTTAC GTCGAGGGGG AGGTGGTCTT TTTGGAGGAT
GCAGGGATAT GA
 
Protein sequence
MGFSLLCINC RLKICMDRYI MDNRVLLTAL GLNPKETTYT LGNRTCKSLL SPVALYNLLP 
EDERFDRVLA LCTEEVLEKT FPLLREELPV MCEPTRVHSG VSAGDLQATL HNMLEKIPEK
AELILDVTHG FRHYPFLFFT ACLYLKALKD VKIKKIWYGR LDAKNPDGAT PFIDLSILLE
MIDWFRVVQS FRDMSNPKAL ADKLLAYKRD VIKPDTNIPP STKKKYSAVS SAAKEFAELF
GIGLPVEIGM ASADLQRLIE ELRVEGDLSA IKILLAEDLL LEIENAAERF RLPEGVEKKD
LILDIGEIHR QEKLIDAYFE AGYHNNAIGL IRELIISRLM LEDEDGRNSW LDKSQREKIE
RCIGALMDYS KKNSADLTKD RKELAGIWSS VIDARNKLHH YGMTKDIVSD KQIGIPKWWD
DLKQRLDDNS FWDCGFGGGS GRLLISALGL HPGFLYTALR NVNPELCLVI ASHETVGRWD
ELARMASYSG DFDVKEMDAH SFDDGKTIVV NSGELLVRFD EVVCNLTGGT AAMQYAVIQL
ARRAERLGRN VRWIAVIDNR SIEEQNLNPY VEGEVVFLED AGI