Gene Mthe_0638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0638 
Symbol 
ID4462278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp664105 
End bp665961 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content48% 
IMG OID639699646 
ProductCRISPR-associated Csh1 family protein 
Protein accessionYP_843068 
Protein GI116753950 
COG category 
COG ID 
TIGRFAM ID[TIGR02556] CRISPR-associated protein, TM1802 family
[TIGR02591] CRISPR-associated protein, Csh1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0097245 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGATCC GTGTGATCGA GGCAGTAGCA AGAATTGGTA AGCAAAGAAT TGGTGAGCAT 
GCTCAGAGCG GTTGTGAGGA TATCCTGTAT TACATTGTGG AGAATCCAAA TCTCAATGGC
GGTTACAACC ATGCTCTTGT TGTCACACTC GAGGAGAAGG ACGGAGATTT CTCCTACAGA
GGGGTGGAGC TGGAGGAGCT CAAGGATTAC AGAAGATACC TGTACAAGGG GAAGAAGGGG
AATGCCACCG ATGCCACCCC GACATGCAAG ATCGCAAAGG ATATCGAGAA AACATTCGAG
AAAAAATTTC TGAGATGGTT TGATGGCATT GATTCTTCAG ATCTAAGCGG GGAAGAAAGA
GCCACGCTGA AAAATATAAA GAATGTGTTA TTCTCAAACA AAGATAAAAT CTTTAAGGAG
CTACAGGAGA AGCGATCTCA TCTCAAACAG AGAGAGAACT GCATAATAAC CCTTGGATTT
GTGAAGGATG GTGATCTTAA ATACCTGGCG GATTATCCGG CGTTCAGAAA CGTTCTCCTC
AAAAATAGCT GCGATCGGTT CTTCCGTAAA TACGGCGGCG AGTCCAGAGG AACGGATGCG
CTCTGCTCTG TCTGCAAGGA GCAAAAGGAT GAGGTTTACG CATACGCCAT CCCCTGGCCG
TTCCACACAT TCGACAAGCC CGGATTCATA GCAGGCGGGT TCAGGCAGTC TGATGCATGG
AAGAACACGC CCGTCTGCTT AAACTGTGCG ATAAATCTCG ATGCGGGAAA GAAGTACATT
GAGGAGAGCC TTGACTTCAG CTTCTACGGC TTCAGGTACC TTCTCATACC CAAGCTCATA
ATCGGCGATG ACTACCAGGA GATCCTGGAT ATTCTGAGTG GTATGAAGAA AGAACTGAAG
ATGAGCCGGA AGACCAGGAA CCGCATAACC GACGACGAGG ATGAGATCCT GGATATGGTG
AAGGATCAGA AGGACTTCTT CAGCAACAGC CTGATGTTTT ATAAAAAGGA TAACTCCGCT
TACAGGATAC TCCTTCACAT CGATGGCATT CTCCCATCCA GGTTGAGAAG GCTGTTCGAG
GCCAAGGAGA GGGTTGAAGG TGCCTTCGGG ATATACAATG AGATGGTGCT ATCCGAAGAC
AAAGGCAGGC TTATATTTGA TTTTGGCGTT CTGAGGCGGT TCTTCCCCAG GGAGTCAGGA
AACGTTACAC ACGACAAGAT GTTTCTGGAA ATTGTCAACA GGATATTCGT GGGGAGGTAT
GTGGATCGCC ATCTTCTGAT CTCATTCATG ATGAAGAGAA TCAGAGATGA TTTCGTGCAT
GGGCGATCCA CGCTGATGAA CACGCTCAAC GGATTCATGC TTCTTCATTA TCTGAAGGAG
CTGAATCTTC TGAAGGATCT TGAGGTAGAC AGAATGTCAG GAATTGTTCT CAAAAGAGAG
GAGCTCGAGG CACTTCCTCT GGATAAGAGG GTTGAGAGGT TCTTCGAGGC GAACAGGGAG
TTCTTTGACA GCGATGCGAA GAAGGCAACG TTTCTGGAAG GCGTGCTTGT GCAGAAGCTG
CTGAACATCC AGTGGATGGA GAAGAATGCA AAGCCGTTCT ACACAAAGCT GCATGGGCTC
AAGATGAACG AGGCCCTGAT CAAGAGGCTT CTTCCGGAGA TACAGAACAA GCTTGAGGAG
TATGAGAAGA ACTACTACAG GGAGCTCGAG GCGATCATAG CGGAGCATTT CGTTCTTTCG
GGCCGCGGGT GGAAGGAGAC AGATGATGAG CTGAGCTTCT ACTTCGTGCT CGGTATGAAC
CTGCACGAGC TCTTCAGGGT GGATAAAGAG AAGGAAAAGA CGGAGGAAAC AGCATGA
 
Protein sequence
MVIRVIEAVA RIGKQRIGEH AQSGCEDILY YIVENPNLNG GYNHALVVTL EEKDGDFSYR 
GVELEELKDY RRYLYKGKKG NATDATPTCK IAKDIEKTFE KKFLRWFDGI DSSDLSGEER
ATLKNIKNVL FSNKDKIFKE LQEKRSHLKQ RENCIITLGF VKDGDLKYLA DYPAFRNVLL
KNSCDRFFRK YGGESRGTDA LCSVCKEQKD EVYAYAIPWP FHTFDKPGFI AGGFRQSDAW
KNTPVCLNCA INLDAGKKYI EESLDFSFYG FRYLLIPKLI IGDDYQEILD ILSGMKKELK
MSRKTRNRIT DDEDEILDMV KDQKDFFSNS LMFYKKDNSA YRILLHIDGI LPSRLRRLFE
AKERVEGAFG IYNEMVLSED KGRLIFDFGV LRRFFPRESG NVTHDKMFLE IVNRIFVGRY
VDRHLLISFM MKRIRDDFVH GRSTLMNTLN GFMLLHYLKE LNLLKDLEVD RMSGIVLKRE
ELEALPLDKR VERFFEANRE FFDSDAKKAT FLEGVLVQKL LNIQWMEKNA KPFYTKLHGL
KMNEALIKRL LPEIQNKLEE YEKNYYRELE AIIAEHFVLS GRGWKETDDE LSFYFVLGMN
LHELFRVDKE KEKTEETA