Gene Mthe_0649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0649 
Symbol 
ID4463143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp682795 
End bp684711 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content56% 
IMG OID639699657 
ProductCRISPR-associated RAMP Crm2 family protein 
Protein accessionYP_843079 
Protein GI116753961 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02577] CRISPR-associated protein, Crm2 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATGA GCGATAGCAG TTCCGAAATA AACTTCACAA TCGGTCCTGT TCAGGGGTTC 
ATCGCGCAGG CGCGCCGTAC GAGAGACCTC TGGAGCGGCT CGTTTCTCCT CTCGTACCTC
TCCGGATGCG CCATGGCTGA GATACGGAGA TGTGATGGCA GAATAAAGGT TCCAGATGTA
GAGGGAGATC AGCTGCTCCG CTGGATCGAG AGTGATGGTG AGGGTGCACC ACCCCGGATC
GGCACACTTC CCAACAGGTT CGTGGCATCT GCGGAAGACC CTGCTGGCGC TGCCAGAAGC
GCCGCGGAGG GTGTGAGGAG CAGGTGGAAG AAGATCGCAG ACTGTGTTTG GAATAAATAC
GTGGAGGGCG TCGCAGATCA GGGAAAGAAC ACGCGTGAGA TCTGGGAGAG GCAGGTGAAT
AACTTCTGGG AGATCCACTG GACGATCGGC GCGATGGAGG GAATGGAGGC GCGCAAGAAC
TGGAGGATCT GCACAGACTG GTCAGACGGT CGACCGACCA TTGAATGGGG AGATCACTGC
ACGATCATGA GCGACTGGCA GGAGATCTCA GGCTATGTGA GATCCATTGA GAGGAAAAAG
CAGGATGATT TCTGGAAAGA GATACGCCAG GAGACCAGCG GGATGGATCT CAGAAGCGAT
GAACGGCTCT GCGCGATCGC CCTGATCAAG AGGATGTTCC CATCGGTGGC AGAGGATGCC
ATAGGATGGG AGGTCTCCGC GGATCGCTGG CCCTCCACGC TCTACGTGGC AGCGATACCA
TGGCTCAGTG CTGTAATTGA ATCAGCTGAG AGGGATTATG CGAACGACTA CGCGAAAAAG
GCGTTTAGCT ATGCGGAGCA CATACAGAGA AAGGGTGTTG CAGAGCAGAT CTTCGGAAGG
AAAGACCTGT TCCTGGAGAT CGATGCGAAC TTCTACCACA CCACCTCCCT GAAGAACCCG
AAGAGCACCC CACTCAAAAT GACCCCGGAG GATGGGGATG AACCGGAGGA TGTGAGGAGA
AGGCGGGAAG AGCTTATTGA ATGCCTTGAG GAGCTTTACA ATAAAAAAGG GAAACCCTCT
CCATTCTACG CCATGCTCCT CATGGACGGG GACAACATGG GGAGGCTGAT CAGGGAGAGT
GGAGAGGCTG TGAGCAGAGC TCTTGCATCC TTCGCAAAAG ATGTTGAGCG TGTCGTGCAC
GATAACCTTG GAGTCCTGGT GTACGCGGGT GGAGACGACG TCCTCGCGAT GCTGCCTGTT
GAAAGAGCGA TGAGCTGCGC GTACAGGCTC TCTGTCAGTT TCGCGGAGTC GTTCGAAGCT
CAGGGGATGG AGGCGACCAT ATCAGCGGGA TTGGTCTTCG CCAGCCATCG CGTGCCGCTC
CGCTCTGTTA TGCGAGAGGC GCACTCCATC CTGGACGATA TAGCGAAGGA TGAGAACGGC
AGGGGAAGCA TGGCTGTCAG CGTCCTCAAG GGCAGCGGCA GGTACTGCAG ATGGGTCAGC
TCCTGGAAGG GTGCTGTATC TCCAGAGAAC GAGGTCGTTC TGGAGCGGCT GGCGAAGAAT
CTCTCAGAGA AACCTGACGG CATAGAGGCG TCGAGCTCGT TCTTCTACAA GTCGAGAGAG
CTGCTGCTCA TGCTCGCCGG CGAGCAGAGA TGGAGCCCGG GAATGTTCTT CGATCTCTCA
CATCTCGGAG GTCTGGATAT AGAGACGCTC CTGCAGGCCG AATACCTCAA CGCGCTTGAG
CACAGATCTG AGATAACAGA GGAGGTACGC GACAGCGCGG TGAGATACGT GAGAGATCTT
CTTTCTGTGA GCCGGAGGCA CCGGGGCCCT GAGCATGAGC GGGCAACTGG CAAGGAGATC
TGCGCGGATG GCATGCTGCT TGTGAAGTTC CTGTCACAGA AGGGGGTTCA GGAATGA
 
Protein sequence
MSMSDSSSEI NFTIGPVQGF IAQARRTRDL WSGSFLLSYL SGCAMAEIRR CDGRIKVPDV 
EGDQLLRWIE SDGEGAPPRI GTLPNRFVAS AEDPAGAARS AAEGVRSRWK KIADCVWNKY
VEGVADQGKN TREIWERQVN NFWEIHWTIG AMEGMEARKN WRICTDWSDG RPTIEWGDHC
TIMSDWQEIS GYVRSIERKK QDDFWKEIRQ ETSGMDLRSD ERLCAIALIK RMFPSVAEDA
IGWEVSADRW PSTLYVAAIP WLSAVIESAE RDYANDYAKK AFSYAEHIQR KGVAEQIFGR
KDLFLEIDAN FYHTTSLKNP KSTPLKMTPE DGDEPEDVRR RREELIECLE ELYNKKGKPS
PFYAMLLMDG DNMGRLIRES GEAVSRALAS FAKDVERVVH DNLGVLVYAG GDDVLAMLPV
ERAMSCAYRL SVSFAESFEA QGMEATISAG LVFASHRVPL RSVMREAHSI LDDIAKDENG
RGSMAVSVLK GSGRYCRWVS SWKGAVSPEN EVVLERLAKN LSEKPDGIEA SSSFFYKSRE
LLLMLAGEQR WSPGMFFDLS HLGGLDIETL LQAEYLNALE HRSEITEEVR DSAVRYVRDL
LSVSRRHRGP EHERATGKEI CADGMLLVKF LSQKGVQE