Gene Mthe_0641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0641 
Symbol 
ID4462281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp667609 
End bp670023 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content54% 
IMG OID639699649 
ProductCRISPR-associated helicase Cas3 
Protein accessionYP_843071 
Protein GI116753953 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3
[TIGR01596] CRISPR-associated endonuclease Cas3-HD 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.674824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAGAG GGTCGTTTTC ATTTAAACTC AGATCCCATC CCGATAAGCT TCTGGTGGAT 
CACCTCAGAA ACGTCGGGGA GATGTGCAGG AGGACAGTGG CTGAGAGCGC ACGGAACCTC
GATGATGCGG AGATTCTGAA GAATGTCGCA TACATCATCG GAGCGACACA CGATCTCGGA
AAGGCGACGG GCTTCTTCCA GGAGTACCTG AGGGAGACTG ATGAGAAAAG GAGGCGGGCT
CTGAGATCGA GAGATGTCAC GAAGCATGGT CTTCTCTCAT CGCTCTTCAC GTACGCAATT
GTCAGGGAGC ATCTGAGGAA GAGAGGATGC GAGATGCCCG GCGCACTTCC CCTGGTCTCA
TTCATCGCGG TGAGGCGCCA TCACGGGGAT CTGACGAACG CGCTGGATGA GATCTCGGCG
ATCTACGCTG AGAGGGAGAG GGTCCTATCT GTGATCGAGG AGCAGGTCTC CAGTCTTGAC
AGAAACGAAT GCGCGCAGAT ATTCAACGCC CTTCTCGGCG GTGTGGTTGA GATAGATCTT
GATTCTCTTT TGGACCACAT CCTCAGAGAT GCTCTGAAGG ATTTGAGGGG TGAGAAGCCG
CTTGTCAGGA ATCTCGGCAG GAGAAATGAT GTTCTGCCAT ACTTCGTGGC CCAGCTCCTG
TACTCCGCGC TTCTGGATGC GGACAAGACA GATGCAGGGC TGGAGGGCGT GGATCTGGGC
AGGGTGAATA TTCCTCCAGA TCTTGTGGAC AGGTACAGGG AGGCGCGCGG GTTCTCCGAG
AGGAGAGATG GCATCAATGG AATGAGGAAC GAGATATACC ATGATGCAGT CAGCCGGATC
TCAGGATGGG ACCTCAATGA GAGGATAATA TCGCTGAACG TGCCCACAGG TACAGGCAAG
ACGCTGACAT CCCTGGCGAT GGCGCTGCGG CTCAGGAGAA GGCTGATGAA CGAACGTGGG
GTCATGCCCA GGATAGTATA CGCTCTGCCG TTTCTCAGCA TAATCGATCA GACATGCGAT
GTCTTCGAGG ATGTGCTGGA AAGCGCGGGC ATCAGGGCTG ACTCAAGCGT GCTTCTGAAG
CACCATCATC TCTCCGACAT CACATACACA AAAGGGGATG AGGAATTCGA GTCTGACGTG
AGCCTTCTGC TGATGGAGGG ATGGAACTCT GAGATCGTGG TTACGACGTT CTGGCAGCTC
TTCCACACGA TATTCTCAAA CAGAAACAGA AGGCTTCGCA AGTTCAACAG GATTGCAAAT
TCAATAATAA TAATGGATGA AGTTCAGGCT GTGCCCCACA GGTACTGGCA TCTTCTGCAT
GATTCTCTGA AAATGCTCTG CGAGAAGTTC AACAGCTATC TGATACTCGT CACCGCGACC
CAGCCGCTGA TCTTCTCTGA GGAGAGCGGC GAGATCAGGG AGGCTGTGAG CGATAAGGAG
CGGTACTTCA GAGCGCTCAA CAGGGTTGAG CTGCGCCCCA TGATCGATGC TCCCATGAGC
CTGGAGGAAT TTGAGGCTCT TCTGGAGGGG GAGATTCACA ATAACCCGGA GAAGGATTTT
CTCATCGTGC TCAACACCAT CCGCTCAGCC AGGAATGTGT ACAGCGCCAT AAAAGCTCTT
GATCTGAGGG ATACAGAGCT GTTTTACCTC TCCACGCATG TCGTTCCCAG GGATAGAAGG
GATCGCATAC GCAGGATCAG AGATGAGCGT GGAGGGCGCA GGAAGATCAC AGTGAGCACG
CAGCTCATAG AGGCGGGTGT TGATATCGAT GCTGATATCG TGTTCAGGGA TCTCGCGCCC
CTCGACTCGA TAAATCAGGT CGCGGGAAGG TGCAACAGGA ACCTCTCGAA GGGCAGGGGG
CGGGTCTCAG TTGTCATCCT CAGGGATGAG CGGAGGGAGC TGTGCAGGTA CATCTACGAT
GGTTTTCTTA TCTCAAAAAC ACTTGACATA TTGAGGTCGT GCGGAGAATG CATAGAGGAA
AGGGAGATAC TGGAGCTGAA CAGCAGGTAC TTCAGGGTGG TGAGGAGCGG CATGAGCGAT
GAGACCTCCA GGAGGTGCAT GAGCATGCTA TCAGGCCTGG AGTTTGGAGA TCTCGGGGAG
AGCTTCAGGC TGATAGAGGA GGATGAGCCC AGGGTGGATG TTTTTGTGGA GACGGATGAG
AGGGCCTCTG ATGTCTGGCA GGCTTACAGA GATCTCAGAT CCGAGAGGGA TCTCTGGGAG
AGGAGAAGGA GATACCTGAG ACTTAGAAAC GCACTGAGCG ATTACATGAT CTCCCTTCCG
AGGAGACTCG CTGCGGATCT TGTCGTGGAG AATCAGGGCA CTGGCTACAT ATCGATGGAC
GAACTCCCTC ACTACTACGA TCCAGAGACC GGCTTCAGGG TGGAGGATGC TGGCGAGGGA
TCTATGATAA TCTGA
 
Protein sequence
MGRGSFSFKL RSHPDKLLVD HLRNVGEMCR RTVAESARNL DDAEILKNVA YIIGATHDLG 
KATGFFQEYL RETDEKRRRA LRSRDVTKHG LLSSLFTYAI VREHLRKRGC EMPGALPLVS
FIAVRRHHGD LTNALDEISA IYAERERVLS VIEEQVSSLD RNECAQIFNA LLGGVVEIDL
DSLLDHILRD ALKDLRGEKP LVRNLGRRND VLPYFVAQLL YSALLDADKT DAGLEGVDLG
RVNIPPDLVD RYREARGFSE RRDGINGMRN EIYHDAVSRI SGWDLNERII SLNVPTGTGK
TLTSLAMALR LRRRLMNERG VMPRIVYALP FLSIIDQTCD VFEDVLESAG IRADSSVLLK
HHHLSDITYT KGDEEFESDV SLLLMEGWNS EIVVTTFWQL FHTIFSNRNR RLRKFNRIAN
SIIIMDEVQA VPHRYWHLLH DSLKMLCEKF NSYLILVTAT QPLIFSEESG EIREAVSDKE
RYFRALNRVE LRPMIDAPMS LEEFEALLEG EIHNNPEKDF LIVLNTIRSA RNVYSAIKAL
DLRDTELFYL STHVVPRDRR DRIRRIRDER GGRRKITVST QLIEAGVDID ADIVFRDLAP
LDSINQVAGR CNRNLSKGRG RVSVVILRDE RRELCRYIYD GFLISKTLDI LRSCGECIEE
REILELNSRY FRVVRSGMSD ETSRRCMSML SGLEFGDLGE SFRLIEEDEP RVDVFVETDE
RASDVWQAYR DLRSERDLWE RRRRYLRLRN ALSDYMISLP RRLAADLVVE NQGTGYISMD
ELPHYYDPET GFRVEDAGEG SMII