Gene Mthe_0228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0228 
Symbol 
ID4462001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp227266 
End bp229515 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content57% 
IMG OID639699235 
ProductHef nuclease 
Protein accessionYP_842666 
Protein GI116753548 
COG category[L] Replication, recombination and repair 
COG ID[COG1111] ERCC4-like helicases
[COG1948] ERCC4-type nuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTACC TGGTCCATCC GCTCCTCAAG CCGGAGGCCG TTGAGAAAAG GCTCTTTCAG 
ATAGATTTGG CTGCCAGGGC GCTGCGTGGA TCGACGCTGG TTGTGATGCC TACCGGTCTT
GGAAAGACCA TCGTGGCGCT TATGGTCATG CTTGCACGCC TCGAGAAGGG CAGGGTGTTA
TTCCTGGCGC CGACACGCCC GCTGGTGGAG CAGCATGCAG CGTTTCTCCG CAGGGTTCTC
ACATCCCCGG ATCTAGTCGC CTCTGTGACA GGGGAGACGG ATCCCGAGAG CAGGGCTGAG
ATCTGGAGGA GCTGCAGGAT AGCAGTCTCA ACTCCTCAGG TCGTGGAGAA CGACCTCCTC
TCAGGCAGGA TGGATCTCAG GGATGTATCA CTTGTGATAT TCGACGAGGC GCACAGGGCG
GCGGGCAACT ACGCTTATGT CTACATAGCA GAACGCTACA GGAGGGAGGC AAGGGATCCG
CTTGTTCTGG GAATGACAGC GAGCCCTGGA AGTGAAGCAG AGCGGATAGC TGAGATCTGC
GCCAACCTCG GGATCGAGAG CATCGAGATG AAGAGCGAGA GCGATCCCGA TGTCGCGCCC
TTCGTCCACC ACAGGGAGAT AGAGTGGATA AAAGTGGAGG TGCCGGAGCA GCTCCAGAAG
ATACGTGGCG TGATAGACGG CCTTGTAAGC GAGCGGATGG AGGAGATCAA CAGCCTGGGG
ATGTGCAGGA TAGATCCCAG GACATCAAAG GGGGAACTCC TGGACCTGCA GAAGCGGTTC
AGCAGCGCGC TTGCACGTGG ACCGAATCAG AACATCTTCA GGGGAATCTC TCTGCTCGCA
GAGATCATGA AGCTCAAGCA TGCAGTCGAG CTCGCGGAGA CCCAGGGAGT GAGCGCTCTG
AGACAGTACC TGGAGCGCCT GGCTCAGGAG GCGAGGTCGA GGGGCGGATC GAAGGCGTCC
CGCAGGCTAA TTGAGGATCC CAGGATACAG CATGTTCTCT CAGTGCTGAA GGATATTGAT
CTGGAGCACC CGAAGCTCAG CAGGGCGCTC GAGATCATCG AGGATCAGCT TGAGACATCT
CCGGAGTCGA GGATAATCGT GTTCACAAAC TACCGCGACA CAGCGACAGC GCTTCTCAGG
TTTCTTCAAG CGAACGCCTC TGATGCTGTG AAACCCGTTC GCTTTGTCGG CCAGGCGAGC
AGGGAGAATG ATGAGGGGCT GAGCCAGAGA AAGCAGTCAG AGATCCTGGA GAAGTTCAGA
GCAGGAGAGT ACAACGTCCT CATAGCGACC TCTGTTGGAG AGGAGGGCAT AGACATACCA
TCCACAGATA TGGTCCTGTT CTACGAGCCG GTACCCTCTG AGATAAGAAG CATACAGCGC
AAGGGCAGAA CCGGGCGTGC AAGGACCGGC CGGGTAGTTG TGCTGATAGC GAAGGGAACA
AGGGACGAGG CATACTACTG GATAAGCGAT CGAAAGGAAC GGACCATGAG GAGGCAGCTC
CAGGGCATGG CAGAGCCGCT GCCAGTAGAC TCTGCTGTAC CTGATACAGC TCCAATCTCA
TCGAGAGCCT CGAGGCAGAT CAGCATCACC GAGATATGCG AGCCGGATGA GCTGCCTCTG
ATTATCGTCG ATTCCCGTGA GCGCGATATG GCCAGGCTTC TCGAGAAGAC CGGGCTCAGA
ATAGTCCTGA GGTCTCTTGA GGTTGGTGAT TACGTCCTCT CAGAGCGGCT CGGAATAGAG
AGGAAGACTG CGGACGATCT CATCGATTCT ATCATAGATC CTGAGCGGGA TCTCTTCAGG
CAGATAGGAG ATCTTGCAAG AACATACGAT CGGCCGCTGC TGATCATAGA GGGCCAGAAC
CTCTACGCCC GACAGGTCCA TCCGAACTCT GTCAGGGGAA TTCTGGCCAC AATAGCGGTG
GATTTCGGCG TCCCGATCGT GCCCACCGGG AGCATTGAGG AGACTGCAGC TCTGATAGCA
CTGATGGCGA GAAGGGAGCA TGAGGCCGGC TACAGGGACG TGAAGCTGCA CGGGAGGAAG
ACGTCCAGAA CGCTGAAGGA ACAACAGGAG TACCTCATAT CCGCACTTCC CGGAGTCGGG
CCGTCAGTGG CGCGCAACCT CCTGCGCCAC TTCGGATCTG TGGAGAGGAT CATGACAGCG
AGCGAGGGGG AGCTGATGTC TGTGGACAAG GTCGGCCCAA AGACTGCTGC CAGGATCAGG
GAGATAGTGT CAGGCGAGTA CAAGGGGTGA
 
Protein sequence
MSYLVHPLLK PEAVEKRLFQ IDLAARALRG STLVVMPTGL GKTIVALMVM LARLEKGRVL 
FLAPTRPLVE QHAAFLRRVL TSPDLVASVT GETDPESRAE IWRSCRIAVS TPQVVENDLL
SGRMDLRDVS LVIFDEAHRA AGNYAYVYIA ERYRREARDP LVLGMTASPG SEAERIAEIC
ANLGIESIEM KSESDPDVAP FVHHREIEWI KVEVPEQLQK IRGVIDGLVS ERMEEINSLG
MCRIDPRTSK GELLDLQKRF SSALARGPNQ NIFRGISLLA EIMKLKHAVE LAETQGVSAL
RQYLERLAQE ARSRGGSKAS RRLIEDPRIQ HVLSVLKDID LEHPKLSRAL EIIEDQLETS
PESRIIVFTN YRDTATALLR FLQANASDAV KPVRFVGQAS RENDEGLSQR KQSEILEKFR
AGEYNVLIAT SVGEEGIDIP STDMVLFYEP VPSEIRSIQR KGRTGRARTG RVVVLIAKGT
RDEAYYWISD RKERTMRRQL QGMAEPLPVD SAVPDTAPIS SRASRQISIT EICEPDELPL
IIVDSRERDM ARLLEKTGLR IVLRSLEVGD YVLSERLGIE RKTADDLIDS IIDPERDLFR
QIGDLARTYD RPLLIIEGQN LYARQVHPNS VRGILATIAV DFGVPIVPTG SIEETAALIA
LMARREHEAG YRDVKLHGRK TSRTLKEQQE YLISALPGVG PSVARNLLRH FGSVERIMTA
SEGELMSVDK VGPKTAARIR EIVSGEYKG