Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0228 |
Symbol | |
ID | 4462001 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 227266 |
End bp | 229515 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639699235 |
Product | Hef nuclease |
Protein accession | YP_842666 |
Protein GI | 116753548 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1111] ERCC4-like helicases [COG1948] ERCC4-type nuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTACC TGGTCCATCC GCTCCTCAAG CCGGAGGCCG TTGAGAAAAG GCTCTTTCAG ATAGATTTGG CTGCCAGGGC GCTGCGTGGA TCGACGCTGG TTGTGATGCC TACCGGTCTT GGAAAGACCA TCGTGGCGCT TATGGTCATG CTTGCACGCC TCGAGAAGGG CAGGGTGTTA TTCCTGGCGC CGACACGCCC GCTGGTGGAG CAGCATGCAG CGTTTCTCCG CAGGGTTCTC ACATCCCCGG ATCTAGTCGC CTCTGTGACA GGGGAGACGG ATCCCGAGAG CAGGGCTGAG ATCTGGAGGA GCTGCAGGAT AGCAGTCTCA ACTCCTCAGG TCGTGGAGAA CGACCTCCTC TCAGGCAGGA TGGATCTCAG GGATGTATCA CTTGTGATAT TCGACGAGGC GCACAGGGCG GCGGGCAACT ACGCTTATGT CTACATAGCA GAACGCTACA GGAGGGAGGC AAGGGATCCG CTTGTTCTGG GAATGACAGC GAGCCCTGGA AGTGAAGCAG AGCGGATAGC TGAGATCTGC GCCAACCTCG GGATCGAGAG CATCGAGATG AAGAGCGAGA GCGATCCCGA TGTCGCGCCC TTCGTCCACC ACAGGGAGAT AGAGTGGATA AAAGTGGAGG TGCCGGAGCA GCTCCAGAAG ATACGTGGCG TGATAGACGG CCTTGTAAGC GAGCGGATGG AGGAGATCAA CAGCCTGGGG ATGTGCAGGA TAGATCCCAG GACATCAAAG GGGGAACTCC TGGACCTGCA GAAGCGGTTC AGCAGCGCGC TTGCACGTGG ACCGAATCAG AACATCTTCA GGGGAATCTC TCTGCTCGCA GAGATCATGA AGCTCAAGCA TGCAGTCGAG CTCGCGGAGA CCCAGGGAGT GAGCGCTCTG AGACAGTACC TGGAGCGCCT GGCTCAGGAG GCGAGGTCGA GGGGCGGATC GAAGGCGTCC CGCAGGCTAA TTGAGGATCC CAGGATACAG CATGTTCTCT CAGTGCTGAA GGATATTGAT CTGGAGCACC CGAAGCTCAG CAGGGCGCTC GAGATCATCG AGGATCAGCT TGAGACATCT CCGGAGTCGA GGATAATCGT GTTCACAAAC TACCGCGACA CAGCGACAGC GCTTCTCAGG TTTCTTCAAG CGAACGCCTC TGATGCTGTG AAACCCGTTC GCTTTGTCGG CCAGGCGAGC AGGGAGAATG ATGAGGGGCT GAGCCAGAGA AAGCAGTCAG AGATCCTGGA GAAGTTCAGA GCAGGAGAGT ACAACGTCCT CATAGCGACC TCTGTTGGAG AGGAGGGCAT AGACATACCA TCCACAGATA TGGTCCTGTT CTACGAGCCG GTACCCTCTG AGATAAGAAG CATACAGCGC AAGGGCAGAA CCGGGCGTGC AAGGACCGGC CGGGTAGTTG TGCTGATAGC GAAGGGAACA AGGGACGAGG CATACTACTG GATAAGCGAT CGAAAGGAAC GGACCATGAG GAGGCAGCTC CAGGGCATGG CAGAGCCGCT GCCAGTAGAC TCTGCTGTAC CTGATACAGC TCCAATCTCA TCGAGAGCCT CGAGGCAGAT CAGCATCACC GAGATATGCG AGCCGGATGA GCTGCCTCTG ATTATCGTCG ATTCCCGTGA GCGCGATATG GCCAGGCTTC TCGAGAAGAC CGGGCTCAGA ATAGTCCTGA GGTCTCTTGA GGTTGGTGAT TACGTCCTCT CAGAGCGGCT CGGAATAGAG AGGAAGACTG CGGACGATCT CATCGATTCT ATCATAGATC CTGAGCGGGA TCTCTTCAGG CAGATAGGAG ATCTTGCAAG AACATACGAT CGGCCGCTGC TGATCATAGA GGGCCAGAAC CTCTACGCCC GACAGGTCCA TCCGAACTCT GTCAGGGGAA TTCTGGCCAC AATAGCGGTG GATTTCGGCG TCCCGATCGT GCCCACCGGG AGCATTGAGG AGACTGCAGC TCTGATAGCA CTGATGGCGA GAAGGGAGCA TGAGGCCGGC TACAGGGACG TGAAGCTGCA CGGGAGGAAG ACGTCCAGAA CGCTGAAGGA ACAACAGGAG TACCTCATAT CCGCACTTCC CGGAGTCGGG CCGTCAGTGG CGCGCAACCT CCTGCGCCAC TTCGGATCTG TGGAGAGGAT CATGACAGCG AGCGAGGGGG AGCTGATGTC TGTGGACAAG GTCGGCCCAA AGACTGCTGC CAGGATCAGG GAGATAGTGT CAGGCGAGTA CAAGGGGTGA
|
Protein sequence | MSYLVHPLLK PEAVEKRLFQ IDLAARALRG STLVVMPTGL GKTIVALMVM LARLEKGRVL FLAPTRPLVE QHAAFLRRVL TSPDLVASVT GETDPESRAE IWRSCRIAVS TPQVVENDLL SGRMDLRDVS LVIFDEAHRA AGNYAYVYIA ERYRREARDP LVLGMTASPG SEAERIAEIC ANLGIESIEM KSESDPDVAP FVHHREIEWI KVEVPEQLQK IRGVIDGLVS ERMEEINSLG MCRIDPRTSK GELLDLQKRF SSALARGPNQ NIFRGISLLA EIMKLKHAVE LAETQGVSAL RQYLERLAQE ARSRGGSKAS RRLIEDPRIQ HVLSVLKDID LEHPKLSRAL EIIEDQLETS PESRIIVFTN YRDTATALLR FLQANASDAV KPVRFVGQAS RENDEGLSQR KQSEILEKFR AGEYNVLIAT SVGEEGIDIP STDMVLFYEP VPSEIRSIQR KGRTGRARTG RVVVLIAKGT RDEAYYWISD RKERTMRRQL QGMAEPLPVD SAVPDTAPIS SRASRQISIT EICEPDELPL IIVDSRERDM ARLLEKTGLR IVLRSLEVGD YVLSERLGIE RKTADDLIDS IIDPERDLFR QIGDLARTYD RPLLIIEGQN LYARQVHPNS VRGILATIAV DFGVPIVPTG SIEETAALIA LMARREHEAG YRDVKLHGRK TSRTLKEQQE YLISALPGVG PSVARNLLRH FGSVERIMTA SEGELMSVDK VGPKTAARIR EIVSGEYKG
|
| |