Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0406 |
Symbol | |
ID | 4462600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 416488 |
End bp | 417363 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639699410 |
Product | apurinic endonuclease Apn1 |
Protein accession | YP_842839 |
Protein GI | 116753721 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0648] Endonuclease IV |
TIGRFAM ID | [TIGR00587] apurinic endonuclease (APN1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0163374 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGTTT GGTCTGAGGG TGTTATCTTG GTGAGATTTG GCGTGCATAT ATCGATAGCT GGTGGCATTC ACCTCTCTGT GAGGAGGGCG ATCGAGCTTG GATGTGATAC ATATCAGATA TTCACATCGA ATCCGAGAGG CTGGCACACA AAGCAGCTCT CAGATGATGT CATAGAGGCC TTTAAAAGCC AGCTCAGCCG CTCGGGAATA TGGCCGGTTG TGGGACACAT GCCATATCTT CCTAATCTGG CCTCCCCCAG GGAGAGCGTT TATTCCAGAT CGGTACAGGC ACTGAAGGAT GAGCTGATGA GATGCAGCGC CCTGGGGATT CCGTATCTGG TAACGCACAT GGGAAGCCAC CTGGGTTCTG GCAGAGATTC AGGCATATCC CGCATAGTCG GCGCGATCGA GGCCGCACTC CCTGATTCAA ATGGCACAAA GATCCTTCTG GAGACAACAT CTGGATCAAA AAACAGCATC GGTGGCAGGT TCGAAGATCT TGCTGATGTC CTGGAGAGGG TTGGGTCGGA TCTACTCGGC ATCTGCCTTG ATACTTGCCA TGTATTCGCT GCCGGATACG ATCTCAGGGA CGAGAAAAGC CTGGATGCAA CCCTGAGAGC ATTCGACAGC ACCGCTGGAC TGAAAGACCT CATGCTGATC CATCTCAACG ACTCGGTGGG CGATCTCGGA TCTGGCCTGG ACAGGCATGA GCACATCGGC ATGGGCAAAA TCGGATTGAA CGGATTCGCA GCGGTGATAA ACGATTACCG CCTCAGAGAG CTGCCCATGA TACTGGAGAC GCCTGTCGAT AAGAGGAGAG ACGATCGTGG GAATCTCGAG GTCGTCCGTG GAATCAGCCG AATAGTGCCC TCATGA
|
Protein sequence | MSVWSEGVIL VRFGVHISIA GGIHLSVRRA IELGCDTYQI FTSNPRGWHT KQLSDDVIEA FKSQLSRSGI WPVVGHMPYL PNLASPRESV YSRSVQALKD ELMRCSALGI PYLVTHMGSH LGSGRDSGIS RIVGAIEAAL PDSNGTKILL ETTSGSKNSI GGRFEDLADV LERVGSDLLG ICLDTCHVFA AGYDLRDEKS LDATLRAFDS TAGLKDLMLI HLNDSVGDLG SGLDRHEHIG MGKIGLNGFA AVINDYRLRE LPMILETPVD KRRDDRGNLE VVRGISRIVP S
|
| |