Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_1397 |
Symbol | |
ID | 4463020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 1493787 |
End bp | 1496612 |
Gene Length | 2826 bp |
Protein Length | 941 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639700415 |
Product | DNA polymerase Pol2 |
Protein accession | YP_843812 |
Protein GI | 116754694 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0417] DNA polymerase elongation subunit (family B) |
TIGRFAM ID | [TIGR00592] DNA polymerase (pol2) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.61356 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACTTTC AGATCCTGGA TGCAAATTAC GTCTACGACA TGAGCGGCTA TCCTGTGGTA CAGCTGTTTG GCATCAGAGA TAATGGGGAT AGCATCATCT GTCGCGTCTC GGGATTCAGG CCGTATTTTT ACGCAAGCGC TCATGATATC ACCAGTGCAG CGGAAGCTGT CGAATCCATG GACCTCGATG TGGAGATCGT CGAGCGATAC GAGCCCATCG GGTACCAGTC GAAGCCAATA AAGATGCTGA GGATAATCGC GAAGGACCCG AAGTCTGTCA GGGAGATAAG GGACAGGGTG AGAGAGATTC CCTCTGTGAG GGCAGTCTAC GAGACCGACA TCCTGTTCAA GAACAGGTTT CTCATTGATA CAGGTCTCGG CGGCATGGCG TGGGTTAAAG TCCTGGATGT TGAGAGAGAT TCGACCTCAG CGAAGGAGTC GAGGGATGGC CTTTCGTGGT ATGGGCTCGC AAATTACAGA GAGGTTGAGA GAGATTCGAC CTCAGCGAAG GAGTCGAGGG TGCTCGAAGG ATGCGAGATG CGCGAAAGGC GCCGGTTTAT CGTTGATGTG CCTGTGGAGA AGCTGGCGCC GGAGGATCTG GAGAGCACTG CGCCCCTCAG GTTCATGTCC TTCGATATCG AATGTCTCCC ACAGAATGGG GAGATGCCGA GGCCGGAGAG CTCCCCTGTG ATACTGATAA GTATGGCGTT TCATCCCCCT TACCACGGGA TGAGCGATCT CGTTCTCGTG GGGAGAGAAC TCGAATGTGA TAGGCCGGAT GTTGAGGGGT GTGCAGACGA GAGAGCCCTT ATCTCCAGAT TTGTGTCCGT GATCGATGAT TACGATCCCG ACATAATCGC GGGCTACAAC TCGAACGAGT TCGACATTCC ATATCTAAGA GAGCGTGCAT CCAGGCTCGG CATCGAGATG AATGTGGGTC GTGACGGAAG CACGTGGCTC ATCAGGAGCA TGGGCAGCAA CAAAAACGTC GCTGTGACAG GCCGGGTTGT TGTTGACCTG CTTCCGATCA TCAGGTCCTC ATTCAGCCTG AAGCAGTACA CTCTGAGAAA TGCTGCCATG GAGCTGATCG GGGAGGAGAA GCGAGATATG GACCCTGCCA GGATGGAATC GATCTGGCTT GGGGGAGATG GGCTGGCCGA TCTAATACGC TACTCGAGGC GGGATGCTGT GCTTGTGATG CAGCTCCTCC TCCGCCTGAG GCTGATGGAT AAATACATAG CGCTGGCGAG GGTGAGCGGA TCGCTGCTCC AGGACATAGT CAACGGCGGC CAGAGCGGCA TGGTGGAAAA TCTCATTCTC AGAAGGTTCC GATCGCATAA GAGAGTCCTC CCGCCGAAGC CAGACTCGGA GGAGTCGGGT GAGCGGTTCA CGGATGCAGA TGAGCTTAAG GGTGGTGCGG TGCTCCCCCC GGTGAAGGGC CTTGTGGAGA ATGTGGTCAT CTTGGACTAC AAATCGCTCT ACCCCACGAT AATGATGGCC CACAACCTCT GCTACTCAAC GGTGGTCACA AAGGAAAGAC CTCCAGAGGT TGTTAAATCG CCATCCGGCG GGTATTTCGC ATCTCCCTCT GTCTGCAAGG GGATCGTGCC GGAGATACTG AGGGAGCTGC TGGAGAAGAG GACGGAGACG AAGATGCTCA TGAAGAGCGC AGGAGAGGAC GAGCGCGCAT TTCTCGATGC AAAGCAGTAC GCGCTCAAGA TCCTGCTGAA CAGCTTCTAC GGGTACTCCG GATACGCAAG GGCGAGGCTG TACAGTCTGA CTCTGGCGAA CGCTGTCACA AGCTTCGGGA GACACAACAT ACTGAGGACA AAGGAGATGA TCGAGGAGAT CGGCTCAGTC TACATTGTTG ACGGGAAAGC TCTGCTTCCC GAGGAGACTT CCGTCTCCTC CAATCCGAAA CCCCCCGACA CAATTGAACT TAATTCCGCC GGAAGACGCT ACGACCTCTC TGTTGTCTAC GGTGATACAG ACAGCGTCTT CGTGAGGATA TCATCAGGTT ACAGCATCAC CCCCGAGGAT GCGGAGCTGA TAGGAAGAAA GATCGCTGAG ACGATCACAT CGAAGCTCCC TAAGCCGATG GAGCTCGTAT TCGAGGCGTT TGCCAGGCGC GCGATATTCC TGGCGAAGAA AAGGTATGCC CTTTGGCTCT TCGAAAGAGT TTCGACGCAG TCGAAACCTC TCGACGGAGT CGAGGGATCT TCGAAGACCC CATCCGAAGG AAGTGGGATC ATATGGAGGG ACAGGATAAA GGTCAGGGGC ATGGAGACCG TGCGGCGGGA CTGGTGCGAC CTCACATCAA AGACCCTGAG GAGATGCCTG GAGCTGATTC TGAAGGAAGG CAGGGTGGAT GATGCAGTTC AGCACGTGAG GGACGTGATC CAGCGGCTAA GGGATATGGA CATCAGGCGG GACAGAGAGC TCCTTGATGA TCTTGTGCTC ACAAGGCGAT TCACAAAGGA TCCATCCTCA TACAGGAACA AGCAGCCACA CATCCAGCTA GTCGAGAAGA TGAGAAGGCG CGGCGGAAGG GTCCCCGGGG TGGGAGACCG GGTCCCGTTC ATAATAGTCA AGGGAGGCAA AAAGACGCTC TTCGTCGACA GGGCTGAGGA TCCGGAGTAT GCGGTGGAGA ACAACCTTCC AATAGATACA GAATACTATA TTGAGAAGCA GCTTCTTCCG CCTGTCCTCA GGCTCTTCAT GCCGTTTGAT GTGGACAGAG AGAGCCTTCT CCAGTGCAGA TCTCAGAAGA ACCTTCTCCA TTTCGATCAG AAGGCTCAGA GGCAGCGAAC CCTGCTGGAT TTCTAG
|
Protein sequence | MHFQILDANY VYDMSGYPVV QLFGIRDNGD SIICRVSGFR PYFYASAHDI TSAAEAVESM DLDVEIVERY EPIGYQSKPI KMLRIIAKDP KSVREIRDRV REIPSVRAVY ETDILFKNRF LIDTGLGGMA WVKVLDVERD STSAKESRDG LSWYGLANYR EVERDSTSAK ESRVLEGCEM RERRRFIVDV PVEKLAPEDL ESTAPLRFMS FDIECLPQNG EMPRPESSPV ILISMAFHPP YHGMSDLVLV GRELECDRPD VEGCADERAL ISRFVSVIDD YDPDIIAGYN SNEFDIPYLR ERASRLGIEM NVGRDGSTWL IRSMGSNKNV AVTGRVVVDL LPIIRSSFSL KQYTLRNAAM ELIGEEKRDM DPARMESIWL GGDGLADLIR YSRRDAVLVM QLLLRLRLMD KYIALARVSG SLLQDIVNGG QSGMVENLIL RRFRSHKRVL PPKPDSEESG ERFTDADELK GGAVLPPVKG LVENVVILDY KSLYPTIMMA HNLCYSTVVT KERPPEVVKS PSGGYFASPS VCKGIVPEIL RELLEKRTET KMLMKSAGED ERAFLDAKQY ALKILLNSFY GYSGYARARL YSLTLANAVT SFGRHNILRT KEMIEEIGSV YIVDGKALLP EETSVSSNPK PPDTIELNSA GRRYDLSVVY GDTDSVFVRI SSGYSITPED AELIGRKIAE TITSKLPKPM ELVFEAFARR AIFLAKKRYA LWLFERVSTQ SKPLDGVEGS SKTPSEGSGI IWRDRIKVRG METVRRDWCD LTSKTLRRCL ELILKEGRVD DAVQHVRDVI QRLRDMDIRR DRELLDDLVL TRRFTKDPSS YRNKQPHIQL VEKMRRRGGR VPGVGDRVPF IIVKGGKKTL FVDRAEDPEY AVENNLPIDT EYYIEKQLLP PVLRLFMPFD VDRESLLQCR SQKNLLHFDQ KAQRQRTLLD F
|
| |