Gene Mthe_1397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1397 
Symbol 
ID4463020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1493787 
End bp1496612 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content55% 
IMG OID639700415 
ProductDNA polymerase Pol2 
Protein accessionYP_843812 
Protein GI116754694 
COG category[L] Replication, recombination and repair 
COG ID[COG0417] DNA polymerase elongation subunit (family B) 
TIGRFAM ID[TIGR00592] DNA polymerase (pol2) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.61356 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACTTTC AGATCCTGGA TGCAAATTAC GTCTACGACA TGAGCGGCTA TCCTGTGGTA 
CAGCTGTTTG GCATCAGAGA TAATGGGGAT AGCATCATCT GTCGCGTCTC GGGATTCAGG
CCGTATTTTT ACGCAAGCGC TCATGATATC ACCAGTGCAG CGGAAGCTGT CGAATCCATG
GACCTCGATG TGGAGATCGT CGAGCGATAC GAGCCCATCG GGTACCAGTC GAAGCCAATA
AAGATGCTGA GGATAATCGC GAAGGACCCG AAGTCTGTCA GGGAGATAAG GGACAGGGTG
AGAGAGATTC CCTCTGTGAG GGCAGTCTAC GAGACCGACA TCCTGTTCAA GAACAGGTTT
CTCATTGATA CAGGTCTCGG CGGCATGGCG TGGGTTAAAG TCCTGGATGT TGAGAGAGAT
TCGACCTCAG CGAAGGAGTC GAGGGATGGC CTTTCGTGGT ATGGGCTCGC AAATTACAGA
GAGGTTGAGA GAGATTCGAC CTCAGCGAAG GAGTCGAGGG TGCTCGAAGG ATGCGAGATG
CGCGAAAGGC GCCGGTTTAT CGTTGATGTG CCTGTGGAGA AGCTGGCGCC GGAGGATCTG
GAGAGCACTG CGCCCCTCAG GTTCATGTCC TTCGATATCG AATGTCTCCC ACAGAATGGG
GAGATGCCGA GGCCGGAGAG CTCCCCTGTG ATACTGATAA GTATGGCGTT TCATCCCCCT
TACCACGGGA TGAGCGATCT CGTTCTCGTG GGGAGAGAAC TCGAATGTGA TAGGCCGGAT
GTTGAGGGGT GTGCAGACGA GAGAGCCCTT ATCTCCAGAT TTGTGTCCGT GATCGATGAT
TACGATCCCG ACATAATCGC GGGCTACAAC TCGAACGAGT TCGACATTCC ATATCTAAGA
GAGCGTGCAT CCAGGCTCGG CATCGAGATG AATGTGGGTC GTGACGGAAG CACGTGGCTC
ATCAGGAGCA TGGGCAGCAA CAAAAACGTC GCTGTGACAG GCCGGGTTGT TGTTGACCTG
CTTCCGATCA TCAGGTCCTC ATTCAGCCTG AAGCAGTACA CTCTGAGAAA TGCTGCCATG
GAGCTGATCG GGGAGGAGAA GCGAGATATG GACCCTGCCA GGATGGAATC GATCTGGCTT
GGGGGAGATG GGCTGGCCGA TCTAATACGC TACTCGAGGC GGGATGCTGT GCTTGTGATG
CAGCTCCTCC TCCGCCTGAG GCTGATGGAT AAATACATAG CGCTGGCGAG GGTGAGCGGA
TCGCTGCTCC AGGACATAGT CAACGGCGGC CAGAGCGGCA TGGTGGAAAA TCTCATTCTC
AGAAGGTTCC GATCGCATAA GAGAGTCCTC CCGCCGAAGC CAGACTCGGA GGAGTCGGGT
GAGCGGTTCA CGGATGCAGA TGAGCTTAAG GGTGGTGCGG TGCTCCCCCC GGTGAAGGGC
CTTGTGGAGA ATGTGGTCAT CTTGGACTAC AAATCGCTCT ACCCCACGAT AATGATGGCC
CACAACCTCT GCTACTCAAC GGTGGTCACA AAGGAAAGAC CTCCAGAGGT TGTTAAATCG
CCATCCGGCG GGTATTTCGC ATCTCCCTCT GTCTGCAAGG GGATCGTGCC GGAGATACTG
AGGGAGCTGC TGGAGAAGAG GACGGAGACG AAGATGCTCA TGAAGAGCGC AGGAGAGGAC
GAGCGCGCAT TTCTCGATGC AAAGCAGTAC GCGCTCAAGA TCCTGCTGAA CAGCTTCTAC
GGGTACTCCG GATACGCAAG GGCGAGGCTG TACAGTCTGA CTCTGGCGAA CGCTGTCACA
AGCTTCGGGA GACACAACAT ACTGAGGACA AAGGAGATGA TCGAGGAGAT CGGCTCAGTC
TACATTGTTG ACGGGAAAGC TCTGCTTCCC GAGGAGACTT CCGTCTCCTC CAATCCGAAA
CCCCCCGACA CAATTGAACT TAATTCCGCC GGAAGACGCT ACGACCTCTC TGTTGTCTAC
GGTGATACAG ACAGCGTCTT CGTGAGGATA TCATCAGGTT ACAGCATCAC CCCCGAGGAT
GCGGAGCTGA TAGGAAGAAA GATCGCTGAG ACGATCACAT CGAAGCTCCC TAAGCCGATG
GAGCTCGTAT TCGAGGCGTT TGCCAGGCGC GCGATATTCC TGGCGAAGAA AAGGTATGCC
CTTTGGCTCT TCGAAAGAGT TTCGACGCAG TCGAAACCTC TCGACGGAGT CGAGGGATCT
TCGAAGACCC CATCCGAAGG AAGTGGGATC ATATGGAGGG ACAGGATAAA GGTCAGGGGC
ATGGAGACCG TGCGGCGGGA CTGGTGCGAC CTCACATCAA AGACCCTGAG GAGATGCCTG
GAGCTGATTC TGAAGGAAGG CAGGGTGGAT GATGCAGTTC AGCACGTGAG GGACGTGATC
CAGCGGCTAA GGGATATGGA CATCAGGCGG GACAGAGAGC TCCTTGATGA TCTTGTGCTC
ACAAGGCGAT TCACAAAGGA TCCATCCTCA TACAGGAACA AGCAGCCACA CATCCAGCTA
GTCGAGAAGA TGAGAAGGCG CGGCGGAAGG GTCCCCGGGG TGGGAGACCG GGTCCCGTTC
ATAATAGTCA AGGGAGGCAA AAAGACGCTC TTCGTCGACA GGGCTGAGGA TCCGGAGTAT
GCGGTGGAGA ACAACCTTCC AATAGATACA GAATACTATA TTGAGAAGCA GCTTCTTCCG
CCTGTCCTCA GGCTCTTCAT GCCGTTTGAT GTGGACAGAG AGAGCCTTCT CCAGTGCAGA
TCTCAGAAGA ACCTTCTCCA TTTCGATCAG AAGGCTCAGA GGCAGCGAAC CCTGCTGGAT
TTCTAG
 
Protein sequence
MHFQILDANY VYDMSGYPVV QLFGIRDNGD SIICRVSGFR PYFYASAHDI TSAAEAVESM 
DLDVEIVERY EPIGYQSKPI KMLRIIAKDP KSVREIRDRV REIPSVRAVY ETDILFKNRF
LIDTGLGGMA WVKVLDVERD STSAKESRDG LSWYGLANYR EVERDSTSAK ESRVLEGCEM
RERRRFIVDV PVEKLAPEDL ESTAPLRFMS FDIECLPQNG EMPRPESSPV ILISMAFHPP
YHGMSDLVLV GRELECDRPD VEGCADERAL ISRFVSVIDD YDPDIIAGYN SNEFDIPYLR
ERASRLGIEM NVGRDGSTWL IRSMGSNKNV AVTGRVVVDL LPIIRSSFSL KQYTLRNAAM
ELIGEEKRDM DPARMESIWL GGDGLADLIR YSRRDAVLVM QLLLRLRLMD KYIALARVSG
SLLQDIVNGG QSGMVENLIL RRFRSHKRVL PPKPDSEESG ERFTDADELK GGAVLPPVKG
LVENVVILDY KSLYPTIMMA HNLCYSTVVT KERPPEVVKS PSGGYFASPS VCKGIVPEIL
RELLEKRTET KMLMKSAGED ERAFLDAKQY ALKILLNSFY GYSGYARARL YSLTLANAVT
SFGRHNILRT KEMIEEIGSV YIVDGKALLP EETSVSSNPK PPDTIELNSA GRRYDLSVVY
GDTDSVFVRI SSGYSITPED AELIGRKIAE TITSKLPKPM ELVFEAFARR AIFLAKKRYA
LWLFERVSTQ SKPLDGVEGS SKTPSEGSGI IWRDRIKVRG METVRRDWCD LTSKTLRRCL
ELILKEGRVD DAVQHVRDVI QRLRDMDIRR DRELLDDLVL TRRFTKDPSS YRNKQPHIQL
VEKMRRRGGR VPGVGDRVPF IIVKGGKKTL FVDRAEDPEY AVENNLPIDT EYYIEKQLLP
PVLRLFMPFD VDRESLLQCR SQKNLLHFDQ KAQRQRTLLD F