Gene Mpal_2254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2254 
Symbol 
ID7272551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2403431 
End bp2407282 
Gene Length3852 bp 
Protein Length1283 aa 
Translation table11 
GC content61% 
IMG OID643570866 
ProductDNA polymerase II large subunit 
Protein accessionYP_002467270 
Protein GI219852838 
COG category[L] Replication, recombination and repair 
COG ID[COG1933] Archaeal DNA polymerase II, large subunit 
TIGRFAM ID[TIGR00354] DNA polymerase, archaeal type II, large subunit
[TIGR01443] intein C-terminal splicing region
[TIGR01445] intein N-terminal splicing region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.413677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCAG CCTATCATAA GAGGCTCCAG GATGGCCTCT ACGAGGCGAT CAGTGTTGCA 
GAGCAAGCGA GGGCTCTGGG GATCGACCCC AAGACCAGGG TCGAGATCCC GATAGCCAAT
GACTTGGCCG ACCGGGTCGA AGCGTTGCTC GGAATCACAG GCGTCGCGGC ACGGATTCGG
GAACTCGAAC AGACGATGTC CCGAGAGGAG GCTTCCCTCC GGATCGGCGA CGACTTCGTC
GCCAAGCGCT TTGGAGAGAC GACTCGGGAG GAGATCCTCG ATCATGCGAT CAGGACCGGG
ATGGCCCTGC TGACCGAAGG GGTGGTGGCG GCCCCCACCG AGGGAATCGG GAATATCTGT
CTTGGCAGGA ACGATGACGG AACCGATTAC CTAAAGATCT ATTATGCCGG CCCGATCCGG
AGCGCCGGCG GAACAGCACA GGCGCTCTCG GTGCTGGTCG GCGACTATGT CAGGCAGCAA
CTTGGGATCG GGAGGTTCAT CCCGCGCCCC GAGGAGGTCG AACGGTACAT CGAGGAGATC
AAGCAGTATA ACTCGATCAT GTCGCTCCAG TACCTGCCGA GCGAGAAGGA GATCAGAACG
ATCGTCTCCA ACTGTCCGGT CTGCATCGAT GGTGAGGCAA CCGAGCAGGA GGAGGTCTCA
GGGTACCGGA ACTTGGAACG GGTCGAGACG AACGTTGTCC GGGGCGGGAT GGCGCTGGTC
GTCGCCGAAG GGATGGCCCT CAAGGCCCCG AAGATCCAGA AGAACGTCCG GAAGATGAAG
ATGGAGGGCT GGGAATGGCT CGACGAACTG ATCAGCGGGA CGGTGAAGAC TGGTGACGAC
GACGATGAGA TCGGGGTTAA ACCCAAGGAC AAATACATCC GCGACCTGAT CGGCGGTCGG
CCGGTCTTCT CATATCCTAT GCGTAAAGGG GGTTTCCGGT TGCGGTATGG CAGGGCACGG
AACACCGGGT TCGCGGCGGC CGGCCTCCAC CCGGCGACGA TGCACCTACT CGGCGATTTT
CTGGCGGTCG GTACCCAGAT GAAGGTCGAA CGACCAGGCA AGGCCGCGGG GATCGTGCCG
GTGGATACCA TCGAGGGGCC GACGGTCAGG CTGGTGAACG GGGACGTCCT CCGGGTCGAT
GATCTGGCGC TCGCCCACCA GATCTCCGGG TCGGTGGAGC GGATCCTCGA TGTCGGTGAG
ATGCTTGTCT CGTACGGTGA GTTTCTGGAG AACAACCATC TGCTGGTACC CTGCGGATAC
TGCGATGAGT GGTGGCAGTT GGAGAGCAAC GGAGCAAGCC GCCCGACGGA CGAGACCGAG
GCGATCCTGC TGGCCCTCGA CGGGGGATAC CTCCATCCAG AGTATACACA GATGTGGGAC
GACATCACGC CAGAACAACT GATTACTCTC TCAGACTGGG TCAGCCGGAC CGGGGCGGTC
ACCAGAGCCG GGCTCGTCCT TCCCTGCGAA GCGGAAGGGA AAGCGATCCT CGAGGAGTTG
CTGGTTCCAC ACACAGTCAG GGACGACCGG ATCATCATCT CCCGTCATCT GGTGCTGATA
GCAGCCCTCG GGCTCGACCT GCACCTGCAG AGGCGAGGAG TCTGGGCGGA TGCGCCGACC
GACGGCCATA ATGCATTGTC GCTCGCCTGC CACCTCGCCG GGTTTGCGAT GCGGCAGAAG
GGAGGGACGC GGATCGGGGG TCGGATGGGC AGGCCCGGCA AGTCCAAGGC GCGGGAGATG
AAGCCTCCGC CCCACGCACT CTTCCCGGTC GGCGAGGCCG GCGGCGCCCG TCGTTCATTC
CAGGAGGCCT CGACCTACGC TCCGGAACGA AACCGGGACG GTGGAGTGAT CACTGCAGAG
GTTGGGGAGC GGCGGTGCCC CGCATGCAAG ACGATCACCT ACAAGAACCG ATGCACCTGC
GGGGCTCACA CTGAACCTGT CTTCCGGTGC CCGAAGTGCA ACATAGAGAT CAACGCGGAC
AGGTGCCCCC GGTGCGATGG CGGGACGGTC TGCACCCAGA CGGTCTCGAT CAATGTTAAG
AACGACTATG CTACGGTCCT TGAGGAACTC GGCCTCAGGA CTGGGATGGT CCCGCTCGTC
AAGGGGGTAA AGGGACTGAT CTCACGGGAG CGGGTCGTCG AAGACCTGGC CAAGGGGGTC
CTCCGGGCCC GGCACAGCCT CTATGTCTTC AAGGACGGGA CGGTCAGGTA CGATATGATC
GATCTTCCCC TGACCCATAT CCGGACCGAC GAGTGCGGAG TGACCCCCGA ACAACTGGTC
GCCCTCGGGT ACACCCAGGA TGTCTACGGG GTGCCGCTGA CCGACCCGAC GCAGGTGGTG
GAACTCCGGC CTCAGGACAT CCTGGTCTCA GAGAAGTGTG CCGTCTGGCT GCTGGAGGTG
ACGGCTTTCA TCGACGATCT GCTGGAGAAG GTCTACCACC TCCCCCGATT CTACAATGTC
ACGTCCCGGG CCGACCTGAT CGGCCACCTG GTAATAGGAC TTGCACCGCA CACCAGTGCC
GGAGTACTGG CCAGGCTGGT CGGGTTCACA AAAGCGAATG TCGGCTATGC CCACCCGTTC
TTCCATGCTG CGAAGAGGCG GAACTGTTTC TATGGAGATA CGGTCATCGA GGTCTATGAT
TATCGCTCCT GGACGAAGGT GCCGATCCGG CAGTTCGTCC TCGAGAACTT TGACCTTTCA
AATCCAGGGC TCGACCATTT CGGCACATTC TACTCTGATC CAAAGAGGTC GTTTCTGACC
CGATCGGTAG ATACGCAGGG AACGATGCAT CTCAAGAAGA TCACCTCGGT TTCGGTGCAC
CGGGCCCCGC CCGCCCTGAT CCGGTTCGGG ACCTCCCGGG GAAAGGTAGT CTCGGTCACC
CCCGAGCATG CGATGCTGAT CTGGGACGTG GACTACCTCA AAAAGATCCG GGCCGACGAA
GTTAAGATCG GAGACGCTGT CCCGGTCTAC GAAGGCGGGC AGGTGCTCGC CGACCGGATC
ACCGAACTGG ACATCGTTCC CTGCCCCGAC GATAGGGTCT TCTGCCTGAC GGTGGCCGAT
GACCACACCC TGGTCGCGGA CGGAATCTTC ACCGGCCAGT GCGACGGGGA TGAGGACTGT
GTGATGATGC TCCTCGACGG GCTGATCAAC TTCTCCCGTT CGTTTCTGCC AGAGACTAGG
GGCGGATCGA TGGATGCTCC GCTGGTGCTC ACCTCCAGGC TCGACCCGGC CGAGATCGAC
AAAGAGTCGC TGAACGTCGA TGTCGGGTCC AGTTATCCAC TGGAATTCTA CCTGGCCGCC
CAGCAGTACA CCCACCCAAA AGACCTCGAT GCCCTGATCG ATCGGGTGGA GCGGCGGCTC
GGCACCCCAG CCCAGCTGGA GGGGTTCATG TTCACCCATG ATACTTCCAA TATCTCGGAA
GGGCCGCTCG AGTCCACCTA TACGATCCTC GAGTCGATGG TCGACAAGCT CGGCGCCGAA
CTCGACCTGG CCGAGAAGAT CCGGGCCGTC GACGCCGACG ATGTGGCAGA GCGAGTGCTC
AAGACCCACT TCATGCCCGA CCTGATGGGA AACCTATCCG CCTTCTCCAA GCAGAAGTTC
AGGTGCACCC GGTGCGGCTC CAAGTACCGT CGGATGCCGC TCGCCGGACG GTGCATCAAG
TGCGGGAATA CGATCATCCC GACTGTGCAT GAGGGGTCAG TGAAGAAATA CCTGGAGATC
TCCAAGGGGA TCTGTAACAA GTATGCGGTC TCGGAGTACA CCCGCCAGCG GGTCGAGGTG
CTGGACATGG CGATCCAGTC CACCTTCGGG GCGGCGAAGG AGCAGCAGCT CGGCCTCGCG
GACTTTATGT GA
 
Protein sequence
MMAAYHKRLQ DGLYEAISVA EQARALGIDP KTRVEIPIAN DLADRVEALL GITGVAARIR 
ELEQTMSREE ASLRIGDDFV AKRFGETTRE EILDHAIRTG MALLTEGVVA APTEGIGNIC
LGRNDDGTDY LKIYYAGPIR SAGGTAQALS VLVGDYVRQQ LGIGRFIPRP EEVERYIEEI
KQYNSIMSLQ YLPSEKEIRT IVSNCPVCID GEATEQEEVS GYRNLERVET NVVRGGMALV
VAEGMALKAP KIQKNVRKMK MEGWEWLDEL ISGTVKTGDD DDEIGVKPKD KYIRDLIGGR
PVFSYPMRKG GFRLRYGRAR NTGFAAAGLH PATMHLLGDF LAVGTQMKVE RPGKAAGIVP
VDTIEGPTVR LVNGDVLRVD DLALAHQISG SVERILDVGE MLVSYGEFLE NNHLLVPCGY
CDEWWQLESN GASRPTDETE AILLALDGGY LHPEYTQMWD DITPEQLITL SDWVSRTGAV
TRAGLVLPCE AEGKAILEEL LVPHTVRDDR IIISRHLVLI AALGLDLHLQ RRGVWADAPT
DGHNALSLAC HLAGFAMRQK GGTRIGGRMG RPGKSKAREM KPPPHALFPV GEAGGARRSF
QEASTYAPER NRDGGVITAE VGERRCPACK TITYKNRCTC GAHTEPVFRC PKCNIEINAD
RCPRCDGGTV CTQTVSINVK NDYATVLEEL GLRTGMVPLV KGVKGLISRE RVVEDLAKGV
LRARHSLYVF KDGTVRYDMI DLPLTHIRTD ECGVTPEQLV ALGYTQDVYG VPLTDPTQVV
ELRPQDILVS EKCAVWLLEV TAFIDDLLEK VYHLPRFYNV TSRADLIGHL VIGLAPHTSA
GVLARLVGFT KANVGYAHPF FHAAKRRNCF YGDTVIEVYD YRSWTKVPIR QFVLENFDLS
NPGLDHFGTF YSDPKRSFLT RSVDTQGTMH LKKITSVSVH RAPPALIRFG TSRGKVVSVT
PEHAMLIWDV DYLKKIRADE VKIGDAVPVY EGGQVLADRI TELDIVPCPD DRVFCLTVAD
DHTLVADGIF TGQCDGDEDC VMMLLDGLIN FSRSFLPETR GGSMDAPLVL TSRLDPAEID
KESLNVDVGS SYPLEFYLAA QQYTHPKDLD ALIDRVERRL GTPAQLEGFM FTHDTSNISE
GPLESTYTIL ESMVDKLGAE LDLAEKIRAV DADDVAERVL KTHFMPDLMG NLSAFSKQKF
RCTRCGSKYR RMPLAGRCIK CGNTIIPTVH EGSVKKYLEI SKGICNKYAV SEYTRQRVEV
LDMAIQSTFG AAKEQQLGLA DFM