Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2254 |
Symbol | |
ID | 7272551 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 2403431 |
End bp | 2407282 |
Gene Length | 3852 bp |
Protein Length | 1283 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643570866 |
Product | DNA polymerase II large subunit |
Protein accession | YP_002467270 |
Protein GI | 219852838 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1933] Archaeal DNA polymerase II, large subunit |
TIGRFAM ID | [TIGR00354] DNA polymerase, archaeal type II, large subunit [TIGR01443] intein C-terminal splicing region [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.413677 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGCAG CCTATCATAA GAGGCTCCAG GATGGCCTCT ACGAGGCGAT CAGTGTTGCA GAGCAAGCGA GGGCTCTGGG GATCGACCCC AAGACCAGGG TCGAGATCCC GATAGCCAAT GACTTGGCCG ACCGGGTCGA AGCGTTGCTC GGAATCACAG GCGTCGCGGC ACGGATTCGG GAACTCGAAC AGACGATGTC CCGAGAGGAG GCTTCCCTCC GGATCGGCGA CGACTTCGTC GCCAAGCGCT TTGGAGAGAC GACTCGGGAG GAGATCCTCG ATCATGCGAT CAGGACCGGG ATGGCCCTGC TGACCGAAGG GGTGGTGGCG GCCCCCACCG AGGGAATCGG GAATATCTGT CTTGGCAGGA ACGATGACGG AACCGATTAC CTAAAGATCT ATTATGCCGG CCCGATCCGG AGCGCCGGCG GAACAGCACA GGCGCTCTCG GTGCTGGTCG GCGACTATGT CAGGCAGCAA CTTGGGATCG GGAGGTTCAT CCCGCGCCCC GAGGAGGTCG AACGGTACAT CGAGGAGATC AAGCAGTATA ACTCGATCAT GTCGCTCCAG TACCTGCCGA GCGAGAAGGA GATCAGAACG ATCGTCTCCA ACTGTCCGGT CTGCATCGAT GGTGAGGCAA CCGAGCAGGA GGAGGTCTCA GGGTACCGGA ACTTGGAACG GGTCGAGACG AACGTTGTCC GGGGCGGGAT GGCGCTGGTC GTCGCCGAAG GGATGGCCCT CAAGGCCCCG AAGATCCAGA AGAACGTCCG GAAGATGAAG ATGGAGGGCT GGGAATGGCT CGACGAACTG ATCAGCGGGA CGGTGAAGAC TGGTGACGAC GACGATGAGA TCGGGGTTAA ACCCAAGGAC AAATACATCC GCGACCTGAT CGGCGGTCGG CCGGTCTTCT CATATCCTAT GCGTAAAGGG GGTTTCCGGT TGCGGTATGG CAGGGCACGG AACACCGGGT TCGCGGCGGC CGGCCTCCAC CCGGCGACGA TGCACCTACT CGGCGATTTT CTGGCGGTCG GTACCCAGAT GAAGGTCGAA CGACCAGGCA AGGCCGCGGG GATCGTGCCG GTGGATACCA TCGAGGGGCC GACGGTCAGG CTGGTGAACG GGGACGTCCT CCGGGTCGAT GATCTGGCGC TCGCCCACCA GATCTCCGGG TCGGTGGAGC GGATCCTCGA TGTCGGTGAG ATGCTTGTCT CGTACGGTGA GTTTCTGGAG AACAACCATC TGCTGGTACC CTGCGGATAC TGCGATGAGT GGTGGCAGTT GGAGAGCAAC GGAGCAAGCC GCCCGACGGA CGAGACCGAG GCGATCCTGC TGGCCCTCGA CGGGGGATAC CTCCATCCAG AGTATACACA GATGTGGGAC GACATCACGC CAGAACAACT GATTACTCTC TCAGACTGGG TCAGCCGGAC CGGGGCGGTC ACCAGAGCCG GGCTCGTCCT TCCCTGCGAA GCGGAAGGGA AAGCGATCCT CGAGGAGTTG CTGGTTCCAC ACACAGTCAG GGACGACCGG ATCATCATCT CCCGTCATCT GGTGCTGATA GCAGCCCTCG GGCTCGACCT GCACCTGCAG AGGCGAGGAG TCTGGGCGGA TGCGCCGACC GACGGCCATA ATGCATTGTC GCTCGCCTGC CACCTCGCCG GGTTTGCGAT GCGGCAGAAG GGAGGGACGC GGATCGGGGG TCGGATGGGC AGGCCCGGCA AGTCCAAGGC GCGGGAGATG AAGCCTCCGC CCCACGCACT CTTCCCGGTC GGCGAGGCCG GCGGCGCCCG TCGTTCATTC CAGGAGGCCT CGACCTACGC TCCGGAACGA AACCGGGACG GTGGAGTGAT CACTGCAGAG GTTGGGGAGC GGCGGTGCCC CGCATGCAAG ACGATCACCT ACAAGAACCG ATGCACCTGC GGGGCTCACA CTGAACCTGT CTTCCGGTGC CCGAAGTGCA ACATAGAGAT CAACGCGGAC AGGTGCCCCC GGTGCGATGG CGGGACGGTC TGCACCCAGA CGGTCTCGAT CAATGTTAAG AACGACTATG CTACGGTCCT TGAGGAACTC GGCCTCAGGA CTGGGATGGT CCCGCTCGTC AAGGGGGTAA AGGGACTGAT CTCACGGGAG CGGGTCGTCG AAGACCTGGC CAAGGGGGTC CTCCGGGCCC GGCACAGCCT CTATGTCTTC AAGGACGGGA CGGTCAGGTA CGATATGATC GATCTTCCCC TGACCCATAT CCGGACCGAC GAGTGCGGAG TGACCCCCGA ACAACTGGTC GCCCTCGGGT ACACCCAGGA TGTCTACGGG GTGCCGCTGA CCGACCCGAC GCAGGTGGTG GAACTCCGGC CTCAGGACAT CCTGGTCTCA GAGAAGTGTG CCGTCTGGCT GCTGGAGGTG ACGGCTTTCA TCGACGATCT GCTGGAGAAG GTCTACCACC TCCCCCGATT CTACAATGTC ACGTCCCGGG CCGACCTGAT CGGCCACCTG GTAATAGGAC TTGCACCGCA CACCAGTGCC GGAGTACTGG CCAGGCTGGT CGGGTTCACA AAAGCGAATG TCGGCTATGC CCACCCGTTC TTCCATGCTG CGAAGAGGCG GAACTGTTTC TATGGAGATA CGGTCATCGA GGTCTATGAT TATCGCTCCT GGACGAAGGT GCCGATCCGG CAGTTCGTCC TCGAGAACTT TGACCTTTCA AATCCAGGGC TCGACCATTT CGGCACATTC TACTCTGATC CAAAGAGGTC GTTTCTGACC CGATCGGTAG ATACGCAGGG AACGATGCAT CTCAAGAAGA TCACCTCGGT TTCGGTGCAC CGGGCCCCGC CCGCCCTGAT CCGGTTCGGG ACCTCCCGGG GAAAGGTAGT CTCGGTCACC CCCGAGCATG CGATGCTGAT CTGGGACGTG GACTACCTCA AAAAGATCCG GGCCGACGAA GTTAAGATCG GAGACGCTGT CCCGGTCTAC GAAGGCGGGC AGGTGCTCGC CGACCGGATC ACCGAACTGG ACATCGTTCC CTGCCCCGAC GATAGGGTCT TCTGCCTGAC GGTGGCCGAT GACCACACCC TGGTCGCGGA CGGAATCTTC ACCGGCCAGT GCGACGGGGA TGAGGACTGT GTGATGATGC TCCTCGACGG GCTGATCAAC TTCTCCCGTT CGTTTCTGCC AGAGACTAGG GGCGGATCGA TGGATGCTCC GCTGGTGCTC ACCTCCAGGC TCGACCCGGC CGAGATCGAC AAAGAGTCGC TGAACGTCGA TGTCGGGTCC AGTTATCCAC TGGAATTCTA CCTGGCCGCC CAGCAGTACA CCCACCCAAA AGACCTCGAT GCCCTGATCG ATCGGGTGGA GCGGCGGCTC GGCACCCCAG CCCAGCTGGA GGGGTTCATG TTCACCCATG ATACTTCCAA TATCTCGGAA GGGCCGCTCG AGTCCACCTA TACGATCCTC GAGTCGATGG TCGACAAGCT CGGCGCCGAA CTCGACCTGG CCGAGAAGAT CCGGGCCGTC GACGCCGACG ATGTGGCAGA GCGAGTGCTC AAGACCCACT TCATGCCCGA CCTGATGGGA AACCTATCCG CCTTCTCCAA GCAGAAGTTC AGGTGCACCC GGTGCGGCTC CAAGTACCGT CGGATGCCGC TCGCCGGACG GTGCATCAAG TGCGGGAATA CGATCATCCC GACTGTGCAT GAGGGGTCAG TGAAGAAATA CCTGGAGATC TCCAAGGGGA TCTGTAACAA GTATGCGGTC TCGGAGTACA CCCGCCAGCG GGTCGAGGTG CTGGACATGG CGATCCAGTC CACCTTCGGG GCGGCGAAGG AGCAGCAGCT CGGCCTCGCG GACTTTATGT GA
|
Protein sequence | MMAAYHKRLQ DGLYEAISVA EQARALGIDP KTRVEIPIAN DLADRVEALL GITGVAARIR ELEQTMSREE ASLRIGDDFV AKRFGETTRE EILDHAIRTG MALLTEGVVA APTEGIGNIC LGRNDDGTDY LKIYYAGPIR SAGGTAQALS VLVGDYVRQQ LGIGRFIPRP EEVERYIEEI KQYNSIMSLQ YLPSEKEIRT IVSNCPVCID GEATEQEEVS GYRNLERVET NVVRGGMALV VAEGMALKAP KIQKNVRKMK MEGWEWLDEL ISGTVKTGDD DDEIGVKPKD KYIRDLIGGR PVFSYPMRKG GFRLRYGRAR NTGFAAAGLH PATMHLLGDF LAVGTQMKVE RPGKAAGIVP VDTIEGPTVR LVNGDVLRVD DLALAHQISG SVERILDVGE MLVSYGEFLE NNHLLVPCGY CDEWWQLESN GASRPTDETE AILLALDGGY LHPEYTQMWD DITPEQLITL SDWVSRTGAV TRAGLVLPCE AEGKAILEEL LVPHTVRDDR IIISRHLVLI AALGLDLHLQ RRGVWADAPT DGHNALSLAC HLAGFAMRQK GGTRIGGRMG RPGKSKAREM KPPPHALFPV GEAGGARRSF QEASTYAPER NRDGGVITAE VGERRCPACK TITYKNRCTC GAHTEPVFRC PKCNIEINAD RCPRCDGGTV CTQTVSINVK NDYATVLEEL GLRTGMVPLV KGVKGLISRE RVVEDLAKGV LRARHSLYVF KDGTVRYDMI DLPLTHIRTD ECGVTPEQLV ALGYTQDVYG VPLTDPTQVV ELRPQDILVS EKCAVWLLEV TAFIDDLLEK VYHLPRFYNV TSRADLIGHL VIGLAPHTSA GVLARLVGFT KANVGYAHPF FHAAKRRNCF YGDTVIEVYD YRSWTKVPIR QFVLENFDLS NPGLDHFGTF YSDPKRSFLT RSVDTQGTMH LKKITSVSVH RAPPALIRFG TSRGKVVSVT PEHAMLIWDV DYLKKIRADE VKIGDAVPVY EGGQVLADRI TELDIVPCPD DRVFCLTVAD DHTLVADGIF TGQCDGDEDC VMMLLDGLIN FSRSFLPETR GGSMDAPLVL TSRLDPAEID KESLNVDVGS SYPLEFYLAA QQYTHPKDLD ALIDRVERRL GTPAQLEGFM FTHDTSNISE GPLESTYTIL ESMVDKLGAE LDLAEKIRAV DADDVAERVL KTHFMPDLMG NLSAFSKQKF RCTRCGSKYR RMPLAGRCIK CGNTIIPTVH EGSVKKYLEI SKGICNKYAV SEYTRQRVEV LDMAIQSTFG AAKEQQLGLA DFM
|
| |