Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0112 |
Symbol | |
ID | 7272282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 132236 |
End bp | 133924 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643568769 |
Product | PHP domain protein |
Protein accession | YP_002465228 |
Protein GI | 219850796 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1796] DNA polymerase IV (family X) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGTGTA CCAACAGGCA GGCGGCCGAT ACGATCCGGT TCATCAGCCA GCTGCTTGAG ATCAGAGGAG AGGACCCGTT CCGGGTCAGG GCATTCCAAC GGGCTGCCAT GGCGATCGAA GGGCTTGGGG AGCCGGTCTG TAGCATGGCG CCCGAACAGG TGCTGGCGGT CCCGGGGATC GGGCCGCACA CCGCGGCCCA GATCAGGGAA CTCTGCGCCG GCGAAGAGAG CAGCCTGCTT CAGGACCTGC AGCAGTCGAT CCCGGCTTCG GTGATCGCCC TCCTCGAACT GGATCAGGTG GGGCCGAAGA CCGTCCACCG GCTCTGGCAT GAACTCGGGA TCTCGACGAT CGAAGACCTG GAGAAGGCCG CCCTGGCCCA CCAGGTCAGC ACCCTGAAGG GGTTCGGGGC GAAGAAGGAG GAGGAACTGG TGAAGGCGAT CGCACGCCAC CGGCGGTCGA CCGGGCGGAT GACCAGAGTT GCTGCAGAGG CTGTGCTCAG AGAGGTGACG GCTCTGCTGG TTGAAGGGAC CTACACGATC GCAGGATCGT ATCGGCGAGG GAGGGCCACG ATCGGGGATA TCGACATCGT CTCCACCGAA GCGCCGGCCG AGGTGAACTC GCGGCTGGTC TCCCTGGCAG ACGAGGTGAT CGATCAGGGT GAAAAGCGAA CCTCGATCCG GGTCATGGGG CAGCGGGTCG ATGTCAGGTT CTGTACCCCG GACCAGTGCG GCCCGATGCT CCTCTACCTG ACCGGGTCGA AGGACTTCAA CATCAAACTC CGCGAGATCG CACTCTCTGC CGGTTGGCGG CTGAACGAGT ACGGGGTCGA GGAGCGGACG ACCGGTGCAC ACAGAACGGC GGCGACCGAG GAGGAGATCT TCTCGATGCT CGGAATGGAT CCCGTTCCGC CGGAACTCAG GGAGGACCGG GGCGAGGTGG AGGCAGCCCG GGAACACACG CTGCCCACCC TCGTGATGGC GGACGACCTC CAGGGAGATC TGCATGTTCA CTCCCGCTGG TCTGACGGCT CGATGACGAT TGCTGAGCTG GCCACCGCCG GAGAGGAACG GGGGTATCAG TACATCATCT GTTCTGACCA CTCGGCCTCA CTGGGGATCG CTCACGGTCT CTCTGCTGCA GAGGTTAGAG ATCAGCAACA CGAGATCGAG ATGGTGAACC GCACTTCATC CTGCCAGGTG CTGGCCGGGA TCGAGGTGGA CATCCTCTCC AACGGGTCGC TCGGCCTCCC CGACCGGACA CTGCAGGACC TGGACCTGGT GGTAGCCTCG ATCCATTCCG GGTTTCATCA GGAGGAAGAC CAGATAACCC GGCGGATCAT CGGTGCGATC GAGAACGAAC ATGTGGATAT CATCGGCCAT CCGACCGGGC GGGTACTCGG GCAGCGGGAT CCGTACGCGG TCGACCTGCA CCGGGTGATC GAGGCCGCCG CTCTCCACCG GACCGCCCTG GAGATCAATG CTTCCCCATT CAGGTTGGAC ATGGACGACA CCGCAGTCAG AGAAGCCCGT GATGGGGGGA TCGTCATCTC TCTCGGCAGC GATGCGCATG CGCGTTCTGA ACTGGAGAAC ATGTCCTACG GGCTTTTGAT GGCCCGCCGC GGATGGTGCC GGCCGGCAGA CCTGCTGAAC ACCAGAGACC TTTCGTCGGT GCTGGAGTGG GCCGCATGA
|
Protein sequence | MGCTNRQAAD TIRFISQLLE IRGEDPFRVR AFQRAAMAIE GLGEPVCSMA PEQVLAVPGI GPHTAAQIRE LCAGEESSLL QDLQQSIPAS VIALLELDQV GPKTVHRLWH ELGISTIEDL EKAALAHQVS TLKGFGAKKE EELVKAIARH RRSTGRMTRV AAEAVLREVT ALLVEGTYTI AGSYRRGRAT IGDIDIVSTE APAEVNSRLV SLADEVIDQG EKRTSIRVMG QRVDVRFCTP DQCGPMLLYL TGSKDFNIKL REIALSAGWR LNEYGVEERT TGAHRTAATE EEIFSMLGMD PVPPELREDR GEVEAAREHT LPTLVMADDL QGDLHVHSRW SDGSMTIAEL ATAGEERGYQ YIICSDHSAS LGIAHGLSAA EVRDQQHEIE MVNRTSSCQV LAGIEVDILS NGSLGLPDRT LQDLDLVVAS IHSGFHQEED QITRRIIGAI ENEHVDIIGH PTGRVLGQRD PYAVDLHRVI EAAALHRTAL EINASPFRLD MDDTAVREAR DGGIVISLGS DAHARSELEN MSYGLLMARR GWCRPADLLN TRDLSSVLEW AA
|
| |