Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2577 |
Symbol | |
ID | 7270704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 2703203 |
End bp | 2704411 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643571171 |
Product | metal dependent phosphohydrolase |
Protein accession | YP_002467570 |
Protein GI | 219853138 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.725882 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCA TCAAAGACCC AGTCCACGGG TATGTCGAGG TGGATGACCA CCTCCTCCCA CTCCTCGACG CCCCGGTGAT GCAGCGGCTT CGTGCAGTCC GGCAGCTTGG GTTCTCGTAC CTGGTCTATC CTGGGGCCAA CCACACCCGC TTCGAACACT CGCTCGGGAC GATGCACCTG GCTGGGCTGA TGGCGCGGCA GCTGGACCTC CCTGCCGAGG AGACACTACT GGTGAAGACC GCGGCCCTGC TCCACGACAT CGGCCACGGT CCGTACTCCC ATGCGATCGA ACCGTTCATG GAGGAGTACA CCGGGCGGAG CCATCACCAT ATCAGGGAGG TTCTGGTCAG CACCGGTACC TGGGATCTGA TAAATGACGC CGGGATCGAT CCTGAGGCGG TCTGCGCCCT GATCGAGGGG GAGCACCCGC TCGGCGGGAT CATCCACGGG GACCTGGACG TGGACCGGAT GGACTATCTG CTGCGGGATG CCCACTACAC AGGGGTGCCA TACGGCACCG TCGACGCCCA CCGATTGATC CGATCGGTGA TGTTGACCGA CAGGGGCATG GTCCTGCGGG AGAGCGGGAT CAATGCCGCC GAGTCCCTGC TGATCGCACG GACCCTGATG CGCCCGGCGG TCTACTTTCA TCATGTAGGA CGGATCGCCG AGGGGATCTT CGGGCTGGCG CTCTATTACC ATATCAGTAC CGGCGTTGAC CAGACTGATA TGGCGGCTCT GGTCCGGATG GACGATGGGG CCTGCATCCA GACACTCAGG AACTCCGCCG CCCAGAGGGC GCAGCTCCTG GTGGCCCGGC TGCTCCAGCG GGATCTCTTC AAGCGGGCGC TGTACGTTGG TAAGGACCAG ATCAATACCG CTGCCCTCCC GGCGGTCCGG CCACAGCACG AGGAACGGGC GATCGCAGCT GCAATCGCCG ATCAGGCTGG GGTCAGGGAG GAGGAGGTGC TGGTCGACAT CCCCGAGTTC CCGCACGAGA TGTCGATGGC CGTTCAGGTC AGGCACCGGC ATCTGACTGT GCAGCTCGAG GAACTCTCCC CGATGTTGAA GACCCTGAAC GAGACCCGGC AGGGGCAGTG GAGGCTCGGG GTGTACGCCC CGGCCGCAGT TTCAGAGCAG GTCGGATTGG CGGCGGCTGC TATCCTCCAC ATCAGAAAAC CGACCGCCCA GGGGAAGCTC ATCCTCTGA
|
Protein sequence | MKIIKDPVHG YVEVDDHLLP LLDAPVMQRL RAVRQLGFSY LVYPGANHTR FEHSLGTMHL AGLMARQLDL PAEETLLVKT AALLHDIGHG PYSHAIEPFM EEYTGRSHHH IREVLVSTGT WDLINDAGID PEAVCALIEG EHPLGGIIHG DLDVDRMDYL LRDAHYTGVP YGTVDAHRLI RSVMLTDRGM VLRESGINAA ESLLIARTLM RPAVYFHHVG RIAEGIFGLA LYYHISTGVD QTDMAALVRM DDGACIQTLR NSAAQRAQLL VARLLQRDLF KRALYVGKDQ INTAALPAVR PQHEERAIAA AIADQAGVRE EEVLVDIPEF PHEMSMAVQV RHRHLTVQLE ELSPMLKTLN ETRQGQWRLG VYAPAAVSEQ VGLAAAAILH IRKPTAQGKL IL
|
| |