Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1526 |
Symbol | |
ID | 5104054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1487501 |
End bp | 1488472 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640507413 |
Product | DHH superfamily phosphohydrolase |
Protein accession | YP_001191606 |
Protein GI | 146304290 |
COG category | [R] General function prediction only |
COG ID | [COG2404] Predicted phosphohydrolase (DHH superfamily) |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.390938 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTACT ACGCCATAGT GCATAACGAC TTTGATGGGA CTGCCTCGGC CAGCGTTTAC GCGAGAGCTG TCAATTCCCT CCCGAGAAAC ATCTGGTTCA CTGAGCCAAC TAAACTTCAC GAGGTTTTAG CCAAGTTAGA GTTGAGGGGA GTCTCCAGCG TGATGATAGC AGACCTAGGT ATCAATGAGT CCACCTTTCC TTCGATAGTT GAGGCTGTGA AACGTCTTAG AAGTGAAGGT GCCACAATAC AATGGTTTGA TCATCATGTT TGGAAGGAGG AGTGGAAATC GAAGCTCAAG GAGGTAGGGG TAGAAGTCTA CCACGATGTT ACTACCTGCG GTGCAGGCGT GGTAAACAAG GTCATGAACC CCAATGACGA GGTATCCAGG AGATTAGCCT CTGCGGACTG CTCCGTGGAT ATATGGCTTC ATGACGATCC ACTTGGTGAA AAATTGAGAA GGATTGTGGA GAATGACAGA AGGTTTGAAT GGAAGAAGAA ATTGCTTGAG ACCTTTTATG GTGGAACCCT TTGGAACGAC GAGTTCCAAA AAATCTTGGA GACTAGAATT AACGAGGAAT TGAAAGGATA TCAAAGGATC TGGAAATATG TGAAGGTGTT GGACGTTGAA GGTGCTAAGG TAGTGGTTGC GATAAGGTGG AAGGGTCCGC CTGACATAAG CTATGCCTCT CAGTTCCTTA TGACGAGAAC AGGGGCAGAC ATATTCGTTT CAGCTAATGG GAAGGCAGTT TCGTTCAGGA GCAATACGAT AGATGTGAGG AGGTTTGCAG CTGGACTAGG TGGCGGAGGA CATCCTCTTG CCGCAGGAGC ATCCCTTAGA ATTCCCCTGC TCTATAGGTT TTTAAGATGG ATAGGCGTTA GAGGGCCTGT GATCGATTGG GTCTCAAGAG TAGTAATTGA CGTAATAAGG AAGGAGGGGC TAGTTAAGTA CGAGAGAAAA CCAGCCCATT AG
|
Protein sequence | MDYYAIVHND FDGTASASVY ARAVNSLPRN IWFTEPTKLH EVLAKLELRG VSSVMIADLG INESTFPSIV EAVKRLRSEG ATIQWFDHHV WKEEWKSKLK EVGVEVYHDV TTCGAGVVNK VMNPNDEVSR RLASADCSVD IWLHDDPLGE KLRRIVENDR RFEWKKKLLE TFYGGTLWND EFQKILETRI NEELKGYQRI WKYVKVLDVE GAKVVVAIRW KGPPDISYAS QFLMTRTGAD IFVSANGKAV SFRSNTIDVR RFAAGLGGGG HPLAAGASLR IPLLYRFLRW IGVRGPVIDW VSRVVIDVIR KEGLVKYERK PAH
|
| |