Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0772 |
Symbol | |
ID | 5103461 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 705829 |
End bp | 706809 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640506677 |
Product | metallophosphoesterase |
Protein accession | YP_001190871 |
Protein GI | 146303555 |
COG category | [R] General function prediction only |
COG ID | [COG2129] Predicted phosphoesterases, related to the Icc protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0765989 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0617226 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTACG TGGGCCTGTT TAAGAGAAAC AATACACAAG AGACGTCCAA TCAAATCAAA ATCCTTTACA CCACTGACAT TCACGGATCT GATGTAATTT TCAAGAAATT CTTGAATGCT GGTAAAATTT ACAAGGTGAA CTACCTTATC ATAGGCGGAG ATATAGCAGG GAAGTCCCTG ACCCCTATAG TCGACATCGG GGAAGGCAAA TACATGATAG ATGACAAGAT AGTGGGAAGA GAGGGACTAA AGGAGATAAC AGATGAGATC AGGAAACAGG GAAACTATTA CGTAATAGTG GACAGGAAAG ATCTTCAAGA GATGAAAGAC GACAAGCGCA AGGTAGATGA GGCTTTCAAA ACATCAATGA TAGAGGTAGT AAGAAATTGG TCCAGAATCG CAGAGGAGAA GTTGAAGGAC GTGGATATCC CGCTTTATGT AAATCTTGGT AACGATGACC CTCTCTACTT GTTCGATGTA ATTGCTGAAA GCAAGGTTAT GAGGAAATGT GAGGGAGAGG TTATTCACCT TGGTGAGCAT GAGATGATTT CCTTTGGTTA CGTGAACCCA ACTCCCTGGA ACACACCGAG GGAGATGTCA GAGGAGAAAA TTTACGAAGT TCTAAAGCAA GAGACAAAGA AAATATCTGA CATGGAGAAG GCCATTTTCA ACATTCATGC ACCTCCCTAC AACACGAACC TTGATAGTGC TCCCTTGCTG ACCCCTGACC TGAAGCCTGT TATAAAGGGA GGGGAGGTGG TCATGTCACA CGTGGGCTCG GTCTCGGTGA GGAAGGTTAT CGAAGAGGAG CAACCCCTTC TGGGTCTCCA TGGCCACATT CACGAGTCTA GGGGATTCGA TAAGTTAGGG AGGTCGCTGG TTCTTAACCC AGGTAGTGAG CACAATGAAG GTATCTTACA CGCAGCATAC ATTATCCTGG AGAAAGGTAA AATAAAGGCT CACCAGTTCA TTATTGGATG A
|
Protein sequence | MMYVGLFKRN NTQETSNQIK ILYTTDIHGS DVIFKKFLNA GKIYKVNYLI IGGDIAGKSL TPIVDIGEGK YMIDDKIVGR EGLKEITDEI RKQGNYYVIV DRKDLQEMKD DKRKVDEAFK TSMIEVVRNW SRIAEEKLKD VDIPLYVNLG NDDPLYLFDV IAESKVMRKC EGEVIHLGEH EMISFGYVNP TPWNTPREMS EEKIYEVLKQ ETKKISDMEK AIFNIHAPPY NTNLDSAPLL TPDLKPVIKG GEVVMSHVGS VSVRKVIEEE QPLLGLHGHI HESRGFDKLG RSLVLNPGSE HNEGILHAAY IILEKGKIKA HQFIIG
|
| |