Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0875 |
Symbol | |
ID | 5103521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 808765 |
End bp | 809793 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640506778 |
Product | metallophosphoesterase |
Protein accession | YP_001190971 |
Protein GI | 146303655 |
COG category | [R] General function prediction only |
COG ID | [COG2129] Predicted phosphoesterases, related to the Icc protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTTGC CAGCAACCCT TTTTAATAGA ATAACTTTTA GACAATTCAT GCCCCTGTTT AAAAGAAGAG GAAACAACGA AAGTGTAGGC GGAAAGAACA AAACAAGGAT TCTATTCACC TCCGACCTTC ATGGGTCCGA GACTGCTTTT AGGAAATTTC TGAATGCCGG AGTGATGCAG AAAGTTGATT GCCTTATCAT AGGTGGAGAT ATTGCAGGAA AGTCATTGGT ACCTATCATA AATAAGGGAA ATGGTTATTT TGTTGTTGAG GATAGGGAGA TCAGCGGTAG CTCTCTTAAC AACGTTGTCG CGGAATTCCG AAAGAGCGGC GCGTATTATG CGATTCTCTC TAAGGCTGAG CATGAGGAAT TAATCAATAA CAAAAAGAAA CTTGACGAGC TATTTCACGA AAAGATGAAG GAAAACCTAA GGAGTTGGGT TGAGATTGCT CAGGAAAAGC TGAAGGAAAG ACGGATTCCA GTCTTCATCA ACCTAGGTAA CGATGATCCT TCGTTTCTCT TTCAAGTTAT TGAGGAGAGC GAATTAATGA GAAAAAGTGA AGGAAATATA ATAGACATAG GCGGTCACGA GATGATATCC TTCGGATATG TTAATCCCAC ACCTTGGAGA ACACCTAGAG AAATGTCCGA GGACGAGCTG ATGTATAATC TGAGGGGAAT GGCTGAGAAG TTAGAGAGGC CAGAAAAGGC AATTTTCAAT TTTCACGCTC CTCCATATAA TACTTCCCTT GATAATGCAC CGTTACTCTC TGCCGACTTG AAACCAGTAG TAAAGGGAGG CGATGTGGTC ATGACTCACG TAGGTTCTAA GGCTATTCGC AAGATAATAG AGGAATATCA ACCAATGCTA GGCATACACG GACATATTCA CGAGTCCAGA GCGTTTGATA AGATCGGAAG GACAGTGATA ATAAATCCAG GTAGTGAGTT TAACCAGGGC ATACTTCACT CTACACTTAT CTTGCTAGAA GATGGGAGAG TTAAAGGTAA CCAGTTTATA GTGGGCTGA
|
Protein sequence | MFLPATLFNR ITFRQFMPLF KRRGNNESVG GKNKTRILFT SDLHGSETAF RKFLNAGVMQ KVDCLIIGGD IAGKSLVPII NKGNGYFVVE DREISGSSLN NVVAEFRKSG AYYAILSKAE HEELINNKKK LDELFHEKMK ENLRSWVEIA QEKLKERRIP VFINLGNDDP SFLFQVIEES ELMRKSEGNI IDIGGHEMIS FGYVNPTPWR TPREMSEDEL MYNLRGMAEK LERPEKAIFN FHAPPYNTSL DNAPLLSADL KPVVKGGDVV MTHVGSKAIR KIIEEYQPML GIHGHIHESR AFDKIGRTVI INPGSEFNQG ILHSTLILLE DGRVKGNQFI VG
|
| |