Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0995 |
Symbol | |
ID | 5104544 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 918319 |
End bp | 919245 |
Gene Length | 927 bp |
Protein Length | 308 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640506894 |
Product | 3,4-dihydroxyphenylacetate 2,3-dioxygenase |
Protein accession | YP_001191087 |
Protein GI | 146303771 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02295] 3,4-dihydroxyphenylacetate 2,3-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0341301 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0115779 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAACT TAAACGTGCT CAGGCTTTCT CACGTGTGCG TGAGAGTAAC AGACCTAGAG AGGGCTGAGA ACTTCTATGT CAATCTCCTT GGCTTCGTGG AGACGCAGAA GGATGGAGAC TACCTCTATC TCAGGGGGAT CGAGGAGGGA CAACACCACA GTCTAGTTCT GAAGAAGGCA AGCTCTCCAG GCCTGTGCTA TATTGCGTTC AGGGTTAGGG AGGGACTCGA TAAGGTGAGG GAACTGGGTA ACTCCACGAG ATTCAAGGAG AAGGGGGTGG AGGACTCCAT ACTTGTGGAG TCGCCTGGAG GAGTTCCCCT CCTCTTCTAT CAGGACATGG AGTATGTGGG TGATTTGAGG CTGAAGTTCT ATCTTCATCG AGGCGTATCC CCGGTGAAGT TAGCCCACGT GAATTACGTG GTAGGTAACC TTGAGAGGGA GGAGAGATTT TTCAAGGACC TCGGGTTCGT GGAAACCGAA CGTTTCCTGG ACAAGACAGG GAGGAAAACA GTGGTCTGGC TCACAAGGAG GGGTAACTCC CACGAAGTGG CAATAGCTGA GTCCCAGAGG AAAGTTCCAG GCTTCCATCA CGAGACCTTC TACGTACATG ACGTAAGGGA CGTGATTAGG GCAGCGGATT TGCTTGCATC CATGGGGTAT TGGGACAACA TAGAAAGGGG CCCGGGAAGG CATGGGGCCA CAGAGGGGTA CTACATCTAT CTCAGGGACC TGGACATGAA TAGGCTGGAG TTCTTTACTA ACGATTATGA GGTGTTAGAC CCAGATAAGT GGAAGACAGT GGAATGGACC CACGATCAGT TTAGGTTTAG GAGCGATTTC TGGGGAAGAC CAATCCCAGA TTCCTGGCTC AAGGAGTGGA TGCCCGTGGA GAACCTTCAC GGAGAGTTAA GGGGGTGGGA AGCATGA
|
Protein sequence | MTNLNVLRLS HVCVRVTDLE RAENFYVNLL GFVETQKDGD YLYLRGIEEG QHHSLVLKKA SSPGLCYIAF RVREGLDKVR ELGNSTRFKE KGVEDSILVE SPGGVPLLFY QDMEYVGDLR LKFYLHRGVS PVKLAHVNYV VGNLEREERF FKDLGFVETE RFLDKTGRKT VVWLTRRGNS HEVAIAESQR KVPGFHHETF YVHDVRDVIR AADLLASMGY WDNIERGPGR HGATEGYYIY LRDLDMNRLE FFTNDYEVLD PDKWKTVEWT HDQFRFRSDF WGRPIPDSWL KEWMPVENLH GELRGWEA
|
| |