Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1775 |
Symbol | |
ID | 5104775 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1717749 |
End bp | 1718693 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640507673 |
Product | 3,4-dihydroxyphenylacetate 2,3-dioxygenase |
Protein accession | YP_001191854 |
Protein GI | 146304538 |
COG category | [R] General function prediction only |
COG ID | [COG2514] Predicted ring-cleavage extradiol dioxygenase |
TIGRFAM ID | [TIGR02295] 3,4-dihydroxyphenylacetate 2,3-dioxygenase |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0882804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGACA TATTACGCGT CTCCCATGTA GTTTACAGGG TCACTGATCT GGACAGGGCC CTCTTTTTCT ATAGGGACCT CCTGGGCTTC GTGGAGACGG AGAGAAATGG AAACGAGGCT TATTTACGCG GAGTTGAGGA GGGACAACAT CACAGTCTTG TTTTAAGGAA AGCTGACTCC CCCGGTTTAT CCTACGCCTC CTTGAGGGTG AGAAAACCCG AGGTTCTGGA TCAGGCCAGG GAGAAGTTCG ATGAGATTGG CATAAGGTAC AGGAGAATGA AGGAAAGGGG AGTGGAGGAG GCAATCCTCT TTGAGGACCC GCAGGGTCTA CCTATTCTCC TGTATCACGA CATGGAGTAC GTGGGAGATA GAAGGCTCAA GTTCCACGAG TACAGGGGAG TGACCCCCGT AAGGATAGAT CACATCAATT TCATGGTAAG GGACCTAGAC GTTGAGGTTG AGTTCTACAC CAAGGTCTTT GGATTCACTG AGACCGAGAC GTTCCTGGAT AGGGATGGGA AAAAGATGGT CTCCTGGATG ACCAAGATCG GTCACTCGCA CGAGATTGCC ATCGCCAGAA GTTCCAGGAA CGTTCCGGGG TTTCATCACG CAACCTTCTA CGTTCATGAC GTGAGGGATA TCATAAGGGC TGCGGACCTA GTCTCCTCGG CTCAACTTTG GGACAGCCTA GAGAGGGGAC CTGGAAGGCA CGGGGTTACC CAGGGGTTTT ACGTTTACCT CAGGGATCAG GACAGGAATA GGATAGAGTT CTTCACGGGC GATTACTTCG TTCTAGATCC CGATAAGTGG AAACCCATAG CCTGGACCTG GGACCAGCTG AGGTACAGGT CAGACTTCTG GGGAAGGGAG GTGCCAGAGA CCTGGCTCAA GGAGTGGGTT CCCGTGGAGG ATATCACGGG TAAATTACGG GGGTGGAATA ATTGA
|
Protein sequence | MLDILRVSHV VYRVTDLDRA LFFYRDLLGF VETERNGNEA YLRGVEEGQH HSLVLRKADS PGLSYASLRV RKPEVLDQAR EKFDEIGIRY RRMKERGVEE AILFEDPQGL PILLYHDMEY VGDRRLKFHE YRGVTPVRID HINFMVRDLD VEVEFYTKVF GFTETETFLD RDGKKMVSWM TKIGHSHEIA IARSSRNVPG FHHATFYVHD VRDIIRAADL VSSAQLWDSL ERGPGRHGVT QGFYVYLRDQ DRNRIEFFTG DYFVLDPDKW KPIAWTWDQL RYRSDFWGRE VPETWLKEWV PVEDITGKLR GWNN
|
| |