Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0439 |
Symbol | |
ID | 5105556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 390681 |
End bp | 391988 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506345 |
Product | deoxyribodipyrimidine photo-lyase type I |
Protein accession | YP_001190540 |
Protein GI | 146303224 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0415] Deoxyribodipyrimidine photolyase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTGTG CCTTTGTGTT CAGGAGAGAC CTTAGGCTAG ATGACAACAC TTGCCTTCTG AGAGCGCTTC AGGAATGCGA TGAAGTGGTT CCAGTATTCG TGTTGGATCC CAGGCAACTG GGCGATAATC CATACAAGTC CGCCTTCGCC TTGGGCTTCA TGGTTGATTC CCTCCTGGAT CTTGACATGC AGTTGAAGCA TCGTAGTTCA AGGCTCCACA TTCTGCAGGG ATATCCTGAG AAGGTGTTGC CAGAGCTCAA GGTTGAGGCA ATATACTTCA ACGAAGATTA CACTCCTTTC AGCCTGAACA GGGATAACGC AATAAGGGAG ACAATGCGTG GAAGGGTTAA GTCATGCGAA GACCTTCTCC TGACACCAAA GGACTTCTTC GTAAGAAAGG GAAAACCCTA CACAGTTTTC ACGCACTTTT ATAACGATGC GAGGAAGCTC GAGGTGAGGA AACCCATGAA GAACGACATG AGGAATTACC TCACTCTCGA TCTCCCTGGG ACGGAGGTCC TGAAGCTGGA GGTCGAGAGG GGTATCCCAG GCGGGAGACA GGAGGGGCTC AAGAGGCTGG AAAGGGCCAG AAACCTGAAC TACTCCATGC GTAACTTCCC AGGAGTTGAA GGTACTACGA AGCTCTCGCC ATATATCAAG TTTGGGGTTG TCTCACCGAG GGAGGTGTAC TGGGCGGTCA ACGAGGAGAT AAGGAGGCAA CTGTACTGGA GGGACTTCTA CACGCTTCTG GCCTACTATA ATCCCCACGT GTTCGGTCAT TCGTACAAGA GGGAGTACGA CTGTATACCC TGGAAGTGGA ATGAGGCTCA TCTTGAGGCA TGGAAGCAGG GTAAGACGGG TTATCCCATA GTTGACGCGG GGATGAGGGA ACTTAACGAG ACTGGATTCA TGCATAACAG AACCAGGATG ATAACGGCCT CATTTCTCGT GAAGGTATTG CATGTGGATT GGAGGATAGG GGAGAGATAC TTCGCTACAA AACTAGTTGA CTACGACCCA TCAGTAAATA ACGGAAATTG GCAATGGGTG GCCTCAACTG GTGCGGATTA CATGTTTAGG GTATTCAACC CTTGGTTGCA ACAGAGGAAG TTTGACCCAG ATGCGGTGTA CATAAAGACG TGGGTACCAG AACTGAAGGA TCTTCCAGCC GAGAAGATTC ACGAGATTTA TAGGTTCAAG GTTTCAGGCT ATCCCTCCCC CATAGTGGAT TATAGTGAGG AAGTCAAGAA AGCTAGGAAG ATGTACGAAG ACTCGGTGGC GTTATGCAGT AAGAGGGGTC TCTTTTAG
|
Protein sequence | MPCAFVFRRD LRLDDNTCLL RALQECDEVV PVFVLDPRQL GDNPYKSAFA LGFMVDSLLD LDMQLKHRSS RLHILQGYPE KVLPELKVEA IYFNEDYTPF SLNRDNAIRE TMRGRVKSCE DLLLTPKDFF VRKGKPYTVF THFYNDARKL EVRKPMKNDM RNYLTLDLPG TEVLKLEVER GIPGGRQEGL KRLERARNLN YSMRNFPGVE GTTKLSPYIK FGVVSPREVY WAVNEEIRRQ LYWRDFYTLL AYYNPHVFGH SYKREYDCIP WKWNEAHLEA WKQGKTGYPI VDAGMRELNE TGFMHNRTRM ITASFLVKVL HVDWRIGERY FATKLVDYDP SVNNGNWQWV ASTGADYMFR VFNPWLQQRK FDPDAVYIKT WVPELKDLPA EKIHEIYRFK VSGYPSPIVD YSEEVKKARK MYEDSVALCS KRGLF
|
| |