Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1864 |
Symbol | |
ID | 5104132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 1808730 |
End bp | 1809728 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640507750 |
Product | diphthamide biosynthesis protein |
Protein accession | YP_001191928 |
Protein GI | 146304612 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1736] Diphthamide synthase subunit DPH2 |
TIGRFAM ID | [TIGR00322] diphthamide biosynthesis protein 2-related domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.866923 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTTACA TTTTTGATGA AGACCTTCTC AAGTCCGAGA TCAGGAAGAG AGGGGCTAGG AGGGTTCTCC TTCAGTTCCC CGAGGGTTTA AGGTATTTCT CCACGGAGTT GGTGGAGAGG TTAAGGGAAT CCCTTCCAGA CGTTGAGTTC GTGATATCGG GAGAACCGAG CTGGGGGGCC TGTGACATAG CTGAAGACGA AGCCTCCCTT CTCAAGGTCG ACCTCCTCAT CCATTTCGGC CACTCTCCTT ATACCTGGTA TTACCCCAGG TTTCCAACCC TCTTCGTTAA GGCTGAGAGC ACGGCTCAAG TGGAGAGGGA GACCCTGGAC AAGCTAGTTG ATGTCCTTCG CGAGAGAGGA GCTAACTCGG TCGCCCTAAC CTCGACCGTT CAGCACGGGA AACTACTGAA CCAGGTGAAG GAGCACCTTT CCCCCCACTT CCACGTGGAG GTTGGAAGGC CTTCCTCACC TTTCATGGGG GATGGACAGG TCCTGGGATG TGACTACAAG TCTGCCCAGG TTGAGGCCGA CGTGCACGTA AACATCTCAG GTGGGGTTTT CCACGCCCTC GGACTGGGAC TGGCCACGGG TAAACCGGTC ATCAAGCTTG ACCCCTACAC GAGATCTGTG GAGGACCTAA CTCCTCAGGT TTTCAAGGTC CTCAAGGTGA GATATTCCAA GATCATGGAG GCCATGGACG CGAGGACCTG GGTCATTGTG CAGGGATTGA AGGTTGGCCA GAACAGGCCC CTCATGGTTA AGTCCCTAGA GTCCAGGCTC AAGTCCCTGG GGAAGACAAC CTACGTGGTC ACAAGCAAGG TTCTGAACCA GGACTCCCTC AGAAACCTAG ATAGGAGCTA CATCGACGCC TTCGTGGTCA CATCGTGTCC AAGATTACCC ACGGATGACC TCTACCTTTA CGAGAAGCCC GTGTTGACAC CTGGAGAGGC GAAAATGATT ATAACCAATA AACTAGAACC ATACATATTT CCGTGGTAA
|
Protein sequence | MSYIFDEDLL KSEIRKRGAR RVLLQFPEGL RYFSTELVER LRESLPDVEF VISGEPSWGA CDIAEDEASL LKVDLLIHFG HSPYTWYYPR FPTLFVKAES TAQVERETLD KLVDVLRERG ANSVALTSTV QHGKLLNQVK EHLSPHFHVE VGRPSSPFMG DGQVLGCDYK SAQVEADVHV NISGGVFHAL GLGLATGKPV IKLDPYTRSV EDLTPQVFKV LKVRYSKIME AMDARTWVIV QGLKVGQNRP LMVKSLESRL KSLGKTTYVV TSKVLNQDSL RNLDRSYIDA FVVTSCPRLP TDDLYLYEKP VLTPGEAKMI ITNKLEPYIF PW
|
| |