Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0646 |
Symbol | |
ID | 5103806 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 591207 |
End bp | 592544 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640506550 |
Product | phosphomethylpyrimidine kinase |
Protein accession | YP_001190745 |
Protein GI | 146303429 |
COG category | [H] Coenzyme transport and metabolism [S] Function unknown |
COG ID | [COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase [COG1992] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00097] phosphomethylpyrimidine kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0762615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.306983 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAAAGC CCGTAGTTAT GGCCATTGGC GGATTTGACA GTGGAGGAGG GGCGGGGGTA GAGAGCGACA TTAAGGTTTT GGAGTCAATA GGTGTTCATG GGGTTGGTGC AATAACCGCA GTCACAGCTC AGAACACCTT AGGGATTAAG CACGTCACTG TTGTAGACCA CAACTCTCTC AGGAAACAGA TTGAGACCCT TCTGGAGGAC TTCAAGGTGA GCTCAGGTAA GACTGGAATG ATTGTTAACG GTGAGCAAAT GAAAGTCGTA TTTGAGGCGG TGAATTTCCC ACTGGTGGTG GATCCGGTAA TCTATGCTAA GGATGGGACA AAGCTCATAG AGGATCTAGA AGCCTTCAAA AAGTTTCTTC TCCCTCGGGC CACGGTGATA ACTCCCAACG CCGTAGAGGC GGGTATACTC CTCGGGATGA AGGTAGAAAC TCTGCAGGAT CAGATAACTG CATCAAAGCT CATTCACGAG AGGTTCTCAG TGCCCTACGT GGTGGTGAAA GGGGGACACG TGAAGTCCTC AGAGAGCGTT GATGTGCTGT ACGATGGCAA AGAGGTGATC CAGTTGAGTT CCCCAAGATT GCCCGGAAGG AATACCCACG GAACCGGGAG CATTTTCGCA TCGAGCATTG CCGGAATGCT AGCCAAGGGG TTTCCTATGA AGGAGGCGGT AAGACGGGCC AAGAGTATCA CGGAGGAAAG TATTCGATAT GGTCTTGAAA TAGGCAGGGG AATAGGGCCA GCGGATCCCA TGGTCCCGCT GGAGAAGATA GCCATGAAGG CAGGGGTTAT GAAGGATATG GAGATTTTTG CCGAGTTCGT TGAGAGAGAA AAAAACTTCT ATCTCCTAGT GCCAGAGGTT CAGTCAAATC TTGCCCATTC CATTGACCCG AAATACGTTA CCGGAATTGA GGACATAGCC ACGTTCAGGG GAAGGATCAT CAGGGAGTGG GGTGGAAGGG TGAGGGTAGG GTTCCCAGTG GCCTTCGGCT ATCCCACACA CACTGCAAGA TTACTATTGT CAATAATAAA TAAACAGGGA GTTGGGGATA CGCTCATTAA CATTAGATAC GATCCCAAGA TTGTGGAATT ATTGAAGAGA ATCGGATACG AGGTTGTGGA GGTCCATAGG GAGCTAGAGC CCCAAGGTCA AGAGGGGAAA ACCATGAGTT GGATAGTGGA TCACGTTTAC GAGAGCTTGG GCAAAATTCC TAACGTGATT TTCGATAGGG GGATGATAGG CAAAGAGGCT ATGATAAGGC TCTGGACTTC ATCAATAGAG GAAATGATGG AATCCCTGAC CAGTCTTTTG AGGGAGATAG GGAAATGA
|
Protein sequence | MKKPVVMAIG GFDSGGGAGV ESDIKVLESI GVHGVGAITA VTAQNTLGIK HVTVVDHNSL RKQIETLLED FKVSSGKTGM IVNGEQMKVV FEAVNFPLVV DPVIYAKDGT KLIEDLEAFK KFLLPRATVI TPNAVEAGIL LGMKVETLQD QITASKLIHE RFSVPYVVVK GGHVKSSESV DVLYDGKEVI QLSSPRLPGR NTHGTGSIFA SSIAGMLAKG FPMKEAVRRA KSITEESIRY GLEIGRGIGP ADPMVPLEKI AMKAGVMKDM EIFAEFVERE KNFYLLVPEV QSNLAHSIDP KYVTGIEDIA TFRGRIIREW GGRVRVGFPV AFGYPTHTAR LLLSIINKQG VGDTLINIRY DPKIVELLKR IGYEVVEVHR ELEPQGQEGK TMSWIVDHVY ESLGKIPNVI FDRGMIGKEA MIRLWTSSIE EMMESLTSLL REIGK
|
| |