Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1575 |
Symbol | |
ID | 5104020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1524514 |
End bp | 1525482 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640507461 |
Product | phosphomevalonate kinase |
Protein accession | YP_001191654 |
Protein GI | 146304338 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG3890] Phosphomevalonate kinase |
TIGRFAM ID | [TIGR01220] phosphomevalonate kinase, ERG8-type, Gram-positive branch |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00534382 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCAAAGCT TTGAAGTGAG TGCTCCGGGG AAGGTCCTGT GGATCGGGAG CTACTCGGTG GTATTTGGTG GGATATCGCA TGTCATTGCA ATTGACAAGA GGGTTAGGTG TAGATGTGAG GAGTCGGAAA GACTAGAATT TATAACGTCG TATGGTAACT TCTCCGAAGG CCAGAACGAA CTCATAGACA GTGTGCTCAA CGAGGTTAGA ACAATCTACG ATATTCCCCG CTTGAGGGTA TATCTCATAA ACGATCCTGC ATTTCAGATA GATGGGAAGA AAACTGGGTT AGGGAGCTCA TCTGCAGGGA CAGTGGCATT GACGGCTTGC CTCAGTTACG CCGTAACTGG AAAGTTTGAT GTGGACCTGG TGTACAAACT TTCACAGAGG GCGAACTATA GACGACAGAA GGGAATTGGT AGCGGTTTCG ATATAGCTGC GGCGACCTAC GGTAGCGTGA TATACAGGAG ATACAATGAC ATCAACAAGG TTGATTCTGT GGTGGAAAGG CTAGACATCC CACAAAACAT ACAGATACTT CTAGGTTTTA CGGGAAGGAG TGCCAGCACA GTGAATCTAG TAAGGAGGTT TGAGGACACC AAGAACAATC CAAGGTTTAA GGAACTAATG AGTGAGATTG AGATAGATAA TGAAATCGCA ATAAAGCTGT TGAGGTTAGG GAAAATTGAT GCTGCAGTTC CACACATAAA GCTGGCTAGA CAGAACTTGA ACCTGCTTTC GAAAGAGGTG GTAGGAGTGG AAATTGAAAC AGAAGAGGAT AGAAAGCTCA TGAGCTTAGC CGAGAAAAAC GGCGCCTTGA TATCCCTGAT GCCCGGGGCA GGTGGAGGAG ATCTCATACT AGCTCTAGGG GAGAACCTTG CGAGAGTCAA GGAGACTTGG GAGAGGATGA GCATTAGAAC CATTAATGTG AAACAAGATG AAGGTGTGAA AATTGAAGCT AGAAGCTGA
|
Protein sequence | MQSFEVSAPG KVLWIGSYSV VFGGISHVIA IDKRVRCRCE ESERLEFITS YGNFSEGQNE LIDSVLNEVR TIYDIPRLRV YLINDPAFQI DGKKTGLGSS SAGTVALTAC LSYAVTGKFD VDLVYKLSQR ANYRRQKGIG SGFDIAAATY GSVIYRRYND INKVDSVVER LDIPQNIQIL LGFTGRSAST VNLVRRFEDT KNNPRFKELM SEIEIDNEIA IKLLRLGKID AAVPHIKLAR QNLNLLSKEV VGVEIETEED RKLMSLAEKN GALISLMPGA GGGDLILALG ENLARVKETW ERMSIRTINV KQDEGVKIEA RS
|
| |