Gene Msed_0184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0184 
Symbol 
ID5103928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp148170 
End bp149381 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content45% 
IMG OID640506089 
Productphosphomethylpyrimidine kinase 
Protein accessionYP_001190285 
Protein GI146302969 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0351] Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase 
TIGRFAM ID[TIGR00097] phosphomethylpyrimidine kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000409945 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.070974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAAA GACCCATTGC GATGACAATA GCAGGAAGTG ACTCAGGAGG AGGGGCTGGA 
GTTCAGGCAG ACCTCAAGAC CTTTACATCT CTTGGCGTGT TCGGCGTATC TGTCATAACT
GGTTTGACTG CCCAGAATAC AGCTAGAGTC ACCAAGGTCC TGGAAGTCCC CCCAGAGTTT
GTGGAGTCCC AATTCGATAC TATAATGGAG GACTTTCAGG TTAAGTATGC CAAGACAGGA
ATGCTTGCAT CGAGCAGGAT AGTTGACGCT GTGGAGAGAA AGCTGACACA ATATGGAATT
AACCTAGTCC TAGACCCAGT TATGATATCA AAGAGTGGTT ACCCTCTAGT AACCGAGGAA
GTGGTAAGGG ATATAGTGAG GCTAGCTAGA AAATCCCTGA TAATAACCCC CAACAAATAT
GAGGCGGAAA GACTAACTGG ATTTAGGATA AGAACTCGAG ATGATCTAAG AAATACAGCA
TTGCACCTCT ATAAGAGCTT GGGTGTAAAC GTTGTGGTAA AGGGAGGGAA AGCCATTGGA
GGATATGATT TCGCAGTCGT TGATGGCGAT GAGATTGAGC TACGTGGAGA ATTAATAAAT
ACCGATAATC TTCACGGGAG TGGTGATGTA TTTTCCGCCT CAATTACGGC CTTTCTGAGC
AAGGGTCTTA ATCTACGCGA CGCGTTAAGG GAGGCTAAGA AAGTTGTAAG TGAGGCAATC
AAATTCTCTC TTGCAATTGG TCACGGGAAC GGGCCAGTGG ATCCTTTCTC CTCTGTGGAG
AGGGTGGTTA AGATTAACCA AGCCAGGGAG GATCTCGAGA GACTTGTGGA ATTTCTTGAA
AGGAACAAGG AAATCGTTAA GAAAATGATA ACTCATGAGG AAAAAATGAA CATTGGTGTC
CTAACAGAGT ATGGGGATTT CGCAACTTTA GCCGGGGGGA TCATAAGGTA CATTGACTGG
ATTAAGGTAG ATGGTCCCAT TGTGGTGAAC TGGTACTACA ATATAGTACA CAAGGCCTTG
AAACAAACTG GCAAGAGGCT TGGTATTTTG GTGTCCTTGA CAAACGAGAT ATTAAATGCT
TGTGAGGGCG GTAAACTGAA AATTTCTGAA AGTGGAATTT ACGGCGATCT GGTAATGATA
GATGGGAGGG CAGTCTTGGT GGGAAACAGT TTAAGTGAGA TTATGGAGAA ACTGGAGGTC
CTGAGGAATT GA
 
Protein sequence
MMQRPIAMTI AGSDSGGGAG VQADLKTFTS LGVFGVSVIT GLTAQNTARV TKVLEVPPEF 
VESQFDTIME DFQVKYAKTG MLASSRIVDA VERKLTQYGI NLVLDPVMIS KSGYPLVTEE
VVRDIVRLAR KSLIITPNKY EAERLTGFRI RTRDDLRNTA LHLYKSLGVN VVVKGGKAIG
GYDFAVVDGD EIELRGELIN TDNLHGSGDV FSASITAFLS KGLNLRDALR EAKKVVSEAI
KFSLAIGHGN GPVDPFSSVE RVVKINQARE DLERLVEFLE RNKEIVKKMI THEEKMNIGV
LTEYGDFATL AGGIIRYIDW IKVDGPIVVN WYYNIVHKAL KQTGKRLGIL VSLTNEILNA
CEGGKLKISE SGIYGDLVMI DGRAVLVGNS LSEIMEKLEV LRN