Gene Msed_0019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0019 
Symbol 
ID5104248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp17075 
End bp18016 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content41% 
IMG OID640505913 
ProductAIR synthase-like protein 
Protein accessionYP_001190120 
Protein GI146302804 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.213041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00139533 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAAGCTAA AGGAGTTAGG AGAGCACAGT TTCATTGAAA CAGTCATTTC CAAGTACGTA 
AATTCGGATG TGAACCTGGA CGTCTACAGG CAAGGAAATG TTGTTCTGAA AATAGACGGA
TTTCCTATTA AATATACAAT GCCCTTTATG GACTTTTATG ATCTAGGATG GAAGGCAGTA
GTTGCCGCTA TGAGTGATCT AGTCTCCTAT GGCGCTACAC CTCTAGTTGT TCTATCGTCT
TTCGGGCTTA GTCCTGAAAT AGAGTCTAAG AACGCCGAGA TGATGATAAT GGGAATAAGT
GACGCCTCAA AATATTACGG AGCTACCTAT GGCGGAGGGG ATACAAATTC TTCAGAAGAC
TCTGGCTGGA TCGATATAGC GATATTTGGT AATCCTGTTT GTAACACAAG ACCCAAGGCA
GAACCCGGAG ACTTAGTTTA TGTTACAGGA GAGTTAGGAA GAACTACAGG CGCCTTTCTC
TGGTATAGTT CGGGTGGGAA GTTTCCCCTA CCCTTAGACT CAGTGATCAA GCTTAGACAT
CCTGTAATCA ACAGGGCCAT TTTGCGTGCT CATAGGGAGC TCTGTAGCGT TGTGGCACTT
GGAACTGATG TGAGTGATGG GATTCTAGTT AGTCTTAACA AAATAGCCCA CTATATTGGA
CACGGAATTG ACATTGCTAA TATTCCATTA GTGGAATATA TACAGGAGTT AGTTGATAAG
AACATAGTTA GTCTAAATGA AATATTAAAA TCTGGAGGAG AAGAATATGA GACGATCTTC
GTAGTGAAGA AGGAGACATC ATCCATTTTT CTGGACGCAA TGAAAAGGTA TGGCGTGATC
GTTAAAGAAA TTGGAAGAGT AATTGATAGT GAACCAGAAA TTAGATTGAA TTCAAAGAAA
TACGAGGTTC ACGGTTGGGA TAACTTCAAG GGTTGGTTTT AG
 
Protein sequence
MKLKELGEHS FIETVISKYV NSDVNLDVYR QGNVVLKIDG FPIKYTMPFM DFYDLGWKAV 
VAAMSDLVSY GATPLVVLSS FGLSPEIESK NAEMMIMGIS DASKYYGATY GGGDTNSSED
SGWIDIAIFG NPVCNTRPKA EPGDLVYVTG ELGRTTGAFL WYSSGGKFPL PLDSVIKLRH
PVINRAILRA HRELCSVVAL GTDVSDGILV SLNKIAHYIG HGIDIANIPL VEYIQELVDK
NIVSLNEILK SGGEEYETIF VVKKETSSIF LDAMKRYGVI VKEIGRVIDS EPEIRLNSKK
YEVHGWDNFK GWF