Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0019 |
Symbol | |
ID | 5104248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 17075 |
End bp | 18016 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640505913 |
Product | AIR synthase-like protein |
Protein accession | YP_001190120 |
Protein GI | 146302804 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.213041 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00139533 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAAGCTAA AGGAGTTAGG AGAGCACAGT TTCATTGAAA CAGTCATTTC CAAGTACGTA AATTCGGATG TGAACCTGGA CGTCTACAGG CAAGGAAATG TTGTTCTGAA AATAGACGGA TTTCCTATTA AATATACAAT GCCCTTTATG GACTTTTATG ATCTAGGATG GAAGGCAGTA GTTGCCGCTA TGAGTGATCT AGTCTCCTAT GGCGCTACAC CTCTAGTTGT TCTATCGTCT TTCGGGCTTA GTCCTGAAAT AGAGTCTAAG AACGCCGAGA TGATGATAAT GGGAATAAGT GACGCCTCAA AATATTACGG AGCTACCTAT GGCGGAGGGG ATACAAATTC TTCAGAAGAC TCTGGCTGGA TCGATATAGC GATATTTGGT AATCCTGTTT GTAACACAAG ACCCAAGGCA GAACCCGGAG ACTTAGTTTA TGTTACAGGA GAGTTAGGAA GAACTACAGG CGCCTTTCTC TGGTATAGTT CGGGTGGGAA GTTTCCCCTA CCCTTAGACT CAGTGATCAA GCTTAGACAT CCTGTAATCA ACAGGGCCAT TTTGCGTGCT CATAGGGAGC TCTGTAGCGT TGTGGCACTT GGAACTGATG TGAGTGATGG GATTCTAGTT AGTCTTAACA AAATAGCCCA CTATATTGGA CACGGAATTG ACATTGCTAA TATTCCATTA GTGGAATATA TACAGGAGTT AGTTGATAAG AACATAGTTA GTCTAAATGA AATATTAAAA TCTGGAGGAG AAGAATATGA GACGATCTTC GTAGTGAAGA AGGAGACATC ATCCATTTTT CTGGACGCAA TGAAAAGGTA TGGCGTGATC GTTAAAGAAA TTGGAAGAGT AATTGATAGT GAACCAGAAA TTAGATTGAA TTCAAAGAAA TACGAGGTTC ACGGTTGGGA TAACTTCAAG GGTTGGTTTT AG
|
Protein sequence | MKLKELGEHS FIETVISKYV NSDVNLDVYR QGNVVLKIDG FPIKYTMPFM DFYDLGWKAV VAAMSDLVSY GATPLVVLSS FGLSPEIESK NAEMMIMGIS DASKYYGATY GGGDTNSSED SGWIDIAIFG NPVCNTRPKA EPGDLVYVTG ELGRTTGAFL WYSSGGKFPL PLDSVIKLRH PVINRAILRA HRELCSVVAL GTDVSDGILV SLNKIAHYIG HGIDIANIPL VEYIQELVDK NIVSLNEILK SGGEEYETIF VVKKETSSIF LDAMKRYGVI VKEIGRVIDS EPEIRLNSKK YEVHGWDNFK GWF
|
| |