Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0892 |
Symbol | |
ID | 4461849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 968073 |
End bp | 969080 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639699911 |
Product | thiamine biosynthesis protein |
Protein accession | YP_843320 |
Protein GI | 116754202 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0301] Thiamine biosynthesis ATP pyrophosphatase |
TIGRFAM ID | [TIGR00342] thiazole biosynthesis/tRNA modification protein ThiI |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.122669 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACATGTG AGGGCGGGAG GATATACGTC CACACCTCGG ATGAAAGAGC GCCCTCCACG ATCTCCAAAG TTTTTGGCGT GGTTTCGGTC AGCCCCGCCT ACAGCGTGAG TCCCAGGATG GCTGATATCT CCAAGCTCGC TGTGGATATC GCAATGATGC GCTCGCCGGG GAGCTTTGCC ATAAGGGCGA GGCGGGCTGG AGGTGAGATG CCCAGCGGGA GGATAGCCGT GGAGGTCGGA GCTGCGGTCC AGCGTGCCAC CGGTGCCGAT GTCGATCTTG ATAATCCGGA TCTTGAGATA TTCATCGAGG CCCGCCCGGA CAGGGCGCTG GTCTTCACAG AGATCGTGAG GGGCGTTGGA GGCCTCCCCC TGGGATCGCA GTCCAGGATG CTCGCGCTGA TCTCCGGCGG CATAGACTCT CCAGTGGCGG CATGGCTTAT CATGCGTCGC GGATGCCCTG TAGCGCTGCT CCACCTCGAT GTCTCGCCAT ATGCTGACTC CATAGAGCAG GTGGTCAGAC AGGCAGAGGT GCTTCAGAGC TGGATGTCTG GAAGGCGGCT CGATCTGGCG ATCGTGCGCA ATGCACGCGC GATCGAGATG ATCTCATCAA GATACCCGAG GGAGACCTGC GTTCTGTGCA GGCGTCTGAT GTACCACATC TCCACTCTCG TGATGAAGCG GCTGAGAGCA AAGGGCATCG TGACAGGATA CTCCCTCGGG CAGGTGGCGT CCCAGACCCC TGAGAACATA ATGGCAGAGC AGGTTGGGAT CGAGGCGCCA GTGTACCATC CACTGATCGC GATGGACAAG ACCGAGATCA TCGAGCTGGC GCGCAGGATA GGCACATACG ACATATCAAT CGGATCACAG CAATGCCGGG CAGCGCCGAA GAAGCCTGTG ACGAGGGCAA GGTTAGAGGA GATACTGAGA ATCGAGGGGG AGCTTGGTCT GAGAGATCTC GCAATGGAGC TCGCTGACGG CGTGGAGATT GTAAGAATAA GGAGATAA
|
Protein sequence | MTCEGGRIYV HTSDERAPST ISKVFGVVSV SPAYSVSPRM ADISKLAVDI AMMRSPGSFA IRARRAGGEM PSGRIAVEVG AAVQRATGAD VDLDNPDLEI FIEARPDRAL VFTEIVRGVG GLPLGSQSRM LALISGGIDS PVAAWLIMRR GCPVALLHLD VSPYADSIEQ VVRQAEVLQS WMSGRRLDLA IVRNARAIEM ISSRYPRETC VLCRRLMYHI STLVMKRLRA KGIVTGYSLG QVASQTPENI MAEQVGIEAP VYHPLIAMDK TEIIELARRI GTYDISIGSQ QCRAAPKKPV TRARLEEILR IEGELGLRDL AMELADGVEI VRIRR
|
| |