Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0223 |
Symbol | |
ID | 6166202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 198483 |
End bp | 199577 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641667388 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001793624 |
Protein GI | 171184705 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.772616 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTAATAC TTATAAGACG GATACGGCAG GTGGCTGTGG AGTATAAGAG GGGGGATATC GAGCGTCTCT TGAAGGAGGA TTTGTGGGTT TTGGGGAGGA GGGCCTACGA GATCAGGCGG AGGCTGTACG GGGATAGGAC GACCTTCATC TCCAACATGG TGCTCAACTA CACAAACGTC TGCGTAATCG GCTGCTCCTT CTGCGCCTTC TACCGGCCGC CTGGACACCC GGAGGCCTAC GGCTATACGC CGGAGGAGGC CGCGAAGCGT GTGTTAGCCG TAGACGCTAG ATACGGCATT AGGCAGGTCC TGATCCAGGG CGGGATTAAC CCGGAGATCG GCATTGAGTA CTTCGAGGAG CTCTTCCGCG CGATAAAGAG GAGGGTCCCC CACGTGGCTA TCCACGCGCT ATCGCCCCTG GAGGTGGACT ACCTCTCGCG GAGGGAGCGC GCCACCTACA GGGAGGTGCT GGAGCGCCTG AGGGAGGCCG GCATGGAGTC CATGCCGGGG GGCGGCGGAG AGATACTGGT GGATAGGGTG AGGAGGGAGC TGGCGCCGCG GAAGATAGAC AGCGCCACTT GGCTCAGAAT TATGGAGGAG GCGCATAAGC TCGGCATCCC TACGTCGGCT ACCATGATGT ACGGCCACGT GGAGACCCTC AGCGATATCG CGGAGCACCT CTACAAAATC GCAGAGCTGC AGGAAAAGAC GAGGGGCTTC ATGGCGTTTA TAGCCTGGAA CTTCGACCCC GGCACCAGCG AGCTTGGCAA GCGCGTTAGG TACCCAAAGA CCTCGGCCTC GCTTCTGAGG ATGGTCGCCG TGGCGAGGAT AGTGTTCAGG GAGTTGATCC CCCACATCCA GAGCGGTTGG CTCACCACGG GGCCCGAGAC CGCCCAGCTG GCCATGTACT TCGGGGCGGA CGACTTCGGG GGGACCCTAT ACGAGGAGAA GGTGCTTGAA TGGAAGCGCG CCGAGGCGCA GATAGACAGG AGGGAGGACG TGGTAGATAT CATAAGGTCG GCGGGCTTCA CCCCAGCGGA GCGGGACAAC ATGTACAACG TCGTGAAGGT ATATGGCCAA GGTGGTCAGG ATTAG
|
Protein sequence | MLILIRRIRQ VAVEYKRGDI ERLLKEDLWV LGRRAYEIRR RLYGDRTTFI SNMVLNYTNV CVIGCSFCAF YRPPGHPEAY GYTPEEAAKR VLAVDARYGI RQVLIQGGIN PEIGIEYFEE LFRAIKRRVP HVAIHALSPL EVDYLSRRER ATYREVLERL REAGMESMPG GGGEILVDRV RRELAPRKID SATWLRIMEE AHKLGIPTSA TMMYGHVETL SDIAEHLYKI AELQEKTRGF MAFIAWNFDP GTSELGKRVR YPKTSASLLR MVAVARIVFR ELIPHIQSGW LTTGPETAQL AMYFGADDFG GTLYEEKVLE WKRAEAQIDR REDVVDIIRS AGFTPAERDN MYNVVKVYGQ GGQD
|
| |