Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0222 |
Symbol | |
ID | 6165941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 196484 |
End bp | 197545 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641667387 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001793623 |
Protein GI | 171184704 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.255233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGA GGGGGCTTTC CAGGAACGAC GCCGTTTATT TAATGCGTGA GGTTGACCTC TTCACTCTTG CGGAGGCGGC CCACGCCGTG ACGCAGAAGT TCTATGGCGA CGTCGTGACG TTTGTCAACA ACGTGGTGAT AAACTATACG AATATCTGCG TCGCCAAGTG CCCCATATGC GCCTTCTACA GATCCCCGGG CCACCCCGAG GCCTACACCC GTAAGGCTGA GGAGGTGGCG GCTCTCGTGG AGCGGTTCGC CGTCGAATAC GGCGTTACTG AGCTACACAT CAACGGGGGC TTCAACCCCC TCCTCCCGCC GGAGTACTTC GACGAGCTGT TCAAGGCCGT TAAACGCCGC GTGCCCCACG TCGTGGTTAA GGGCCCCACC ATGGCCGAGG CCGCCTACTA CGCCGGGCTG TGGAAGATGA GCGTGAGGGA GGTGCTCTCC AGGTGGAAAG AGGCGGGCCT CGACGCCATC TCGGGAGGCG GCGCCGAGAT ATTCGCCGAT GAGGTGAGAA AAGTAGTGGC TCCCCACAAG ATCTCCGGCG AGGAGTGGCT CAGGGTAGCT GAGGTGGCCC ACGAGCTGGG CATACCGAGC AACGCCACAG TTCTCTACGG CCACGTAGAG GCAGTGGAGC ATCTGGTAGA CCACATCTTC AGGGTGAGGG AGCTCCAGGA GAAGACGGGG GGCCTCCTCC TCTTTATACC CGTCAAGTTC AACCCCCTAA ATACGGAGCT CCACAGAAGG GGGGTGGTAA AGGCCCCAGC GCCCTCCACC TACGACGTGA AGGTGGTGGC GTTGGCTAGG CTGATCCTGC TGGATAGGCT TAAGGTAGCC GCCTACTGGC TGTCGGTCGG CAAGAAGCTG GCCTCTACCC TCCTGCTGGC GGGGGCAAAC GACTTGGTGG GCACCATGTA CAACGAGGCT GTTTTGACGT CGGCTGGGGC AAAACACAGC GCTACCGTGG AGGAGCTCGC AGCCATCGCC AGGGAGGCCG GGAAGACGCC GGCTCTACGC GACACCTTCC ACCGCAGGAT AAAGCCCCTA GATGGGCCCT AG
|
Protein sequence | MTARGLSRND AVYLMREVDL FTLAEAAHAV TQKFYGDVVT FVNNVVINYT NICVAKCPIC AFYRSPGHPE AYTRKAEEVA ALVERFAVEY GVTELHINGG FNPLLPPEYF DELFKAVKRR VPHVVVKGPT MAEAAYYAGL WKMSVREVLS RWKEAGLDAI SGGGAEIFAD EVRKVVAPHK ISGEEWLRVA EVAHELGIPS NATVLYGHVE AVEHLVDHIF RVRELQEKTG GLLLFIPVKF NPLNTELHRR GVVKAPAPST YDVKVVALAR LILLDRLKVA AYWLSVGKKL ASTLLLAGAN DLVGTMYNEA VLTSAGAKHS ATVEELAAIA REAGKTPALR DTFHRRIKPL DGP
|
| |