Gene Tneu_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0223 
Symbol 
ID6166202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp198483 
End bp199577 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content60% 
IMG OID641667388 
Productradical SAM domain-containing protein 
Protein accessionYP_001793624 
Protein GI171184705 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.772616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTAATAC TTATAAGACG GATACGGCAG GTGGCTGTGG AGTATAAGAG GGGGGATATC 
GAGCGTCTCT TGAAGGAGGA TTTGTGGGTT TTGGGGAGGA GGGCCTACGA GATCAGGCGG
AGGCTGTACG GGGATAGGAC GACCTTCATC TCCAACATGG TGCTCAACTA CACAAACGTC
TGCGTAATCG GCTGCTCCTT CTGCGCCTTC TACCGGCCGC CTGGACACCC GGAGGCCTAC
GGCTATACGC CGGAGGAGGC CGCGAAGCGT GTGTTAGCCG TAGACGCTAG ATACGGCATT
AGGCAGGTCC TGATCCAGGG CGGGATTAAC CCGGAGATCG GCATTGAGTA CTTCGAGGAG
CTCTTCCGCG CGATAAAGAG GAGGGTCCCC CACGTGGCTA TCCACGCGCT ATCGCCCCTG
GAGGTGGACT ACCTCTCGCG GAGGGAGCGC GCCACCTACA GGGAGGTGCT GGAGCGCCTG
AGGGAGGCCG GCATGGAGTC CATGCCGGGG GGCGGCGGAG AGATACTGGT GGATAGGGTG
AGGAGGGAGC TGGCGCCGCG GAAGATAGAC AGCGCCACTT GGCTCAGAAT TATGGAGGAG
GCGCATAAGC TCGGCATCCC TACGTCGGCT ACCATGATGT ACGGCCACGT GGAGACCCTC
AGCGATATCG CGGAGCACCT CTACAAAATC GCAGAGCTGC AGGAAAAGAC GAGGGGCTTC
ATGGCGTTTA TAGCCTGGAA CTTCGACCCC GGCACCAGCG AGCTTGGCAA GCGCGTTAGG
TACCCAAAGA CCTCGGCCTC GCTTCTGAGG ATGGTCGCCG TGGCGAGGAT AGTGTTCAGG
GAGTTGATCC CCCACATCCA GAGCGGTTGG CTCACCACGG GGCCCGAGAC CGCCCAGCTG
GCCATGTACT TCGGGGCGGA CGACTTCGGG GGGACCCTAT ACGAGGAGAA GGTGCTTGAA
TGGAAGCGCG CCGAGGCGCA GATAGACAGG AGGGAGGACG TGGTAGATAT CATAAGGTCG
GCGGGCTTCA CCCCAGCGGA GCGGGACAAC ATGTACAACG TCGTGAAGGT ATATGGCCAA
GGTGGTCAGG ATTAG
 
Protein sequence
MLILIRRIRQ VAVEYKRGDI ERLLKEDLWV LGRRAYEIRR RLYGDRTTFI SNMVLNYTNV 
CVIGCSFCAF YRPPGHPEAY GYTPEEAAKR VLAVDARYGI RQVLIQGGIN PEIGIEYFEE
LFRAIKRRVP HVAIHALSPL EVDYLSRRER ATYREVLERL REAGMESMPG GGGEILVDRV
RRELAPRKID SATWLRIMEE AHKLGIPTSA TMMYGHVETL SDIAEHLYKI AELQEKTRGF
MAFIAWNFDP GTSELGKRVR YPKTSASLLR MVAVARIVFR ELIPHIQSGW LTTGPETAQL
AMYFGADDFG GTLYEEKVLE WKRAEAQIDR REDVVDIIRS AGFTPAERDN MYNVVKVYGQ
GGQD