Gene Tneu_0222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0222 
Symbol 
ID6165941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp196484 
End bp197545 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content61% 
IMG OID641667387 
Productradical SAM domain-containing protein 
Protein accessionYP_001793623 
Protein GI171184704 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.255233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGA GGGGGCTTTC CAGGAACGAC GCCGTTTATT TAATGCGTGA GGTTGACCTC 
TTCACTCTTG CGGAGGCGGC CCACGCCGTG ACGCAGAAGT TCTATGGCGA CGTCGTGACG
TTTGTCAACA ACGTGGTGAT AAACTATACG AATATCTGCG TCGCCAAGTG CCCCATATGC
GCCTTCTACA GATCCCCGGG CCACCCCGAG GCCTACACCC GTAAGGCTGA GGAGGTGGCG
GCTCTCGTGG AGCGGTTCGC CGTCGAATAC GGCGTTACTG AGCTACACAT CAACGGGGGC
TTCAACCCCC TCCTCCCGCC GGAGTACTTC GACGAGCTGT TCAAGGCCGT TAAACGCCGC
GTGCCCCACG TCGTGGTTAA GGGCCCCACC ATGGCCGAGG CCGCCTACTA CGCCGGGCTG
TGGAAGATGA GCGTGAGGGA GGTGCTCTCC AGGTGGAAAG AGGCGGGCCT CGACGCCATC
TCGGGAGGCG GCGCCGAGAT ATTCGCCGAT GAGGTGAGAA AAGTAGTGGC TCCCCACAAG
ATCTCCGGCG AGGAGTGGCT CAGGGTAGCT GAGGTGGCCC ACGAGCTGGG CATACCGAGC
AACGCCACAG TTCTCTACGG CCACGTAGAG GCAGTGGAGC ATCTGGTAGA CCACATCTTC
AGGGTGAGGG AGCTCCAGGA GAAGACGGGG GGCCTCCTCC TCTTTATACC CGTCAAGTTC
AACCCCCTAA ATACGGAGCT CCACAGAAGG GGGGTGGTAA AGGCCCCAGC GCCCTCCACC
TACGACGTGA AGGTGGTGGC GTTGGCTAGG CTGATCCTGC TGGATAGGCT TAAGGTAGCC
GCCTACTGGC TGTCGGTCGG CAAGAAGCTG GCCTCTACCC TCCTGCTGGC GGGGGCAAAC
GACTTGGTGG GCACCATGTA CAACGAGGCT GTTTTGACGT CGGCTGGGGC AAAACACAGC
GCTACCGTGG AGGAGCTCGC AGCCATCGCC AGGGAGGCCG GGAAGACGCC GGCTCTACGC
GACACCTTCC ACCGCAGGAT AAAGCCCCTA GATGGGCCCT AG
 
Protein sequence
MTARGLSRND AVYLMREVDL FTLAEAAHAV TQKFYGDVVT FVNNVVINYT NICVAKCPIC 
AFYRSPGHPE AYTRKAEEVA ALVERFAVEY GVTELHINGG FNPLLPPEYF DELFKAVKRR
VPHVVVKGPT MAEAAYYAGL WKMSVREVLS RWKEAGLDAI SGGGAEIFAD EVRKVVAPHK
ISGEEWLRVA EVAHELGIPS NATVLYGHVE AVEHLVDHIF RVRELQEKTG GLLLFIPVKF
NPLNTELHRR GVVKAPAPST YDVKVVALAR LILLDRLKVA AYWLSVGKKL ASTLLLAGAN
DLVGTMYNEA VLTSAGAKHS ATVEELAAIA REAGKTPALR DTFHRRIKPL DGP