Gene HY04AAS1_0810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_0810 
Symbol 
ID6743616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp751822 
End bp752916 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content36% 
IMG OID642750611 
ProductRadical SAM domain protein 
Protein accessionYP_002121475 
Protein GI195953185 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0537641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGAGC TTTTAGAGGA TAAAGAAGTA TCTTTTCTTT TGGAAAAAGC CTTAAACAAA 
GAAAGATTGA CAGAGGAAGA GGCTGTTTTT TTATATAAAA ATGCTCCGTT AAGCGCTTTA
GGGTATATAG CAAACGAGTT AAACAAAGAA AAAAATCAAG ACAGAGCTTT TTTTATAGTA
AACAGATATT TAAACCCTAC AAACATATGT GTATACAAAT GCAAATTTTG TAGCTTTGGT
GTTTCAAAGT CAGACGAAAG AGCTTTTGAA CTTAGCATAG GCGAGGTGTT AAGAAAAATA
GAAAACTCTT ACAAAAACGG TATAACAGAG GTACACATAG TTGGCGGATT GCCACCGCAT
TGGGAAAGAG AAGATTACGT AAACCTAATA AAAGTTGTCA AAGAAAACTT TCCAAACATA
GTCATAAAAG CTTATACAGC GGTGGAAATA GACCACATAG CAAAAATATC AAAATCTACT
TACGAAGATG TGCTCCTTGA ATTAAAAGAA GTTGGCTTGT CTTTATTGCC AGGAGGCGGT
GCTGAAATAT TTGCCGATAG GGTGAGAAAC ATAATAGCAC CAAACAAAGC CAACGCGGAA
GAATACCTTG AAATACATGA AACTGCCCAT AGACTAGGTA TACCATCAAA CGTTACGATG
CTTTATGGAC ATATAGAAAC CATAGAAGAA AGAGTAGATC ATATGAAAAG AATAAGAGAT
TTGCAGGGCA AAACCGGAGG TTTTCAAGTA TTTATACCTT TAGCTTATCA TCCAAAAGGG
ACATCTCTCG GTGGCGAGAG GACATCTTCT GTGGATGATC TTAAAACCAT AGCGATGTCA
AGGATTTTTC TAGATAACTT CGATAACATA AAAGCATATT GGATAACCTT AGGAGAAAAG
TTAGCTCAGA TAGCTCTAAA TTTTGGCGCA AACGATATAG ATGGAACTTT AGAAGAAGAA
CTCGTTGTGC ATGCGGCTGG TTCTACAGAA ACTTACGGTA AAACGGTAGA CAAGCTTGTA
AGCATTATAA AAGGAGCTTC CAAGATTCCT GTACAAAGAG ACTCCTTTTA TAATATAATA
AAAGTTTATA ATTGA
 
Protein sequence
MIELLEDKEV SFLLEKALNK ERLTEEEAVF LYKNAPLSAL GYIANELNKE KNQDRAFFIV 
NRYLNPTNIC VYKCKFCSFG VSKSDERAFE LSIGEVLRKI ENSYKNGITE VHIVGGLPPH
WEREDYVNLI KVVKENFPNI VIKAYTAVEI DHIAKISKST YEDVLLELKE VGLSLLPGGG
AEIFADRVRN IIAPNKANAE EYLEIHETAH RLGIPSNVTM LYGHIETIEE RVDHMKRIRD
LQGKTGGFQV FIPLAYHPKG TSLGGERTSS VDDLKTIAMS RIFLDNFDNI KAYWITLGEK
LAQIALNFGA NDIDGTLEEE LVVHAAGSTE TYGKTVDKLV SIIKGASKIP VQRDSFYNII
KVYN