Gene HY04AAS1_0090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_0090 
SymbolpurH 
ID6742873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp84037 
End bp85557 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content37% 
IMG OID642749874 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002120760 
Protein GI195952470 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGCAC TTATATCTGT ATACGATAAA ACCGGTATTT TAGAACTGGC TAAGGAGCTT 
TTGAACCAAG GGTATGAGAT TCTATCCAGT GGTGGGACAT ACACTTATCT AAAAAATGCT
GGTGTTGATG CCATAGAGGT ATCTGAGGTA ACAGGTTTTA GAGAGATTTT AGGTGGTAGA
GTTAAGACGC TTCATCCCGC TATACACGGT GGTATTTTGT TTAGAGAAGA CGTAGAAAAA
GATTTAGAAG AAATAAAAGA AAACTCTATA GAACCTATTG ATATCGTAGT GGTGAACTTG
TATCCTTTTG AAAAGAAGAT GAAAGAGCTA AAAGATATAG ATGCTCTTGT GGAGTTTATA
GACATAGGGG GTCCTACTCT TGTAAGAGCC GCTGCTAAAA ATCACAAACG AGTTAGTGTA
CTTACAGATA TCGAAGATTA CGGATGGTTT ATAGAAAAGC TCAAAATGAA TGCTGTATCT
CAGCAAGACA GAAAATACTT AGCTTTGAAA GCTTTTTGGT TAACATCTTA CTATGATGCT
GTTATAGCTA GTTATTTTTC CAAAGTATTT GGCTTTTCAG AAAAAGATTT TAAGCATCAT
ACTGTACCTA TGTTTTTGAG AGATGAATTG AGATACGGTG AAAATCCACA TCAGCAAGCT
TACCTATATG AAAATCCGTT GGAAGAAAAT GGTATTGTAA GAGCTGATGT GCTTCAAGGT
AAAAAGATGT CTTACAACAA CTATCTTGAT GCCGATTCTG TTGTAAAGCT CATGTCAGAG
TTTTCTAATC CTTGCTGTGC TATCGTAAAA CACAACAATC CAAGCGGTAT AACCACAGAC
AACAATATTC TGGAAGCTTA CAAAAAAGCT TTTCAATGTG ATCCTGAGGC GGCGTTTGGT
GGTATCGTAG CTTTTAACAA GGTTGTAGAT AAAGACGTTG CTAAGGCTAT TACAGAGCAT
TTTTATGAGA TAGTTATAGC TCCTGAGTTT ACCGAAGAAG CTGTTGAAGA GTTTTCCAAA
AAGAAGAATT TAAGGTTAGT AAGGTATAAA AACTACAATC AGAATATTGA TCTAAGGAGT
ATATCTGGCG GGTTTTTAGT GCAGGACATA GATGATAAGC TTTATGAGTC TATAGAGATA
GTTTCATTGA GAAGACCTAC TGAGCAAGAG TTAGAGGATG CTATATTTGC TTGGAAAGTG
GCAAAATGGA CAAAATCAAA TGCTATAGTG ATAGCTAAAA ACAATCAAAC CATAGGTATA
GGCGCTGGTC AGGTGTCGAG GGTAGATTCT CTTAGAAGCG CTATAAGAAA AGCAAAAAAC
TTTTCCCATG ATTTAAAAGG CGCCGTAGTG GCCTCAGACG CTTTTTTCCC GTTTAGAGAT
AGCATAGATA TAGCTGCTGA AGAAGGAATA TCTGGTACAA TACAACCTGG TGGTTCTATA
AGAGACAAAG AAGTTATAGA GGCTGTAAAT GAGCATAACA TGTTTATGAT ATTTACCCAT
ATGAGGCATT TCAGACATTG A
 
Protein sequence
MRALISVYDK TGILELAKEL LNQGYEILSS GGTYTYLKNA GVDAIEVSEV TGFREILGGR 
VKTLHPAIHG GILFREDVEK DLEEIKENSI EPIDIVVVNL YPFEKKMKEL KDIDALVEFI
DIGGPTLVRA AAKNHKRVSV LTDIEDYGWF IEKLKMNAVS QQDRKYLALK AFWLTSYYDA
VIASYFSKVF GFSEKDFKHH TVPMFLRDEL RYGENPHQQA YLYENPLEEN GIVRADVLQG
KKMSYNNYLD ADSVVKLMSE FSNPCCAIVK HNNPSGITTD NNILEAYKKA FQCDPEAAFG
GIVAFNKVVD KDVAKAITEH FYEIVIAPEF TEEAVEEFSK KKNLRLVRYK NYNQNIDLRS
ISGGFLVQDI DDKLYESIEI VSLRRPTEQE LEDAIFAWKV AKWTKSNAIV IAKNNQTIGI
GAGQVSRVDS LRSAIRKAKN FSHDLKGAVV ASDAFFPFRD SIDIAAEEGI SGTIQPGGSI
RDKEVIEAVN EHNMFMIFTH MRHFRH