Gene HY04AAS1_0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_0407 
Symbol 
ID6743201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp353528 
End bp354538 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content34% 
IMG OID642750200 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_002121075 
Protein GI195952785 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACTT ACAAAGAAGC AGGTGTTGAT ATAGAAAAAG CAGATCGTTT TGTAGGCTTT 
TTAAAAGAAA GACTTAACAA TCTTAACAAA AACCTAAAAC AAGCATTACC CTTTGGGGCT
TTTGCGGCAG GTTTTTTGGT AGAAGATTGC GATTTGGTAA TTACATCCAC CACAGACGGG
GTTGGCACAA AGCTAAAAAT AGCCCAAAAC GTAAATATAC ACAACACAGT AGGCATAGAC
TTAGTAGCAA TGAACGTAAA CGATATAATT ACCACTGGCT CAAAGCCAAT AGCATTTTTA
GATTATATAG CCATAGGCAT GATTGAAGGG TCTACAATAA ACCCGCTGAT AGAAGGCATT
ATAACAGGTT GCGAAGAAGC AAATACACCT CTTGTGGGTG GAGAAACTGC GGAAATGCCT
TCTTTCTACA AAGACGGTGA ATACGATTTA GCTGGTTTTT GCATAGGTAT TTGTAAAAAA
GATGAGATTG TTACAGGGCA AGATGTAAAA GAGAATGATA TTATAATTGC TATACCATCT
TCTGGATTTC ACAGCAACGG TTTTTCCCTT GTAAGATATA TATTAGAAAA GCACAATATT
AAATACAATG ATTATATAAA AGAGTTTGGA AAAGAACTTT GGGAAATATT ACTAACACCT
ACAAGGATAT ACGTAAAAGA TGTTTTGGAG CTTAAAAACA AAATAAAGAT AAAAGCTATG
GCTCATATAA CAGGCGGTGG AATACCGGGA AACATAACGA GGGTTATACC ATATGGTTTA
AGAGCCGTGA TATCAGCTTA TCCGGTACCG GATTTATTTT TATGGTTTCA AAAGCTTGGA
AACATAAAAA AAGAAGAAAT GTACAAAACT TTTAATATGG GAGTAGGGTT TATGATTATT
ATCGAAGAAA AAGATAAAGA GGTTGCTTTG AACACTATAA AAGATTCTTT TGTTGTAGGG
TATATAGAAC AATCAAAAGA TAATAGCAAA ATTGTTTTAA ATGACATATA G
 
Protein sequence
MSTYKEAGVD IEKADRFVGF LKERLNNLNK NLKQALPFGA FAAGFLVEDC DLVITSTTDG 
VGTKLKIAQN VNIHNTVGID LVAMNVNDII TTGSKPIAFL DYIAIGMIEG STINPLIEGI
ITGCEEANTP LVGGETAEMP SFYKDGEYDL AGFCIGICKK DEIVTGQDVK ENDIIIAIPS
SGFHSNGFSL VRYILEKHNI KYNDYIKEFG KELWEILLTP TRIYVKDVLE LKNKIKIKAM
AHITGGGIPG NITRVIPYGL RAVISAYPVP DLFLWFQKLG NIKKEEMYKT FNMGVGFMII
IEEKDKEVAL NTIKDSFVVG YIEQSKDNSK IVLNDI