Gene HY04AAS1_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_1023 
Symbol 
ID6743838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp957940 
End bp959046 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content33% 
IMG OID642750831 
ProductPhosphoribosylaminoimidazole carboxylase 
Protein accessionYP_002121687 
Protein GI195953397 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000261565 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGTTG GTATAATAGG CGATGGACAA CTTGCCATGA TGAGCGTAAT GGAAGGGCTC 
ATGATGGATA TAGATTTTGC TGTCTTATCT TTTGAAAAAG ACCCTCCAGC TTCTTACGTA
ACTAAACATG TTTTCAAAGA AAATGAAGTG GAAGAGTTTG TGGCTTTTAG CGATGTGATA
ACCTATGAGT TTGAGCATTT TAACAAAAAG ATATTTGACT GCACGAAACT ACTGGACAAG
CTTTATCCTG GTGTAAAACC AATAGAGTTA AAACAAAACA GGCTTTTAGA AAAAAAGTTT
TTAAAAGACC ACAACTTTCC AACAGTCCCT TTTTACGAAG CTAAAAATAC AGATGAACTT
TTTGAGATTG TAGAAAGTTT AAACAAAGAA GCGGTAGTAA AAACAATATC AAACGGCTAC
GATGGTAAAG GACAGTATGT AATACATACA AGAGAAGATT TGGAACTTTT AAAAGATAAA
CTAAAAGATT CTAAAGACAG TTTTTTAATA GAGGAGTTTT GTTATTTTGA TTTTGAAATG
TCTTTGATAG CTGGTATATC AAAAGATAGA ATCGTCTTCA TGCCAATGAC AAAAAATATA
CATAAAAACG GTATTTTGTT ATACAACCAT ACAGATTTTT TTACAAACGA AGTGCAAGAA
AAAGCTAAAG CTATAACCTC AAGGCTTCTA AAGGCCCTTG GTATAGAAAA AGGTGTTTTA
GCGGTGGAAT TTTTTGTAAA AGCCAAAGAC GTTTATATAA ACGAATTTGC CCCAAGAGTA
CACAACACCG GTCATCACAC GTTGAACGAT GCCGAATACT CCCAGTTTGA ACTGCTTTTA
AGAACAATGT TAGATATGCC AATATACTCT CCCTCTCTTA TAACACAAGG TGGGATGATA
AACATAATAG GCAATATAAA TCTTACAAAA GAACTAAAAG ATGGTATATT GTCTTTAGAA
GGTGCTAGTC TTTATTGGTA TAGGAAAACA CCAAGAGAAG GCAGGAAATT AGGACATATA
AACGTTGTTG GGAGAGATGT TGAAGAAGTA AGAGCTAAGC TTAGAAATTT ATCTAAACTT
TTATACCCTT CGTTAAATAT ATGGTAA
 
Protein sequence
MRVGIIGDGQ LAMMSVMEGL MMDIDFAVLS FEKDPPASYV TKHVFKENEV EEFVAFSDVI 
TYEFEHFNKK IFDCTKLLDK LYPGVKPIEL KQNRLLEKKF LKDHNFPTVP FYEAKNTDEL
FEIVESLNKE AVVKTISNGY DGKGQYVIHT REDLELLKDK LKDSKDSFLI EEFCYFDFEM
SLIAGISKDR IVFMPMTKNI HKNGILLYNH TDFFTNEVQE KAKAITSRLL KALGIEKGVL
AVEFFVKAKD VYINEFAPRV HNTGHHTLND AEYSQFELLL RTMLDMPIYS PSLITQGGMI
NIIGNINLTK ELKDGILSLE GASLYWYRKT PREGRKLGHI NVVGRDVEEV RAKLRNLSKL
LYPSLNIW