Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_1023 |
Symbol | |
ID | 6743838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 957940 |
End bp | 959046 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 642750831 |
Product | Phosphoribosylaminoimidazole carboxylase |
Protein accession | YP_002121687 |
Protein GI | 195953397 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00000261565 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGTTG GTATAATAGG CGATGGACAA CTTGCCATGA TGAGCGTAAT GGAAGGGCTC ATGATGGATA TAGATTTTGC TGTCTTATCT TTTGAAAAAG ACCCTCCAGC TTCTTACGTA ACTAAACATG TTTTCAAAGA AAATGAAGTG GAAGAGTTTG TGGCTTTTAG CGATGTGATA ACCTATGAGT TTGAGCATTT TAACAAAAAG ATATTTGACT GCACGAAACT ACTGGACAAG CTTTATCCTG GTGTAAAACC AATAGAGTTA AAACAAAACA GGCTTTTAGA AAAAAAGTTT TTAAAAGACC ACAACTTTCC AACAGTCCCT TTTTACGAAG CTAAAAATAC AGATGAACTT TTTGAGATTG TAGAAAGTTT AAACAAAGAA GCGGTAGTAA AAACAATATC AAACGGCTAC GATGGTAAAG GACAGTATGT AATACATACA AGAGAAGATT TGGAACTTTT AAAAGATAAA CTAAAAGATT CTAAAGACAG TTTTTTAATA GAGGAGTTTT GTTATTTTGA TTTTGAAATG TCTTTGATAG CTGGTATATC AAAAGATAGA ATCGTCTTCA TGCCAATGAC AAAAAATATA CATAAAAACG GTATTTTGTT ATACAACCAT ACAGATTTTT TTACAAACGA AGTGCAAGAA AAAGCTAAAG CTATAACCTC AAGGCTTCTA AAGGCCCTTG GTATAGAAAA AGGTGTTTTA GCGGTGGAAT TTTTTGTAAA AGCCAAAGAC GTTTATATAA ACGAATTTGC CCCAAGAGTA CACAACACCG GTCATCACAC GTTGAACGAT GCCGAATACT CCCAGTTTGA ACTGCTTTTA AGAACAATGT TAGATATGCC AATATACTCT CCCTCTCTTA TAACACAAGG TGGGATGATA AACATAATAG GCAATATAAA TCTTACAAAA GAACTAAAAG ATGGTATATT GTCTTTAGAA GGTGCTAGTC TTTATTGGTA TAGGAAAACA CCAAGAGAAG GCAGGAAATT AGGACATATA AACGTTGTTG GGAGAGATGT TGAAGAAGTA AGAGCTAAGC TTAGAAATTT ATCTAAACTT TTATACCCTT CGTTAAATAT ATGGTAA
|
Protein sequence | MRVGIIGDGQ LAMMSVMEGL MMDIDFAVLS FEKDPPASYV TKHVFKENEV EEFVAFSDVI TYEFEHFNKK IFDCTKLLDK LYPGVKPIEL KQNRLLEKKF LKDHNFPTVP FYEAKNTDEL FEIVESLNKE AVVKTISNGY DGKGQYVIHT REDLELLKDK LKDSKDSFLI EEFCYFDFEM SLIAGISKDR IVFMPMTKNI HKNGILLYNH TDFFTNEVQE KAKAITSRLL KALGIEKGVL AVEFFVKAKD VYINEFAPRV HNTGHHTLND AEYSQFELLL RTMLDMPIYS PSLITQGGMI NIIGNINLTK ELKDGILSLE GASLYWYRKT PREGRKLGHI NVVGRDVEEV RAKLRNLSKL LYPSLNIW
|
| |