Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0925 |
Symbol | |
ID | 7401297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 923929 |
End bp | 925536 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643707991 |
Product | phosphoribosylglycinamide formyltransferase |
Protein accession | YP_002565593 |
Protein GI | 222479356 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase [TIGR00639] phosphoribosylglycinamide formyltransferase, formyltetrahydrofolate-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000561233 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGATCG CCGGACTCGC GAGCAACCGG GGACGGAACC TCAGACACAT CGCCGACGCC GCGCCGGGTG ATGCGGAGCT GTCGGTCGTC CTGACCAACC GCGAACAGGC GCCCGTGCTG GAGGCCGCCA CGGAACGCCG GATCCCGACC GAGGTCGTCG AGCGCGAAGA CGGGGAGTCG CGCGAGGCCC ACGAGCGACG GATCCTCGAC CGACTCGCCG ATTACGACTT CGATCTCGTC TGTCTGGACG GGTACATGCG CGTGCTCACC GACGAGTTCC TCGACGCAGC CCCAACGACG CTGAACGTCC ACCCGTCGCT TCTCCCCGCG TTCCCCGGTA CGGACGCCCA CGAACAGGTG ATCGACGCCG GCGTCCGCAC CACCGGCTGT ACCGTCCACG TCGTCACCGA GGCGGTCGAC GCCGGCCCGA TCGTCACGCA GGAGCCGGTA CCCGTCTACG AGGGCGACGA CGCCGAGGCG CTGAAAGGCC GAGTACTTCA CGACGCCGAG TTCACGGCGT ACCCGCGAGC AGTGCGGTGG TTCGCGGAGG ACCGAGTGAC GATCGAGCGC GGGGGCGACG ACGACGCTCC CGTGAACGTG ACCGTTGACG GCGACACCGG CGGCGACTTC CCCGAGCGGC GGTTCGTCTC CGAGGAGCGC GCCGACACGC TGCGGTACGG CGAGAACCCC CATCAGGACG CCGCGCTCTA CGTCGACGAC GGCTGCGAGG AGGCGAGTGT CGTCGGCGCC GATCAGCTGA ACCCCGGCGC GAAGGGGATG GGGTACAACA ACTACAACGA CGCCGACGGC GCGTTGAACC TCGTCAAGGA GTTCGACGAG CCCGCCGCCG CCGTGATCAA GCACACGAAC CCGGCCGGCT GCGCAACGAG CGACACGGTC GCTGACGCGT ACGACCGCGC GCTCCGCACT GACGCGAAGT CTGCGTTCGG CGGTATTGTC GCGCTGAATC GCGAGTGCGA CGCCGACACC GCCGACGCCA TCGTCGACTC GTTCAAGGAG GTCGTCGTCG CGCCCGGCTA CACCGACAGC GCGCTCGATG TGCTCCGGGA GAAGAAGAAC CTCCGCGTGC TCGATGTCGG CCCCCTCGGT GAGGGTGATG AGCGCTTCTC CGAACGGTTC ACGGAGAAGC CGGTCGTCGG CGGGCGGCTG GTTCAAGAAC GGGACCGCCA GTCGCCGACC GCCGGCGACC TCGACGTGGT CACCGAGCGC GAGCCCACCG ACGAACAGCT AGCAACGATG GTGTTCGCGT GGAAGACGCT CAAACACGTG AAATCGAACG GGATCTTGTT CGCGACCGGC ACCGAAACGG TCGGCGTCGG GATGGGGCAG GTGTCTCGCG TCGACGCCGT CACGCTGGCG GCGATGAAGG CGGAGAAGGA CGCCGAGGGG AAATCTGCAG AGGGCGCGGT GATGGCCTCG GACGCCTTCT TCCCGTTCCC GGACGCGATC GAAGAGGCGG CGGAGGCCGG GATCGAGGCC GTGATCCAGC CCGGCGGCTC GGTCAACGAC GAGGACGTGA TCGCCGCGGC CGACGAACAC GACATGGCGA TGGCGTTCAC CGGCTCGCGC TGCTTCCGAC ACGACTGA
|
Protein sequence | MKIAGLASNR GRNLRHIADA APGDAELSVV LTNREQAPVL EAATERRIPT EVVEREDGES REAHERRILD RLADYDFDLV CLDGYMRVLT DEFLDAAPTT LNVHPSLLPA FPGTDAHEQV IDAGVRTTGC TVHVVTEAVD AGPIVTQEPV PVYEGDDAEA LKGRVLHDAE FTAYPRAVRW FAEDRVTIER GGDDDAPVNV TVDGDTGGDF PERRFVSEER ADTLRYGENP HQDAALYVDD GCEEASVVGA DQLNPGAKGM GYNNYNDADG ALNLVKEFDE PAAAVIKHTN PAGCATSDTV ADAYDRALRT DAKSAFGGIV ALNRECDADT ADAIVDSFKE VVVAPGYTDS ALDVLREKKN LRVLDVGPLG EGDERFSERF TEKPVVGGRL VQERDRQSPT AGDLDVVTER EPTDEQLATM VFAWKTLKHV KSNGILFATG TETVGVGMGQ VSRVDAVTLA AMKAEKDAEG KSAEGAVMAS DAFFPFPDAI EEAAEAGIEA VIQPGGSVND EDVIAAADEH DMAMAFTGSR CFRHD
|
| |