Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_0027 |
Symbol | purH |
ID | 5160622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | + |
Start bp | 24181 |
End bp | 25722 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640551941 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_001233175 |
Protein GI | 148259048 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.493241 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACA GAGTCAACAT CCGACGGGCC CTCATCTCGG TGTCCGACAA GGCGGGGCTG GTCGAGCTCG GCCGCGCGCT GGCGGCAGCG GGCGTGGAAA TCCTCTCGAC CGGCGGTTCC GCCCGCGCGC TGCGCGAGGC CGGGATCGCC GTTGTCGAGG TGGCGGATTA CACGGGTGTT CCCGAAATGC TGGATGGGCG GGTCAAGACG CTGGTGCCCA AGATCCATGG CGGCCTGCTC GGCCGGCGCG ACCTGCCGGA GCATCTGGCG CAGATGCAGC GGCACGACAT CCCGCCGATC GACCTGCTCG CGGTCAATCT CTACCCGTTC GAGGAAACGG TCGCGAAGGG CTCGGATTTC GAAACCTGCG TCGAGAACAT CGATATCGGC GGCCCGGCGC TGATCCGCGC GGCGGCGAAG AACCACGATT CGGTGGCGGT TCTCACCAGC CCGGCGCAGT ATGACGACCT CATCGCGGCG CTAGCCGCCG GCGGAACGAC GCTGGAGCAG CGCCGCCGCC TCGCCGCCGC CGCCTATGCC CGCACCGCCG CCTACGACGC CGCGATTTCC GCCTGGTTCG CGCAGCAGAC CGGCGAGATG TTCCCGGCGC ACCTCGCCCT GGCCGGCGCG CGGCAGCAGA TGCTGCGCTA CGGCGAGAAC CCGCACCAGT CCGCTGCGTT CTATCGCACC GGCAACCGCC CCGGCGTTGC CACCGCGCGG CAGTTGCAGG GCAAGGAACT CTCCTACAAC AACATCAACG ACACCGATGC CGCTTTCGAA TGCGTCGCCG AGTTCGACCG GCCGGCGGTG GTGATCGTCA AGCACGCCAA TCCGTGCGGC GTCGCCCTCG GCGCCGATCT TGCCGAGGCC TGGGACCGCG CGCTGGACTG CGACCCGGTT TCGGCGTTTG GCGGCATCAT CGCGGTCAAC CGCCCGCTCG ATGTTGCGGC AGCCGAGAAG ATGGCGAGCA TCTTCTCCGA GGTGATCATC GCGCCGGACG CCGCACCTGA CGCGGTTGAA CTGCTTGCCC GCAAGAAGAA TCTCCGCCTG CTGCTCACCG GCGGCCTGCC CGACCCGGCG GAACCGGGCC TTGCCTGGCG CAGCGTTGCC GGTGGTTTCC TGGCCCAGAC CCGCGACGCC GGGAGGATTG GCCGCGACGA TCTGAAGGTC GTCACCCAGC GCGCGCCGAC CAACGCCGAG TTCGCCGATC TGCTGTTCGC CTTCCGTGTG GCCAAGCATG TGAAGTCGAA TGCGATCATC TACGCGAAAG CAGGGGCGAC CACGGGCATC GGCGCGGGGC AGATGAGCCG CGTCGATTCC TCGCGCATCG CCGCACAGAA GGGTGGGGAG AAGATTCCGG GTTCGGTCGT CGCGTCCGAC GCGTTCTTCC CCTTCGCCGA CGGTCTTGTG GCCGCGATCG AGGCAGGGGC GACGGCGGTG ATCCAGCCCG GCGGCTCGAT CCGCGACAAC GAGGTGATCG AGGCGGCAGA TGCCGCCGGG ATTGCCATGG TGTTCACCGG CATGCGCCAT TTCAGGCATT GA
|
Protein sequence | MNDRVNIRRA LISVSDKAGL VELGRALAAA GVEILSTGGS ARALREAGIA VVEVADYTGV PEMLDGRVKT LVPKIHGGLL GRRDLPEHLA QMQRHDIPPI DLLAVNLYPF EETVAKGSDF ETCVENIDIG GPALIRAAAK NHDSVAVLTS PAQYDDLIAA LAAGGTTLEQ RRRLAAAAYA RTAAYDAAIS AWFAQQTGEM FPAHLALAGA RQQMLRYGEN PHQSAAFYRT GNRPGVATAR QLQGKELSYN NINDTDAAFE CVAEFDRPAV VIVKHANPCG VALGADLAEA WDRALDCDPV SAFGGIIAVN RPLDVAAAEK MASIFSEVII APDAAPDAVE LLARKKNLRL LLTGGLPDPA EPGLAWRSVA GGFLAQTRDA GRIGRDDLKV VTQRAPTNAE FADLLFAFRV AKHVKSNAII YAKAGATTGI GAGQMSRVDS SRIAAQKGGE KIPGSVVASD AFFPFADGLV AAIEAGATAV IQPGGSIRDN EVIEAADAAG IAMVFTGMRH FRH
|
| |