Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apre_1108 |
Symbol | purH |
ID | 8397895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerococcus prevotii DSM 20548 |
Kingdom | Bacteria |
Replicon accession | NC_013171 |
Strand | - |
Start bp | 1188990 |
End bp | 1190495 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 644995455 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_003152856 |
Protein GI | 257066600 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGAGCTT TACTATCAGT TACTGACAAG ACAGGAATAG AAAAACTAGC CAAAGACCTT AGGGACTTGG GAGTAAGTTT GGTTTCAACA GGAGGCACCT ACAAAAAGAT CAAAGATAGC GGAGTAGATG TATCAGAGAT TGAGGAAATA ACAAACTTTC CAGAAATACT AGAAGGAAGG GTAAAGACCC TATCTCCTTA TGTACATGGA GGAATTCTTT ATAAGAGGGA TGAAGCTAGT CATGTTTCAA CTGTAGAAGA GTTGGGGATA AAGGCAATTG ATATAGTAGT TGTAAATTTA TACGAATTCC AAAAGGCCCT TGATAAGGGA AATCCAGAAG AGATAATCGA AAATATCGAC ATCGGTGGCC CATCCATGGT TAGGTCTGCT GCCAAAAACC ATAAAGATGT CTTAATTGTA ACAGATCCAA GTGATTATGA TGAACTAATC GAAAGACTTA AAAACGATGA TATAGACCTA GCTTATAGGC AAAGACTTGC TATGAAGGCC TTTAGCCTTA CAGCATTCTA CGATTCAGTA ATAGCAAGGT ACTTCACAAA ACTTACTGGA GAAGAATCTA AATATAAGAC CTACGGCTTT GAGAAAGAAA CAGATTTACG TTATGGGGAA AATCCAGGAC AAGAGGCAAG TTTATACAAT GATCCATTTG TCACAGGACT TATGGAAGAT ATAGAAGTAA TTCACGGCAA GGAAATGAGT TATAACAACT ACAATGATCT AAACCCAGCC CTAGAGCTTG CCCAAGAGCT AGGAGATAAT GCAGTAGTTG CTCTTAAACA CCAATCACCA TGCGGGGTTG CTGTAGGAAG TGATGTCTAT GATTCATACA TTAAGGCCTT CGAGTGCGAC AGCCAATCAA TATTTGGAGG AATCCTTGCA GTAAATGGAG TAGTTGATGA GAAAGCAGCT TCGAAAATGC ATGAAATATT CCTAGAAATA ATAGCAGCAA AAGACTTTAC AAAAGAGGCT CTAGAAATTC TTACAAAGAA GAAAAATCTA AGGCTCGTTA AAGTCGACTT TGCTAATGAA AGTGTAAGAG AAGAAATCAG ATATCTTAAT GGAAAAGTCC TAATTCAAGG AAAAGACTTC GGCAAGGACG AAGTAAATAT AGTAACTGAC AAAAAGCCTA GTGAAGAAGA AATCAAAGAC CTCTTATTTG CCCAAAAGGT GGTAAAATAT GTCAAATCAA ATGCCATTGT AGTAGCCAAG GGAATGAAGA CCCTAGGTTG TGGGGCAGGT CAACAATCTA GAGTTTGGGC GCTTGAATCT ATCAAAGATC ACTTTAAGGA TAGGGACTTT GAGGGAGCAG TCCTTGGATC AGATGCCTTC TTCCCATTTT CAGATACAGT AGAGCTTGCC CACGAGATGG GAATTAGCTC AATCATCCAA CCAGGTGGAT CAATAAGAGA CGAAGACTCA ATCGATAAAT GTAATGAATA TGGTATGAGC ATGGTATTTA GCAAATCACG TCACTTCAAA CATTAA
|
Protein sequence | MRALLSVTDK TGIEKLAKDL RDLGVSLVST GGTYKKIKDS GVDVSEIEEI TNFPEILEGR VKTLSPYVHG GILYKRDEAS HVSTVEELGI KAIDIVVVNL YEFQKALDKG NPEEIIENID IGGPSMVRSA AKNHKDVLIV TDPSDYDELI ERLKNDDIDL AYRQRLAMKA FSLTAFYDSV IARYFTKLTG EESKYKTYGF EKETDLRYGE NPGQEASLYN DPFVTGLMED IEVIHGKEMS YNNYNDLNPA LELAQELGDN AVVALKHQSP CGVAVGSDVY DSYIKAFECD SQSIFGGILA VNGVVDEKAA SKMHEIFLEI IAAKDFTKEA LEILTKKKNL RLVKVDFANE SVREEIRYLN GKVLIQGKDF GKDEVNIVTD KKPSEEEIKD LLFAQKVVKY VKSNAIVVAK GMKTLGCGAG QQSRVWALES IKDHFKDRDF EGAVLGSDAF FPFSDTVELA HEMGISSIIQ PGGSIRDEDS IDKCNEYGMS MVFSKSRHFK H
|
| |