Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0133 |
Symbol | purH |
ID | 3785781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 137927 |
End bp | 139489 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637810203 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_410834 |
Protein GI | 82701268 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATTA AACAGGCCCT GATCAGTGTT TCAGATAAAA GCGGTATTGT CGAGTTTGCT CAAGCGCTGC ACAAACTTGG AGTAACCATT CTGTCAACGG GCGGCACCGC CAGACTTTTG AGGGATGCCG GCGTTGCCGT GACCGAGGTC GGGAGCTACA CCGGCTTTCC CGAGATGCTG GATGGGCGGG TCAAGACCCT GCATCCAAAA ATACACGCGG GAATTCTGGC CAGACGGGAT TTGTCCGAGC ATGTGTCCGC GCTGGAAAAG GCGGAAATTC CTGCCATCGA TCTGGTAGTA GTGAATCTCT ATCCTTTCAG CCAGACAGTG GCGCAGCCGG ATTGCAGTCT GGAAGAGGCC ATTGAAAATA TCGATATCGG CGGCCCGACC ATGGTGCGCG CTGCGGCCAA GAACTACCAG AGCGTGGCCG TAGTCACCGA TCCGGCGGAT TACCCCGCGT TACTTGACGA GATGAAGACT GCCGGCGGCG AGGTTACGCC TGAGTTCCGG TTCCGGCTGG CGTGCAAGGC GTTTTCGCAT ACAGCCGCCT ATGATGGGGC GATCAGCAAC TACCTCACCT CCATCGAGGG GGAAAATGCA CAACGTCGTA CCTTTCCGGA ACGCCTGAAT CTCAATTTCA GCCTGGTGCA GCCTCTGCGC TACGGGGAAA ACCCGCATCA GCAGGCCGCC TTTTACCGCG ATCCGCAACT CGTTCCCGGC AGCCTTGCAA GCTACAGGCA GTTGCAGGGC AAGGAGCTCT CTTACAACAA CATCGCGGAC GCGGATGCCG CCTGGGAATG CGTAAAGACG TTTGATTCCC CGGCCTGTGT CATCATCAAG CATGCCAATC CTTGCGGCGT GGCAATCAGC GATTCGCCGC TCGCGGCTTA CAAGCTTGCA TTTGCCACCG ATCCCACTTC CGCATTCGGC GGCATTATCG CTTTCAATCG CACGCTGGAC GCCTCGGCCG CCGAGGCGGT GATGAGCCAG TTTGTCGAAG TGATCATCGC GCCGCAAATG ACCGACGAGG CAAGGCAGAT GCTCGCGCGC AAAGCCAATG TGCGCGTGCT GACCGTGCCG CTCCAGGCGG GGAACAACGC CCACGACTTC AAGCGAGTGG GTGGAGGATT GCTGGTGCAG ACTCCGGACA ATCTCAACGT TACCCCTAAC CAATTGAAGG TGGTGACCGA AGTCCAGCCA ACAGCGCAGC AATTGCAGGA CCTGCTGTTT GCCTGGCGGG TGGCGAAATT CGTCAAATCG AATGCCATCG TCTTTTGTGC CAACGGCCGC ACGCTTGGCG TGGGCGCCGG ACAGATGAGC AGGGTGGATA GCGCGCGCAT TGCCTCCATC AAAGCCGGGA ACGCGAACCT CACCCTGGCA GGCTCGGTAG TGGCATCCGA TGCCTTCTTC CCTTTCCGCG ACGGACTGGA TGTCGTCGTC CAGGCGGGGG CGGTGGCGGT CATCCAGCCG GGGGGCAGTG TGCGGGATGA AGAGGTTATC GCTGCGGCGG ATGAACAAGG GGTGGCAATG GTATTTACCG GCGTGCGCCA TTTCAGGCAT TGA
|
Protein sequence | MSIKQALISV SDKSGIVEFA QALHKLGVTI LSTGGTARLL RDAGVAVTEV GSYTGFPEML DGRVKTLHPK IHAGILARRD LSEHVSALEK AEIPAIDLVV VNLYPFSQTV AQPDCSLEEA IENIDIGGPT MVRAAAKNYQ SVAVVTDPAD YPALLDEMKT AGGEVTPEFR FRLACKAFSH TAAYDGAISN YLTSIEGENA QRRTFPERLN LNFSLVQPLR YGENPHQQAA FYRDPQLVPG SLASYRQLQG KELSYNNIAD ADAAWECVKT FDSPACVIIK HANPCGVAIS DSPLAAYKLA FATDPTSAFG GIIAFNRTLD ASAAEAVMSQ FVEVIIAPQM TDEARQMLAR KANVRVLTVP LQAGNNAHDF KRVGGGLLVQ TPDNLNVTPN QLKVVTEVQP TAQQLQDLLF AWRVAKFVKS NAIVFCANGR TLGVGAGQMS RVDSARIASI KAGNANLTLA GSVVASDAFF PFRDGLDVVV QAGAVAVIQP GGSVRDEEVI AAADEQGVAM VFTGVRHFRH
|
| |