Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3583 |
Symbol | purH |
ID | 4599462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 3800072 |
End bp | 3801649 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639778191 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_924770 |
Protein GI | 119717805 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0699465 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCCGAAC CCGTGTCCAC GCCCGAGCAC CGCATCCCGA TCCGCCGCGC CCTCGTCTCC GTCTACGACA AGACCGACCT CGAGGACCTG GTCCGCGGCC TGCACGACGC CGGCGTCGAG CTGGTGTCGA CGGGCGGCTC AGCGAAGCTG ATCGAGGGCC TGGGCCTCCC GGTCACCAAG GTCGAGGACC TGACCGGCTT CCCGGAGTGC CTCGACGGCC GGGTCAAGAC GCTGCACCCG CGCGTGCACG CGGGGATCCT CGCCGACCGG CGCCTGGACT CCCACGTCCA GCAGCTCGCC GACCTCGGGG TGGAGCCGTT CGACCTGGTG GTCTCCAACC TCTACCCGTT CCGCGAGACC GTCGCGTCGG GCGCCACGCC CGACGAGTGC GTGGAGCAGA TCGACATCGG CGGGCCCTCG ATGGTCCGGG CCGCCGCCAA GAACCACCCG TCCGTGGCGA TCGTGACCTC GCCGGAGCGG TACGCCGACG TGCTGGCGGC CGTCGCCGCA GGCGGGTTCA CCCTCGAGCA GCGCAAGGTG CTGGCAGCCG AGGCGTTCAC CCACACCGCG GCCTACGACG TCGCGGTCGC GGGCTGGTTC GCCTCGACGT ACGTGCCGGC CGAGGACGGC TGGCCCGAGT TCGCCGGGGA GACCTGGCAG AAGGCCGCCG TGCTGCGGTA CGGCGAGAAC CCGCACCAGG ACGCCGCCCT CTACACCGAT TCGTCAGGGG GCGGCGGTCT GGCCGGGGCC GAGCAGCTGC ACGGCAAGGA GATGTCCTAC AACAACTACG TCGACACCGA CGCGGCGCGG CGCGCGGCGT ACGACTTCGA CGAGCCTGCC GTCGCGATCA TCAAGCACGC CAACCCGTGT GGCATCGCCG TCGGCGCCGA CGTCGCCGAG GCCCACCGCC GCGCCCACGA GTGCGACCCG GTCAGCGCCT TCGGCGGCGT GATCGCGGTC AACCGGCCCG TCTCGGTCGA GATGGCCCGC CAGGTGGCCG ACGTGTTCAC CGAGGTGATC GTCGCGCCGT CGTACGACGA GGGCGCGGTC GAGATCCTGC AGGGCAAGAA GAACATCCGC ATCCTGCGCT GCGCCGACCC GGCCGAGGAG CGCTCCACCG AGCTGCGCCA GATCAGCGGC GGCGTGCTCG TGCAGGTGCG TGACCACGTC GACGCGACGG GCGACGACCC GTCGACCTGG ACGCTGGCCG CGGGGGAGCC CGCCTCGGCG GAGGTGCTCG CCGACCTCGC GTTCGCCTGG ACGGCGTGCC GCGCCGCGAA GTCCAACGCG ATCCTGCTCG CCAAGGACGG CGCCTCGGTC GGCATCGGCA TGGGCCAGGT CAACCGGGTC GACTCCTGCC GGCTCGCCGT CTCGCGGGCC GGGGACCGGG CCGCGGGATC GGTCGCCGCC TCCGACGCGT TCTTCCCCTT CGAGGACGGC CCGCAGATCC TCATCGACGC CGGCGTCACC GCGATCGTGC AGCCGGGCGG CTCGGTCCGT GACGAGCTCA CGGTCGAGGC GGCCAAGGCC GCCGGCGTCA CCATGTACTT CACCGGCACC CGGCACTTCT TCCACTGA
|
Protein sequence | MSEPVSTPEH RIPIRRALVS VYDKTDLEDL VRGLHDAGVE LVSTGGSAKL IEGLGLPVTK VEDLTGFPEC LDGRVKTLHP RVHAGILADR RLDSHVQQLA DLGVEPFDLV VSNLYPFRET VASGATPDEC VEQIDIGGPS MVRAAAKNHP SVAIVTSPER YADVLAAVAA GGFTLEQRKV LAAEAFTHTA AYDVAVAGWF ASTYVPAEDG WPEFAGETWQ KAAVLRYGEN PHQDAALYTD SSGGGGLAGA EQLHGKEMSY NNYVDTDAAR RAAYDFDEPA VAIIKHANPC GIAVGADVAE AHRRAHECDP VSAFGGVIAV NRPVSVEMAR QVADVFTEVI VAPSYDEGAV EILQGKKNIR ILRCADPAEE RSTELRQISG GVLVQVRDHV DATGDDPSTW TLAAGEPASA EVLADLAFAW TACRAAKSNA ILLAKDGASV GIGMGQVNRV DSCRLAVSRA GDRAAGSVAA SDAFFPFEDG PQILIDAGVT AIVQPGGSVR DELTVEAAKA AGVTMYFTGT RHFFH
|
| |