Gene Noca_3583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3583 
SymbolpurH 
ID4599462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3800072 
End bp3801649 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content72% 
IMG OID639778191 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_924770 
Protein GI119717805 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0699465 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCCGAAC CCGTGTCCAC GCCCGAGCAC CGCATCCCGA TCCGCCGCGC CCTCGTCTCC 
GTCTACGACA AGACCGACCT CGAGGACCTG GTCCGCGGCC TGCACGACGC CGGCGTCGAG
CTGGTGTCGA CGGGCGGCTC AGCGAAGCTG ATCGAGGGCC TGGGCCTCCC GGTCACCAAG
GTCGAGGACC TGACCGGCTT CCCGGAGTGC CTCGACGGCC GGGTCAAGAC GCTGCACCCG
CGCGTGCACG CGGGGATCCT CGCCGACCGG CGCCTGGACT CCCACGTCCA GCAGCTCGCC
GACCTCGGGG TGGAGCCGTT CGACCTGGTG GTCTCCAACC TCTACCCGTT CCGCGAGACC
GTCGCGTCGG GCGCCACGCC CGACGAGTGC GTGGAGCAGA TCGACATCGG CGGGCCCTCG
ATGGTCCGGG CCGCCGCCAA GAACCACCCG TCCGTGGCGA TCGTGACCTC GCCGGAGCGG
TACGCCGACG TGCTGGCGGC CGTCGCCGCA GGCGGGTTCA CCCTCGAGCA GCGCAAGGTG
CTGGCAGCCG AGGCGTTCAC CCACACCGCG GCCTACGACG TCGCGGTCGC GGGCTGGTTC
GCCTCGACGT ACGTGCCGGC CGAGGACGGC TGGCCCGAGT TCGCCGGGGA GACCTGGCAG
AAGGCCGCCG TGCTGCGGTA CGGCGAGAAC CCGCACCAGG ACGCCGCCCT CTACACCGAT
TCGTCAGGGG GCGGCGGTCT GGCCGGGGCC GAGCAGCTGC ACGGCAAGGA GATGTCCTAC
AACAACTACG TCGACACCGA CGCGGCGCGG CGCGCGGCGT ACGACTTCGA CGAGCCTGCC
GTCGCGATCA TCAAGCACGC CAACCCGTGT GGCATCGCCG TCGGCGCCGA CGTCGCCGAG
GCCCACCGCC GCGCCCACGA GTGCGACCCG GTCAGCGCCT TCGGCGGCGT GATCGCGGTC
AACCGGCCCG TCTCGGTCGA GATGGCCCGC CAGGTGGCCG ACGTGTTCAC CGAGGTGATC
GTCGCGCCGT CGTACGACGA GGGCGCGGTC GAGATCCTGC AGGGCAAGAA GAACATCCGC
ATCCTGCGCT GCGCCGACCC GGCCGAGGAG CGCTCCACCG AGCTGCGCCA GATCAGCGGC
GGCGTGCTCG TGCAGGTGCG TGACCACGTC GACGCGACGG GCGACGACCC GTCGACCTGG
ACGCTGGCCG CGGGGGAGCC CGCCTCGGCG GAGGTGCTCG CCGACCTCGC GTTCGCCTGG
ACGGCGTGCC GCGCCGCGAA GTCCAACGCG ATCCTGCTCG CCAAGGACGG CGCCTCGGTC
GGCATCGGCA TGGGCCAGGT CAACCGGGTC GACTCCTGCC GGCTCGCCGT CTCGCGGGCC
GGGGACCGGG CCGCGGGATC GGTCGCCGCC TCCGACGCGT TCTTCCCCTT CGAGGACGGC
CCGCAGATCC TCATCGACGC CGGCGTCACC GCGATCGTGC AGCCGGGCGG CTCGGTCCGT
GACGAGCTCA CGGTCGAGGC GGCCAAGGCC GCCGGCGTCA CCATGTACTT CACCGGCACC
CGGCACTTCT TCCACTGA
 
Protein sequence
MSEPVSTPEH RIPIRRALVS VYDKTDLEDL VRGLHDAGVE LVSTGGSAKL IEGLGLPVTK 
VEDLTGFPEC LDGRVKTLHP RVHAGILADR RLDSHVQQLA DLGVEPFDLV VSNLYPFRET
VASGATPDEC VEQIDIGGPS MVRAAAKNHP SVAIVTSPER YADVLAAVAA GGFTLEQRKV
LAAEAFTHTA AYDVAVAGWF ASTYVPAEDG WPEFAGETWQ KAAVLRYGEN PHQDAALYTD
SSGGGGLAGA EQLHGKEMSY NNYVDTDAAR RAAYDFDEPA VAIIKHANPC GIAVGADVAE
AHRRAHECDP VSAFGGVIAV NRPVSVEMAR QVADVFTEVI VAPSYDEGAV EILQGKKNIR
ILRCADPAEE RSTELRQISG GVLVQVRDHV DATGDDPSTW TLAAGEPASA EVLADLAFAW
TACRAAKSNA ILLAKDGASV GIGMGQVNRV DSCRLAVSRA GDRAAGSVAA SDAFFPFEDG
PQILIDAGVT AIVQPGGSVR DELTVEAAKA AGVTMYFTGT RHFFH