Gene Oant_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOant_1087 
SymbolpurH 
ID5379484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOchrobactrum anthropi ATCC 49188 
KingdomBacteria 
Replicon accessionNC_009667 
Strand
Start bp1135850 
End bp1137466 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content61% 
IMG OID640833739 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001369636 
Protein GI153008421 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTCA GCTCCAAGCA TATTCCCGCT CCCGATCTTC ATCGGGTGCG CCGCGCCCTC 
CTTTCCGTTT CCGACAAAAC CGGCCTGATC GATTTCGCCA AAGCGCTTCA CGCGCAAGGT
GTCGAAATCC TTTCGACCGG CGGCACCGCC AAGTCGATTG CCGCTGAAGG CATTCCGGTG
AAGGATGTAT CCGAAGTAAC CGGCTTCCCG GAAATCATGG ACGGACGCGT CAAGACGCTG
CATCCGGCAG TCCATGGCGG CCTGCTCGCC GTGCGCAACG ATCGGGAACA CGTCGCGGCC
ATGGAAGAAC ACGGCATCGG CGGCATCGAT CTTGCCGTCA TCAATCTCTA TCCGTTCGAG
GAAGTCCGCT TCAAGGGCGG CGACTACGAC ACGACCGTCG AAAACATCGA CATTGGTGGC
CCGGCTATGA TCCGCGCTTC CGCCAAGAAC CACGCCTATG TCGCAACGGT CGTCGATCCT
GCCGATTATG CCGACGTCGT CGCTGAACTG GAAAAGCATG CGGGCTCCCT GCCGCTTGCC
TTCCGCAAGA AGCTTGCCGC CAAGGCCTTC TCGCGCACGG CAGCCTATGA CGCAGCCATT
TCCAACTGGT TTGCCGAAGC GATCAACGAA GAAACTCCGG TCTATCGCTC GGTCGCAGGC
AAGCTGCATT CCGTCATGCG CTACGGAGAA AACCCGCACC AGACGGCTGG CTTTTATCTC
ACTGGCGAGA AGCGCCCCGG TGTTGCCACC GCGACCCAGC TTCAGGGCAA GCAGCTCTCC
TACAACAACA TCAACGACAC CGATGCGGCT TTCGAACTCG TCGCAGAATT CGATCCTGCC
CGCACCGCAG CCGTCGCCAT CATCAAGCAC GCCAATCCTT GCGGCGTTGC AGAAGCAGCC
ACCATCAAGG AAGCTTATTT GAAGGCACTC GCCTGCGATC CGGTTTCGGC GTTCGGCGGT
ATTGTTGCGC TCAATAAAAC GCTCGATGAG GAAGCCGCTG AAGAGATCGT AAAGATCTTC
ACCGAAGTCA TTATCGCTCC GGATGCCACC GAAGGTGCAC AGGCCATCGT TGCTGCCAAG
AAGAACCTCC GTCTGCTCGT CACCGGCGGC CTGCCGGACC CGCGCGCCAA GGGCATCGCC
GCCAAGACAG TCGCCGGTGG TTTGCTGGTC CAGTCGCGCG ACAATGGTGT GGTCGACGAT
CTCGATCTCA AGGTCGTCAC CAAGCGTGCG CCGACCGAAG CCGAACTCAA CGATATGAAA
TTCGCCTTCC GCGTCGGCAA GCATGTGAAG TCGAACGCCA TCGTCTATGT GAAGGACGGC
GCAACGGTCG GCATCGGCGC AGGCCAGATG AGCCGCGTGG ATTCAGCCCG CATCGCAGCC
CGCAAGGCTG AAGACGCGGC AGAAGCCGCC GGTCTTGCAG AACCGCTCAC CAAGGGCTGC
GTGGTCGCTT CCGACGCGTT CTTCCCGTTT GCCGACGGTC TGCTTTCCGC CGTTCAGGCC
GGTGCAACCG CGGTCATCCA GCCGGGAGGT TCCATGCGGG ACGATGAAGT GATCGCCGCT
GCCGACGAAC ATGGCATCGC CATGGTCATG ACGGGGATGC GTCACTTCCG CCATTAG
 
Protein sequence
MAVSSKHIPA PDLHRVRRAL LSVSDKTGLI DFAKALHAQG VEILSTGGTA KSIAAEGIPV 
KDVSEVTGFP EIMDGRVKTL HPAVHGGLLA VRNDREHVAA MEEHGIGGID LAVINLYPFE
EVRFKGGDYD TTVENIDIGG PAMIRASAKN HAYVATVVDP ADYADVVAEL EKHAGSLPLA
FRKKLAAKAF SRTAAYDAAI SNWFAEAINE ETPVYRSVAG KLHSVMRYGE NPHQTAGFYL
TGEKRPGVAT ATQLQGKQLS YNNINDTDAA FELVAEFDPA RTAAVAIIKH ANPCGVAEAA
TIKEAYLKAL ACDPVSAFGG IVALNKTLDE EAAEEIVKIF TEVIIAPDAT EGAQAIVAAK
KNLRLLVTGG LPDPRAKGIA AKTVAGGLLV QSRDNGVVDD LDLKVVTKRA PTEAELNDMK
FAFRVGKHVK SNAIVYVKDG ATVGIGAGQM SRVDSARIAA RKAEDAAEAA GLAEPLTKGC
VVASDAFFPF ADGLLSAVQA GATAVIQPGG SMRDDEVIAA ADEHGIAMVM TGMRHFRH