Gene Hoch_4917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4917 
Symbol 
ID8547324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6792323 
End bp6793963 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content71% 
IMG OID646389590 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_003269299 
Protein GI262198090 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.421893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATACAGA TCCAACGAGC AATTCTCAGC GTCTCGGACA AACGAGGCCT CGTCGCCTTG 
GCGCGCGGCC TCAGCGCCCA CGGCGCAACC CTGCTCTCCA CCGGCGGAAC CGCGCGCGCG
CTGCGCGAAG CCGGCATCGA GCCGCTCTCG GTCTCCGAGT ACACGGGCGC GCCCGAGATC
TTCGGCGGCC GCGTCAAGAC GCTGCACCCC AAGATCCACG GCGGCCTGCT CGCGCGTCCG
GCGCACGAAG AGGACTCACG TCAGATGGCC GAGCACGACA TCGCGCCCAT CGATCTGGTC
GTGGTCAACC TCTACCCCTT CGAGGCCACG GTGGCGCGCG AGGGCGTGAC CACGGCCGAG
GCCATCGAAA ACATCGACAT CGGTGGTCCG ACGATGATCC GCGCCGCGGC CAAGAACCAT
CAGCGCGTGG CCGTCGTCGT CGATCCCGAT GACTACGCGG AGGTGCTGCG CGAGCTCGAC
GAGAGCGGCG GCGCGCTGTC GCACGAGACC CGCGCGCGCC TGGCCCGCAA AGCCTTCGCC
CACACCGCCG CCTACGACAC CGCGATCTCG GCCTATCTCG CCAGCTCCTC GCTCGCCGGC
GACAGCGCGC CCGCCGCGTC GCCGCCCGCC ACCGCCGCCC ACAGCGCGGA CGGCGCCGAC
GAGCCCGCGA CCTTCCCCGC GTCGCTCACG GTGACCTGGA AACGCACGCA GACCCTGCGC
TACGGCGAAA ATCCCCACCA ACGCGCCGCC TTCTACGAAG AGATGATTCA ACCGCTCGGC
GTCCGCGGCC ACCGCCCGCG CCTGCCCGCG GCCGACATCC TCCAGGGCAA AGCGCTCTCC
TATAACAACG TCCTCGACAC CGACGCCGCC CTGGCCTGCT GCCTCGAGTT CACGGCCCCC
TGCGCCGTGG TCGTCAAGCA CACCAACCCC TGCGGCGTGG CCCTGGCCGA CGACATCGCG
AGCGCCTACG AGCAGGCCCG CGCCACCGAT CCCACCTCGT CCTTTGGCGG CATCGTCGCG
GTCAACCGCG AGGTCTCCGA GGAGCTGGCC GAGCTGCTCG CCGAGACCTT CCTCGAGTGC
GTGATCGCGC CCGGCTTCTC CGAGGCCGCG CGCGCGCGCC TGGCCAAGAA GAAGAACCTG
CGCCTGCTGG CCACCGGCCC GTGGCTGGCC TCCGATTCCG ACCTCGCCTG GAGCCTGCGC
TCGGTCGCCG GCGGGGTTCT CGTCCAGGAG GCCGATTTCA CCCTCGCGGC TGCGCGCAAC
GGCAAGGTGG TGAGCGCGCG CGCCCCCGAC GAGGCCGAGC TCGACACCCT CGACTTCGCC
TGGCGCGTGG GCAAGCACGT CAAATCCAAC GCCATCGTCT TCTGCGCCGG CACCCGCACC
CTGGGCGTCG GCGCCGGGCA GATGAGCCGC GTGGACGCCG CCCGCATCGC CCGCGACAAA
GCGCTCGGCG ACCTGAGCGG CAGCTGCGTG GCCTCGGACG CCTTCTTTCC CTTCCGCGAC
GGCGTCGACG CGCTCGCCGA AGCCGGCGCC CGCGCCGTCA TCCAGCCCGG CGGTTCGGTG
CGCGACGAGG AGGTCATCGC CGCGGCCGAC GAACACGGCA TGGCCATGGT GCTCACGGGA
ATGCGACACT TCCGCCACTG A
 
Protein sequence
MIQIQRAILS VSDKRGLVAL ARGLSAHGAT LLSTGGTARA LREAGIEPLS VSEYTGAPEI 
FGGRVKTLHP KIHGGLLARP AHEEDSRQMA EHDIAPIDLV VVNLYPFEAT VAREGVTTAE
AIENIDIGGP TMIRAAAKNH QRVAVVVDPD DYAEVLRELD ESGGALSHET RARLARKAFA
HTAAYDTAIS AYLASSSLAG DSAPAASPPA TAAHSADGAD EPATFPASLT VTWKRTQTLR
YGENPHQRAA FYEEMIQPLG VRGHRPRLPA ADILQGKALS YNNVLDTDAA LACCLEFTAP
CAVVVKHTNP CGVALADDIA SAYEQARATD PTSSFGGIVA VNREVSEELA ELLAETFLEC
VIAPGFSEAA RARLAKKKNL RLLATGPWLA SDSDLAWSLR SVAGGVLVQE ADFTLAAARN
GKVVSARAPD EAELDTLDFA WRVGKHVKSN AIVFCAGTRT LGVGAGQMSR VDAARIARDK
ALGDLSGSCV ASDAFFPFRD GVDALAEAGA RAVIQPGGSV RDEEVIAAAD EHGMAMVLTG
MRHFRH