Gene Hoch_3759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3759 
Symbol 
ID8546152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5165405 
End bp5166475 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content73% 
IMG OID646388429 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_003268152 
Protein GI262196943 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.261432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.953602 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCA CCTACAAGGA CGCTGGCGTG GACATCGAAG AAGGTGCCCG CCTGGTCGAC 
GCCATCGCGC CGCTGGCCCG GGCCACCGCC CGCCCCGAAG TGCTCGGCGG CATCGGCGGC
TTCGCCGGCC TGTGTGCCCT GCCCCCGGGC TATCGCCAGC CCATCTTGGT GAGCAGCACC
GACGGCGTCG GCACCAAGCT CAAGTCGGCC CTGGCCACCG GCCGCCACCG CGGCATCGGC
ATCGACCTGG TCGCCATGTC GGTCAACGAC GTCATCGTCA CCGGCGCCGA TCCCCTGCTC
TTCCTCGATT ACTTCGCCAC CAGCCGCCTC GAGCTCGCGG TCGCGCGCGA GGTGGTCGCC
GGCATCGCCG AGGGCTGCAC CCAGGCCGGC TGTGCCCTGG TCGGCGGCGA GACCGCCGAG
ATGCCCGGCA TCTACAGCCC CGGCGACTAC GACGTCGCCG GCTTCTGCGT GGGCGTCGTC
GAGCGCGACC AGATCCCGAG CGCCGATACC CTGCAAGCCG GCGATCTCGT CATCGGCTTG
CCGTCCTCGG GCCTGCACGC CAACGGCCAC TCCCTGGCCC GCAAGCTGCT GCTCGAGCGC
TTCTCCTACG ACGACGCGCC CGCGGCGCTG GCGGGCCAGA CCATCGCCGA CGTGCTGCTG
CAGCCGACCC TGATCTACGC CTGGGCCTTC GCCGCGCTGC GCGAAGCCGG CCTCGCCGCC
CTGGGCGCCG CCCACATCAC CGGCGGCGGC CTGATCGAGA ACCCGCCGCG CCTGCTGCGC
ACCAGCACCG GCGCCGAGCG CGACGATCTC GCCCTGCGCT TCGACACCGA CACCTGGCAG
ATGCCCGCTG TCATGCAGCT CATCGCCGAA GCCGGCGTCG AGGAGGACGA GATGCGGCGC
ACCTTCAACA TGGGCATCGG CATGGTCCTC GTGGTCCGCG CGGCCGATGC CGAGCGCGTG
CTCGCTGTCC TGGGCCGCGC CGAGCAAGCC GCGGGCGAGC GCGCGCCACG CGTCATCGGC
GCCCTCGAAG CCCGGCCCGC GGGCGCCGCC GCGGTGCGGT TCGCGCCATG A
 
Protein sequence
MAITYKDAGV DIEEGARLVD AIAPLARATA RPEVLGGIGG FAGLCALPPG YRQPILVSST 
DGVGTKLKSA LATGRHRGIG IDLVAMSVND VIVTGADPLL FLDYFATSRL ELAVAREVVA
GIAEGCTQAG CALVGGETAE MPGIYSPGDY DVAGFCVGVV ERDQIPSADT LQAGDLVIGL
PSSGLHANGH SLARKLLLER FSYDDAPAAL AGQTIADVLL QPTLIYAWAF AALREAGLAA
LGAAHITGGG LIENPPRLLR TSTGAERDDL ALRFDTDTWQ MPAVMQLIAE AGVEEDEMRR
TFNMGIGMVL VVRAADAERV LAVLGRAEQA AGERAPRVIG ALEARPAGAA AVRFAP