Gene Gdia_1896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1896 
SymbolpurH 
ID6975319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2110434 
End bp2112011 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content72% 
IMG OID643391422 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002276271 
Protein GI209544042 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0249314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGA CGCCCGTGCC GGTCCGCCGC GCCCTGATTT CCGTTTCCGA CAAGGCGGGG 
TTGCTCGACC TCGCCCGCGC CCTGATCGCC CATGGGGCGG AAATCCTCTC GACCGGGGGC
TCGGCCCGTG CGCTGCGCGA GGCCGGACTG AAGGTGACCG AGGTGTCGGA CCATACCGGC
TTTCCCGAAA TTCTCGACGG GCGGGTGAAG ACCCTGGTGC CGCAGATCCA TGGCGGCATC
CTGGGCCGCC GCGACCTGCC GGCCCATCTG GCGCAGATGG ACGAACACGG GATCGCGCCG
ATCGACCTGG TCGCGGTGAA CCTCTACCCG TTCGAGGCCA CGGTGGCCTC CGGCGCGGGG
GAAGAGGACT GCATCGAGAA CATCGATATC GGCGGCCCCG CCCTGATCCG CGCGGCGGCC
AAGAACCACG GCCATGTCGT GGTGCTGACC GATCCGGCGC AGTACGGCGC GGTGATCGAC
GCGCTGGCCC AGGGCGGCAC CACGCTTGCG GCGCGGCGCG CGCTGGCCGG CGCGGCCTAT
GCCCGCACCG CCGCCTATGA TTCCGCCATC GCGGCGTGGT TCGCCGTGCA GCGCGGCGAC
GTGCTGCCGG AACGCCTGGC CGTCGCGGGC CTGCGCCGCG AAAGCCTGCG CTATGGCGAA
AATCCGCACC AGCAGGCGGC CTTCTATGCC GACGGCAGCA GCCGGCCCGG TGTGGCCACC
GCCCGCCAGG TGCAGGGCAA ATCCCTGTCC TACAACAACC TGAACGATAC CGATGCGGCG
TTCGAGGCCG TGGCGGAATT CGACGGCCCG GCGGTGGTGA TCGTCAAGCA CGCCAATCCC
TGCGGCGTCG CCACCGCCGA TACGCTGTCG GCGGCCTGGG ACCTGGCGCT GCGCTGCGAT
CCGGTCTCGG CCTTCGGCGG CATCGTCGCG CTGAACCGCA CGCTGGACGC CGACGCCGCC
GCGCGCATCG CGGCCATCTT CACCGAGGTC ATCGTCGCCC CCGACGCGAC GGAGGAGGCC
CAGGCGATCC TGGCGAAGAA GAAGAACCTG CGCCTGCTGC TGACCGGCGC GATGCCCGAC
CCGTCCGTGG GCGGGGTGGC CATCCGTTCG GTCGCCGGCG GCTTCCTGGC GCAGACCCGC
GACAATGGCC GGATCGTCCC CGCCGGCCTG AAGGTGGTGA CCCGCCGCGC CCCGACCGAG
GCCGAGATGG CGGATCTGAT CTTCGCCTTC CGCGTCGGCA AGCATGTGAA GTCGAACGCC
ATCGTCTATG CCAAGGGCCA GGCGACCGCC GGCATCGGCG CGGGGCAGAT GAGCCGCGTG
GACTCGGCGC GCATCGCCGC GATCAAGGGG GCGGAAGCCG CCCGGGCCGC CGGCCTGGAC
CAGCCGCTGA CGACGGGCAG CGTGGTGGCG TCGGACGCGT TTTTCCCCTT CGCCGACGGG
CTGGAGGCCG CGATCGCGGC CGGCGCCACG GCGGTGATCC AGCCGGGCGG ATCGATCCGC
GATGACGAGG TCATCGCCGC CGCCGACCGG GCGGGCATCG CCATGGTGTT CACAGGTATG
CGCCACTTCC GGCACTGA
 
Protein sequence
MTQTPVPVRR ALISVSDKAG LLDLARALIA HGAEILSTGG SARALREAGL KVTEVSDHTG 
FPEILDGRVK TLVPQIHGGI LGRRDLPAHL AQMDEHGIAP IDLVAVNLYP FEATVASGAG
EEDCIENIDI GGPALIRAAA KNHGHVVVLT DPAQYGAVID ALAQGGTTLA ARRALAGAAY
ARTAAYDSAI AAWFAVQRGD VLPERLAVAG LRRESLRYGE NPHQQAAFYA DGSSRPGVAT
ARQVQGKSLS YNNLNDTDAA FEAVAEFDGP AVVIVKHANP CGVATADTLS AAWDLALRCD
PVSAFGGIVA LNRTLDADAA ARIAAIFTEV IVAPDATEEA QAILAKKKNL RLLLTGAMPD
PSVGGVAIRS VAGGFLAQTR DNGRIVPAGL KVVTRRAPTE AEMADLIFAF RVGKHVKSNA
IVYAKGQATA GIGAGQMSRV DSARIAAIKG AEAARAAGLD QPLTTGSVVA SDAFFPFADG
LEAAIAAGAT AVIQPGGSIR DDEVIAAADR AGIAMVFTGM RHFRH