Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1896 |
Symbol | purH |
ID | 6975319 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 2110434 |
End bp | 2112011 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643391422 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002276271 |
Protein GI | 209544042 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0249314 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGA CGCCCGTGCC GGTCCGCCGC GCCCTGATTT CCGTTTCCGA CAAGGCGGGG TTGCTCGACC TCGCCCGCGC CCTGATCGCC CATGGGGCGG AAATCCTCTC GACCGGGGGC TCGGCCCGTG CGCTGCGCGA GGCCGGACTG AAGGTGACCG AGGTGTCGGA CCATACCGGC TTTCCCGAAA TTCTCGACGG GCGGGTGAAG ACCCTGGTGC CGCAGATCCA TGGCGGCATC CTGGGCCGCC GCGACCTGCC GGCCCATCTG GCGCAGATGG ACGAACACGG GATCGCGCCG ATCGACCTGG TCGCGGTGAA CCTCTACCCG TTCGAGGCCA CGGTGGCCTC CGGCGCGGGG GAAGAGGACT GCATCGAGAA CATCGATATC GGCGGCCCCG CCCTGATCCG CGCGGCGGCC AAGAACCACG GCCATGTCGT GGTGCTGACC GATCCGGCGC AGTACGGCGC GGTGATCGAC GCGCTGGCCC AGGGCGGCAC CACGCTTGCG GCGCGGCGCG CGCTGGCCGG CGCGGCCTAT GCCCGCACCG CCGCCTATGA TTCCGCCATC GCGGCGTGGT TCGCCGTGCA GCGCGGCGAC GTGCTGCCGG AACGCCTGGC CGTCGCGGGC CTGCGCCGCG AAAGCCTGCG CTATGGCGAA AATCCGCACC AGCAGGCGGC CTTCTATGCC GACGGCAGCA GCCGGCCCGG TGTGGCCACC GCCCGCCAGG TGCAGGGCAA ATCCCTGTCC TACAACAACC TGAACGATAC CGATGCGGCG TTCGAGGCCG TGGCGGAATT CGACGGCCCG GCGGTGGTGA TCGTCAAGCA CGCCAATCCC TGCGGCGTCG CCACCGCCGA TACGCTGTCG GCGGCCTGGG ACCTGGCGCT GCGCTGCGAT CCGGTCTCGG CCTTCGGCGG CATCGTCGCG CTGAACCGCA CGCTGGACGC CGACGCCGCC GCGCGCATCG CGGCCATCTT CACCGAGGTC ATCGTCGCCC CCGACGCGAC GGAGGAGGCC CAGGCGATCC TGGCGAAGAA GAAGAACCTG CGCCTGCTGC TGACCGGCGC GATGCCCGAC CCGTCCGTGG GCGGGGTGGC CATCCGTTCG GTCGCCGGCG GCTTCCTGGC GCAGACCCGC GACAATGGCC GGATCGTCCC CGCCGGCCTG AAGGTGGTGA CCCGCCGCGC CCCGACCGAG GCCGAGATGG CGGATCTGAT CTTCGCCTTC CGCGTCGGCA AGCATGTGAA GTCGAACGCC ATCGTCTATG CCAAGGGCCA GGCGACCGCC GGCATCGGCG CGGGGCAGAT GAGCCGCGTG GACTCGGCGC GCATCGCCGC GATCAAGGGG GCGGAAGCCG CCCGGGCCGC CGGCCTGGAC CAGCCGCTGA CGACGGGCAG CGTGGTGGCG TCGGACGCGT TTTTCCCCTT CGCCGACGGG CTGGAGGCCG CGATCGCGGC CGGCGCCACG GCGGTGATCC AGCCGGGCGG ATCGATCCGC GATGACGAGG TCATCGCCGC CGCCGACCGG GCGGGCATCG CCATGGTGTT CACAGGTATG CGCCACTTCC GGCACTGA
|
Protein sequence | MTQTPVPVRR ALISVSDKAG LLDLARALIA HGAEILSTGG SARALREAGL KVTEVSDHTG FPEILDGRVK TLVPQIHGGI LGRRDLPAHL AQMDEHGIAP IDLVAVNLYP FEATVASGAG EEDCIENIDI GGPALIRAAA KNHGHVVVLT DPAQYGAVID ALAQGGTTLA ARRALAGAAY ARTAAYDSAI AAWFAVQRGD VLPERLAVAG LRRESLRYGE NPHQQAAFYA DGSSRPGVAT ARQVQGKSLS YNNLNDTDAA FEAVAEFDGP AVVIVKHANP CGVATADTLS AAWDLALRCD PVSAFGGIVA LNRTLDADAA ARIAAIFTEV IVAPDATEEA QAILAKKKNL RLLLTGAMPD PSVGGVAIRS VAGGFLAQTR DNGRIVPAGL KVVTRRAPTE AEMADLIFAF RVGKHVKSNA IVYAKGQATA GIGAGQMSRV DSARIAAIKG AEAARAAGLD QPLTTGSVVA SDAFFPFADG LEAAIAAGAT AVIQPGGSIR DDEVIAAADR AGIAMVFTGM RHFRH
|
| |