Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1443 |
Symbol | purH |
ID | 7083526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1608874 |
End bp | 1610463 |
Gene Length | 1590 bp |
Protein Length | 529 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643698461 |
Product | bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_002355098 |
Protein GI | 217969864 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTGA CCCAAGCCCT GATCAGCGTC TCCGACAAAC GTGGCGTGCT CGACTTCGCC CGCAAGCTCT CCGCGCTCGG CATCAAGCTG CTGTCGACCG GCGGCACCGC CAGCCTGCTG CGCGAGGCCG GCCTGCCGGT GACCGACGTC TCCGAGCACA CCGGCTTCCC CGAGATGCTG GACGGCCGGG TCAAGACCCT GCACCCGAAG GTGCATGGCG GCATCCTCGC CCGCCGCGAT CTCGCCGAAC ACATGGACAC CATCGCCGCC CACGACATCG GCCGCATCGA CCTGGTGGTG GTCAATCTCT ACCCCTTCCA GCAGACCGTG GCCAAGCCCG ACTGCACGCT GGAAGACGCG ATCGAGAACA TCGACATCGG CGGCCCCACC ATGGTGCGCG CCGCGGCCAA GAACCACGGC AACGAGCAGG GCGGCGTCGG CATCGTCACC GACCCCGAGG ACTACGGCTG CATCATCGAA GAGCTCGAGG CCAACGCCGG CAAGCTCAGC CACAAGACCC GCTTCGCGCT CGCGGTGAAG GCCTTCACCC ACACCGCGCG CTACGACTCG GCGATCTCCA ACTACCTCAC CGCGCTCGTC ACCAACGAGG CCGGCGACGT GTCGCTGCAG ACCTATCCCG AGCGCCTGCA GCTCGCCTTC GACAAGGTGC AGGACCTGCG CTACGGCGAG AACCCGCACC AGACCGCGGC CTTCTACCGC CAGCCCGGCG CGGCCGAGGG CGGCGTGGCC GGCTACACCC AGCTGCAGGG CAAGGAGCTG TCCTACAACA ACATCGCCGA CGCCGACGCG GCCTGGGAAT GCGTGAAGGC CTTCGACGGC TCGGCGGCGG CCTGCGTCAT CGTCAAGCAC GCCAATCCCT GTGGCGTGGC CGTCGCCGCC AGCCCGCTCG AGGCCTACAA GAAGGCCTTC TCCACCGACC CCACCTCGGC CTTCGGCGGC ATCATCGCGT TCAACGGCGA GGTCGACCGT GCCGCGGCCG AGGCCGTTTC GGCACAGTTC CTCGAGGTGC TGATCGCGCC GTCCTACACC GCCGACGCGC TCGAGCTGCT CGCGAGCAAG AAGAACGTGC GCGTGCTCAC CTGCGCGCTC GGACAGCCTG CCGGTGCCTT CGACTACAAG CGCGTCGGTG GCGGCCTGCT GGTGCAGAGC GCCGACGAGG CCCGCATCCA GATCGCGGAC CTCAAGGTCG TCACGAAGCG GGCGCCGACG GAAGCCGAGA TGCGCGACAT GCTCTTCGCC TGGCGCGTGG CCAAGTACGT CAAGTCCAAC GCCATCGTGT ACTGCAAGGA CGGCATGACC ATCGGCGTCG GTGCCGGCCA GATGAGCCGC GTCGACTCGG CGCGCATCGC CAGGATCAAG GCCGAGAACG CCGGTCTGCA GATCGCCGGC TGCGTGGTCG CCTCGGACGC CTTCTTCCCC TTCCGCGACG GCCTCGACGT GCTCGCCCAG GCGGGTGCGA CCGCGGTGAT CCAGCCCGGC GGCTCGATGC GCGACGAAGA GGTGATCGCG GCAGCCAACG AGCAGGACAT CGCCATGGTG TTCACCGGCT TCCGTCACTT CCGTCACTAA
|
Protein sequence | MNVTQALISV SDKRGVLDFA RKLSALGIKL LSTGGTASLL REAGLPVTDV SEHTGFPEML DGRVKTLHPK VHGGILARRD LAEHMDTIAA HDIGRIDLVV VNLYPFQQTV AKPDCTLEDA IENIDIGGPT MVRAAAKNHG NEQGGVGIVT DPEDYGCIIE ELEANAGKLS HKTRFALAVK AFTHTARYDS AISNYLTALV TNEAGDVSLQ TYPERLQLAF DKVQDLRYGE NPHQTAAFYR QPGAAEGGVA GYTQLQGKEL SYNNIADADA AWECVKAFDG SAAACVIVKH ANPCGVAVAA SPLEAYKKAF STDPTSAFGG IIAFNGEVDR AAAEAVSAQF LEVLIAPSYT ADALELLASK KNVRVLTCAL GQPAGAFDYK RVGGGLLVQS ADEARIQIAD LKVVTKRAPT EAEMRDMLFA WRVAKYVKSN AIVYCKDGMT IGVGAGQMSR VDSARIARIK AENAGLQIAG CVVASDAFFP FRDGLDVLAQ AGATAVIQPG GSMRDEEVIA AANEQDIAMV FTGFRHFRH
|
| |