Gene Tmz1t_1443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1443 
SymbolpurH 
ID7083526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1608874 
End bp1610463 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content68% 
IMG OID643698461 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002355098 
Protein GI217969864 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTGA CCCAAGCCCT GATCAGCGTC TCCGACAAAC GTGGCGTGCT CGACTTCGCC 
CGCAAGCTCT CCGCGCTCGG CATCAAGCTG CTGTCGACCG GCGGCACCGC CAGCCTGCTG
CGCGAGGCCG GCCTGCCGGT GACCGACGTC TCCGAGCACA CCGGCTTCCC CGAGATGCTG
GACGGCCGGG TCAAGACCCT GCACCCGAAG GTGCATGGCG GCATCCTCGC CCGCCGCGAT
CTCGCCGAAC ACATGGACAC CATCGCCGCC CACGACATCG GCCGCATCGA CCTGGTGGTG
GTCAATCTCT ACCCCTTCCA GCAGACCGTG GCCAAGCCCG ACTGCACGCT GGAAGACGCG
ATCGAGAACA TCGACATCGG CGGCCCCACC ATGGTGCGCG CCGCGGCCAA GAACCACGGC
AACGAGCAGG GCGGCGTCGG CATCGTCACC GACCCCGAGG ACTACGGCTG CATCATCGAA
GAGCTCGAGG CCAACGCCGG CAAGCTCAGC CACAAGACCC GCTTCGCGCT CGCGGTGAAG
GCCTTCACCC ACACCGCGCG CTACGACTCG GCGATCTCCA ACTACCTCAC CGCGCTCGTC
ACCAACGAGG CCGGCGACGT GTCGCTGCAG ACCTATCCCG AGCGCCTGCA GCTCGCCTTC
GACAAGGTGC AGGACCTGCG CTACGGCGAG AACCCGCACC AGACCGCGGC CTTCTACCGC
CAGCCCGGCG CGGCCGAGGG CGGCGTGGCC GGCTACACCC AGCTGCAGGG CAAGGAGCTG
TCCTACAACA ACATCGCCGA CGCCGACGCG GCCTGGGAAT GCGTGAAGGC CTTCGACGGC
TCGGCGGCGG CCTGCGTCAT CGTCAAGCAC GCCAATCCCT GTGGCGTGGC CGTCGCCGCC
AGCCCGCTCG AGGCCTACAA GAAGGCCTTC TCCACCGACC CCACCTCGGC CTTCGGCGGC
ATCATCGCGT TCAACGGCGA GGTCGACCGT GCCGCGGCCG AGGCCGTTTC GGCACAGTTC
CTCGAGGTGC TGATCGCGCC GTCCTACACC GCCGACGCGC TCGAGCTGCT CGCGAGCAAG
AAGAACGTGC GCGTGCTCAC CTGCGCGCTC GGACAGCCTG CCGGTGCCTT CGACTACAAG
CGCGTCGGTG GCGGCCTGCT GGTGCAGAGC GCCGACGAGG CCCGCATCCA GATCGCGGAC
CTCAAGGTCG TCACGAAGCG GGCGCCGACG GAAGCCGAGA TGCGCGACAT GCTCTTCGCC
TGGCGCGTGG CCAAGTACGT CAAGTCCAAC GCCATCGTGT ACTGCAAGGA CGGCATGACC
ATCGGCGTCG GTGCCGGCCA GATGAGCCGC GTCGACTCGG CGCGCATCGC CAGGATCAAG
GCCGAGAACG CCGGTCTGCA GATCGCCGGC TGCGTGGTCG CCTCGGACGC CTTCTTCCCC
TTCCGCGACG GCCTCGACGT GCTCGCCCAG GCGGGTGCGA CCGCGGTGAT CCAGCCCGGC
GGCTCGATGC GCGACGAAGA GGTGATCGCG GCAGCCAACG AGCAGGACAT CGCCATGGTG
TTCACCGGCT TCCGTCACTT CCGTCACTAA
 
Protein sequence
MNVTQALISV SDKRGVLDFA RKLSALGIKL LSTGGTASLL REAGLPVTDV SEHTGFPEML 
DGRVKTLHPK VHGGILARRD LAEHMDTIAA HDIGRIDLVV VNLYPFQQTV AKPDCTLEDA
IENIDIGGPT MVRAAAKNHG NEQGGVGIVT DPEDYGCIIE ELEANAGKLS HKTRFALAVK
AFTHTARYDS AISNYLTALV TNEAGDVSLQ TYPERLQLAF DKVQDLRYGE NPHQTAAFYR
QPGAAEGGVA GYTQLQGKEL SYNNIADADA AWECVKAFDG SAAACVIVKH ANPCGVAVAA
SPLEAYKKAF STDPTSAFGG IIAFNGEVDR AAAEAVSAQF LEVLIAPSYT ADALELLASK
KNVRVLTCAL GQPAGAFDYK RVGGGLLVQS ADEARIQIAD LKVVTKRAPT EAEMRDMLFA
WRVAKYVKSN AIVYCKDGMT IGVGAGQMSR VDSARIARIK AENAGLQIAG CVVASDAFFP
FRDGLDVLAQ AGATAVIQPG GSMRDEEVIA AANEQDIAMV FTGFRHFRH