Gene Tfu_2572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_2572 
SymbolpurH 
ID3581499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp3030555 
End bp3032123 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content69% 
IMG OID637686288 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_290628 
Protein GI72162971 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACACAGC AGGCCATTCG GCGTGCGTTG ATCAGCGTGT ACGACAAGAC CGGGGTCGAG 
GAGCTAGGAC GCGGCCTCGC TGAAGCAGGG GTGGAGATCG TCTCCACCGG TTCCACTGCT
GCCCGGCTGA CCGCAGCCGG GGTCGCAGTC ACCCCGGTGG AATCCGTCAC CGGTTTTCCC
GAGTGTTTCG AAGGCCGGGT GAAGACCCTG CACCCGAAGG TGCACGCGGG ACTGCTCGCT
GACCGGACCA AGGCCGAGCA CCGGGCGCAG CTCGCCGAAC TCGACATCGC GCCGTTCGAC
CTGGTCGTGG TCAACCTCTA CCCTTTCGCG GACACGGTTG CGTCCGGCGC TTCCCCTGAG
GAGTGCATCG AGAAGATCGA CATCGGCGGT CCGGCGATGG TGCGGGCCGC GGCAAAGAAC
CACGCGAGTG TCGCGGTGGT CGTGGACCCG GCCCGCTACG GCGACGTCCT GAAAGCGGTG
CGCAGCGGCG GGTTCACCCT GGAGGAGCGC AAGCGGTTGG CGGCGGCGGC TTTCGCGCAC
ACGGCGAGCT ACGACGCTGC CGTAGCGGCA TGGTTCGCCG AGGCCTACGC CCCCGACGAG
GTGGCGAAGG ACTCGGGGTG GCCCGAGTTC ACGGCTGTCA CCTACCAGCG GCAGACGACG
CTGCGCTACG GCGAGAACCC CCACCAGAGC GCAGCACTGT ACCGGCCGGC GAGCGCCAGC
GGCGAGGGCC TGGCCGGGGC GCGGCAGTTG CACGGCAAGG AGATGTCGTA CAACAACTAC
GTGGACAGCG ACGCTGCCCT GCGCGCCGCC TACGACTTCA CTGAGCCGTG CGTGGCGATT
ATCAAACATG CCAACCCGTG CGGGATCGCT GTAGGAGAAA ATATCGCCGA AGCACATCGC
AAGGCGCACG CCTGCGATCC GGTGTCGGCG TTTGGTGGGG TGATCGCCGC TAACCGCGTT
GTTGACGAGG CCATGGCCGC GCAGGTCGCC GAGGTGTTCA CCGAGGTTGT CGTGGCGCCC
GGGTTCAGCC CTGAGGCCGT GGAGATCCTC ACGCGCAAGA AGAACATCCG CCTGCTGGAG
GTGGCGGAGC CGGACCGCGG GGCCCGGCGG GAGATGCGGC AGATCAGCGG CGGGCTGCTG
ATGCAGGACG CCGACCTGGT CGACGCGCCC GGGGATGATC CTGCGCAGTG GCAGTTGCGG
GCCGGACCAG CCGCGGATGA GGCGACCCTG GCCGATCTGG CCTTCGCGTG GCGTGCGGTG
CGGGCCGTGA AATCCAACGC GATCCTGCTG GCTGCCGACC GGGCCACGGT GGGCGTGGGC
ATGGGCCAGG TGAACCGGGT GGACTCGGCT CGCCTCGCGG TGACACGCGC CGGGGAGCGG
GTGAAGGGCT CCGTAGCCGC GAGTGACGCG TTCTTCCCCT TCCCTGACGG ACTGGAAGTG
CTGGCCGAGG CAGGCGTGCG GGCGATCGTG CAGCCGGGAG GTTCGGTGCG GGACGACGAA
GTCATCGCCG CTGCCGAGCG TGCCGGGGTG ACCTTGTACT TCACCGGAAC CCGGCACTTC
TTCCACTGA
 
Protein sequence
MTQQAIRRAL ISVYDKTGVE ELGRGLAEAG VEIVSTGSTA ARLTAAGVAV TPVESVTGFP 
ECFEGRVKTL HPKVHAGLLA DRTKAEHRAQ LAELDIAPFD LVVVNLYPFA DTVASGASPE
ECIEKIDIGG PAMVRAAAKN HASVAVVVDP ARYGDVLKAV RSGGFTLEER KRLAAAAFAH
TASYDAAVAA WFAEAYAPDE VAKDSGWPEF TAVTYQRQTT LRYGENPHQS AALYRPASAS
GEGLAGARQL HGKEMSYNNY VDSDAALRAA YDFTEPCVAI IKHANPCGIA VGENIAEAHR
KAHACDPVSA FGGVIAANRV VDEAMAAQVA EVFTEVVVAP GFSPEAVEIL TRKKNIRLLE
VAEPDRGARR EMRQISGGLL MQDADLVDAP GDDPAQWQLR AGPAADEATL ADLAFAWRAV
RAVKSNAILL AADRATVGVG MGQVNRVDSA RLAVTRAGER VKGSVAASDA FFPFPDGLEV
LAEAGVRAIV QPGGSVRDDE VIAAAERAGV TLYFTGTRHF FH