Gene TBFG_10975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10975 
SymbolpurH 
ID5221649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp1072028 
End bp1073599 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content67% 
IMG OID640605726 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001286920 
Protein GI148822166 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones282 
Plasmid unclonability p-value0.306281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones212 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCG ACGACGGAAG ACGGCCGATC CGCCGTGCGC TGATCAGCGT GTACGACAAG 
ACCGGGCTGG TAGACCTGGC ACAGGGCCTG AGCGCGGCCG GCGTCGAGAT CATCTCGACT
GGGTCAACGG CCAAGACCAT TGCCGACACC GGGATTCCGG TGACCCCCGT GGAGCAGCTG
ACCGGCTTTC CCGAGGTGCT CGATGGCCGG GTCAAGACAC TGCACCCGCG AGTGCATGCC
GGGCTGCTGG CTGACCTGCG CAAGTCCGAG CACGCCGCGG CCCTCGAGCA ACTCGGGATC
GAGGCTTTCG AACTCGTTGT AGTCAACTTG TATCCGTTCA GCCAGACCGT CGAATCCGGC
GCCAGTGTCG ACGACTGCGT CGAGCAGATT GATATCGGCG GGCCGGCGAT GGTGCGGGCC
GCCGCCAAAA ACCATCCCAG CGCGGCGGTG GTCACCGATC CGCTTGGGTA CCATGGCGTG
CTTGCCGCAC TGCGCGCCGG CGGATTCACC CTCGCCGAGC GCAAAAGGCT GGCGTCGTTA
GCGTTTCAGC ATATAGCCGA GTACGACATC GCCGTCGCGA GCTGGATGCA ACAGACCCTA
GCGCCCGAAC ATCCTGTTGC CGCCTTTCCG CAGTGGTTCG GCCGAAGCTG GCGCCGCGTG
GCGATGCTGC GCTACGGCGA GAACCCGCAC CAACAGGCCG CTCTCTACGG CGACCCCACC
GCCTGGCCGG GGCTGGCCCA GGCCGAGCAA CTGCACGGAA AAGACATGTC CTACAACAAC
TTCACCGATG CGGACGCAGC CTGGCGGGCC GCCTTCGACC ACGAACAAAC GTGCGTGGCG
ATCATCAAGC ACGCCAACCC GTGCGGCATC GCAATCTCGT CCGTTTCGGT CGCCGACGCG
CATCGCAAGG CTCACGAATG CGATCCGCTG AGCGCCTACG GCGGGGTCAT CGCCGCCAAT
ACCGAGGTCA GTGTCGAAAT GGCCGAGTAT GTGAGCACCA TCTTCACCGA AGTCATCGTC
GCGCCTGGCT ACGCCCCCGG GGCCCTCGAT GTGCTGGCCC GCAAGAAGAA CATCCGGGTG
CTGGTAGCCG CCGAGCCACT GGCCGGTGGC AGCGAGTTGC GTCCGATCAG CGGTGGACTG
CTGATACAGC AGAGCGACCA GCTTGACGCG CACGGTGACA ACCCGGCGAA CTGGACCTTG
GCGACCGGGT CACCTGCGGA CCCCGCGACG CTGACCGACC TGGTCTTCGC GTGGCGAGCC
TGCCGTGCGG TCAAGTCGAA CGCGATAGTG ATAGCTGCCG ACGGCGCCAC CGTCGGCGTC
GGGATGGGTC AGGTCAACCG TGTCGACGCC GCCCGGTTGG CCGTCGAACG CGGCGGCGAG
CGGGTTCGCG GCGCGGTGGC AGCCTCGGAT GCGTTCTTCC CCTTTCCCGA CGGCCTGGAA
ACGTTGGCCG CCGCGGGGGT CACCGCGGTC GTCCACCCCG GTGGCTCGGT GCGCGACGAG
GAAGTGACCG AAGCGGCGGC CAAGGCCGGT GTCACCCTAT ATCTCACCGG GGCGCGGCAC
TTCGCGCACT GA
 
Protein sequence
MSTDDGRRPI RRALISVYDK TGLVDLAQGL SAAGVEIIST GSTAKTIADT GIPVTPVEQL 
TGFPEVLDGR VKTLHPRVHA GLLADLRKSE HAAALEQLGI EAFELVVVNL YPFSQTVESG
ASVDDCVEQI DIGGPAMVRA AAKNHPSAAV VTDPLGYHGV LAALRAGGFT LAERKRLASL
AFQHIAEYDI AVASWMQQTL APEHPVAAFP QWFGRSWRRV AMLRYGENPH QQAALYGDPT
AWPGLAQAEQ LHGKDMSYNN FTDADAAWRA AFDHEQTCVA IIKHANPCGI AISSVSVADA
HRKAHECDPL SAYGGVIAAN TEVSVEMAEY VSTIFTEVIV APGYAPGALD VLARKKNIRV
LVAAEPLAGG SELRPISGGL LIQQSDQLDA HGDNPANWTL ATGSPADPAT LTDLVFAWRA
CRAVKSNAIV IAADGATVGV GMGQVNRVDA ARLAVERGGE RVRGAVAASD AFFPFPDGLE
TLAAAGVTAV VHPGGSVRDE EVTEAAAKAG VTLYLTGARH FAH