Gene Acel_0384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_0384 
SymbolpurH 
ID4485795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp397253 
End bp398848 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content66% 
IMG OID639729151 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_872144 
Protein GI117927593 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACGT TCGATATTCC CGATACCGCT GAGGCGGGCC GGCGCCCGAT CCGCCGGGCG 
CTGGTCTCGG TGTATGACAA GACCTCCCTC GTTGAATTGG GCCGCGGTCT CGCGGCGGCC
GGAGTCGAAA TCGTCTCAAC GGGAAGCACG GCGAGTGTCC TTGAAGAGGC AGGTGTTCCC
GTCGTCCGGG TCGAAACGGT CACCGGATTT CCGGAATGCC TCGACGGCCG CGTCAAGACG
CTGCATCCCG CGATTCATGC CGGGCTCCTT GCGGATGTCA CCCGGCCGGA ACACGCTCGG
CAACTCAGTG ACCTCGGTGT GCGGCCCTTT GACCTGCTCG TGGTCAACCT CTACCCGTTC
TCCGAGACCG TGGCGCGGGG AGCGTCAGCT GCGGAGGTCA TCGACCAGAT CGATATCGGC
GGTCCGGCGA TGGTCCGCGC CGCGGCGAAG AATCACCACT GTGTCGCCGT CATCACCTCA
CCCGCGCGCT ATCAGGACGT CTTGGATGCC GTGCATCACG GCGGTTTCAC GGACGCCGAG
CGCCGGGCGC TCGCCGTTGA GGCGTTCGTG CACACGGCGT CCTATGACAT CGCCGTCGCG
ACCTGGATGG GTGGTTCGTA CACGCCGACA GATGAGGGAA GCGGATTTCC CGAATGGCTC
GGGGCAAGTT ACCGGCGCGC TGAGATATTG CGGTACGGCG AGAATCCGCA TCAACGGGCG
GCCCTCTACC GGGCGGACCA CGCCGCTCCC GGGCTGGCGC AGGCGCGGGT TCTGCACGGC
AAGGCCATGT CGTACAACAA CTACGTGGAC ACCGACGCCG CGCATCGGGC GGCGTACGAC
TTCACCGAAC CGTGCGTCGC CATTGTGAAA CACGCAAACC CGTGCGGCAT AGCCGTCGGC
CGGGACATCG CGGAAGCACA CCGTAAAGCC CACGCGTGTG ACCCCGTCTC GGCATATGGC
GGCGTCATTG CGGCGAATCG GGAAGTCAGC GTCGCCATGG CGGAACAGAT TGCGGACATT
TTCACGGAAG TGGTCTGTGC GCCGAGTTAC GCCGAGGGAG CCCTCGACAT TCTCACTGCG
AAGAAGAATC TGCGCATTCT GCTCTGTCCC GGACTGCACC CTGCGCACCG GGTGCGGGAA
TTCCGCCGGA TCAGCGGGGG GCTTCTGGTG CAGACCGTGG ACAACCTCGA CGCCGAGGGT
GACGACCCGG CGAATTGGCA ACTGCGCGCC GGGGAGCCGG CGGACGACCA GACGCTCGCG
GATTTGGAGT TCGCCTGGCG GGCGGTGCGG TCGGTAAAGT CGAACGCCAT TCTGCTTGCC
GCCGACCGCG CATCGGTCGG CGTCGGAATG GGCCAGGTCA ATCGGGTGGA CGCCGCCCGC
CTGGCGGTGC AACGGGCCGG CGACCGGGCG AAAGGCGCGG TCGCCGCATC GGACGCGTTC
TTCCCGTTCG CCGACGGCCT GCAGGTTCTG ATCGACGCCG GAGTCCGAGC CGTGGTGGAG
CCCGGCGGGT CCGTGCGTGA CGATGAGGTC GTCGAGGCTG CACGCCGAGC CGGTATCACC
CTGTACTTCA CCGGAAGCCG TCATTTCTCC CACTGA
 
Protein sequence
MSTFDIPDTA EAGRRPIRRA LVSVYDKTSL VELGRGLAAA GVEIVSTGST ASVLEEAGVP 
VVRVETVTGF PECLDGRVKT LHPAIHAGLL ADVTRPEHAR QLSDLGVRPF DLLVVNLYPF
SETVARGASA AEVIDQIDIG GPAMVRAAAK NHHCVAVITS PARYQDVLDA VHHGGFTDAE
RRALAVEAFV HTASYDIAVA TWMGGSYTPT DEGSGFPEWL GASYRRAEIL RYGENPHQRA
ALYRADHAAP GLAQARVLHG KAMSYNNYVD TDAAHRAAYD FTEPCVAIVK HANPCGIAVG
RDIAEAHRKA HACDPVSAYG GVIAANREVS VAMAEQIADI FTEVVCAPSY AEGALDILTA
KKNLRILLCP GLHPAHRVRE FRRISGGLLV QTVDNLDAEG DDPANWQLRA GEPADDQTLA
DLEFAWRAVR SVKSNAILLA ADRASVGVGM GQVNRVDAAR LAVQRAGDRA KGAVAASDAF
FPFADGLQVL IDAGVRAVVE PGGSVRDDEV VEAARRAGIT LYFTGSRHFS H