Gene Acry_0027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_0027 
SymbolpurH 
ID5160622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp24181 
End bp25722 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content68% 
IMG OID640551941 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001233175 
Protein GI148259048 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.493241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA GAGTCAACAT CCGACGGGCC CTCATCTCGG TGTCCGACAA GGCGGGGCTG 
GTCGAGCTCG GCCGCGCGCT GGCGGCAGCG GGCGTGGAAA TCCTCTCGAC CGGCGGTTCC
GCCCGCGCGC TGCGCGAGGC CGGGATCGCC GTTGTCGAGG TGGCGGATTA CACGGGTGTT
CCCGAAATGC TGGATGGGCG GGTCAAGACG CTGGTGCCCA AGATCCATGG CGGCCTGCTC
GGCCGGCGCG ACCTGCCGGA GCATCTGGCG CAGATGCAGC GGCACGACAT CCCGCCGATC
GACCTGCTCG CGGTCAATCT CTACCCGTTC GAGGAAACGG TCGCGAAGGG CTCGGATTTC
GAAACCTGCG TCGAGAACAT CGATATCGGC GGCCCGGCGC TGATCCGCGC GGCGGCGAAG
AACCACGATT CGGTGGCGGT TCTCACCAGC CCGGCGCAGT ATGACGACCT CATCGCGGCG
CTAGCCGCCG GCGGAACGAC GCTGGAGCAG CGCCGCCGCC TCGCCGCCGC CGCCTATGCC
CGCACCGCCG CCTACGACGC CGCGATTTCC GCCTGGTTCG CGCAGCAGAC CGGCGAGATG
TTCCCGGCGC ACCTCGCCCT GGCCGGCGCG CGGCAGCAGA TGCTGCGCTA CGGCGAGAAC
CCGCACCAGT CCGCTGCGTT CTATCGCACC GGCAACCGCC CCGGCGTTGC CACCGCGCGG
CAGTTGCAGG GCAAGGAACT CTCCTACAAC AACATCAACG ACACCGATGC CGCTTTCGAA
TGCGTCGCCG AGTTCGACCG GCCGGCGGTG GTGATCGTCA AGCACGCCAA TCCGTGCGGC
GTCGCCCTCG GCGCCGATCT TGCCGAGGCC TGGGACCGCG CGCTGGACTG CGACCCGGTT
TCGGCGTTTG GCGGCATCAT CGCGGTCAAC CGCCCGCTCG ATGTTGCGGC AGCCGAGAAG
ATGGCGAGCA TCTTCTCCGA GGTGATCATC GCGCCGGACG CCGCACCTGA CGCGGTTGAA
CTGCTTGCCC GCAAGAAGAA TCTCCGCCTG CTGCTCACCG GCGGCCTGCC CGACCCGGCG
GAACCGGGCC TTGCCTGGCG CAGCGTTGCC GGTGGTTTCC TGGCCCAGAC CCGCGACGCC
GGGAGGATTG GCCGCGACGA TCTGAAGGTC GTCACCCAGC GCGCGCCGAC CAACGCCGAG
TTCGCCGATC TGCTGTTCGC CTTCCGTGTG GCCAAGCATG TGAAGTCGAA TGCGATCATC
TACGCGAAAG CAGGGGCGAC CACGGGCATC GGCGCGGGGC AGATGAGCCG CGTCGATTCC
TCGCGCATCG CCGCACAGAA GGGTGGGGAG AAGATTCCGG GTTCGGTCGT CGCGTCCGAC
GCGTTCTTCC CCTTCGCCGA CGGTCTTGTG GCCGCGATCG AGGCAGGGGC GACGGCGGTG
ATCCAGCCCG GCGGCTCGAT CCGCGACAAC GAGGTGATCG AGGCGGCAGA TGCCGCCGGG
ATTGCCATGG TGTTCACCGG CATGCGCCAT TTCAGGCATT GA
 
Protein sequence
MNDRVNIRRA LISVSDKAGL VELGRALAAA GVEILSTGGS ARALREAGIA VVEVADYTGV 
PEMLDGRVKT LVPKIHGGLL GRRDLPEHLA QMQRHDIPPI DLLAVNLYPF EETVAKGSDF
ETCVENIDIG GPALIRAAAK NHDSVAVLTS PAQYDDLIAA LAAGGTTLEQ RRRLAAAAYA
RTAAYDAAIS AWFAQQTGEM FPAHLALAGA RQQMLRYGEN PHQSAAFYRT GNRPGVATAR
QLQGKELSYN NINDTDAAFE CVAEFDRPAV VIVKHANPCG VALGADLAEA WDRALDCDPV
SAFGGIIAVN RPLDVAAAEK MASIFSEVII APDAAPDAVE LLARKKNLRL LLTGGLPDPA
EPGLAWRSVA GGFLAQTRDA GRIGRDDLKV VTQRAPTNAE FADLLFAFRV AKHVKSNAII
YAKAGATTGI GAGQMSRVDS SRIAAQKGGE KIPGSVVASD AFFPFADGLV AAIEAGATAV
IQPGGSIRDN EVIEAADAAG IAMVFTGMRH FRH