Gene Rcas_4336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4336 
SymbolpurH 
ID5541849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5588663 
End bp5590180 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content62% 
IMG OID640896442 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001434378 
Protein GI156744249 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.166921 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCGCGC TTGTAAGCGT TTCCGATAAG CGTGGCATCG AAGCATTTGC CGCCGGTCTC 
GTCGAATTCG GCTTCGAGAT TATCTCGACC GGTCATACGG CGCGAACGCT TGCCACGGCT
GGTGTTCCAG TTCGACCGGT CAGTGACGTG ACGGGTTTTC CCGAAATCCT TGGCGGGCGG
GTGAAAACGC TCCATCCCGC CATTCACGCT GGCATTCTGG CGCGCCGCGA TGATCCAGAC
CATATGGCGG CGCTCGATGT TCATGGTATT GCGCCGATCG ATCTCGTTGT CGTCAATCTC
TATCCCTTTA GTGAGACGAT CACCCGTCCC GATGTCACTC TTGCAGAAGC GATCGAACAG
ATTGACATCG GCGGTCCTGC TATGGTTCGC GCCGCCGCCA AGAACCACCC ATCGGTGCTG
GTGGTGGTCA GCCCCGACGA CTACGACGCG GTGCTGACGG CGTTGCGCAC CGAAACGGTG
ACGCCAGAGT TGCGGCGGCG CCTGGCAGCG CGTGCCTTTG CTCACACTGC TGCGTATGAT
GCCGCCATCG CGGGGTATTT GTCTGGGGAA CTCTTCCCGG AGACACTGCC ACTGGCGTTT
CGTAAGGCGC AGGATCTGCG TTACGGCGAA AATCCGCATC AGCGCGCGGC GCTCTACGGT
GATTTCCATG CCTTCTTCGA GCAACTGCAC GGACGTGAGT TGTCGTATAT CAATATTCTC
GATATTGCTG CGGCTCAGTC GCTGATCGAG GAGTTCGACC CGGCAGCGGG CGCGGCGCTG
GCGATTGTCA AGCATACGAA TCCGTGTGGC GCGGGAGTGG GCGCAACGCC GCTCGAAGCC
TGGGAGAAAG CGTTCGCCAC CGACCGGGAA GCGCCATTTG GCGGTATTAT CGCGGTCAAC
CAGATGCTCG ATCTTCCGTT GGCGCAGGCG ATTGACGAGA TTTTCTCCGA GATCGTCATT
GCGCCCGCCT TCGCCGATGA TGCACTGGCA TTGCTGCGCA AGAAGAAAAA CCGGCGTTTG
ATGCGTGCTC TGCGCCCGGT CGGGCAGTCG CGCGGGCTGG TATACCATAG CGTGCCGGGT
GGCATCCTGG CGCAGGAGCC GGACCTTGCG CCGCTTGATG AGGAACCGTT CGAGGTGGTG
ACACAGCGCA CGCCGACCGA TGCCGAACGC GCTGCGCTAC AGTTTGCCTG GCGGATCGTG
AAGCATGTCA AGTCGAATGC GATCGTCTTT GCCGCTGCCG ATCGCACCCT GGGGATCGGC
GCCGGGCAGA TGAGCCGGGT CGATAGTACG CGGGTGGCGG TGTGGAAGGC GCAGAATGCC
GGTCTCTCGC TCGCCGGGTC GGTCATTGCC AGTGATGCGC TCTTCCCGTT CCCCGATAGC
GTCGAGATCG CGGCGGAGGC TGGAGCAACG GCAGTGATTC AGCCCGGCGG GTCGGTGCGC
GACGACGAGG TGATTGCTGC CGCCAACCGG CTTGGTATGG CGATGGTGTT CACCGGAAGA
CGCCACTTCT TGCACTAG
 
Protein sequence
MRALVSVSDK RGIEAFAAGL VEFGFEIIST GHTARTLATA GVPVRPVSDV TGFPEILGGR 
VKTLHPAIHA GILARRDDPD HMAALDVHGI APIDLVVVNL YPFSETITRP DVTLAEAIEQ
IDIGGPAMVR AAAKNHPSVL VVVSPDDYDA VLTALRTETV TPELRRRLAA RAFAHTAAYD
AAIAGYLSGE LFPETLPLAF RKAQDLRYGE NPHQRAALYG DFHAFFEQLH GRELSYINIL
DIAAAQSLIE EFDPAAGAAL AIVKHTNPCG AGVGATPLEA WEKAFATDRE APFGGIIAVN
QMLDLPLAQA IDEIFSEIVI APAFADDALA LLRKKKNRRL MRALRPVGQS RGLVYHSVPG
GILAQEPDLA PLDEEPFEVV TQRTPTDAER AALQFAWRIV KHVKSNAIVF AAADRTLGIG
AGQMSRVDST RVAVWKAQNA GLSLAGSVIA SDALFPFPDS VEIAAEAGAT AVIQPGGSVR
DDEVIAAANR LGMAMVFTGR RHFLH