Gene RoseRS_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0206 
SymbolpurH 
ID5207141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp250856 
End bp252373 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content62% 
IMG OID640593836 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001274592 
Protein GI148654387 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.951642 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTGCAC TGTTGAGCGT TTCTGATAAG CGCGGCATCG AGGCATTCGC TGCCGGTCTG 
GTTGAACTCG GGTACGAGAT TGTTTCGACC GGCAACACGG CGCGAACGCT TGCGGCCGCC
GGTATTCCGG TTCGACCGGT CAGTGATGTC ACCGGTTTTC CTGAGATTCT CGGCGGGCGG
GTGAAAACGC TCCACCCTGC CATTCATGCT GGCATCCTGG CGCGTCGTGA CGATCCTGGA
CACATGGCGG CGCTGGATGT CCACGGTATT GCGCCGATCG ATATTGTCGC TGTTAATCTC
TACCCCTTCA GCGAGACGAT TGCCCGTCCC GATGTTTCCT TTGCCGAAGC AATCGAGCAG
ATCGATATCG GCGGTCCTGC CCTGGTACGC GCCGCTGCCA AGAACCACGA CTCGGTGCTG
GTGGTGGTCA GTCCTGATGA TTATGATCCG GTGCTGACGG CGCTGCGCTC CGAAGCGGTG
ACGCCTAACC TGCGCCGGCG TCTGGCAGCG CGCGCGTTTG CTCATACAGC AGCGTATGAT
GCTGCGATTG CTGCGTATCT GTCCGATGAA CCCTTCCCGG AGACGCTGCC GCTGGCGTTC
CGCAAGGCGC AGGATTTACG CTACGGTGAG AATCCACATC AGCGCGCTGC GCTTTATGGC
GAGTTTCACA CCTTCTTCGA GCAACTGCAC GGTCGTGAGT TGTCGTATAT CAACATCCTT
GATATTGCTG CGGTACAGGG GTTGATCGAG GAGTTCGATC CACAGGAAGG CGCCGCACTG
GCAATCGTCA AGCATACGAA CCCGTGCGGC GTCGGCATCG GTGCAACGCC GCTCGAAGCC
TGGGAAAAGG CATTTGCGAC CGACCGCGAG GCGCCGTTCG GCGGCATCAT TGCGGTGAAC
CAGACGCTTG ATCTGCCGCT GGCGCAGGCG ATTGACGAGA TTTTCTCCGA GATTGTCATT
GCGCCAGCGT TCGCCGATGA TGCGCTGGCG CTGCTGCGGA AGAAGAAGAA CCGCCGCCTG
ATGCGTGCGC TGCGCCCCGT TCGCCTTGCC CGCGGGCTGG CATACCACAG CGTGCCCGGC
GGTATCCTGG CGCAGGAGCC AGACCTTGCG CCGCTCGATG AGGAGCCGTT CCAGGTTGTG
ACACAGCGCG CTCCGACTGA GACGGAACGG GCTGCGCTGC GCTTTGCCTG GCGCGTGGTG
AAGCACGTCA AATCGAACGC GATAGTGTTT GCTGCTGCCG ACCGGACGTT AGGCATCGGC
GCCGGGCAGA TGAGTCGCGT CGATAGTACG CGGGTGGCGG TGTGGAAAGC GCAGAACGCT
GGTCTCTCGC TCGCCGGTTC GGTCATCGCC AGCGATGCGC TGTTCCCGTT CCCCGATAGT
GTCGAGATTG CAGCGGCGGC GGGAGCAACA GCGGTTATTC AGCCCGGCGG ATCGGTGCGC
GATGATGAGG TGATCGCCGC CGCCAACCGG CTCGGCATGG CGATGGTGTT CACCGGCAGA
CGACATTTTC TGCACTGA
 
Protein sequence
MRALLSVSDK RGIEAFAAGL VELGYEIVST GNTARTLAAA GIPVRPVSDV TGFPEILGGR 
VKTLHPAIHA GILARRDDPG HMAALDVHGI APIDIVAVNL YPFSETIARP DVSFAEAIEQ
IDIGGPALVR AAAKNHDSVL VVVSPDDYDP VLTALRSEAV TPNLRRRLAA RAFAHTAAYD
AAIAAYLSDE PFPETLPLAF RKAQDLRYGE NPHQRAALYG EFHTFFEQLH GRELSYINIL
DIAAVQGLIE EFDPQEGAAL AIVKHTNPCG VGIGATPLEA WEKAFATDRE APFGGIIAVN
QTLDLPLAQA IDEIFSEIVI APAFADDALA LLRKKKNRRL MRALRPVRLA RGLAYHSVPG
GILAQEPDLA PLDEEPFQVV TQRAPTETER AALRFAWRVV KHVKSNAIVF AAADRTLGIG
AGQMSRVDST RVAVWKAQNA GLSLAGSVIA SDALFPFPDS VEIAAAAGAT AVIQPGGSVR
DDEVIAAANR LGMAMVFTGR RHFLH