Gene Sala_3123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_3123 
SymbolpurH 
ID4082709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3274335 
End bp3275993 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content68% 
IMG OID638011508 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_618159 
Protein GI103488598 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTCTTGG GGAGTGACCC GACTAGGGGA GCCGCGTCTT TCCCGTCTCG CAAAAGGCCG 
CCCTCCATGA CCGACCTGAT TCCCGTCCGC CGCGCGCTCT TGTCCGTCAG CGACAAGGCG
GGGCTTGCCG ATCTGGCCGC GGCGCTCGTC CGCCACGGGG TCGAACTGGT GTCGACCGGG
GGGACTGCGA AGGCATTGCG CGAGGCGGGT CATAGCGTGC TCGATGTCGC CGATTTGACC
GGCTTTCCCG AGATGATGGA CGGCCGCGTC AAGACGCTGC ACCCGGCGGT GCATGGCGGC
ATATTGGCGG TGCGCGACGA CGAGCGCCAC GTCGCCGCGA TGGACGCGCA CGGGATCGGC
GCGATCGATC TGGTCGTCGT CAATCTCTAC CCCTTCGCCG CGACCGTCGC GAAGGGCGCG
GCGCGCGACG AGATCATCGA AAATATAGAC ATCGGCGGCC CCGCGATGGT GCGCTCGGCA
GCGAAGAACC ATGCGTTCGT CGGCATCGTC ACCGAGCCCG AGGATTATGC CGCGGTGATC
GCGGAGATGG ACGCCAACGG CGGCGCGATG ACGCTGGACC TGCGCAAGCG GCTCGCCGCG
ACCGCCTTTG CCCACACCGC CACCTATGAC GGGACGATCG CGAGCTGGTT CGCCTTTGCC
GACCAGGGCA AGCTGTTTCC CGACACGCTG CCGCTGACCG CCAAGCTGTC GGCCGAACTG
CGCTATGGCG AAAATCCGCA CCAAAAGGCC GCGCTTTACC TGCCCGCCGG TCCCGCCGGG
CGCGGGATAG CGCAAGCCGA ACAGGTGCAG GGCAAGGAAC TCAGCTACAA CAATATCAAC
GACGCCGATG CCGCGCTCGA ACTCGTCGCG GAGTTTCGCG AGGCCGATCC GACCTGCGTG
ATCGTCAAGC ACGCCAATCC GTGCGGCGTC GCGACCGCCG CGAGTTTGAG CCAGGCCTAT
GACGCGGCGC TGAAATGCGA CGATGTGTCG GCGTTCGGCG GGATCATCGC GGTCAACCGA
CCACTCGACG GGCCGACGGC GGAGGCGATC AGCGGCATTT TCACCGAGGT CGTCTGCGCC
CCCGACGCCG ATGCCGATGC CCGTGCGGTG TTCGCGAAGA AGAAGAACCT CCGCCTGCTG
CTCACCGGCG ACTTGCCCGA TCCGGCGCGC GGCGGGTTGA TGCTGAAGAC GATCGCCGGC
GGCTGGCTCG CGCAGAGCCG CGACAACGGC CGCATCACCC GCGCCGACCT GAAGGTCGTG
ACCGACCGCG CGCCGACCGA GGAAGAACTG GCCGACGCGC TATTCGCGTG GACGGTTGCC
AAGCATGTGA AGTCGAACGC GATCGTCTAT GCCAAGGGCG GCGCAACCGC GGGCATCGGC
GCGGGGCAGA TGAACCGCCG CGACAGCGCG CGCATTGCCG CGGCGAAAGC GCGCGAAGCG
GCCGAATCCC ATGGCTGGGC AAGCCCGCGC ACCATTGGCA GCGCGGTCGC CAGCGACGCC
TTCTTCCCCT TTGCCGACGG GTTGCTCGCG GCGGTCGAGG CGGGCGCGAC CTGCGTGATC
CAGCCCGGCG GATCGATCCG CGACGATGAG GTGATCGCAG CCGCGAACAA AGCCGGGCTG
GCGATGGTCT TCACCGGAAT GCGGCATTTC CGGCATTGA
 
Protein sequence
MLLGSDPTRG AASFPSRKRP PSMTDLIPVR RALLSVSDKA GLADLAAALV RHGVELVSTG 
GTAKALREAG HSVLDVADLT GFPEMMDGRV KTLHPAVHGG ILAVRDDERH VAAMDAHGIG
AIDLVVVNLY PFAATVAKGA ARDEIIENID IGGPAMVRSA AKNHAFVGIV TEPEDYAAVI
AEMDANGGAM TLDLRKRLAA TAFAHTATYD GTIASWFAFA DQGKLFPDTL PLTAKLSAEL
RYGENPHQKA ALYLPAGPAG RGIAQAEQVQ GKELSYNNIN DADAALELVA EFREADPTCV
IVKHANPCGV ATAASLSQAY DAALKCDDVS AFGGIIAVNR PLDGPTAEAI SGIFTEVVCA
PDADADARAV FAKKKNLRLL LTGDLPDPAR GGLMLKTIAG GWLAQSRDNG RITRADLKVV
TDRAPTEEEL ADALFAWTVA KHVKSNAIVY AKGGATAGIG AGQMNRRDSA RIAAAKAREA
AESHGWASPR TIGSAVASDA FFPFADGLLA AVEAGATCVI QPGGSIRDDE VIAAANKAGL
AMVFTGMRHF RH