Gene Swit_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_0447 
SymbolpurH 
ID5197905 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp472481 
End bp474082 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content69% 
IMG OID640579986 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001260953 
Protein GI148553371 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.635969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCC CGACCACGCC CGCCCCCGAC AAAGTCGCCA TCAAGCGCGC GCTGCTGTCG 
GTCTCCGACA AGAGCGGGCT GGTCGAGCTG GGCCGGGCGC TGGCGGGGTG GGGCGTCGAA
CTGGTCTCGA CCGGCGGCAC CGCCAAGGCG CTGCGCGACG CCGGGCTCGA CGTGAAGGAC
ATCTCCGACA TCACCGGCTT CCCGGAGATG ATGGACGGGC GGGTCAAGAC GCTGCACCCG
ATGGTCCATG GCGGCCTACT CAGCGTCCGC GACGATCCCG AACATGCGAA AGCGATGACC
GATCACGGCA TCGGCGCGAT CGACCTCGTG GTGGTCAACC TCTATCCCTT CGCGCAGACC
GTCGCGAAGG GCGCCGGCCG CGACGAGATC ATCGAGAATA TCGACATCGG CGGACCGTCG
ATGGTCCGCT CGGCGGCCAA GAACCACGCC TATGTGACGA TCGCGACCGA TCCGGCCGAC
TATGCCGAGA TCATCGCCAG CGGCGGCACG ACCGACTTCG CGCTGCGCAA GCGCTTCGCC
GCCAAGGCGT TCGCCGCGAC CGCGACCTAT GATGCGATGA TCTCCTCCTG GTTCGCCCAT
GCCGACCAGG GCCAGTTCTT CCCGGAGGCG CTCTCCATCC CGGTCCGCAA GGCCGAGGAG
CTGCGCTACG GCGAGAACCC CCACCAGCAG GCGGCGCTCT ACCTGCCGGT CGGCCCGTCC
GCGCGCGGCA TCGCCCAGGC GACCCAGGTG CAGGGCAAGG AGCTGAGCTA CAACAATTAC
AACGACGCCG ACGCCGCGCT CGAACTGGTC AGCGAATTCC GCGACGGCCC GCCGACCGTC
GTCATCGTCA AGCATGCCAA CCCCTGCGGC GTCGCCAGCG CCGACACGCT GATCGAGGCC
TATGAGGCGG CGCTCGCCTG CGACAGCGTC TCGGCCTTCG GCGGCATCAT CGCGGTCAAC
CGGCCGCTCG ACGGCAAGAC CGCCGAGGCG ATCAGCGGCA TCTTCACCGA GGTCGTCGCC
GCGCCCGACG CCGACGACGA CGCCAAGGCG GTGTTCGCGA AGAAGAAGAA CCTCCGCCTG
CTGCTGACCG GCGAACTTCC CGATCCGGCG CGCGCTGGCA TGACGATGAA GAGCATCGCC
GGCGGCGTCC TGCTCCAGTC ACGCGACAAT GGCCGGATCG GCCTCGACGA TCTCAAGGTG
GTGACGAAGC GCGCGCCGAC CGATCAGGAG CTCAAGGACT GCCTGTTCGC CTGGACGGTC
GCCAAGCACG TCAAGTCGAA CGCGATCGTC TACGCCAGGG GCGGATCGAC CGCCGGCGTC
GGCGCCGGCC AGATGAACCG GCTCGAATCG GCGCGCATCG CCGCCTGGAA GGCGAAGGAC
GCCGCCGAGA AGGCGGGCTG GGCGACGCCG CGCACGATCG GCTCGGCGGT CGCGTCGGAT
GCCTTCTTCC CCTTCGCCGA CGGCCTGCTG GCGGCGGTCG AGGCCGGGGC GACGGCGGTG
ATCCAGCCGG GCGGATCGAT CCGCGATGCC GAGGTGATCG CGGCGGCGGA CGAGGCCGGG
CTCGCGATGG TCTTCACGGG CATGCGCCAT TTCCGGCATT GA
 
Protein sequence
MKTPTTPAPD KVAIKRALLS VSDKSGLVEL GRALAGWGVE LVSTGGTAKA LRDAGLDVKD 
ISDITGFPEM MDGRVKTLHP MVHGGLLSVR DDPEHAKAMT DHGIGAIDLV VVNLYPFAQT
VAKGAGRDEI IENIDIGGPS MVRSAAKNHA YVTIATDPAD YAEIIASGGT TDFALRKRFA
AKAFAATATY DAMISSWFAH ADQGQFFPEA LSIPVRKAEE LRYGENPHQQ AALYLPVGPS
ARGIAQATQV QGKELSYNNY NDADAALELV SEFRDGPPTV VIVKHANPCG VASADTLIEA
YEAALACDSV SAFGGIIAVN RPLDGKTAEA ISGIFTEVVA APDADDDAKA VFAKKKNLRL
LLTGELPDPA RAGMTMKSIA GGVLLQSRDN GRIGLDDLKV VTKRAPTDQE LKDCLFAWTV
AKHVKSNAIV YARGGSTAGV GAGQMNRLES ARIAAWKAKD AAEKAGWATP RTIGSAVASD
AFFPFADGLL AAVEAGATAV IQPGGSIRDA EVIAAADEAG LAMVFTGMRH FRH