Gene Nwi_0158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNwi_0158 
SymbolpurH 
ID3674174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter winogradskyi Nb-255 
KingdomBacteria 
Replicon accessionNC_007406 
Strand
Start bp183485 
End bp185077 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content66% 
IMG OID637711695 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_316778 
Protein GI75674357 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC GACCCCGCCG CGTGACCCGC GCCTTGCTTT CCGTTTCCGA CAAGACCGGC 
CTGACCGAGT TCGCCCGCGC GCTTGCCGAC CTCGGCGTCG AACTGGTCTC GACCGGCGGC
ACCGCCAAGG AAATCGCGGC GGCGGGATTG AAGGTCAGTG ACGTCTCCGA CCTGACGGGT
TTTCCCGAAA TGATGGACGG CCGGGTCAAG ACGCTGCATC CGAAGGTGCA TGGCGGCCTG
CTCGCCATCC GCGACAACGC CGATCATGCG AAGTCCATGA AGGACCACGG CATCGCCCCG
ATCGATCTTT TGGTCGTCAA TCTCTATCCG TTCGAATCAA CGGTCGATAA AGGCGCCGCC
TGCGAGGAGT GCATCGAGAA TATCGACATC GGCGGCCCCG CGATGATTCG CGCCGCCGCC
AAGAACCATG ATGACGTAGC GGTCGTGGTC GAACCGCAGG ATTATCAGGC GGTGCTTGAC
GAACTCAAGG CCAACGCGGG CGCGACGACG TTGAGCTTGC GCAAGCGCCT CGCCGCGAAA
GCCTACGCCC GGACCGCCGC CTATGATGCG GCGATCTCCA ACTGGTTCGC GGTGCAGCTT
GAGACCGATG CGCCCGACTA TCGCGCCGTC GGCGGCCGTC TCGCGCAGAG GTTGCGCTAT
GGCGAGAATC CGCACCAGAC CGCCGCGTTC TATCGCACGC CGGAGCGGCG CGCCGGCGTA
GCCACCGCGC GGCAGTTGCA GGGCAAGGAA CTGTCCTACA ACAACATCAA CGACACCGAC
GCGGCTTACG AGTGCGTCGC CGAATTCGAT GCGGCGCGCA CCGCGGCCTG CGTCATCATC
AAGCACGCCA ATCCCTGCGG TGTCGCGGAA GGCTCAAGCC TCACCGAAGC CTATCGCCGG
GCGCTCGCCT GTGACCAGAC CTCGGCCTAT GGCGGCATCA TAGCCTTCAA CCGCACCATC
GACGCCGACG CCGCCAATGC GGTGGCCGGC ATCTTCACCG AAGTCATCAT CGCGCCCGAT
GCGACCGAGG AGGCGATCGC GGTCATCGGC AAGCGCAAGA ACCTGCGGCT GCTGCTGGCC
GGCGGCCTGC CCGATCCGCG CGCGCGCGGC CTGACCGCGA AGACGGTCGC CGGCGGGCTT
CTGGTGCAGG GCCGTGACAA CGCCGTCATT GATGATATGT CACTGAAGGT CGTCACGAAG
CGTCCGCCGA CCGAGGCGGA GATGCGCGAC CTGCGGTTCG CCTTCCGTGT CGCCAAGCAC
GTCAAGTCGA ACACCATCGT CTATGCCAGG GATCTCGCCA CCGTCGGCAT CGGCGCGGGC
CAGATGAGCC GCGTCGATTC CGCGCGCATC GCCGCGCGCA AGGCGGAAGA TGCGGCGCGC
GATCTGAAGC TCGCCGAGCC CTTGACCAAA GGCTCGGTCG TGGCGTCGGA TGCGTTCTTT
CCCTTCGCCG ACGGCATGCT CGCCTGTATC AAAGCCGGCG CCACCGCGGT CATCCAGCCC
GGCGGCTCCA TGCGCGACGA GGAGGTGATC AAGGCCGCCG ACGAGCATGG CATCGCCATG
GTGTTCACCG GCGTCAGGCA TTTCCGTCAT TAG
 
Protein sequence
MTDRPRRVTR ALLSVSDKTG LTEFARALAD LGVELVSTGG TAKEIAAAGL KVSDVSDLTG 
FPEMMDGRVK TLHPKVHGGL LAIRDNADHA KSMKDHGIAP IDLLVVNLYP FESTVDKGAA
CEECIENIDI GGPAMIRAAA KNHDDVAVVV EPQDYQAVLD ELKANAGATT LSLRKRLAAK
AYARTAAYDA AISNWFAVQL ETDAPDYRAV GGRLAQRLRY GENPHQTAAF YRTPERRAGV
ATARQLQGKE LSYNNINDTD AAYECVAEFD AARTAACVII KHANPCGVAE GSSLTEAYRR
ALACDQTSAY GGIIAFNRTI DADAANAVAG IFTEVIIAPD ATEEAIAVIG KRKNLRLLLA
GGLPDPRARG LTAKTVAGGL LVQGRDNAVI DDMSLKVVTK RPPTEAEMRD LRFAFRVAKH
VKSNTIVYAR DLATVGIGAG QMSRVDSARI AARKAEDAAR DLKLAEPLTK GSVVASDAFF
PFADGMLACI KAGATAVIQP GGSMRDEEVI KAADEHGIAM VFTGVRHFRH