Gene Pnap_3358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnap_3358 
SymbolpurH 
ID4690068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolaromonas naphthalenivorans CJ2 
KingdomBacteria 
Replicon accessionNC_008781 
Strand
Start bp3557518 
End bp3559116 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content65% 
IMG OID639836371 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_983576 
Protein GI121606247 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCT TGATTTCCGT TTCCGATAAA ACCGGCATCC TCGAATTTGC CCAGGCGCTG 
CACGCGCTGG GCATCAAGCT GCTGTCCACC GGCGGCACCG CCAAACTCTT GCTTGACGCA
GGTTTGCCCG TGACCGAAGT CGCGGAACAC ACCGGTTTTC CGGAAATGCT CGATGGCCGC
GTCAAGACGC TGCACCCGAA AGTCCATGGC GGCCTGCTGG CCCGGCGCGA CCTGCCCGAG
CACATGGCGG CGCTGCAAAC GCACGGCATC GCCACCATCG ACCTGCTGGT GGTCAACCTC
TATCCGTTTG AAGCCACGGT GGCCAAGCCC GGCTGCACGC TGGAGGATGC GATTGAAAAT
ATCGACATCG GCGGCCCGGC GATGGTGCGC TCGGCGGCCA AGAACTGGAA GGATGTCGGC
GTGCTGACCG ATGCGTCGCA GTACGCCGGC GTGCTGGCCG AACTCCAGGC CGACGGCAAG
CTGACCGACA AGACCAGGTT TGCCCTGTCG GTGGCCGCCT TCAACCGCAT CAGCCAGTAC
GACGGCGCGA TCAGCGACTA CCTGTCGTCA GTCGCGTTTG ACGCGGGCAA GCTCTCGGCG
GCCTATGTGC CCGAGCGCGC GCTGTTCCCC GGCCAGTCCA ACAGCCAGTT CACCAAGCTG
CAGGACTTGC GCTATGGCGA AAACTCGCAC CAGCAGGCCG CGCTGTACCG CGACCTGTAC
CCCGCGCCGG GCTCGCTGGT GACGGCCAAA CAGCTGCAGG GCAAGGAACT GTCGTACAAC
AACATCGCCG ACGCCGACGC CGCCTGGGAA TGCGTCAAGA GCTTCACCGA GGCGGCCTGC
GTCATCGTCA AGCATGCCAA TCCCTGCGGC GTGGCCGTGG GCGCGGACGC GCTTGAAGCC
TACAGCAAGG CTTTCAAGAC CGACCCGACC TCGGCGTTTG GCGGCATCAT TGCGCTGAAC
TGCCCGCTGG ACGAGCGCGC GGCGCTGCAG ATTTCGAAGC AGTTCGTCGA AGTACTGATG
GCGCCAAGCT TCACGCCCGA GGCGCTGGAA GTCTTCAAGA CCAAGGTCAA CGTGCGGATT
CTGCAGATCG AACTGCCGCC CGGCGGCGAC ACGGCCTGGA AGCAAGGCCG CAACCTGATC
GACGTCAAGC GCGTCGGCTC GGGCCTCTTG ATGCAAACCG CCGACAACCA CGAGTTGGCC
CTGGCCGACC TGAAGGTGGT CAGCAAGCTG CAGCCGACAC CGGCCCAACT GCAGGATTTG
CTGTTTGCCT GGAAGGTCGC CAAGTACGTC AAGTCCAACG CCATCGTGTT CTGCGCCGGC
GGCATGACCA TGGGCGTCGG TGCCGGCCAG ATGAGCCGCC TCGACTCGGC CCGCATCGCC
AGCATCAAGG CCGAGCATGC CGGCCTGTCG CTGGCGGGCA CGGCGGTGGC GAGCGATGCC
TTCTTCCCGT TCCGCGACGG GCTCGACGTG GTGGTCGATG CGGGCGCGAG CTGCGTGATC
CAGCCCGGCG GCTCGATGCG CGACCAGGAA GTGATTGACG CGGCCGACGA GCGCGGCGTG
GTCATGGTGC TGTCCGGCGT GCGGCATTTC AGGCATTGA
 
Protein sequence
MNALISVSDK TGILEFAQAL HALGIKLLST GGTAKLLLDA GLPVTEVAEH TGFPEMLDGR 
VKTLHPKVHG GLLARRDLPE HMAALQTHGI ATIDLLVVNL YPFEATVAKP GCTLEDAIEN
IDIGGPAMVR SAAKNWKDVG VLTDASQYAG VLAELQADGK LTDKTRFALS VAAFNRISQY
DGAISDYLSS VAFDAGKLSA AYVPERALFP GQSNSQFTKL QDLRYGENSH QQAALYRDLY
PAPGSLVTAK QLQGKELSYN NIADADAAWE CVKSFTEAAC VIVKHANPCG VAVGADALEA
YSKAFKTDPT SAFGGIIALN CPLDERAALQ ISKQFVEVLM APSFTPEALE VFKTKVNVRI
LQIELPPGGD TAWKQGRNLI DVKRVGSGLL MQTADNHELA LADLKVVSKL QPTPAQLQDL
LFAWKVAKYV KSNAIVFCAG GMTMGVGAGQ MSRLDSARIA SIKAEHAGLS LAGTAVASDA
FFPFRDGLDV VVDAGASCVI QPGGSMRDQE VIDAADERGV VMVLSGVRHF RH