Gene Pnec_1581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_1581 
SymbolpurH 
ID6183700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp1381096 
End bp1382676 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content46% 
IMG OID641672101 
Productbifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_001798272 
Protein GI171464159 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.292698 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.687224 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCGTA CAGCCCTTCT CTCCGTCTCC GATAAAAATG GCATCGTTCC TTTCGCTAAG 
TCCCTGCATG AGCAAGGCAT CAAACTCATT TCAACTGGCG GCACCGCAAA GCTATTAGCT
GAAAATGGCC TGCCCGTTGT TGAAATTTCT TCGCTAACAA AGTTTCCGGA AATGCTCGAT
GGTCGTGTAA AAACCCTGCA CCCGATGGTC CATGGCGGCT TGCTGGCTCG CAGAGATTTT
CCAGAACATA TGGCTGCCTT GAAAGAGTAT GGTATTAATA CGATCGATAT GCTTGTCATC
AACCTATACC CCTTCAATGA GACCGTCGCC AAGGAAAATT GTTCATTTGA AGATGCAGTG
GAGAATATTG ATATTGGTGG TCCTGCGATG TTGCGTGCTG CAGCAAAAAA TCATCAAGAC
GTTACTGTAT TAATTTCTCC AGAAGATTAC GCCCCTGTAT TGGCTGAAAT GAAAGCCAAT
CAAAATAGTG TGTCTTACAA AACAAATTTG GCTTTAGCTA AAAAAGTGTT CGCTCATACC
GCACAGTATG ATGGCGCCAT TGCCAACTAC CTATCCGCAT TGGGTGACGA CTTAGATCAC
AAGGCGCGTT CCGCTTATCC AGAAACCCTG CATCTTGCCT TTGAAAAAGT ACAAGAGATG
CGTTACGGCG AGAATCCACA TCAAGCTGCA GCGTTCTATA AGGACATCTA TCCTGTAGAT
GGCGCTCTAG CTAATTACAA ACAGTTACAA GGAAAAGAAC TTTCTTACAA CAACATTGCT
GATGCTGATT CAGCTTGGGA ATGTGTAAAA AGCTTTACTG GCAATGCCGG TGGTGCCGCA
GCTTGCGTAA TCATCAAGCA TGCCAATCCT TGTGGTGTAG CTGTGGGCGC CAGCGCTCTT
GAGGCATACC AAAAGGCATT TAAGACCGAC CCAAGTTCAG CCTTTGGCGG CATTATTGCT
TTTAACGTTT CTTGCGATGG TGCAGCGGCA GAAGCGATCT CCAAACAGTT TGTAGAGGTA
CTAATTGCTC CTAGCTTTAG CGATGAAGCC AAGACAATCT TTGCTGCCAA ACAAAATATG
CGTCTTTTAG AGATTCCATT AGGCACCGCA TTTAATACGT TTGATTTCAA ACGCGTTGGT
GGTGGCTTGC TCGTGCAATC GCCTGATGCT AAAAACGTAC TCGAAAATGA AATGTGTGTT
GTTAGCAAAC GTCTACCAAC TCCAAGCGAA ATGCACGACA TGATGTTTGC ATGGCGCGTT
GCTAAATTTG TGAAGTCTAA TGCCATCATC TATTGCGCGA ATGGCATGAC TCTCGGTATT
GGTGCAGGCC AAATGAGTCG TGTTGACTCC GCACGTATGG CCAGCATTAA GGCTAAGAAT
GCCGGCTTGA GCTTAAAAGG ATCTGCAGTG GCCAGTGACG CATTCTTCCC ATTTCGCGAC
GGATTAGATG TGGTTGTTAA TGGCGGCGCA AGCTGTGCGA TTCAACCTGG CGGTAGCATG
CGTGACGATG AAATCATTGC AGCCGCAGAT GAACACGGTA TTGCCATGAT CTTTACTGGC
ACACGTCATT TCCGTCACTA A
 
Protein sequence
MIRTALLSVS DKNGIVPFAK SLHEQGIKLI STGGTAKLLA ENGLPVVEIS SLTKFPEMLD 
GRVKTLHPMV HGGLLARRDF PEHMAALKEY GINTIDMLVI NLYPFNETVA KENCSFEDAV
ENIDIGGPAM LRAAAKNHQD VTVLISPEDY APVLAEMKAN QNSVSYKTNL ALAKKVFAHT
AQYDGAIANY LSALGDDLDH KARSAYPETL HLAFEKVQEM RYGENPHQAA AFYKDIYPVD
GALANYKQLQ GKELSYNNIA DADSAWECVK SFTGNAGGAA ACVIIKHANP CGVAVGASAL
EAYQKAFKTD PSSAFGGIIA FNVSCDGAAA EAISKQFVEV LIAPSFSDEA KTIFAAKQNM
RLLEIPLGTA FNTFDFKRVG GGLLVQSPDA KNVLENEMCV VSKRLPTPSE MHDMMFAWRV
AKFVKSNAII YCANGMTLGI GAGQMSRVDS ARMASIKAKN AGLSLKGSAV ASDAFFPFRD
GLDVVVNGGA SCAIQPGGSM RDDEIIAAAD EHGIAMIFTG TRHFRH