Gene Pisl_0094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_0094 
SymbolpurP 
ID4616976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp87632 
End bp88699 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content53% 
IMG OID639783175 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase 
Protein accessionYP_929620 
Protein GI119871613 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.279967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACGTC AGACTTATAT CTTTGTCGCC ATACAAAAAT TCAAAAGTCT TCAACATCTG 
GAGCACATGT CTGTATTAAA GAGATATGAC TTAGAAAAAC TTGCTGTAGC AACAGTGGCG
TCTCATACTG CATTACAAAT TCTGAGGGGG GCAAAGAAAT ATGGTTTTAG GACTATTGCG
ATAGCGGGTA GAGCCGACGT CGCCGAATTT TACCAACAGT TTAACTTCAT AGACGAAGTG
TGGACTGTAG ACTTTAGAAA TTTTGTAAAA GCGGCTGAAA AACTGGTAGA AGCCAATGCA
GTTTTTATAC CACATGGCTC TTACGTAGAA TACGTCGGCT GGAGACAGGC GCTGGAGGCG
CCGGTTCCCA CCCTCGGCTG TAGAGAGTTA ATCAAGTGGG AGGCAGATCA GTACAAGAAG
ATGGAGCTCC TCCAGAGAGG CGGCATCCCC ACGTCGAGGG TCTACAAAAC GCCGGAGGAG
GTGGATAGGC CCGTCATAGT TAAGCTCTTC GGCGCCAAGG GGGGCAGGGG GTACTTCCTC
GCTAGAGATA GAGAGGAGCT GAGGAGGAGG CTGGCCGGCT TGAGCGACTA CATTATCCAG
GAGTACGTAT TCGGCGTGCC GGCCTACTAC CATTACTTCT CCTCGCCGGT CTACGGCAGG
GTGGAGGTCT TTGGCGCAGA CATTAGGTAC GAGTCTAACG TAGACGGGAG GACCTTCGGC
TGGGTAGAGC CCACTTTCGT CGTCGTGGGC AACCTCCCCC TGGTTCTTAG GGAGTCTTTA
CTCCCAACGA TATGGAAGTA CGGCGTCCAG TTTGCCAAAG CCGTCGAGGA GGTGGTCGGC
TGCAGACTTG CGGGGCCCTA CTGCCTGGAG TCTATAATAA GAGACGACAT GTCAATCTCA
GTCTTCGAGT TCTCTGGGCG GATTGTGGCT GGGACGAACA TATACATGGG CTACGGCTCG
CCCTACTCAG TCCTCTACTT CGACAGGCCT ATGGACATGG GGGAGAGGAT AGCCCACGAG
ATAAGGGAGG CCGCCCGGAG GGGCCGCCTA GAGGATCTCT TCACGTAG
 
Protein sequence
MLRQTYIFVA IQKFKSLQHL EHMSVLKRYD LEKLAVATVA SHTALQILRG AKKYGFRTIA 
IAGRADVAEF YQQFNFIDEV WTVDFRNFVK AAEKLVEANA VFIPHGSYVE YVGWRQALEA
PVPTLGCREL IKWEADQYKK MELLQRGGIP TSRVYKTPEE VDRPVIVKLF GAKGGRGYFL
ARDREELRRR LAGLSDYIIQ EYVFGVPAYY HYFSSPVYGR VEVFGADIRY ESNVDGRTFG
WVEPTFVVVG NLPLVLRESL LPTIWKYGVQ FAKAVEEVVG CRLAGPYCLE SIIRDDMSIS
VFEFSGRIVA GTNIYMGYGS PYSVLYFDRP MDMGERIAHE IREAARRGRL EDLFT