Gene Pars_2265 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2265 
SymbolpurP 
ID5056179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2026871 
End bp2027878 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content54% 
IMG OID640469817 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase 
Protein accessionYP_001154461 
Protein GI145592459 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0561077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0630751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAGA TTTTGAAAAG ATACGACCTG GACAAGCTTG CGGTTGCCAC AATCGCATCA 
CATACAGCTT TGCAAATCCT CAGAGGGGCA AAAAAATACG GATTTAGAAC AATCGCCATA
GCAAAGAACG AGGACATCGC CCAGTTCTAC AAGCAATTTT TCTTCATAGA TGAGGTGTGG
ACTGGGGATT TCTCCAACTT TAGAAAAACT GCCGAAAGGC TCGTAGCGGA AAACGCACTG
TTGATACCCC ACGGCTCCTA CGTCGAATAT GTCGGCTGGA GACAAGCTCT GGAAGCCCCA
GTCCCTACGC TCGGCTGTAG AGAGTTGTTG AGATGGGAGG CCGACCAGTA CAAAAAGATG
GCGTTGTTGG AAGAGGCTGG GATACCCATC CCCCGGGTGT ACAGATCACC AACGGAGGTG
GACGGGCCTG TTATCGTGAA GTTCTTCGGC GCAAAGGGTG GCAGGGGTTA CTTCGTGGCT
AAGGGCAGGG AGGAACTGGA GGCTAGGCTA AAGGCGCTGG GCGAGGAGTA CATCATACAA
GAGTACCTCT TCGGCGTGCC GGCCTACTAC CACTACTTCG CCTCGCCTGT CTACTCCCGC
ATCGAAGTTT TCGGCGCCGA CATCCGCTAC GAATCCAACG TCGACGGCAG GACCTTCGGC
TGGGCCGAGC CGACCTTCGT CGTGGTGGGC AACCTTTCGC TGGTGCTCCG GGAGTCCCTC
TTGCCCATCA TTCACAAATA CGGAGTCCAG TTCGCCAAGG CCGTGGAGAA GCGGGTTGGA
TGCAGGTTGG CCGGCCCCTA CTGCTTGGAG TCGATAATAA AAGACGACAT GTCCATCGTG
GTGTTTGAGT TCTCTGGGAG GATCGTGGCG GGGACAAACA TCTACATGGG CTACGGCTCG
CCCTACTCCG TCCTCTACTT CGACAAACCA ATGGACATGG GCGAGAGGAT AGCCCACGAA
ATAAGAGAAG CCGCAAAAGC TGGCAAACTA GATCAGCTAT TTACTTAG
 
Protein sequence
MSQILKRYDL DKLAVATIAS HTALQILRGA KKYGFRTIAI AKNEDIAQFY KQFFFIDEVW 
TGDFSNFRKT AERLVAENAL LIPHGSYVEY VGWRQALEAP VPTLGCRELL RWEADQYKKM
ALLEEAGIPI PRVYRSPTEV DGPVIVKFFG AKGGRGYFVA KGREELEARL KALGEEYIIQ
EYLFGVPAYY HYFASPVYSR IEVFGADIRY ESNVDGRTFG WAEPTFVVVG NLSLVLRESL
LPIIHKYGVQ FAKAVEKRVG CRLAGPYCLE SIIKDDMSIV VFEFSGRIVA GTNIYMGYGS
PYSVLYFDKP MDMGERIAHE IREAAKAGKL DQLFT