Gene Pars_2262 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2262 
SymbolpurP 
ID5054848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2025209 
End bp2026252 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content59% 
IMG OID640469814 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein 
Protein accessionYP_001154458 
Protein GI145592456 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.141593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0244292 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTTT CAGTGGCCGT GTTGGCCAGC CACAGCGCCC TCGACGTGCT AGACGGCGCC 
AAGGACGAGG GGTTGAGGAC GGTAGCTGTG GCGAAGAAGG GCCGCGACCG GGCCTACAGG
GAGTTCCCCG TAGTGGACAA GCTCATAGTT CTTGACGACT ATGTAGACAT CTTGTATATT
GTCGATATGC TTAAGGCTGA AGGCTCCGTG TTTGTGCCGA ACCGCTCATT CGCCGTGTAC
GTCGGCTACG ACAACATAGA GAGGAGGTTC CCCGTCCCCG TCTTCGGGAA CAGATATCTG
CTGAGGTGGG AGGAGCGGAC AGGTCCACAG AGCTACTACC GCCTCCTAGA CGAGGCTGGG
GTCAAAAGAC CTAGGACCTT CCGCCCCGAC GAGGTGGACC GCCCCGTCAT AGTGAAGATG
CCAGAGGCCG AGCGGAGGGT CGAGCGGGGC TTCTTCGTGG CGAGGGACAG AGACGACTTG
TGGAGGAAGG CCAAAAGGCT GGCGGAGGCC GGGATCATAA GGCTCGAGGA CCTAGAAGCC
GCCTCCATTG AGGAGTTGGT GCTGGGGGCC CACTTCAACG CCAACTACTT CTACTCCCCC
CTCCGCAAGA GGCTTGAGCT ACACAGCTTC GACAGGAGGA TCCAGTCTAA CCTAGACGGG
GTATTCCGCC TCCCAGCGAG GGACCAGCTA GACCTCGATC CAGATGTGCG CTACATCGAG
GTGGGCCATG AACCAGCCAC AATTAGGGAA TCCCTCCTCG AAAAGGTCTT CGACATTGGG
TACCGCTTCT TGGAGGCTAC CAGAAGGCTG GTGCCGCCGG GCGTGATCGG CCCCTTCACC
CTACAGTTCA TAGTAACACC CCAGCTAGAC CTCGTGGTGT ACGACGTGGC TCCGCGCATC
GGGGGAGGCA CCAACGTCTA CATAGGGATC GGGGGGCAGT ACTCGAAGCT CTACTTCGGC
AAGCCCATAT CCATTGGGAG GAGGATTGCA ATGGAGATAA GAGAGGCTGC CGAGCAGAAG
AGGCTGGAGG AGGTCACGAC TTGA
 
Protein sequence
MAVSVAVLAS HSALDVLDGA KDEGLRTVAV AKKGRDRAYR EFPVVDKLIV LDDYVDILYI 
VDMLKAEGSV FVPNRSFAVY VGYDNIERRF PVPVFGNRYL LRWEERTGPQ SYYRLLDEAG
VKRPRTFRPD EVDRPVIVKM PEAERRVERG FFVARDRDDL WRKAKRLAEA GIIRLEDLEA
ASIEELVLGA HFNANYFYSP LRKRLELHSF DRRIQSNLDG VFRLPARDQL DLDPDVRYIE
VGHEPATIRE SLLEKVFDIG YRFLEATRRL VPPGVIGPFT LQFIVTPQLD LVVYDVAPRI
GGGTNVYIGI GGQYSKLYFG KPISIGRRIA MEIREAAEQK RLEEVTT