Gene Pisl_0095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_0095 
SymbolpurP 
ID4616974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp88692 
End bp89762 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content55% 
IMG OID639783176 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein 
Protein accessionYP_929621 
Protein GI119871614 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATAA ACTGTTGCTT ACTTCTGTCT ATGGTTTCTG TGGCTGTGTT GGCGTCCCAT 
AGCGCATTAG ACGTGTTAGA TGGCGCCAGA GACGAAGGGC TGAAGACAGT GGCTATAGCA
AAGAAGGGGA GGGAGAGGGC CTACAGAGAG TTCCCCGTGG TAGACAAGCT AATTGTATTA
GATGACTATA GAGACATATT GAAGATCGTA GACTTATTAA AGGCCGAGGA GGCGGTTTTT
GTCCCAAATA GATCTTTCGC AGTATATGTG GGCTACGACG CCATAGAGAG AGAGTTCCCA
GTGCCGATCT TCGGGAACCG GTTCCTACTA AGGTGGGAGG AGAGGACGGG GCCTCAGAAC
TACTACCGTT TGCTAGACGA GGCAGGGATA AGGCGGCCTA GGACTTTTAG ACCGGACGAG
GTGGACCGCC CAGTTATCGT CAAAATGCCA GAGGCGGAGA GGAGGGTCGA GAGGGGGTTC
TTCATCGCCC GCGACCGCGA CGACCTATAC AGAAAGGCCA AACGTCTGGC AGACGCCGGA
GTGATAAAGC TGGAGGACTT AGAGCGGTCT TCCATCGAGG AGCTGGTCCT CGGGGCCCAT
TTCAACGCCA ACTACTTCTA CTCAGTCATG AGGAGACGGC TTGAGTTACA CAGTTTCGAC
AGGAGGATAC AGAGCAACCT AGATGGGGTG TTCCGCCTGC CTGCCAGAGA CCAGCTGGAG
GTAGACCCTG AGGTGAGGTA TATAGAGGTG GGCCACGAGC CTGCGACAAT ACGAGAGTCC
CTCCTCGAGA AGGTGTTCGA CGTCGGCTAC AGATTTGTGG AGGCGGCACG CCGTCTCGTC
CCCCCGGGGG TGATCGGGCC TTTCACCCTT CAGTTCATTG TGACACCCCA GCTGGACCTC
GTGGTTTACG ACGTCGCGCC GAGGATAGGC GGCGGCACTA ACGTATACAT TGGGATCGGG
GGGCAGTACT CCAAGCTCTA CTTCGGCAAA CCGATATCGA TAGGGAGGAG GATAGCTATG
GAGATCCGCG AAGCCGCCGA GCAGGGGAGA CTCCCTGAAA TTACTACGTG A
 
Protein sequence
MSINCCLLLS MVSVAVLASH SALDVLDGAR DEGLKTVAIA KKGRERAYRE FPVVDKLIVL 
DDYRDILKIV DLLKAEEAVF VPNRSFAVYV GYDAIEREFP VPIFGNRFLL RWEERTGPQN
YYRLLDEAGI RRPRTFRPDE VDRPVIVKMP EAERRVERGF FIARDRDDLY RKAKRLADAG
VIKLEDLERS SIEELVLGAH FNANYFYSVM RRRLELHSFD RRIQSNLDGV FRLPARDQLE
VDPEVRYIEV GHEPATIRES LLEKVFDVGY RFVEAARRLV PPGVIGPFTL QFIVTPQLDL
VVYDVAPRIG GGTNVYIGIG GQYSKLYFGK PISIGRRIAM EIREAAEQGR LPEITT