Gene Pcal_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_0449 
SymbolpurP 
ID4909352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp431249 
End bp432250 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content58% 
IMG OID640124201 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase 
Protein accessionYP_001055348 
Protein GI126459070 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCG CGCTGAAGAG GTACGACTTG GATAAGTTGG CCGTGGCCAC GGTGGCCTCC 
CACACCGCGC TCCAAATCCT CAGGGGAGCC AAGAGGTACG GCTTTAGGAC AATAGCCGTG
GCACAGAGAA ACGCCGACTT CTACCGCCAG TTCTCCTTCA TCGACGAAGT GTGGACCGCC
GACTTCTCCA ACTTTAGGCA CGTCGCTGAG AAGCTCGTGG AGAAAAACGC CTTGTTCATA
CCCCACGGCT CATACGTCGA GTACGTGGGG TGGAGACAGG CGCTGGAGGC CCCAGTGCCC
ACCCTAGGCT GTAGAGAGCT GTTGCGCTGG GAGGCCGACC AGTACAAGAA GATGGAGCTG
CTGGCCGCCG CGGGGATACC CACGCCGAGG TACTACAAGA GGGCCGAAGA GGCCGAGGGC
CCCGTAATAG TCAAACTCTT CGGCGCCAAG GGCGGCAGGG GGTACTTCGT GGCAAAGAAC
AGAGAGGAGT TGGCCAAAAG GATCAAGGCG GTGGAGGGCG ACTACATAAT TCAGGAATAC
GTCTTCGGCG TGCCCGCCTA CTACCACTAC TTCGCCTCGC CGGTGTACAA CAGAGTGGAG
ATCTTCGGCA TGGACATCAG ATACGAGACC AATGTAGATG GGAGAACCTT CGGCTGGGTA
GAGCCCACCT TCGTAGTGGT GGGGAATCTC CCGCTGGTGT TAAGGGAGTC TCTGCTCCCC
GTGGTGCACA AGTACGGAGT CGACTTCGCC AAGGCAGTGA GAGAAAAGGT CAGCTGCGAG
CTGGCGGGAC CCTACTGTCT CGAGAGCATA ATAAGAGACG ACATGACCAT CGTCGTCTTT
GAATTCTCGG GGAGGATCGT GGCTGGGACA AACGTCTACA TGGGCGTGGG CTCCCCCTAC
TCTGTCCTCT ACTTCGACGA GCCCATGGAC ATGGGAGAGA GGATAGCCCA CGAGATAAAA
GAGGCCGCTG CGAGGGGCAT ATTGGAGAAA CTTTTCACAT AA
 
Protein sequence
MSAALKRYDL DKLAVATVAS HTALQILRGA KRYGFRTIAV AQRNADFYRQ FSFIDEVWTA 
DFSNFRHVAE KLVEKNALFI PHGSYVEYVG WRQALEAPVP TLGCRELLRW EADQYKKMEL
LAAAGIPTPR YYKRAEEAEG PVIVKLFGAK GGRGYFVAKN REELAKRIKA VEGDYIIQEY
VFGVPAYYHY FASPVYNRVE IFGMDIRYET NVDGRTFGWV EPTFVVVGNL PLVLRESLLP
VVHKYGVDFA KAVREKVSCE LAGPYCLESI IRDDMTIVVF EFSGRIVAGT NVYMGVGSPY
SVLYFDEPMD MGERIAHEIK EAAARGILEK LFT