Gene Msed_0017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0017 
SymbolpurP 
ID5105156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp14526 
End bp15524 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content44% 
IMG OID640505910 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase 
Protein accessionYP_001190118 
Protein GI146302802 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.117033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00153786 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCTATTAC TCACTCTAGC AAGCCATTCC TCGCTTCAGA TTCTTCACGG GGCCAAAAGG 
GAAGGCTTCG AGACTGGGAT AGTTGTAAAT GGCAAGAGGG AAGGTTTTTA CAGAAGATTC
AGCTTCATAG ATAACTTCTA TGTGTATTCC AGTGAAGATG AGGCTGTGGA AAAGATCAAT
GGCACTTCCA ACTCCGTTTT TGTACCACAT GGAAGTCTCA TCGAGTACAT CGGCATGGAG
AGGGTTTCAC GGATAAGGAC GCCCATTTTC GGTAACAGGA ACTTGTTTCC ATGGGAATCA
AATCAATCCA AAAAAATGAA ATTATTGGAG CTAAGCAATA TAAAAACACC GATGAAGTTT
GAAAACCCTG AGGATGTCGA TAGAATGGTT ATCGTGAAAT TGCCAGGTGC CAAGGGTGGA
AGAGGTTACT TCATAGGTAG GAATAAACAG GAGGTCAAGG AGGGGATAAG GAGGCTCCAG
GAAAAAGGGC TGATCAACAA CGTAGAGGAA TTAATAATAC AGGAATACGT GATTGGGATT
CCTATGTACT TTCAATTCTT CTACAGTCCA ATGCTACAGC GTGTGGAGAT GACTGGGATT
GATATCAGGT ATGAGACCAA CGTTGACGGT CTGAGGAGAC TTCCAAGTGA TATTAAAGCC
GAGCCCACGT TCGTGGTCGC AGGCAACATC CCAGCGGTAG CCAGAGAGAG TATATTGCCT
TCAGTATATG AATACGCAGA AAATTTTGTT AAAACTACAA AAAATGTGGT GCCTCCCGGG
GCCATAGGTC CCTTCTGCCT TGAATCCATA GTTACGGATA CCCTTGATGT GGTTGTCTTT
GAGTTCTCGG GAAGGATAGT GGCCGGAACT AACCTTTATG TGGATGGGAG TCCCTACAGT
TGGCTCTATT GGGATGAACC CATGAGCGTC GGAAGAAGGA TAGGGAGAGA AATAAATCTA
GCAATCCAAA AGAACAGGTT AAATGAGGTG ACCACATGA
 
Protein sequence
MLLLTLASHS SLQILHGAKR EGFETGIVVN GKREGFYRRF SFIDNFYVYS SEDEAVEKIN 
GTSNSVFVPH GSLIEYIGME RVSRIRTPIF GNRNLFPWES NQSKKMKLLE LSNIKTPMKF
ENPEDVDRMV IVKLPGAKGG RGYFIGRNKQ EVKEGIRRLQ EKGLINNVEE LIIQEYVIGI
PMYFQFFYSP MLQRVEMTGI DIRYETNVDG LRRLPSDIKA EPTFVVAGNI PAVARESILP
SVYEYAENFV KTTKNVVPPG AIGPFCLESI VTDTLDVVVF EFSGRIVAGT NLYVDGSPYS
WLYWDEPMSV GRRIGREINL AIQKNRLNEV TT