Gene Msed_0016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0016 
SymbolpurP 
ID5105155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp13486 
End bp14529 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content44% 
IMG OID640505909 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein 
Protein accessionYP_001190117 
Protein GI146302801 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000276238 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00501926 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATAG CGTCGGTAGC CAGCCACTCA GCCCTGGACG TGTTTGATGG TGCTAAGGAC 
GAGGGATTTA GTACAATAGC CCTTTGTAAA AAGGGGAGAG AAAGACCATA TAAGGAATTC
AAGAGAATAG TTGACACTTG CATCATCCTC AACGATTTTA AGGAGATATC CAGTGACGAG
ATTCAGACGA GACTGATCGA GCAAGACGCG CTAATAGTCC CGAACCGAAG TATGGCAGTG
TATCTAGGTT ACGATGCCAT AGAGGCCATG AAGGTCAAGT TCTTCGGTAA TAGAATGATG
CTTAGATGGG AGGAGAGGAC TGGTGAAAAG AATTACTACA GGATACTTGA CGAGGCTAAA
ATAAGGAGGC CTAAGACCCT AAGGCCTGAG GAAGTTGATA GACCTGTTAT AGTCAAAATA
CCCGAGGCTA AGAGAAAGGT AGAAAGGGGA TTTTTCTTCG CAGCTAATAG GGAAGATTTT
CAGGAAAAGT TGAGGAAACT GGAGAAGGAT GGGATAATAG ATCAACAGGG AATATCGCAA
ATGGTCATAG AGGAGTTCAT TTTTGGGGCT TATTTCAATA TCAACTACTT TTACTCGCCA
ATATTTGATA GAGTTGAAAT AATTAGCGTA GATAGAAGGA TACAAAGCGA TTGGGACTCA
TTTTACAGAC TTCCCTCTGA CATACAGAGT AAGCTGGGGA GATATCCAAG ACTAATTGAG
GTGGGCCACG AGCCTGCGAC CATCAGGGAA AGCATGCTCG AAAAGCTGTT CGACGCGGGC
TACTCGTTCG TAGAGACTAC AAGAAAGCTG GAAAAGGGGG GTATAATTGG GCCTTTCACC
CTTCAACTTG CAGTTACTCC AGACTTAGAC ATTGTGGTCT TCGACGTTGC CCCAAGGATA
GGTGGCGGAA CTAACGCGTA CATGGGAATA GGAAGTCAAT ACTCTAAATT ATACTTCGGC
AAGCCCATCA GTCTAGGAAG AAGAATAGCC ATAGAAATCA AGGAAGCTAT ACAGAAGAAG
GAGCTGGACA AGGTTATAAG CTAG
 
Protein sequence
MKIASVASHS ALDVFDGAKD EGFSTIALCK KGRERPYKEF KRIVDTCIIL NDFKEISSDE 
IQTRLIEQDA LIVPNRSMAV YLGYDAIEAM KVKFFGNRMM LRWEERTGEK NYYRILDEAK
IRRPKTLRPE EVDRPVIVKI PEAKRKVERG FFFAANREDF QEKLRKLEKD GIIDQQGISQ
MVIEEFIFGA YFNINYFYSP IFDRVEIISV DRRIQSDWDS FYRLPSDIQS KLGRYPRLIE
VGHEPATIRE SMLEKLFDAG YSFVETTRKL EKGGIIGPFT LQLAVTPDLD IVVFDVAPRI
GGGTNAYMGI GSQYSKLYFG KPISLGRRIA IEIKEAIQKK ELDKVIS