Gene Mbar_A1334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A1334 
SymbolpurP 
ID3627446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp1645399 
End bp1646469 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content45% 
IMG OID637700225 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase 
Protein accessionYP_304876 
Protein GI73668861 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACAA AACAGCAGGT TCTTGAATTT CTGAAAAATT ACGATTTAGA AAACATTACA 
ATTGCAACAA TTTGCTCCCA CTCTAGCCTT CAAATTTTTG ACGGAGCTCG GAAAGAAGGC
TTCAGAACTC TGGGAATCTG CGTCGGCAAG CCTCCAAAAT TTTATGAGGC TTTTCCCAAA
GCAAAACCCG ATGAGTACCT TGTCGTCGAT AGCTATGCCG ATATAATGAA TAAGGCTGAG
GAGCTTAGAC AGAAAAACAC CATTATTATC CCACACGGCT CAGTAGTTGC TTACCTTGGC
ACGGAAAATT TTGCAGATAT GGCCGTACCT ACTTTTGGAA ACCGGGCTGT GCTCGAATGG
GAATCAGACA GGGATAAAGA GCGTGAGTGG TTGCTCGGAG CAGGCATTCA TATGCCCGGA
AAAGTCGATG ATCCTCACGA TATTAATGGG CCTGTAATGG TCAAGTACGA CGGGGCAAAA
GGAGGAAAAG GTTTCTTCGT CGCAAAAACC TACGAAGAGT TCGAAGAACT TATAGACCGG
ACCCAGAAGT ACACGATTCA GGAATTTATC ACCGGGACCC GTTATTACCT TCATTACTTC
TATTCCCCTA TCCGAAACGA AGGATACACT TTAAGCAAGG GCAGCCTTGA ACTTCTGAGC
ATGGACCGCA GGGTAGAATC CAATGCGGAT GAAATTTTCA GGCTCGGTTC CCCCAGAGAA
CTCGTAGGAG CAGGAATCCG CCCGACATAT GTAGTCACAG GAAATATGCC CCTTGTAGCA
AGAGAATCCC TCTTACCCCT CATCTTCTCT CTTGGAGAAA GGGTAGTTGA AGAATCTCTT
GGCCTCTTTG GCGGGATGAT AGGGTCTTTC TGCCTTGAGA CTGTTTTTAC GGATACGCTA
GAGATTAAAG TGTTTGAGAT CTCGGCCAGA ATTGTTGCAG GAACAAACCT TTACATTTCA
GGTTCTCCTT ATGCCGACCT GATGGAAGAA AACCTCTCAA CCGGAAGAAG AATAGCCAGG
GAAATCAAAA TCGCAGCCCA AACAGGCCAG CTGGATAAGA TAATATCCTA A
 
Protein sequence
MITKQQVLEF LKNYDLENIT IATICSHSSL QIFDGARKEG FRTLGICVGK PPKFYEAFPK 
AKPDEYLVVD SYADIMNKAE ELRQKNTIII PHGSVVAYLG TENFADMAVP TFGNRAVLEW
ESDRDKEREW LLGAGIHMPG KVDDPHDING PVMVKYDGAK GGKGFFVAKT YEEFEELIDR
TQKYTIQEFI TGTRYYLHYF YSPIRNEGYT LSKGSLELLS MDRRVESNAD EIFRLGSPRE
LVGAGIRPTY VVTGNMPLVA RESLLPLIFS LGERVVEESL GLFGGMIGSF CLETVFTDTL
EIKVFEISAR IVAGTNLYIS GSPYADLMEE NLSTGRRIAR EIKIAAQTGQ LDKIIS