Gene Mbur_1029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_1029 
SymbolpurP 
ID3998769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp1111845 
End bp1113002 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content45% 
IMG OID637958805 
Product5-formaminoimidazole-4-carboxamide-1-(beta)-D- ribofuranosyl 5'-monophosphate synthetase-like protein 
Protein accessionYP_565714 
Protein GI91773022 
COG category[R] General function prediction only 
COG ID[COG1759] ATP-utilizing enzymes of ATP-grasp superfamily (probably carboligases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGACA GGAAAGAGAT TATTGAGATT GCGGAGAGCT ATTATACTGA TGACATAAAG 
ATCGGTACAG TTGCTTCTCA TTCAGGATTG GATGTATTTG ACGGTGCCAT CGAGGAAAAT
TTCGAGACCT TTGCAATATG CCAGGCGGGT CGTGAAAAAA CATATACCGA GTACTTCAAG
TCAAAAAGGG ATGCCAATGG CAATGTTGTG CGCGGTATCG TTGATGATCA TGTTGTATAT
GATAAATTCA ATGAGCTCAT GCTTCCAGAG AACCAGCAGA AGCTTGTGGA CGACAATGTT
CTTTTCATAC CTAACAGATC CTTTACTTCT TACTGTGACA TCGATGAGGT CGAGAACGAT
TTCCGTGTAC CAATGGTCGG AAGCAGGAAC ATGCTCCGAA GTGAGGAGCG CGGTATGGAC
CAGGATTATT ACTGGCTTCT TGAGAAGGCT GGTCTCCCAT TCCCTGAAAG GATAAACGAT
CCTGAAGACA TTGATGAGCT TGTAATGGTA AAGCTCCCTC ATGCAGTAAA GAAACTTGAG
CGTGGGTTCT TCACTGCCGG AACTTACAGT GAATATGTGG AGAAGTCCGA GTCCCTTATC
AAACAGGGTG TAATTACAAG GGAAGCCCTT GCGGAAGCAA GGATCGAGCG CTATATTATT
GGTCCGGTCT TCAATTTTGA TATGTTCCAT TCTCCTATCG AGGAAGAAAT GAACAAGACC
GAGATCCTTG GTGTTGACTG GAGGTTCGAG ACAAGTCTGG ACGGTTATGT CAGGCTTCCG
GCACCACAGC AGATGAATCT CGCAGAGCAT CAGTTAACTC CTGAGTACAC AGTATGTGGT
CACAATTCTG CAACACTTCG CGAGTCTCTC CTTGAGGAGG TTTTCAAACT TTCAGAAATG
TATATCAAAG CATCCAAGGA GTTCTATGAC CCCGGGGTCA TTGGTCCTTT CTGTCTTCAG
ACATGCATTG ATAAAGATCT GAACTTCCAC ATTTATGATG TTGCTCCACG GGTTGGCGGT
GGGACGAACG TTCACATGTC TGTTGGCCAT CCATATGCTA ATACATTATG GCGTAAACCT
ATGAGTACTG GAAGGCGCGT TGCCTTTGAG GTACGTCGTG CTATTGAATC CGGGCAATTA
GATAAGATCA TAACATGA
 
Protein sequence
MIDRKEIIEI AESYYTDDIK IGTVASHSGL DVFDGAIEEN FETFAICQAG REKTYTEYFK 
SKRDANGNVV RGIVDDHVVY DKFNELMLPE NQQKLVDDNV LFIPNRSFTS YCDIDEVEND
FRVPMVGSRN MLRSEERGMD QDYYWLLEKA GLPFPERIND PEDIDELVMV KLPHAVKKLE
RGFFTAGTYS EYVEKSESLI KQGVITREAL AEARIERYII GPVFNFDMFH SPIEEEMNKT
EILGVDWRFE TSLDGYVRLP APQQMNLAEH QLTPEYTVCG HNSATLRESL LEEVFKLSEM
YIKASKEFYD PGVIGPFCLQ TCIDKDLNFH IYDVAPRVGG GTNVHMSVGH PYANTLWRKP
MSTGRRVAFE VRRAIESGQL DKIIT