Gene Mbar_A0571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A0571 
Symbol 
ID3626164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp682785 
End bp684401 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content46% 
IMG OID637699462 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase / IMP cyclohydrolase 
Protein accessionYP_304131 
Protein GI73668116 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTAAAGA GGGCACTGCT CAGCGTCTCA GACAAGACAG GAATTGCAGA ATTCGCACGC 
GGGCTTGAAT TACTTGGCGT GAAAATAATT TCAACAGGCG GAACAGCGAA AATTCTTCGC
GACGCCGGCA TAGAAGTTAC CGATGTCTCG GAAGTCACGG GATGTCCGGA GATGATGGGA
GGAAGGGTCA AAACCCTCCA TCCAAGAATT CATGGCGGAC TCTTATGCCT GCGAGAGAGC
AAGGAACAGA TGGCTGAAGC TGAAAGAGAA GACATTTCGC TTATTGACAT GGTGGCTGTA
AACCTCTATC CATTTGAGGT AACAGTCTCA AAAGAAGGCG TCGAACTCGA AGAAGCCATT
GAGAACATAG ACATCGGAGG ACCAACTCTT CTTCGTTCGG CAGCTAAAAA CTACCGTTCG
GTTACAGTAC TCTCAGACCC GTTGGACTAC GGGCGTGTCC TCAAAGAACT TCGCTCAACA
GGGGTAGTTT CGGAGGAAAC TCGAGCTGCC CTTGCAGTTA AGGCTTTCAG ACATACTGCG
GATTATGACG CAACCATTGA TACCTATCTA AGTAGAACCC TGCTCGGCGA AAACGTGCTC
CGCATGAATT TTACCGACGG TGTAAAACTT CGCTATGGGG AAAATTGGCA CCAGAAAGCC
TATTTTTACA AAGATTCCAA AATAGAAGGT CCAACTCTGG CAAAAGCCAC CCAGTTGCAT
GGGAAAGAAC TCTCCTATAA TAATTATGTG GATGCCGATA ACGCCCTTCA GACAGTAAAA
GAACTCGGAA ATGCTCACCC CGCCGTGGCA ATCGTAAAAC ATAATAATCC ATGTGGACTT
GCAACCGGAA ACACACTCCT GCAGGCTCTC CAGGCTGCCT GGGACGGAGA CCCGATTTCA
GCTTATGGAA GTATAATCTG CACCAATGAA ACCTTTGATC TTGAAGCTGC GACTTTTCTT
AACGGAAAAT TTGTAGAGAT AATTCTTGCT CCTGACTTCA ACCCCGATGC TCTTGAATAC
CTGAAAAACA AAAGTGAAAA TCTTAGACTT CTTAAACTGC CTGATCTCAG GGAAGCTTTC
GGGACAGACT ACACATATAA GTATGTAATA GGAGGAATGC TCAAGCAGAG TCGCGATATT
GGCCTCTACG AAAAATGGGA GTCAGTCACA GACATGCCCT ATCCTGAAGA GAAGCGCTCT
CTTTCGGAAT TCTGCCTTAA AGCCTGTAAG TCAACTAAAT CCAATTCCGT GATCCTTGCT
CATGAGTACG AGCCAGGTTT CTTCATGGTA CTTGCTATGG GCGCAGGACA ACCGAATAGG
GTTGACTCAA TTCGCAAACT TGCAGCCACA AAAGCTGTTG AAAATCTCAA GGTGATTTAC
GAACGGGAAA ATCCTGCAAT CTCTTTTGAA GCTTATTGCC AGAAAGTCAT GTCAGAATGC
GTAATGGCCT CAGACGCCTT TTTCCCCTTC GACGACAGCA TAGTTCATGC AGCAGAAAAT
AATATTCGCT ATATAGTTTC CCCAGGTGGA TCAATTCGGG ACGGCGAAGT CATTGCTGCT
GCAAATCGGC TAGGAGTTTC CATGGTCTTT ACAGGCATGC GCCACTTCCT TCATTAA
 
Protein sequence
MVKRALLSVS DKTGIAEFAR GLELLGVKII STGGTAKILR DAGIEVTDVS EVTGCPEMMG 
GRVKTLHPRI HGGLLCLRES KEQMAEAERE DISLIDMVAV NLYPFEVTVS KEGVELEEAI
ENIDIGGPTL LRSAAKNYRS VTVLSDPLDY GRVLKELRST GVVSEETRAA LAVKAFRHTA
DYDATIDTYL SRTLLGENVL RMNFTDGVKL RYGENWHQKA YFYKDSKIEG PTLAKATQLH
GKELSYNNYV DADNALQTVK ELGNAHPAVA IVKHNNPCGL ATGNTLLQAL QAAWDGDPIS
AYGSIICTNE TFDLEAATFL NGKFVEIILA PDFNPDALEY LKNKSENLRL LKLPDLREAF
GTDYTYKYVI GGMLKQSRDI GLYEKWESVT DMPYPEEKRS LSEFCLKACK STKSNSVILA
HEYEPGFFMV LAMGAGQPNR VDSIRKLAAT KAVENLKVIY ERENPAISFE AYCQKVMSEC
VMASDAFFPF DDSIVHAAEN NIRYIVSPGG SIRDGEVIAA ANRLGVSMVF TGMRHFLH