Gene Mbar_A3542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3542 
Symbol 
ID3627564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4544112 
End bp4545260 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content48% 
IMG OID637702368 
Productsulfopyruvate decarboxylase subunit alpha / sulfopyruvate decarboxylase subunit beta 
Protein accessionYP_306991 
Protein GI73670976 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase]
[COG4032] Predicted thiamine-pyrophosphate-binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0505956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAATC CGGAAGAAGA AGTCATAGCA ATAATGAAAA AGTCAGGTAT CGACCTTGCC 
GCAACTCTGC CCTGCGACAG GATCAAAAAC CTGCTTCCCC TGGTTTCGGA AAATTTTCCT
GAAATCAAGT TAACGAGAGA AGAAAACGGC GTGGGCATTT GTGCCGGGTT TTATCTCGCA
GGGGGAAAGC CCATGATGCT TATCCAGAGT ACCGGGCTTG GAAATATGAT TAATGCCCTA
GAATCTCTGA ACTTAATCTG CAGGATCCCC TTACCTGTTC TGGCAAGCTG GCGCGGGGTT
TACGCTGAAG GCATCGAAGC CCAGGTTCCG CTTGGCGCCC ATCTCCCTGC AATCCTTGAA
GGAGCAGGCC TGGAATATTC AATAATTGAC GAAGCCGAAA AGCTGCCTCT CCTTGAAACC
GTAATAAAAG ACGCTTTTGA GAACCTCAGG CCACATATAG CCCTGATTTC TCCGAAAGTT
TGGGAAAGTT CGGATTGTTG TGCCTGGGAA GCGGCTGGAC TGCCCGAAAA ACCAGAGATT
ATGGAAAGAG CATGTAGTTT TAATCTTACA AGGGAAACCC TCAGGCCATT AATGCTCAGG
AACGATGCGA TTTGTGCAAT TGCTTCTCAA CTCGATGATG AAATTACGGT GACAAATCTG
GGGGTTCCCT GCAAGGAACT TTATGCTTGC AGGGACAGAA AACTCAACTT TTATATGTTT
GGTTCAATGG GGCTCGTCTC GTCGATAGGG TTTGGCCTTG CTCTTAGAAC CGAGAAGACG
GTTGTGACCT TTGACGGAGA CGGCAGCCTG CTTATGAATC CCAACGCCCT GCTCGAAATT
GCCAGAGAAA CTCCGAAGAA CCTGATAGTT ATCTGCCTTG ACAATGGCGC CTATGGCTCC
ACAGGCTCGC AGGAAACCTG TGCTCTCCGC TATATTGACC TGGAAATTTT TGCAACTGCT
TGCGGAATTC GAAATACGGC TAAGGTAAAC AGTCCAGAGG AGTTAATAGA AGCTTTCAGG
AAGTTTCGGG CCATGCAGGA ACTTTCCTTT ATCCATGTAA TCCTGAAGCC TGGAAATACA
AAAGCACCCA ATATCCCCCT GAGCCCTGAA GAGGTTACAA AGCGCTTCAA AGAGGCTTTA
AAAGCTTGA
 
Protein sequence
MANPEEEVIA IMKKSGIDLA ATLPCDRIKN LLPLVSENFP EIKLTREENG VGICAGFYLA 
GGKPMMLIQS TGLGNMINAL ESLNLICRIP LPVLASWRGV YAEGIEAQVP LGAHLPAILE
GAGLEYSIID EAEKLPLLET VIKDAFENLR PHIALISPKV WESSDCCAWE AAGLPEKPEI
MERACSFNLT RETLRPLMLR NDAICAIASQ LDDEITVTNL GVPCKELYAC RDRKLNFYMF
GSMGLVSSIG FGLALRTEKT VVTFDGDGSL LMNPNALLEI ARETPKNLIV ICLDNGAYGS
TGSQETCALR YIDLEIFATA CGIRNTAKVN SPEELIEAFR KFRAMQELSF IHVILKPGNT
KAPNIPLSPE EVTKRFKEAL KA