Gene Mbar_A1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A1198 
Symbol 
ID3624545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp1476513 
End bp1478162 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content45% 
IMG OID637700088 
ProductPpx/GppA phosphatase 
Protein accessionYP_304745 
Protein GI73668730 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCCG AGAAAATCTC CGAAGGCAGA GTTGTTGCGT TTATCGATAT AGGCACTAAC 
TCGATCCGGC TTCTTCTGGT CCGGATCAAT CCCAATGGGT CTTACCTGCC CCTGACCAAG
CAGAAGGAAA CGGTCAGACT CGGGGATGGG GAATTCATAG ACAGGATTCT GCAGCCTAAG
GCGATAGAAC GCGCAGTTGT TGTCTGTAAA AAGTTCATGG AACTTGCCAG AGCCTACAGG
GCAGAAGAGG TCATAGCTGT GGCTACCTCA GCAACGCGAG ATGCAAGCAA TAAGGTTCAG
CTTCTCGAAA TGTTGAAAAA AGAAGCAAAC CTGGAAGTTT GTACGATTTC CGGAACTGAA
GAAGCTCGCC TCATTTACCT TGGGGTTTCA AGCGGGCTTC GGCTCGGGAA CTCAAAAGCT
CTGTTCATAG ATATCGGAGG CGGGAGCACT GAAGTATCAG TAGGGGACCA GATCCAATGT
TATTACCTTT ATTCCTTTAA CCTTGGAGCT GTCAGGCTGA CAAATATGTT TTTGCCGGAC
GAAACCGGGC CTGTTTCCGA GGAACAGTAT GAGCAGATTA AAGCTTATAT TCGGCGCAAG
ATTACAGATA TAATAAAAGA CCTTTCCGAG TACGATATAA GCTGTTCAAT TGGAAGTTCG
GGAACAATTG AAAACCTTGC CAGGATCGCT TTTGTCTACC TGCGCAAAAC AGCCCGTGAG
AGTTTTGAGA AACTGGAGTA TGAAGACCTG AAGAAAATAG TCAGGGCAAT GTGCGCCATA
CCCCTCGAAG AGCGGCGCAA ATTCCCGGGA ATCAATGCGC AAAGAGCAGA TATTATTATC
GCAGGGGCTG CAATTATTGA GACCTTTATG GAAGAGATGG AGCTTTCCGA AATCAGGATA
AGTAAACGCG GGCTTCGAGA GGGGCTGCTT GTAGACTATA TCTCAAAGAG TGAATTCTCC
TACATGATTA CGCAGATGTC GGTAAGAAAA CGCAGCATTA TGCAGCTTGG GCTTACCTGC
AATTTCGATG AAGAACATGC TCATACCGTT ACCAGACTTG CTCTTGAGCT CTTTGACAGT
ATTCAGGCTC TTGGAATTTA TAAATTCAGA GAAGGAGAAA GGGAACTTCT TGAATACGGT
TCCACATTGC ACGACATAGG GACATTCCTA TCATATGATA CCCATCAGGC CCATGCCTAT
CACCTTATAA GGGAGAGCAA TCTTCCGGGC TTCCAGCCTG AAGAAATCGA AATAATAGCA
AATCTTGCCT ATTTCCACAG GAAAAATACG CCTAAAAAGA AACATCCCAA TCTTGTCGGG
CTTAATAAAG ATGTAGTCAA AAGTATAAAA GTTCTCAGTG CCCTGCTCCG TATTGCCGAA
GGGCTGGATC GTTCTCACAA TGGAATTATT TCTCATGTCC GGTTTTACAT TGCTTCTACC
GATAGTCTGG TGCTTGAAAT GCACGCCCAG CGGGAGTGCC AGCTTGAAAT CTGGGAAGTA
GAGAAGCAGA AAAAATACTT CAAGAAAATA TTCGGATACA ATCTTCAGTC TAAAGTTCTT
ATAAAGCAAG ATGCTGGAGT TCCCTTAGTT CTTGAAGGAG ATGCTGAGCT TGAGGAACTT
CCAGATGTCA AAATGTCCTC GAAAAGTTAA
 
Protein sequence
MEPEKISEGR VVAFIDIGTN SIRLLLVRIN PNGSYLPLTK QKETVRLGDG EFIDRILQPK 
AIERAVVVCK KFMELARAYR AEEVIAVATS ATRDASNKVQ LLEMLKKEAN LEVCTISGTE
EARLIYLGVS SGLRLGNSKA LFIDIGGGST EVSVGDQIQC YYLYSFNLGA VRLTNMFLPD
ETGPVSEEQY EQIKAYIRRK ITDIIKDLSE YDISCSIGSS GTIENLARIA FVYLRKTARE
SFEKLEYEDL KKIVRAMCAI PLEERRKFPG INAQRADIII AGAAIIETFM EEMELSEIRI
SKRGLREGLL VDYISKSEFS YMITQMSVRK RSIMQLGLTC NFDEEHAHTV TRLALELFDS
IQALGIYKFR EGERELLEYG STLHDIGTFL SYDTHQAHAY HLIRESNLPG FQPEEIEIIA
NLAYFHRKNT PKKKHPNLVG LNKDVVKSIK VLSALLRIAE GLDRSHNGII SHVRFYIAST
DSLVLEMHAQ RECQLEIWEV EKQKKYFKKI FGYNLQSKVL IKQDAGVPLV LEGDAELEEL
PDVKMSSKS