Gene Mbar_A1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A1201 
Symbol 
ID3624548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp1483554 
End bp1485221 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content48% 
IMG OID637700091 
ProductHsp60 
Protein accessionYP_304748 
Protein GI73668733 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGAC AGCCAATATT CATTTTAAAA GAAGGAAGTA AGCGAACCAG AGGCAGGGAT 
GCCCAGAATA ATAATATTAT GGCTGCAAAA GCAGTAGCAG AAGCAGTCAG GACAACCCTT
GGTCCCAAGG GCATGGATAA AATGCTTGTG GACTCCATGG GCGATGTGGT TATTACAAAC
GACGGGGCAA CCATTCTCAA AGAAATGGAC ATCGAACATC CTGCAGCAAA GATGGTAGTA
GAAGTAGCCA AGACACAGGA TGAGGAAGTA GGAGACGGCA CTACCAGTGC AGCTGTGGTC
GCAGGCCAGC TCTTAAGTAA AGCTGAGGAT TTAATTGAAC AGGAAATCCA CCCGACAATT
ATCGCATCAG GATACAGGCT TGCAGCCGAA AAAGCTGTAG AAGTCCTGAA TTCCCTTGCA
ATGACAGTAG AACTGTCCAA CCGTGACCTG CTGGTCAGCA TTGCTGAAAC AGCAATGACT
GGAAAGGGTG CTGAGTCCTC CAAAAAACTC CTCTCAGAGA TTGCTGTAGA TGCTGTAACA
AGCGTTGTAG ACAAAAATGG AAAGAATAGT GTTGACAAAG ACAACATCAA TGTTGTTAAG
AAAGTCGGTG GCAAGGTCGA GGATTCTGAG CTTATCCGGG GCATGATAAT TGATAAGGAA
AGAATCCACC CCAACATGCC TGAAAAAGTA AAGGACGCAA AAATCATTCT TCTCAACAGT
GCAATCGAAC TGAAGGACAC TGAAGTAGAT GCGGAAATCT CTATAACCTC TCCTGACCAG
CTTCAGTCCT TCCTTGATCA GGAAGAGCAG ATGCTTAAAA AGATCGTCCA GAAGGTTATC
AGCAGCGGAG CAAATGTTGT CTTCTGCCAG AAGGGAATTG AAGAACTTGC CCAGCACTAT
CTTGCAAAAG CAGGTATCTT TGCTGTCCGC AGAGTTAAGA AGAGCGACAT GGAAAAACTC
GCAAGAGCAA CAGGCGGCAA ACTCATCACC AACATGGATG AAATCACTCC TGAAGACCTC
GGATATGCAG CACTCGTTGA AGAGAAAAAG GTTGGTGGAG ACAGCATGAC TTTTGTCACA
GGCTGCGACA ACCCGAAAGC TGTAACAATC CTGCTGCGCG GCGGTACAGA GCATGTTGTT
GATAGTATTG ACAGTGCTCT TGAAGATGCC CTGCGTGTGG TCGGAGTTGC AATCGAAGAT
GAGAAGCTCG TTGCAGGCGG CGGTTCCCCT GAAGTTGAGG TTGCACTCAG GCTCCAGGAA
TACGCAGCAA CTCTCGAAGG CAGAGAACAA CTTGCAGTCA AAGCCTATTC TGAAGCTCTT
GAAATCATTC CAAGAACTCT TGCAGAAAAC GCAGGTTTAG ATCCAATTGA CATGCTCATG
GATCTGCGTT CACAGCACGA GAAAGGTGTA AAGGCCGCAG GACTCAATGT TTACGAAGGC
AAAGTCGTTG ACATGTGGAA AAACTTCGTA GTAGAACCCC TCAGAGTCAA GACCCAGGTT
ATCAATGCAG CTACCGAGTC TGCAGTTATG ATTCTCAGGA TCGACGACGT CATAGCTTCT
ACCCGTGCAG CAGGCCCTGA AGAAGGCGGA ATGCCTCCAG GAGCAATGGG TGGAATGCCG
GGCGGAATGC CACCAGGAAT GGGCGGAATG CCTCCAGGAA TGATGTAA
 
Protein sequence
MAGQPIFILK EGSKRTRGRD AQNNNIMAAK AVAEAVRTTL GPKGMDKMLV DSMGDVVITN 
DGATILKEMD IEHPAAKMVV EVAKTQDEEV GDGTTSAAVV AGQLLSKAED LIEQEIHPTI
IASGYRLAAE KAVEVLNSLA MTVELSNRDL LVSIAETAMT GKGAESSKKL LSEIAVDAVT
SVVDKNGKNS VDKDNINVVK KVGGKVEDSE LIRGMIIDKE RIHPNMPEKV KDAKIILLNS
AIELKDTEVD AEISITSPDQ LQSFLDQEEQ MLKKIVQKVI SSGANVVFCQ KGIEELAQHY
LAKAGIFAVR RVKKSDMEKL ARATGGKLIT NMDEITPEDL GYAALVEEKK VGGDSMTFVT
GCDNPKAVTI LLRGGTEHVV DSIDSALEDA LRVVGVAIED EKLVAGGGSP EVEVALRLQE
YAATLEGREQ LAVKAYSEAL EIIPRTLAEN AGLDPIDMLM DLRSQHEKGV KAAGLNVYEG
KVVDMWKNFV VEPLRVKTQV INAATESAVM ILRIDDVIAS TRAAGPEEGG MPPGAMGGMP
GGMPPGMGGM PPGMM