Gene Mbar_A0259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A0259 
Symbol 
ID3624968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp308589 
End bp309776 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content51% 
IMG OID637699151 
Productputative protease 
Protein accessionYP_303823 
Protein GI73667808 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTCAA AATATGTTAT CCTGCACAGC AGTGAGATAT TACCACCTTC ACGGGGAGAT 
ATAGGAAGAG CACGCCGAGC AGAAGTATTC CCCTTGGAAG CGGCCCAACC AATCGTCAAG
TTAGAGTACG CCGAACTTAC CAAGCGGGAG AGCAATGATC TGCGCAGAGA TCCCAGAACA
CTTGCCATTG CCAAACCTAT GCCAATGAAA CTGATTGCAC CGGTTGGTAG CCTTGATGCA
CCAACAGCTA CAAAATCCTG GGGAATCGAC GCAGTACGTG CATCGGAATC GCCATTTGAT
GGAACCGGTG TCACTGTAGC TGTGCTAGAT ACTGGAATCG ATCCAAATCA CCCAGCATTT
AAAGGCATGA AGCTGGTTCA GAAGAACTTC ACTACGGAAA TCGATAATGA TATTCATGGG
CATGGCACGC ATTGTGCAGG GACCATTTTT GGCCAGGATG TCAATGGTGT CCGCATCGGC
ATCGCTAGAA AAATTAAATG TGCCCTAATT GGCAAGGTGC TGGGCAAAGA AGGAGGTTCC
TCAGACACGA TCGCCAAGGC CATCCAGTGG GCAGTCCAGG AAGGCGCAAA TGTCATTTCC
ATGTCCCTAG GTATTGATTT TCCGGGCTAT GTAGATTGGC TGGTTCATGA CCAAGGCATG
AATATTAACC CAGCAACATC CCAGGCGCTG GAAGAGTATC GTGCAAACGT CAACCTGTTC
ACCGAGTTAG TGCGCGTCGT GGCAGCACAT GGGGCATTTG GGCAATCTGC AATCATCGTT
GCGGCCAGCG GTAACGAAAG CAATCGGCCT AAATACGAAA TTGCAGTCTC CCCTCCCGCT
GCCGCCACAG GCATCGTTGC CGTTGGCGCA CTGAATAAAT CAGGCAAGGG CTTTAACGTT
GCCGAATTTT CAAATAATCA GGTGAACATT GCCGCCCCTG GCGTTAACAT CATCTCTGCT
AAAGCAGGCA CGAGTGGCCT TATCAGTATG AGTGGGACCA GCATGGCGAC ACCTCACGCT
GCGGGTATTG CTGCCCTATG GGCACAGCGT CAACTGAAAT TGACCGGGAG GATAAATAAC
GTGAGCTTGA TGGCGCAACT TATTGCTAGT GGCACCTTTG ACTCTCTAGT CCCAGGCAGC
GAAGAGGATG ATGTGGGTAC AGGCATCATT CAGGCACCAT TGAAGTGA
 
Protein sequence
MDSKYVILHS SEILPPSRGD IGRARRAEVF PLEAAQPIVK LEYAELTKRE SNDLRRDPRT 
LAIAKPMPMK LIAPVGSLDA PTATKSWGID AVRASESPFD GTGVTVAVLD TGIDPNHPAF
KGMKLVQKNF TTEIDNDIHG HGTHCAGTIF GQDVNGVRIG IARKIKCALI GKVLGKEGGS
SDTIAKAIQW AVQEGANVIS MSLGIDFPGY VDWLVHDQGM NINPATSQAL EEYRANVNLF
TELVRVVAAH GAFGQSAIIV AASGNESNRP KYEIAVSPPA AATGIVAVGA LNKSGKGFNV
AEFSNNQVNI AAPGVNIISA KAGTSGLISM SGTSMATPHA AGIAALWAQR QLKLTGRINN
VSLMAQLIAS GTFDSLVPGS EEDDVGTGII QAPLK