Gene Mpal_2694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2694 
Symbol 
ID7272516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2823917 
End bp2825833 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content59% 
IMG OID643571285 
ProductATP-dependent protease Lon 
Protein accessionYP_002467681 
Protein GI219853249 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGTA CTGTAAATGA GGAGCCAACG CACGAGGATA CCATTTTAGA GAACGTGCAG 
GTGGAGACCT CATCCGAGAT TGAGGTACCG GCGCATCTGA TCGATCAGGT GATCGGTCAG
GAGCATGCGG TTGAGGTGAT CAGGAAGGCG GCGATACAAC GCCGTCATGT GATGATGATC
GGTAGCCCGG GGACCGGGAA GTCGATGCTT GCAAAAGCAA TGGCTGAACT GCTCCCCAAA
GAGGAGCTGC AGGACCTGCT GGTCTACCCC AATGCCGAGG ACTCAAACAA TCCTATCATC
AGGACCGTCC CTGCAGGAAA GGCAAAGCAG ATCGTCGGTG CTCATAAAGC TGAGGCCAAG
AAGCGGGCGC AGTTCCGGAA CACGTTGATG ATGTTGCTGA TGGTCGGGAT CATCGGCTAC
TCATTCATCA CGATGCAGTG GCTGATGGGG ATCATCGCTG CAGCCTTCGT CTTCATGGCT
CTCCGGTACA GCACTCCCAA GGATGAGGCG ATGATCCCAA AGTTGCTCGT CTCCAATGAT
ACCACCGCGA CAGCTCCGTT CATCGATGCG ACCGGTTCCC AGGCCGGCGC CTTGCTCGGG
GATGTTCGGC ATGACCCGTT CCAGAGCGGC GGGCTTGAGA CCCCTGCCCA TGACCGTGTG
GAGTCTGGGG CGATCCACCG TGCTAACGGA GGTGTGCTCT TCATCGATGA GATCAATACC
CTCTCGCCGG GTTCACAACA GAACCTGCTG ACAGCACTGC AAGAGGGTGA ATTTCCCATC
ACCGGGCAAA GTGAACGCTC AAGCGGTGCG ATGGTCAGAA CCGAACCGGT CCCGTGCCGA
TTCGTGATGA TCGCAGCCGG CAACCTGGAC GCGGTCCAGG GGATGCATCC GGCCCTCCGG
TCCCGTATCA GGGGGTACGG TTACGAGGTT TTCATGTCCG AGTCGATGGA GGAGACCCCT
GAGAACCGTG AGAAGTTCAT CAGGTTCATT GCCCAGGAGA TCAAGAACGA CGGCAAGATC
CCACACTTCG ACCAGGGTGC AATGGCAGAG GTGCTCAGAG AGGGCCGCCG CCGGTCAGGG
CGCAAAGGGC ACCTGACCTT GAAACTGCGT GACATGGGTG GATTGATCCG GGTGGCCGGG
GACCTGGCCA GGCAGGATGG GGTCGAACTG ACGACCGCTG CCCACGTGCT TGCAGCCAAG
GAGACCGCTC GTTCGATCGA GGATCAGATC TCTGATGAGA ACAGCCGGCG GTTGAAGGAC
TATGATCTCT CGGTGGTGAA GGGGACAAGC ATCGGTCGGG TGAACGGGCT TGCCGTGACC
GGGGCCGACT CGGGCTCGGT GCTCCCGATC ATGGCCGAGG TCACCCTCAG CCAGAGCCAG
TTCGGCCAGG TGATCGCCAC CGGGCTGCTC AAGGAGATCG CCCAGGAGTC GATCACCAAT
GTCTCAGCGA TCATCAAGAA GTTCACCGGG CAGGACATCC AGAAGCTTGA CATCCATATC
CAGTTCATCG GCACCTACGG TGGTGTGGAC GGCGACTCGG CATCGGTTAG TGTGGCCACG
GCTGTGATCA GTGCTATCGA GGGGATCCCG GTCAGGCAGG ATCTCGCGAT GACCGGGTCG
CTCTCGGTTC GTGGGGACGT CCTTCCGATT GGGGGGGTCA CCTACAAGAT CGAGGCTGCG
GCCAAGGCAG GGATCAAGAA GGTACTGATC CCGGCCTCGA ACATGAACGA TGTGATGATC
GAGGAGCGGT ACCGCTCAAT GATCGAGATC GTTCCGGTCT ATCATATCGA GGACGTGCTG
AAGGAGGCCC TGGTCCCGGA GAACGAGGCG GGTTTCCTTT CCAAGATTAA GAACATGGCC
TCGCGGCCGG CCGCCAACCT CCTCGACAAG ACCGGAATCC GTCCAACGGT GATCTGA
 
Protein sequence
MDSTVNEEPT HEDTILENVQ VETSSEIEVP AHLIDQVIGQ EHAVEVIRKA AIQRRHVMMI 
GSPGTGKSML AKAMAELLPK EELQDLLVYP NAEDSNNPII RTVPAGKAKQ IVGAHKAEAK
KRAQFRNTLM MLLMVGIIGY SFITMQWLMG IIAAAFVFMA LRYSTPKDEA MIPKLLVSND
TTATAPFIDA TGSQAGALLG DVRHDPFQSG GLETPAHDRV ESGAIHRANG GVLFIDEINT
LSPGSQQNLL TALQEGEFPI TGQSERSSGA MVRTEPVPCR FVMIAAGNLD AVQGMHPALR
SRIRGYGYEV FMSESMEETP ENREKFIRFI AQEIKNDGKI PHFDQGAMAE VLREGRRRSG
RKGHLTLKLR DMGGLIRVAG DLARQDGVEL TTAAHVLAAK ETARSIEDQI SDENSRRLKD
YDLSVVKGTS IGRVNGLAVT GADSGSVLPI MAEVTLSQSQ FGQVIATGLL KEIAQESITN
VSAIIKKFTG QDIQKLDIHI QFIGTYGGVD GDSASVSVAT AVISAIEGIP VRQDLAMTGS
LSVRGDVLPI GGVTYKIEAA AKAGIKKVLI PASNMNDVMI EERYRSMIEI VPVYHIEDVL
KEALVPENEA GFLSKIKNMA SRPAANLLDK TGIRPTVI