Gene Mpal_2753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2753 
Symbol 
ID7270861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2880563 
End bp2882248 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content55% 
IMG OID643571341 
Productpeptidase C1A papain 
Protein accessionYP_002467736 
Protein GI219853304 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.757043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.325796 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCTCT TCAGTGAAGG TGCAGTCTCT GCAGCAGAAA CCCTGGCATC CAACGAATCG 
GTCACAACAC TCAACCCGAT CATGCACCCA AACCAGACGA CCCTGCAGCG GTGGATCGAA
CAGTATGAGG CTGCTGAGAC TGTGCCGATA GATCAGACGA TCAAAGACCG GGCGATCCAG
TCAACTGGTG CACGGTCGCT CCTCTCGGAT CTGCCCTATA TCGCTTCCGA GCGGAATCAG
AATCCGATTG GGAACTGCTG GGTCTGGGCC GGCACTGGTG TACTTGAAGT AGCTCATGCA
GAACAGACCG GGATCAAAGA CCGACTCTCG ATCCAGTACC TCGACTCGAA TTATAATGAT
GGATCCGGCC CTGACTGGGC TGGTGAGGGA GGGTCCCTGA CCGATTTTGT CCAGTTTTAT
AACCAGCAGC ATATCGCAGT CCCCTGGTCC AACTACAACG CGAACTTCCA GGATGGACAG
TTATGGTCCT TGAAGAATAT GCAGACCTGG ATTCCTGCCG CATCGATACA AACCACCCCT
CATTATGACA TCTCCAGCAT CACTGGACAG AAGGTTCAAA CCAGGGGGGT CAGTCAGGTC
ACAGCGATCA CCAATATCAA GGCGGTGTTG GATCAGAACC GAGGTATTTA TCTCGGTTTC
AAACTCCCAG ACGAGGCGGC CTGGGACGAT TTTGAAACCT TCTGGAACGA TCAGGGGGAG
GATGCTGTTT GGGATCCGTC TCCATATGTC AGTAGAACGT GGGACAGCAA CACCGGTGGT
GGTCACGCTG TGCTCTGTAT TGGTTATGAT GACACGGACC CCAACAATCG TTACTGGATC
CTCGTCAACT CCTGGGGGAC TGCCGGGGGC AAAAGGCCAA ACGGCATCTT CCGAATGAAG
ATGGATGTCG ATTACAGTGC TACCTATTAT GGACTGTACA ATCTGCCCGC GATGCAATGG
GAGACCGAGT CAGTGACATT CAGCGCTACC CCGCAGCCCG TCTTCACGAT CACCACAGTC
GCCCCCATAC CAGCGTCGCT CGGACTTTCC CCGAAAAATA ACACAATGAA GAGCGGGGGA
AACCTGTCAC TCGACCTGAA CCTCACCCCA TCAGTGAACG GCCTGTCTGG GTATATCATC
ACGATCACCA GCGATAACCC CCAGGTGGCC TCGATCACCA GTGCAACGCT CCCCGCCTGG
GCTGGTATGA CCAGTATCTC GACACTCCCG TCATCGACCG TCACCGTCAT GGGCTCTGAC
CTCTCCGATC AGGTCCACGG AGCAGCACAG ACCATCCCCC TGGCCAACCT GGGGATCACG
ACCGGAAAGG CAGGGACGGC CACACTGACT GTGAAGGTGA CCGGGCTCAG CGACGACAGT
GGACACCCCA TCCTGACCAC CAGCGTGCCG GCTGTGATCA CAGTGCAGGC GCCGAATATC
TCCTCCGGAG TGATTCCAAT CCAGTCAAAT GGAGACCTGC TGATAGGCAA CCAGCCCACC
GACCCGGATC ATGACGGGCT ATACGAGGAT CTGAACGGGA ATGGGGTGCT GGACTTCAAT
GATATCACGC TCTTCTTCAA TCAGATGGAC TGGATCAGTG GGCATGAACC GTTAGGGGCG
TTCGACTTCA ATGGAAACGG TCAGATCGAC TTCAATGATA TCGTCATCCT CTTCAATTCA
CTTTAA
 
Protein sequence
MILFSEGAVS AAETLASNES VTTLNPIMHP NQTTLQRWIE QYEAAETVPI DQTIKDRAIQ 
STGARSLLSD LPYIASERNQ NPIGNCWVWA GTGVLEVAHA EQTGIKDRLS IQYLDSNYND
GSGPDWAGEG GSLTDFVQFY NQQHIAVPWS NYNANFQDGQ LWSLKNMQTW IPAASIQTTP
HYDISSITGQ KVQTRGVSQV TAITNIKAVL DQNRGIYLGF KLPDEAAWDD FETFWNDQGE
DAVWDPSPYV SRTWDSNTGG GHAVLCIGYD DTDPNNRYWI LVNSWGTAGG KRPNGIFRMK
MDVDYSATYY GLYNLPAMQW ETESVTFSAT PQPVFTITTV APIPASLGLS PKNNTMKSGG
NLSLDLNLTP SVNGLSGYII TITSDNPQVA SITSATLPAW AGMTSISTLP SSTVTVMGSD
LSDQVHGAAQ TIPLANLGIT TGKAGTATLT VKVTGLSDDS GHPILTTSVP AVITVQAPNI
SSGVIPIQSN GDLLIGNQPT DPDHDGLYED LNGNGVLDFN DITLFFNQMD WISGHEPLGA
FDFNGNGQID FNDIVILFNS L