Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_2753 |
Symbol | |
ID | 7270861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 2880563 |
End bp | 2882248 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643571341 |
Product | peptidase C1A papain |
Protein accession | YP_002467736 |
Protein GI | 219853304 |
COG category | [R] General function prediction only |
COG ID | [COG3291] FOG: PKD repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.757043 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.325796 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCTCT TCAGTGAAGG TGCAGTCTCT GCAGCAGAAA CCCTGGCATC CAACGAATCG GTCACAACAC TCAACCCGAT CATGCACCCA AACCAGACGA CCCTGCAGCG GTGGATCGAA CAGTATGAGG CTGCTGAGAC TGTGCCGATA GATCAGACGA TCAAAGACCG GGCGATCCAG TCAACTGGTG CACGGTCGCT CCTCTCGGAT CTGCCCTATA TCGCTTCCGA GCGGAATCAG AATCCGATTG GGAACTGCTG GGTCTGGGCC GGCACTGGTG TACTTGAAGT AGCTCATGCA GAACAGACCG GGATCAAAGA CCGACTCTCG ATCCAGTACC TCGACTCGAA TTATAATGAT GGATCCGGCC CTGACTGGGC TGGTGAGGGA GGGTCCCTGA CCGATTTTGT CCAGTTTTAT AACCAGCAGC ATATCGCAGT CCCCTGGTCC AACTACAACG CGAACTTCCA GGATGGACAG TTATGGTCCT TGAAGAATAT GCAGACCTGG ATTCCTGCCG CATCGATACA AACCACCCCT CATTATGACA TCTCCAGCAT CACTGGACAG AAGGTTCAAA CCAGGGGGGT CAGTCAGGTC ACAGCGATCA CCAATATCAA GGCGGTGTTG GATCAGAACC GAGGTATTTA TCTCGGTTTC AAACTCCCAG ACGAGGCGGC CTGGGACGAT TTTGAAACCT TCTGGAACGA TCAGGGGGAG GATGCTGTTT GGGATCCGTC TCCATATGTC AGTAGAACGT GGGACAGCAA CACCGGTGGT GGTCACGCTG TGCTCTGTAT TGGTTATGAT GACACGGACC CCAACAATCG TTACTGGATC CTCGTCAACT CCTGGGGGAC TGCCGGGGGC AAAAGGCCAA ACGGCATCTT CCGAATGAAG ATGGATGTCG ATTACAGTGC TACCTATTAT GGACTGTACA ATCTGCCCGC GATGCAATGG GAGACCGAGT CAGTGACATT CAGCGCTACC CCGCAGCCCG TCTTCACGAT CACCACAGTC GCCCCCATAC CAGCGTCGCT CGGACTTTCC CCGAAAAATA ACACAATGAA GAGCGGGGGA AACCTGTCAC TCGACCTGAA CCTCACCCCA TCAGTGAACG GCCTGTCTGG GTATATCATC ACGATCACCA GCGATAACCC CCAGGTGGCC TCGATCACCA GTGCAACGCT CCCCGCCTGG GCTGGTATGA CCAGTATCTC GACACTCCCG TCATCGACCG TCACCGTCAT GGGCTCTGAC CTCTCCGATC AGGTCCACGG AGCAGCACAG ACCATCCCCC TGGCCAACCT GGGGATCACG ACCGGAAAGG CAGGGACGGC CACACTGACT GTGAAGGTGA CCGGGCTCAG CGACGACAGT GGACACCCCA TCCTGACCAC CAGCGTGCCG GCTGTGATCA CAGTGCAGGC GCCGAATATC TCCTCCGGAG TGATTCCAAT CCAGTCAAAT GGAGACCTGC TGATAGGCAA CCAGCCCACC GACCCGGATC ATGACGGGCT ATACGAGGAT CTGAACGGGA ATGGGGTGCT GGACTTCAAT GATATCACGC TCTTCTTCAA TCAGATGGAC TGGATCAGTG GGCATGAACC GTTAGGGGCG TTCGACTTCA ATGGAAACGG TCAGATCGAC TTCAATGATA TCGTCATCCT CTTCAATTCA CTTTAA
|
Protein sequence | MILFSEGAVS AAETLASNES VTTLNPIMHP NQTTLQRWIE QYEAAETVPI DQTIKDRAIQ STGARSLLSD LPYIASERNQ NPIGNCWVWA GTGVLEVAHA EQTGIKDRLS IQYLDSNYND GSGPDWAGEG GSLTDFVQFY NQQHIAVPWS NYNANFQDGQ LWSLKNMQTW IPAASIQTTP HYDISSITGQ KVQTRGVSQV TAITNIKAVL DQNRGIYLGF KLPDEAAWDD FETFWNDQGE DAVWDPSPYV SRTWDSNTGG GHAVLCIGYD DTDPNNRYWI LVNSWGTAGG KRPNGIFRMK MDVDYSATYY GLYNLPAMQW ETESVTFSAT PQPVFTITTV APIPASLGLS PKNNTMKSGG NLSLDLNLTP SVNGLSGYII TITSDNPQVA SITSATLPAW AGMTSISTLP SSTVTVMGSD LSDQVHGAAQ TIPLANLGIT TGKAGTATLT VKVTGLSDDS GHPILTTSVP AVITVQAPNI SSGVIPIQSN GDLLIGNQPT DPDHDGLYED LNGNGVLDFN DITLFFNQMD WISGHEPLGA FDFNGNGQID FNDIVILFNS L
|
| |