Gene Mpal_1732 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1732 
Symbol 
ID7271296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1801841 
End bp1804462 
Gene Length2622 bp 
Protein Length873 aa 
Translation table11 
GC content60% 
IMG OID643570346 
Productpeptidase U32 
Protein accessionYP_002466762 
Protein GI219852330 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAGA AACAGAGGGA TGGGCTGCCT GAATTGCTCG CGCCAGCAGG GTCGGAAGAG 
GCGCTGATCG CTGCCATTAA TGCAGGGGCG GATGCCGTGT ACCTGGGCGG GTCGCGTTTT
GGGGCTCGAC AGTTTGCAAC GAACTTTGAT GAAGCGGCCC TGGTCCGTTC GGTAGCCAGA
GCGCATGCAC AGAACGTCGC GGTCTATGTC ACCGTCAACA CCCTGATCCA TGACCGGGAA
CTCGTTGACG TGGCCCGGTA TCTGCTGGTG CTGTACCGTA TGGGGGTGGA TGCGGTGCTG
GTGCAGGATG TCGGAGTGGC AGCCCTGGCC AGATGGCTGG TTCCTGACCT GCCGCTCCAC
GCGTCGACTC AGATGACGAT CTACACCATT GGAGGGGTGC GGTGGGCCGT AGCGCATGGA
TTCTCACGGG TGGTGCTGGC CCGCGAACTC TCCCTTGCAG AGGTGACGGC GATCGCCGCT
GCAGCAGAGG AGGAACATCT CGATATCGGT CTGGAGATAT TTTTACATGG AGCGCTCTGT
TACAGTTACT CAGGACAGTG CCTCCTCTCG TCGGTGATCG GGGGACGGTC CGGAAACCGG
GGGAGCTGCG CACAGCCGTG CAGAAAACCA TACACCCTCC TCCATGGAGA GACAGACCAG
TACGGGAGGC CTGTTCGGAT GCAGGAGGTC CGGGATGATG AGCAGTACCT CCTATCGCCA
AAGGATCTTG CCTGCTACCC GCACCTCGAT CAGGTTGTCC ATTCGCCGAT CACCTCGTTA
AAGATCGAAG GAAGGATGAA GTCCCCCGAA TATGTTGCCA CCGTTGTTGG GATCTACCGT
ACCGCGCTCG ATGCGATCAA AACTGGGGGG TGGAAGCCTG AAGATGCGGT GATCGAACGC
CTGCTGATGG CCTTCAACCG GGGTTTCACG GGTGGGTACC TGCTGGGCGC CCGTCACCGA
TCATTGATGG GGCGAGATCG ACCGGATAAT CGCGGGCTCT TCGTCGGGAC GGTATTGGAA
TCTGATAGCG AACGGGGGGA GGTCATCGTT ACCCTGGAGG GGACGACCGC CCCTGATACA
GGTGACGGGC TGGTGTTCAT CGCTCCTGAC GGTGAGGAGT GCGGTCTGGT GCTGCGGGGT
ACACCGTGGT ACCATCCAGG GGAACAGAAG GTCTCCCTGC CTGTGCAGGA TCGGGTTGTC
GTGGGTTCGA AGGTCTACCT GACAAGAAGT GCAGTGTATA CCCGGGCCAC ACGGCGGCAG
ATCAGCGATG ATGCGGCACA CCCCCCTGCT CGAGTCTCCC TCTCCCTCTC GTTCTGGTTT
GATGAAGAGC GACTGCCGCA CCTCTCCGGC ACGGTCCTCC GCAGGGACCA GATTGCAGTT
CCGGTTTCAG TCATCGGCGA CGAACCGTTT CTGGAGGCAC GGGAACACCC GCTCACCAGC
GACCGGATCC GGGAACAGCT GACCCGGACT GGGGGAACTC CGTTTGTGAT CACCGATCTT
GTGATCGACA ACCCCGGTAC ACTCTTTGGT CGGAGCAGTG CGCTCAACAG GCTCCGGCGC
GACCTCCTCT CCGCTGCGGC CGGTGTAATC GTCAGTTCAT ATCATCCATC AGAGGAGGCA
CAGGCTGCCG CCGAAGCCCG ATGCACCCGG TTTGTTGAGA TGGAGAGCCA GCCGGGGAAT
GTTATACCAG TGTCGACGAT CCCGAATCTG GTCCTCTTTA CAGATACCCT GGAAGCGGTG
GCAGCTGCGG CTGCGGCCGG GTGTCGTCTG ATCTGTTTTG AACCAAAATC ACCCGCCCCC
TGTGGATGTA CTGGTACGGT CCCCTCGATG ATCGATCAAC TCACCAAGGC AGTTAATCTC
CTCCGTACAA CCGGGGCGAA ACTGGTCTGG AAGTGGCCGC AGATCACCCG TCATGAGTTT
TTTAAGATGG CCGGCGATAT CCTGGCTACT CCTCTTATCG AAGACCTCGC AGGTGTGATG
GTCAACGGGA TCGGAGCGGC AACTGCCCTG CAGGAGATGG CCCCCACCCT TCCACTCTAT
GGCGGGCAGG GGTTGAACAT CTGGAACCGA TGTACGGTCA ATGCGCTGAC TGGGTTCCAC
CTGCTGACCC TCTCTGGGGA ACTCTCACGT GAGGAGATTG CAGAACTCAC CGATATCGTC
TCGCGTCTAC CATCCAGCGA TCATTGTCCA CATCTCGCCC TGATCGTTCA GGGGAGTGCC
GGGGCACTGG TGACCGAGGA CTGTCCGGTC GGCACGTCCT GTGGATGTAT CCCGCCGCGT
ACCGGTTCAT GGGCCGTCCG CGATCAGAAA GGCGAGGTAT TCCCTCTCCT CTTCGATGGC
AGTTGCAGGA CCCGCATCCA GAACGCCGTC GAGACCTGTC TGATCGATCA TCTCCTGGCG
ATCGCTGGCA TCGGTGTCAC CGAGGTGGTG ATCGACGCGA GGGGCAGAAC CCCCCTCTAT
GCAGAGGAGA TGACCCGGCT GTACCTGCAG GGGCTCGATC TGGTGGCGAC GGGTGGTCGT
CGGAGTGTAG AGGAACTTGC GCACCTTAAG GAAAAGGTGA AGGTGATCTC CCTTGGTGGC
ATCACCACCG GTCCGTTCCT GCATGGACTC AAAGAGACCT GA
 
Protein sequence
MNQKQRDGLP ELLAPAGSEE ALIAAINAGA DAVYLGGSRF GARQFATNFD EAALVRSVAR 
AHAQNVAVYV TVNTLIHDRE LVDVARYLLV LYRMGVDAVL VQDVGVAALA RWLVPDLPLH
ASTQMTIYTI GGVRWAVAHG FSRVVLAREL SLAEVTAIAA AAEEEHLDIG LEIFLHGALC
YSYSGQCLLS SVIGGRSGNR GSCAQPCRKP YTLLHGETDQ YGRPVRMQEV RDDEQYLLSP
KDLACYPHLD QVVHSPITSL KIEGRMKSPE YVATVVGIYR TALDAIKTGG WKPEDAVIER
LLMAFNRGFT GGYLLGARHR SLMGRDRPDN RGLFVGTVLE SDSERGEVIV TLEGTTAPDT
GDGLVFIAPD GEECGLVLRG TPWYHPGEQK VSLPVQDRVV VGSKVYLTRS AVYTRATRRQ
ISDDAAHPPA RVSLSLSFWF DEERLPHLSG TVLRRDQIAV PVSVIGDEPF LEAREHPLTS
DRIREQLTRT GGTPFVITDL VIDNPGTLFG RSSALNRLRR DLLSAAAGVI VSSYHPSEEA
QAAAEARCTR FVEMESQPGN VIPVSTIPNL VLFTDTLEAV AAAAAAGCRL ICFEPKSPAP
CGCTGTVPSM IDQLTKAVNL LRTTGAKLVW KWPQITRHEF FKMAGDILAT PLIEDLAGVM
VNGIGAATAL QEMAPTLPLY GGQGLNIWNR CTVNALTGFH LLTLSGELSR EEIAELTDIV
SRLPSSDHCP HLALIVQGSA GALVTEDCPV GTSCGCIPPR TGSWAVRDQK GEVFPLLFDG
SCRTRIQNAV ETCLIDHLLA IAGIGVTEVV IDARGRTPLY AEEMTRLYLQ GLDLVATGGR
RSVEELAHLK EKVKVISLGG ITTGPFLHGL KET