Gene Mpal_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1914 
Symbol 
ID7272731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2029210 
End bp2030127 
Gene Length918 bp 
Protein Length305 aa 
Translation table11 
GC content66% 
IMG OID643570528 
Productproline-specific peptidase 
Protein accessionYP_002466941 
Protein GI219852509 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01250] proline-specific peptidases, Bacillus coagulans-type subfamily 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.094773 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACACCG GGCCAGGGAC TGATGAGAAC AGAGAGCAGC GTGACGAGGA AGGGTTCATC 
CAGACCCCGG ATGGAAAGGT CTGGTACCGG ATCGTCGGTG GGGGATCCGC TGGAATACCA
CTGCTGGTCC TGCACGGCGG CCCGGGGTTT CCGCATGACT ATCTGGAACC GCTCGAGGCC
CTGGCAGACG AGCGGCCGGT GATCTTCTAC GACCAGCTCG GCTGCGGCCG GTCCGACCGG
CCGGACGACC CGTCCCTCTG GACGATCGAG CGGTATGTCG ACGAGGTGGC GGCGGTCAGG
GAGGCGCTCG GGCTGAAGGC GGTCCACCTG CTCGGGCAGT CGTGGGGGAC GATGCTGGCG
GTGGCCTACC TGGTCCGGGA AGGGCCGACC GGGATCGTCA GTGCAGTCCT CTCTGCCCCC
TATATCAGCA CACCACGCTG GATCGCCGAC CAGCGGGCAT ACCTCGCAGC GATGACCGAG
TCAGTGCAGG AGGCGGTCAG GGTCCACGAG GCTTCAGGAG ACTTTGCTGC GCCAGCCTAT
CAAGAGGCGA TGACGGCCTA CTACCAGGAG CATCTCTGCC GCCTCGAAAC ACGGCCAGAT
TGTCTGCAAC GGAGTATGGA CGGTAGCAGT GCAGCGATCT ACGCACAGAT GTGGGGGCCG
AGCGAGTTCA CCGTGACCGG GACACTCCGG ACAGCAGACC TGACCGACCG CCTGCCCTCG
CTGACGATCC CGGTCCTCTA TACCTGCGGG GAGTTCGATG AGGCGACGCC GGCTACCACC
CGGTTCTATC AGGAGCTGAC CCCCGATGCC GGGATGATCG TCTTGGCCGG CGCCTCGCAT
CAGCACCATC TGGAAGAGCC GGAGCAGTTC CTCGCCGCGG TCCGCCGGTT CCTGGCTGCC
GCTGAAGAGC GGCGATGA
 
Protein sequence
MHTGPGTDEN REQRDEEGFI QTPDGKVWYR IVGGGSAGIP LLVLHGGPGF PHDYLEPLEA 
LADERPVIFY DQLGCGRSDR PDDPSLWTIE RYVDEVAAVR EALGLKAVHL LGQSWGTMLA
VAYLVREGPT GIVSAVLSAP YISTPRWIAD QRAYLAAMTE SVQEAVRVHE ASGDFAAPAY
QEAMTAYYQE HLCRLETRPD CLQRSMDGSS AAIYAQMWGP SEFTVTGTLR TADLTDRLPS
LTIPVLYTCG EFDEATPATT RFYQELTPDA GMIVLAGASH QHHLEEPEQF LAAVRRFLAA
AEERR