Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0768 |
Symbol | |
ID | 7270509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | - |
Start bp | 782624 |
End bp | 784729 |
Gene Length | 2106 bp |
Protein Length | 701 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643569416 |
Product | Thimet oligopeptidase |
Protein accession | YP_002465853 |
Protein GI | 219851421 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.484964 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0509225 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCCAC GACTCAGGAC CATCCGGTAT TACCCAGTAA CTGCCGGCGT AGTTGCGCTC CTCGTTGCCC TCCTGATGAT CGCAGGATGC CTCCAGGACG GCACATCGGC AGACTCTGAA CAGAATCGCA CGACACCGGC AGCACCAATC CAGACCCGGT ATTCAACCGG AGAGATTACC AGGATCAATG ATCAGGCAGA AGAGGAGGCT ACTGCCTCAC TCAATGCTAT AGCCGCCATT CCCCCCGGGG AGCGTACGGT CGAGAACACC CTCCTTGCGT ACGACCGGAT ACTATCCGAT TACAACGATG CGATCGGACC ACTCACGCTG ATGCGGTATG TATATCCGGA TCCGGCCATC GCCGCTGAAG GCAGTGAGGT CGCAACATCC TCCCAGATCT TTCTCAATGG AGTGACCACG CGGCGCGACC TGTACAGTGC ACTGAAGGGT CAGGTTCCGC AAACCCCGGA TGAAGTCCGG CTCTATAACG AGACCATCCG CGAATTCGAG CACAACGGGC TGAGGCTTCC GGACGACCGG CTGGCCAGGG TCAGAGAGAT GCGGGCGAAC CTGAGTTCTC TTGAATCACA GTACCTGTTC AACCTGAACA ACGACACTAC CACGCTTGAG TTCACGGAAG AGGAACTGAC CGGCGTCCCG GCAGCAACGA TCGCCACATT CAAAAAGACA CCGCAGGGAA CCTGCATCGT TACAACGAAA TACCCTGACT ACACCTCCGT GATGGCGAAC GCCGACAGGA GCGAAACCCG GAAGAAGATG TATGCCGCCT ACTTTAACCG GCAGGCAGAG GCGAACACCG CCCTCCTTGA ACAGGCTATC GATCTCCGCC GGCAGATTGC GCAGGAACTC GGGTATGCCA CCTGGGCCGA CTTCCAGTTG GACGGGAGGA TGGCGAAGAG CACCGGCACG GTCATGACAT TCCTCACAAC GTTACAAGAA CCCCTGCGGG AGAAGACCAG GGAAGAATTC ACAGGACTTC TAGCGATCAA GAAAGGGCTC GATCCCCAGG CAACGACGGT CGATCCCTGG GATATTATGT ACCTGCAGGA GATCCGGAAG AAACAGCAGT ATGCGTACAA CGACGACGAG GTCCGGGAAT ACTTCCCGAT GGACAACGTG CTGCAGGGCC TCTTCTCGAT CTATGGCACC CTCTTTGGCA TCGGGTTCGA TGAGGTGAAG GGTGCCCCGG TCTGGTCGCC CGAAGTGCGC CTGTTCCGTG TGAGGAACCT CTCCGACAAC GCCACGGTCG GGTACCTGTA TCTCGATCTC TATCCGCGGG ACGGTAAGGA TGCGTGGTTC TCCGAGTCCG ATGTTATCAA AGGAAGGCAG AACAACGGCT CGTATCAGGT CCCGGTTGCT GCAATTATTG CAAATTTCCA GGCCCCGTCA GGAGACAAAC CCTCGCTCCT GACCCCCTAC GATCTGGAGA CGCTCTTCCA TGAAAGCGGG CATGCCATGC ACAGTCTTCT CACCACCGCG CCCTATGGTA CGATGTCCGG GACCAGTGTC GAGTGGGATT TTGTCGAGAC TCCCTCACAG GCGCTTGAGG AGTGGGTCTG GGACCCTCAG CTGCTGGAAT CCATCTCAGG CCACTATACG AATACCTCCC AGAAGATCCC CGCGGACCTC CGCGACCGGG TCATTGCCGC ACAGCAAGCT TCCATGGGAA GTGATTACAG CAACCGTATG GAGAAATCAC TGGAGGATAT GCGTTTCCAC ACGGCTGCAG AACCGGTCAA CGTGACAGAG GTATCATACC AGACCTACGA GGAGGTAATG GGCATACCTC AGCTTGCAGG GACGCACCAA CCTGCATCGT TCGACCATAT CATGGACGGG TACGATGCAG GGTATTACAG TTATCTGTGG TCGAAAGTGT ACGCTCTCAG CATCGTTGAT ACATTCAAAC GCGACGGGAT GACCAACCAG ACCACCGGCA TGAAGTTCCG GCAGGAGATA CTTGCCCGGG GTAACATGGA GGACGGCAGC GTGCTCCTGA AAAATTTCCT GGGGAAGGAA CCCGACATGG AGGCTCTGTA TCGGCACATC GGGATTCATA TGTCGCAGCC TGCATCCGGG ACGTAA
|
Protein sequence | MPPRLRTIRY YPVTAGVVAL LVALLMIAGC LQDGTSADSE QNRTTPAAPI QTRYSTGEIT RINDQAEEEA TASLNAIAAI PPGERTVENT LLAYDRILSD YNDAIGPLTL MRYVYPDPAI AAEGSEVATS SQIFLNGVTT RRDLYSALKG QVPQTPDEVR LYNETIREFE HNGLRLPDDR LARVREMRAN LSSLESQYLF NLNNDTTTLE FTEEELTGVP AATIATFKKT PQGTCIVTTK YPDYTSVMAN ADRSETRKKM YAAYFNRQAE ANTALLEQAI DLRRQIAQEL GYATWADFQL DGRMAKSTGT VMTFLTTLQE PLREKTREEF TGLLAIKKGL DPQATTVDPW DIMYLQEIRK KQQYAYNDDE VREYFPMDNV LQGLFSIYGT LFGIGFDEVK GAPVWSPEVR LFRVRNLSDN ATVGYLYLDL YPRDGKDAWF SESDVIKGRQ NNGSYQVPVA AIIANFQAPS GDKPSLLTPY DLETLFHESG HAMHSLLTTA PYGTMSGTSV EWDFVETPSQ ALEEWVWDPQ LLESISGHYT NTSQKIPADL RDRVIAAQQA SMGSDYSNRM EKSLEDMRFH TAAEPVNVTE VSYQTYEEVM GIPQLAGTHQ PASFDHIMDG YDAGYYSYLW SKVYALSIVD TFKRDGMTNQ TTGMKFRQEI LARGNMEDGS VLLKNFLGKE PDMEALYRHI GIHMSQPASG T
|
| |