Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1779 |
Symbol | |
ID | 4784446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1914199 |
End bp | 1916232 |
Gene Length | 2034 bp |
Protein Length | 677 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640090350 |
Product | thimet oligopeptidase |
Protein accession | YP_001020973 |
Protein GI | 124266969 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.117155 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGCCG ACACCTATCC CTTCCCGATC CGCATGAACG CTTCCGTCCT CGCTTTCCTG TCCCGGCTGC TTCGGCTGTC GAGTTGCGTC GCCCTGTTCG GCGTCAGCGG CTGGAGCGCC GCGGCCGACA CCGCGCCCTA CGTCTTTCCG AGCTACCGCG ACGGCGCCGC CGTGAAGGCG CGTTGCGACC GCACGCTGCG CGAGATCGAG GCGCAGGCGC GGCGCATCGC GGCCGGTCGG GGCCCGGACG GCGTGCTGGT CGAGATCGAC CGGCTCGGAC AGACAGTCGA CGACAGCATG TCGCCGGTGT TCTTCCTGGC GAACGTGCAC CCCGACAAGC CGGTGCGCGA CGCGGCCGAG GCCTGCGAGC TGCGCTACCA GGCCTTCACC AGCCGGCTGT ACCAGAACCC CAGGATCTAC CGGCGTCTGC AGGCGCTGCA GCCGGTCGAT GCGATCGACC GGCAGATGCG GGCCGATCTG CTCGCGAGTT TCGAGGACGC CGGGGTCGGC TTGCCGGCCG CGAAGCGCGA CCGCGCACGC GCGCTCAACG ACGAACTGGG CCGCCTGTCG CAGGACTTCG AGCGTCGCCT GCGCGAGGAC AAGACCCGCG TCGCCTTCAC CGACAGCGAG CTCGACGGGG TGCCGGCCAG TGTGTGGAGC ACCGCCCCCC GCGACGCGCA GGGCCGGGTG CTGCTGGGGC TCGACTACCC GACCTACTCG CCGGTGGTGG AGAACGCGCG CAGCCCGGGG GCACGCGAAC GCATGTGGCG GGCCTTCCAG GCGCGCGGCG GCCAGGCGAA CCTGAAGACG CTGGCGCGGC TGGGCGAGAA GCGCCGCAGC TACGCGCGCC TGTTCGGTGT CGAGAGCTAT GCCGATTTCA CGCTGCGCCG TCGCATGGCG CTGAACGTGG GCAACGTGCA GGCCTTCCTC GGCGAGGTGA AGGGCGAGCT GGGCGAGCGC GAGGAACGCG ATCTGTCCGA ACTGCGTGCC GCCAAGGCCG CCGAGTTGAA GACGGCGCCC GACACCACGC CGCTGAAACG CTGGGACGTG GGCTACTACC TCGAGCGCGT CAAGCGCGAG CGCCTGGCGC TCGACCAGGA GAGCTTCCGC CGCTATTTCC CGCCGCAGGC CAGCGTCGAC TTCGTGTTCG CGCTCGCCGG GCGGCTGTTC GGCGTGGGCT TCGAGCCGGT GCCTCAGTCG CTGTGGCATC CCGACGCCAA GGCCTACGTC GTGGTCGACG CCGCCAGCCG CACGCCGCTG GCCACGCTCT ACCTCGATCT CTACCCCCGT GCCGACAAGT ACGGCCACGC GGCGGTGTGG CCGCTGCGCG GCTCGTCGAC CTGGAGCGGG CAGTTGCCGA CGGCCGCGCT GGTGACCAAC TTCGACCGCC AGGGCCTGAC GATCGACGAG CTGGAAACGC TGCTGCACGA GTTCGGCCAT GCGCTGCACG TCACGCTGTC GCACACGCGC TATGCCGCAC AGGCCGGGAC CGCGGTCAAG CTCGACTTCG TCGAGGCGCC ATCGCAGATG CTGGAGGAAT GGGTCTACGA CGCCCAGGTG CTGGCGCTGT TCCAGCAGGT CTGCGCGAGC TGCGAGCCGG TGCCAGCCGA CCTGCTGGCG CGCGCCGTGC AATCGCGCAG CTTCGCCAAG GGGCTGCAGT TCGCGCGCCA GCATCTGTAC GCCAACTACG ACCTCGCGCT GCACGACAAG GACGCGCCGG ACCCGATGGC GCTGTGGGCC CGCATGGAGA GCGCCACGCC GCTCGGCTAC GAGCCCGGCT CGCTGTTCCC GGCCGGCTTC TCGCACGTCG CCGGCGGCTA CGGCGCCGGC TACTACGCCT ACCTCTGGAG CCTGGCGATC GCGCAGGATC TGCGCACCGC CTTCGCGGCC GACCCGCTGG ACCCGGCCGT CGGTCGTCGC TACCGTGAGA CGGTGCTGGC CAACGGCGGC CAGGCTCCGC CCGCCGAGCT GGTGGCGCGC TTCCTGGGGC GTGCGCCGAG CAACGCCGCG TTCTTTGAGT GGCTCGAGCG CTGA
|
Protein sequence | MIADTYPFPI RMNASVLAFL SRLLRLSSCV ALFGVSGWSA AADTAPYVFP SYRDGAAVKA RCDRTLREIE AQARRIAAGR GPDGVLVEID RLGQTVDDSM SPVFFLANVH PDKPVRDAAE ACELRYQAFT SRLYQNPRIY RRLQALQPVD AIDRQMRADL LASFEDAGVG LPAAKRDRAR ALNDELGRLS QDFERRLRED KTRVAFTDSE LDGVPASVWS TAPRDAQGRV LLGLDYPTYS PVVENARSPG ARERMWRAFQ ARGGQANLKT LARLGEKRRS YARLFGVESY ADFTLRRRMA LNVGNVQAFL GEVKGELGER EERDLSELRA AKAAELKTAP DTTPLKRWDV GYYLERVKRE RLALDQESFR RYFPPQASVD FVFALAGRLF GVGFEPVPQS LWHPDAKAYV VVDAASRTPL ATLYLDLYPR ADKYGHAAVW PLRGSSTWSG QLPTAALVTN FDRQGLTIDE LETLLHEFGH ALHVTLSHTR YAAQAGTAVK LDFVEAPSQM LEEWVYDAQV LALFQQVCAS CEPVPADLLA RAVQSRSFAK GLQFARQHLY ANYDLALHDK DAPDPMALWA RMESATPLGY EPGSLFPAGF SHVAGGYGAG YYAYLWSLAI AQDLRTAFAA DPLDPAVGRR YRETVLANGG QAPPAELVAR FLGRAPSNAA FFEWLER
|
| |