Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2119 |
Symbol | |
ID | 4784338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 2263950 |
End bp | 2266715 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640090687 |
Product | putative zinc protease |
Protein accession | YP_001021310 |
Protein GI | 124267306 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.105618 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCTCGA AGTACTTGGT CGCCGGCGCG CTCGCGCTGG TCGTGATGCA GGTGTCGCCG GTTGTTGCGG CCGCCAGCAA GCCAATCCCA CCTTCCGGTT CCGTCGCCTC ACCAGGCTTG CCACGCGGGG TGACCGCAGT GACCCAGGTC GAGGGCATCA CCGAGTACCG GCTGACCAAC GGGCTGCAGG TGCTGCTGGT GCCCGATGCG TCCAAGCCCA CGACGACGGT CAACCTGACG TACCACGTCG GTTCTCGCCA CGAGAACTAC GGCGAGACGG GCATGGCGCA CCTGCTCGAG CACCTGATGT TCAAGGGCAC GCCGACCACG CCCAACGTGT GGGGCGAGTT CACCAAGCGC GGCTTGCGGG CCAACGGCAG CACCTGGTTC GACCGTACCA ACTACTTCGC CAGCTTTGCG GCGAACGACG ACAACCTGCG GTGGTTCCTG TCGTGGCACG CGGATGCCAT GGTCCACAGC TTCATCGCGC GCAAGGATCT CGATTCCGAG ATGACGGTGG TGCGCAACGA GATGGAGATG GGCGAGAACA ACCCCGGCCG CATCCTGTAC CAGAAGACGC TGGCCGCGAT GTACGACTGG CACAACTACG GCAAGGACAC GATCGGCGCG CGCAGTGATG TCGAGAACGT CGACATCGCG CGGTTGCAGG CCTTCTACCG TCAGTATTAC CAGCCCGACA ACGCCACGCT GGTCGTCAGC GGTCAGTTCG ACACGGCCCG GGTGCTGGCC TGGGTGCAGC AGTACTTCGG CAAGATCCCG AGGCCTCGAC GCGTGCTGCC CACGCTCTAC ACCCTCGATG CGGCTCAGGA CGGTGAGCGC GCCCTGACGC TGCGCCGCGT CGGCGGCGCG CCGCTGCTGT ACGCGGGCTA TCACGTGCCG GCTGCGCCGG ATCCCGAGTT CGCCGCCATC GAACTGCTCG CGCTGGTGCT GGGCGATGCG CCTTCAGGGC GGCTGCATAA GCGACTGGTC GAGAAGCAAC TGGCCGCAAG CGTGGGGGCG GAGCCGTTCG GCCTGCATGA TCCCGGCGCG GCGCTGTTCG TGGCGCAGCT CGCGCCGGAG CAGGACGTCG AGCGTGCGCG CAGCGAGCTG ATCGCGGTGC TGGAGTCGGT CGCGGCCGAG CCGGTCACGG CCGAGGAACT GGAGCGCGCG CGCGCCAAGT GGCTCAAGGG CTGGGACCTG GCGTTCACGA ACCCGGAGAC GGTGGGCGTT TCGCTGTCGG AGTCGGTCGC GCAGGGCGAC TGGCGCCTGT TCTTCCTGAT CCGCGACCGC GTCAAGGCGA CCACGCTGGA GGACGTGCAG CGCGTCGCCG TCGAGCGGTT GCTGCCGTCG AACCGCACGC TCGCGACCTA TGTCCCCACT GACAAGCCGC AGCGCGCGCC GGCGCCGAAG GCTGTGGACG TGGCGGCGCA ATTCAAGGAC TTCAAGCCGC AGGCGGGCGC CACCGCGGTG GCCGCGTTCG ACACCACCCC GGCCAACATC GACGCGCAGA CGCAGCGTTT CGCGCTGGCC AGCGGCATGA AGGTCGCGCT GCTGCCCAAG CCGACCCGCG GCGGGGCGGT GAATGCGGTG CTCTCTCTGC ACTTCGGCGA CGAGAAAAGC CTGGCCGATC AGGGCGAGGT GCCTGCATTG ACGGCCGCGA TGCTCGACGA GGGCACGGCG AAGCTCTCGC GCCAGCAGAT CCGCGATCGG CTCGACGCGC TGCAGGCCGA GGTGGCCTTT TCCAGCGGCA CCGGTAGCGT GAGCGCGACG ATCGCCACCA AGCGCGAGAA CCTGCCAGCC GTGATCGCGC TGGTGGGCGA ACTGCTCCGC GAGCCTTCCT TCCCGCCTGC GGTGCTGGAG GAGCAGCGCA GTCAGGCGCT GACGGGCGTG GAGCAGCAGC GCAAGGAACC CGAGGCGGTG GTGGCCAACG CGATCGACCG CCATGTGAAC CGGTACCCAC GCAGCGACGT GCGCCATGCG AAGAGCTTCG ACGAACTGGT GGCCGACATC CGCGCGGCCA CGCCGGACCA GTTGCGTGCC TTCCATCGTC GCTTCTACGG CGCCTCGCAT GCCGAGTTCG GCGCGAGCGG CGACCTCGAT GTGCCGGCGG TGAGGCAGGC GCTCGAGGCG GCCTTCGGCG ACTGGAAGAG CAGTGAACCC TACGCGCGGG TGTCCGATCC GCTGGCGCCG GTGGCACCGG CTCGCCTGGT GCTGCCGACG CCCGACAAGC AGAACGCCCA CATGGCAGTG TTCCTGCCGG TGCCGTTGAT GGACAGCGAT CCGGACTACG CACCGCTCAC GCTCGCCAAC CACCTGCTGG GCGGCGGCGG CAGCTCGAGG CTGTGGGTGC GCATCCGCGA GAAGGAGGGC CTGTCCTACG GCGTCTACAG CTACCTGGCG TGGAACCAGG ACGAGCGCAA TTCGCCGTGG CAGGCGCAGG CCATCTTCGC GCCTCAGAAC CGCGCCAAGG TCGAGGCGGC GTTCAGGGAA GAGGTGGCGC GTTCCCTGCA GGACGGCTTC ACTGCAACCG AGCTGCAGGA GGCGCAGCGG GGGCTGATCA GCGCGCGCCG CCTGTCACGC GCGCAGGACG CGCGGCTGGC GGCCGGGCTG GCCAGCAACC TGCGGCTCGA CCGAACCTTC GCGATCTCGC AGCAGGTCGA CGACGCGATC GCTGCGGCGA CCCTGGAGCA GGTGAACGCG GCGCTGCGCA AGTACATCCG GCCCGAGGCC TTCGTCTACG GCTTCGGCGG CGACTTCAAG GAGTAG
|
Protein sequence | MFSKYLVAGA LALVVMQVSP VVAAASKPIP PSGSVASPGL PRGVTAVTQV EGITEYRLTN GLQVLLVPDA SKPTTTVNLT YHVGSRHENY GETGMAHLLE HLMFKGTPTT PNVWGEFTKR GLRANGSTWF DRTNYFASFA ANDDNLRWFL SWHADAMVHS FIARKDLDSE MTVVRNEMEM GENNPGRILY QKTLAAMYDW HNYGKDTIGA RSDVENVDIA RLQAFYRQYY QPDNATLVVS GQFDTARVLA WVQQYFGKIP RPRRVLPTLY TLDAAQDGER ALTLRRVGGA PLLYAGYHVP AAPDPEFAAI ELLALVLGDA PSGRLHKRLV EKQLAASVGA EPFGLHDPGA ALFVAQLAPE QDVERARSEL IAVLESVAAE PVTAEELERA RAKWLKGWDL AFTNPETVGV SLSESVAQGD WRLFFLIRDR VKATTLEDVQ RVAVERLLPS NRTLATYVPT DKPQRAPAPK AVDVAAQFKD FKPQAGATAV AAFDTTPANI DAQTQRFALA SGMKVALLPK PTRGGAVNAV LSLHFGDEKS LADQGEVPAL TAAMLDEGTA KLSRQQIRDR LDALQAEVAF SSGTGSVSAT IATKRENLPA VIALVGELLR EPSFPPAVLE EQRSQALTGV EQQRKEPEAV VANAIDRHVN RYPRSDVRHA KSFDELVADI RAATPDQLRA FHRRFYGASH AEFGASGDLD VPAVRQALEA AFGDWKSSEP YARVSDPLAP VAPARLVLPT PDKQNAHMAV FLPVPLMDSD PDYAPLTLAN HLLGGGGSSR LWVRIREKEG LSYGVYSYLA WNQDERNSPW QAQAIFAPQN RAKVEAAFRE EVARSLQDGF TATELQEAQR GLISARRLSR AQDARLAAGL ASNLRLDRTF AISQQVDDAI AAATLEQVNA ALRKYIRPEA FVYGFGGDFK E
|
| |