Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1023 |
Symbol | |
ID | 4785625 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1091672 |
End bp | 1094239 |
Gene Length | 2568 bp |
Protein Length | 855 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640089585 |
Product | cyanophycin synthetase |
Protein accession | YP_001020220 |
Protein GI | 124266216 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0769] UDP-N-acetylmuramyl tripeptide synthase [COG1181] D-alanine-D-alanine ligase and related ATP-grasp enzymes |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR02068] cyanophycin synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.434242 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGTTT CCCGCATCCG TGCCTTGCGC GGCCCCAACC TGTGGAGCCG GCACACCGCG ATCGAGGCGA TCGTCTCGTG CACCGCGCCC GAATGCGAGA TCGCCGATCC GGTCGGTCTC GAGGCGCGCC TGCGGGCACT GTTCCCGACC ATCGGCGCGC TGCGTACCGC GGTGCCGGGC GTGCCGGTCT CGCTGGCCCA CCTGCTCGAG GCGGCGGCGC TGGCGCTGCA GGCGCAGGCC GGCTGCCCGG TGACCTTCAG CCGCACCTCG GCGACCGTCG ATCCCGGTGT CTTCCAGGTC GTCGTCGAGT ACAGCGAGGA AGAGGTCGGT CGACTGGCCT TCGACCTCGC CCGCGAACTG ATCGATGCGG CACTGCACGC TCGCGGCTTC GATGTCGATG CGGCGATCAC GCGCCTGCGT GAGACCGACG AGGACGTGCG CCTGGGCCCC AGCACCGGCG CGATCGTCGA TGCGGCGCTG GCGCGCAACA TCCCCTACCG CCGCCTGACG CAGGGCAGCA TGGTGCAGTT CGGCTGGGGG TCGCGGCAAC GCCGCATCCA GGCCGCCGAG GTCGACTCGA CCAGCGCTGT CTCCGAGGCC ATCGCGCAGG ACAAGGACCT CACCAAGACG CTGCTGCGTG CCGCCGGCGT GCCGGTGCCG ATCGGCCGGC TGGTGGCGAG CGCCGACGAG GCCTGGGCCG TGGCCCTGGA GATCGGCCTG CCGGTGGTGG TCAAGCCGCA GGACGGCAAC CAGGGCAAGG GGGTCACGGT CAACATCGTC GACCGCGCCC ACCTCGATGT GGCCTTCAAG GCCGCGGCCG AGATCGGCGA CGTGATGGTC GAGAAGTTCC TGCCCGGCAG CGACTTCCGC CTGCTGGTGG TGGGCGACCG GCTGGTGGCG GCGGCGCGCC GCGAGCCGCC GCACGTCATC GGCGACGGCG TGCACACGGT GCGCGAACTG GTCGATCTGG TGAACGCCGA CCCGCGGCGC GGCACCGGGC ATGCCACCTC GCTGACCAAG ATCCGCTTCG ACGAGATCGC CGTCGCGCGC CTGCAGGTGC AGGGGCTGGC GCCCGATTCG GTGCCCGAGA AGGGCCGCCG CGTGATCCTG CGCAACAACG CCAACCTCAG CACCGGCGGC ACCGCCGCCG ACGTGACCGA CGACGTGCAC CCCGAGGTGG CCGCGCGCGC CGTCGCCGCG GCGCAGATGA TCGGACTGCA CATCTGCGGC GTCGATGTGG TGTGCGAGAC GGTGGGTCGC CCGCTCGAAG AGCAGAGCGG CGGCGTGGTC GAGGTCAACG CCGCGCCTGG CCTGCGCATG CACCTGTCGC CCTCGTTCGG CAAGGGTCGC AACGTCGGCG AGGCGATGAT CCAGCACCTG TTCGGCCACG GCGACGACGG CCGCATCCCG GTGGTGGCGG TGACGGGTAC CAACGGCAAG ACCACCACGG CGCGCCTGAT CGCCCACCTG TTCGCGACCA GCGGGCTGCG CGTGGGCATG ACCAACACCG ACGGTGTCTA CGTCGAGGGC CGCCAGATCG ACAGCGGCGA CTGCAGCGGA CCCAAGAGCG CGCGCAACGT GCTGATGCAC CCCGATGTCG ACGCGGCCGT GTTCGAGACC GCACGCGGCG GCGTGTTGCG CGAGGGCCTG GGCTTCGACC GTTGCCAGGT GGCGGTGGTC ACCAACATCG GCAGTGGCGA CCACCTCGGC CTCAACTACA TCACCACCGT CGAGGATCTG GCGGTGCTCA AGCGGGTGAT CGTGCAGAAC GTGGCGCAGA GCGGCTACGC GGTGCTCAAC GCGACCGACG TCAACGTCGC CGCGATGGCC GGCACCTGCC CCGGTGACGT GATTTTCTTC GCGGCCGATC GCCATCACCC GGTGATGGCC ACGCACCGCG CGCAGGGCAA GCGCACCGTC TACGTCGAGG GCGACGCGCT GGTCGCGGCG CAGGGCGCCT GGCGCGAGAA GCTGCTGTTG CGCGACATCC CGCTCACGCG CGGCGGCACG ATCGGCTTCC AGGTCGAGAA CGCCATGGCG GCGGTGGCCG CCGCCTGGGG CGTGGGGCTC GACTGGGACA CGGTGCGCCG CGGCCTCGCG AGTTTCGTCA ACGACGCCGA CAACGCGCCG GGCCGCTTCA ACGTGATGGA CTTCAAGGGC GCGACGGTGA TCGCCGACTA CGGCCACAAC CCCGACGCGA TGCGCGCGCT GGTGGCCGCG GTCGACGCGA TGCCGGCCAC GCGCCGCGCG GTGGTGATCA GCGGCGCCGG CGACCGGCGT GACGACGACA TCCGCGAGCA GACCCAGATC CTCGGTGCCG CGTTCGACGA TGTGCTTCTG TATCAGGACG CCGCGCAGCG CGGCCGTGCC GACGGCGAGG TGATCGCGCT GCTGCGCGAG GGCCTGCGGG GCGCGACGCG CACCCGCCAC GTCGAGGAGA TCCACGGCGA GTTCATCGCC ATCGACACCG CGCTCGATCG GCTGAAGCCG GGCGAGCTCT GCCTGGTGCT GGTCGATCAG GTCGAGGAGG CGATGGCGCA CCTGGCGCGG CGCGTCGCTG CGGGCTGA
|
Protein sequence | MEVSRIRALR GPNLWSRHTA IEAIVSCTAP ECEIADPVGL EARLRALFPT IGALRTAVPG VPVSLAHLLE AAALALQAQA GCPVTFSRTS ATVDPGVFQV VVEYSEEEVG RLAFDLAREL IDAALHARGF DVDAAITRLR ETDEDVRLGP STGAIVDAAL ARNIPYRRLT QGSMVQFGWG SRQRRIQAAE VDSTSAVSEA IAQDKDLTKT LLRAAGVPVP IGRLVASADE AWAVALEIGL PVVVKPQDGN QGKGVTVNIV DRAHLDVAFK AAAEIGDVMV EKFLPGSDFR LLVVGDRLVA AARREPPHVI GDGVHTVREL VDLVNADPRR GTGHATSLTK IRFDEIAVAR LQVQGLAPDS VPEKGRRVIL RNNANLSTGG TAADVTDDVH PEVAARAVAA AQMIGLHICG VDVVCETVGR PLEEQSGGVV EVNAAPGLRM HLSPSFGKGR NVGEAMIQHL FGHGDDGRIP VVAVTGTNGK TTTARLIAHL FATSGLRVGM TNTDGVYVEG RQIDSGDCSG PKSARNVLMH PDVDAAVFET ARGGVLREGL GFDRCQVAVV TNIGSGDHLG LNYITTVEDL AVLKRVIVQN VAQSGYAVLN ATDVNVAAMA GTCPGDVIFF AADRHHPVMA THRAQGKRTV YVEGDALVAA QGAWREKLLL RDIPLTRGGT IGFQVENAMA AVAAAWGVGL DWDTVRRGLA SFVNDADNAP GRFNVMDFKG ATVIADYGHN PDAMRALVAA VDAMPATRRA VVISGAGDRR DDDIREQTQI LGAAFDDVLL YQDAAQRGRA DGEVIALLRE GLRGATRTRH VEEIHGEFIA IDTALDRLKP GELCLVLVDQ VEEAMAHLAR RVAAG
|
| |