Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2394 |
Symbol | |
ID | 4784290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2557291 |
End bp | 2560179 |
Gene Length | 2889 bp |
Protein Length | 962 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640090964 |
Product | hypothetical protein |
Protein accession | YP_001021584 |
Protein GI | 124267580 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTGGT CGCTGCCGTG GTCACGCAAG CCCGACGCAT CGCCTGCTGA CGCCGTAGAT GCGGCCGATG ATGCCTGGGC ACGGCACGTA ACGGCGCTGG CGGTACAGGG TGTCGCCGAG CCGGGCAGCG CGCTCGGCCG GGGCCGGCGC AGACCGGCCA CCCAGGCCGA CCACGATGCG CTCTATGGCG TCGCGCCGTC GTTCGCGGAC TTGCTGCCCT GGGTCGAGTA CCTGCCCGAC ACCAAGTGCA TGTTGCTGGA AGACGGCCAG TCGGTGGCGG CCTTCTTCGA GCTGGCGCCG GTCGGCACCG AGGGCCGCGA GATGGCCTGG CTGTGGCAGG CGCGCGATGC GCTGGAGAAC GCCCTGCAGG ATTCCTTCGA CGAGTTGGAC GACAACCCCT GGGTGGTGCA GCTCTACGCC CAGGACGAGG CCGACTGGGA CAACTATCGG CGCTCCCTGG CGAACTATCT GCAGCCGCGT GCACAGGGCA GCGCGTTCAG CGACTTCTAC CTGCGCTTCT TCGCCCATCA CCTGCGGGCC ATCGCCAAGC CGGGTGGCCT GTTCGAGGAC ACCACGGTGA CGCGCCTGCC GTGGCGCGGC CAAGTCCGGC GCGTGCGCAT GGTGGTCTAC CGCCGCACGT CCGCGGCCCA GACCTCGCGG CGCGGCCAGT CGCCCGAGCA GGCGCTGACC ACGATCTGCG ACCGCCTCGC CGGCGGGCTG GCCAATGCCG GCGTGAAAGC CCGGCGTCTC GGCGCGGCGG ACATCCATGC CTGGCTCCTG CGTTGGTTCA ACCCGAATCC GACCTTGCTC GGCGCCACTG CCGAAGATCG GGAACGCTTC TATGCGCTGA GCCGCTACCC GGAAGAGCGG GAGGAGGGCG AGATCGAACT CGCCAGCGGC ACCGATTTCG CGCAGCGCCT GTTCTTCGGC CAGCCCCGCT CGGACGTGCC CAACGGCCTG TGGTTCTTCG ACGGCATGCC GCATCGGGTG ATCGTGATGG ATCGCCTGCG CACGCCGCCC GTGACGGGCC ATCTGACGGG CGAGACGCGC AAAGGCGGCG ATGCCATGAA CGCGCTGTTC GACCAGATGC CCGAGGACAC GGTGATGTGC CTGACGCTGG TCGCCACACC CCAGGACGTG CTGGAGGCGC ACCTCAACCA CCTCGCCAGG AAGGCCGTCG GCGAGACCCT GGCCTCGGAG CAGACCCGGC AGGACGTGCA GCAGGCGCGC GGGCTGATCG GCAGCGCGCA CAAGCTCTAC CGTGGCGCGC TGGCGTTCTA CCTGCGCGGC CGCGACCTGG CCCAGCTCGA TGCGCGCGGC CTGCAGCTCG TCAACGTGAT GCTCAACGCC GGCCTGCAAC CGGTACGCGA AGAGGACGAG GTGGCGCCGC TGAACAGCTA TCTACGCTGG CTGCCGTGCG TGTTCGATCC GGCGGCCGAC AAGCGCCAGT GGTACACCCA GCTCATGTTC GCGCAACATG CGGCGAACCT GGCGCCGGTC TGGGGCCGCA GTCAGGGCAC GGGGCATCCG GGCATCACGT TCTTCAACCG CGGCGGCGGC CCGATCACCT TCGATCCGTT GAACCGCCTC GACCGGCAGA TGAACGCGCA TCTATTCCTG TTCGGCCCCA CGGGTTCGGG CAAGAGCGCG ACGCTCAACA ACATCCTGAA CCAGGTGACG GCGATCTACC GGCCGCGCCT GTTCATCGTC GAGGCGGGCA ACAGCTTCGG CCTGTTCGGC GACTTCGCGG CACGGCTGGG CCTCACCGTG CATCGGGTGA AGCTCGCGCC GGGCGCGGGC GTCAGTCTGG CTCCGTTCGC CGACGCCTGG CGCCTGGTCG ATACGCCGAG CCAGGTACAG ACGCTGGACG CCGATGCGCT CGACGAAGAC CAGACCGATG CCGGCATGGC CGTGGAAGGC GACGAGCAGC GCGACGTGCT CGGCGAGCTG GAGATCACTG CACGGCTGAT GATTACCGGC GGCGAGGACA AGGAAGAAGC GCGCATGACG CGCGCCGACC GCAGCCTGAT CCGCCAGTGC ATTCTCGATG CGGCCCAGCA TTGCGTGGCG GACAGACGCA CGGTGCTCAC GCGCGATGTG CGCGACGCGC TGCGCGAGCG CGCCCGCGAC GCCACGCTGC CGGAGATGCG GCGCGCACGG CTGCTGGAGA TGGCCGACGC CATGGATATG TTCTGCCAGG ACGTGGACGG CGAGATGTTC GACCGGTCCG GCACGCCGTG GCCCGAGGCG GACATCACCA TCGTGGACCT GGCCACCTTC GCGCGCGAGG GCTACAACGC CCAACTCTCG ATTGCCTACA TCTCGCTCAT CAACACCGTC AACAACATCG CCGAGCGCGA CCAGTTCCTG GGCCGTCCGA TCATCAACGT GACGGACGAA GGCCACATCA TCACGAAGAA CCCGCTGCTC GCGCCCTACG TGGTCAAGAT CACCAAGATG TGGCGCAAGC TCGGCGCCTG GTTCTGGCTC GCCACGCAGA ACCTCGACGA CTTGCCGAAG GCGGCCGAGC CCATGCTCAA CATGATCGAG TGGTGGATCT GCCTGTCGAT GCCACCCGAT GAAGTGGAGA AGATCGCGCG CTTCCGCGAA CTCAACGCTT CGCAGAAGGC GCTGATGCTC TCGGCGCGCA AGGAGGCCGG CAAGTTCAGC GAGGGCGTCA TCCTGTCCAA GTCGATGGAG GTGCTGTTCC GCGCCGTGCC GCCCAGCCTC TACCTGGCGA TGGCGATGAC CGAGCCCGAG GAGAAGGCCG AACGCTTCCA GTTGATGCAG CAGCACGGCA TCAGCGAACT GGATGCCGCC TTCCGCGTGG CCGAGAAGAT CGACCGCGCG CGGGGCATCG AACCTCTGAC GCTGGACACG CTGGCCTGA
|
Protein sequence | MAWSLPWSRK PDASPADAVD AADDAWARHV TALAVQGVAE PGSALGRGRR RPATQADHDA LYGVAPSFAD LLPWVEYLPD TKCMLLEDGQ SVAAFFELAP VGTEGREMAW LWQARDALEN ALQDSFDELD DNPWVVQLYA QDEADWDNYR RSLANYLQPR AQGSAFSDFY LRFFAHHLRA IAKPGGLFED TTVTRLPWRG QVRRVRMVVY RRTSAAQTSR RGQSPEQALT TICDRLAGGL ANAGVKARRL GAADIHAWLL RWFNPNPTLL GATAEDRERF YALSRYPEER EEGEIELASG TDFAQRLFFG QPRSDVPNGL WFFDGMPHRV IVMDRLRTPP VTGHLTGETR KGGDAMNALF DQMPEDTVMC LTLVATPQDV LEAHLNHLAR KAVGETLASE QTRQDVQQAR GLIGSAHKLY RGALAFYLRG RDLAQLDARG LQLVNVMLNA GLQPVREEDE VAPLNSYLRW LPCVFDPAAD KRQWYTQLMF AQHAANLAPV WGRSQGTGHP GITFFNRGGG PITFDPLNRL DRQMNAHLFL FGPTGSGKSA TLNNILNQVT AIYRPRLFIV EAGNSFGLFG DFAARLGLTV HRVKLAPGAG VSLAPFADAW RLVDTPSQVQ TLDADALDED QTDAGMAVEG DEQRDVLGEL EITARLMITG GEDKEEARMT RADRSLIRQC ILDAAQHCVA DRRTVLTRDV RDALRERARD ATLPEMRRAR LLEMADAMDM FCQDVDGEMF DRSGTPWPEA DITIVDLATF AREGYNAQLS IAYISLINTV NNIAERDQFL GRPIINVTDE GHIITKNPLL APYVVKITKM WRKLGAWFWL ATQNLDDLPK AAEPMLNMIE WWICLSMPPD EVEKIARFRE LNASQKALML SARKEAGKFS EGVILSKSME VLFRAVPPSL YLAMAMTEPE EKAERFQLMQ QHGISELDAA FRVAEKIDRA RGIEPLTLDT LA
|
| |