Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0897 |
Symbol | |
ID | 4787220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 945870 |
End bp | 948671 |
Gene Length | 2802 bp |
Protein Length | 933 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640089458 |
Product | ATP-dependent transcriptional regulator-like protein protein |
Protein accession | YP_001020094 |
Protein GI | 124266090 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.571021 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCCGT CGTCGCCCCA CCGTCAGCTG CGATCGCACC CGCTCGAGGC CACGACGGAC TGGCCCACGC CGCTGGCCGA CGAGCTGGCC TCGACCGAAC GTGGCTTCCC GTCGGACGTG CGGATGCACA AGCTGTTCAC GCCGCCGGTG TACCCGGGCG CCGTGCCGCG ACAGGCCATC CTCGACCGTG TGCTGCAGGA CGACAGCCTG CGCGTCACGG TGCTGCAGGG GCCCGCCGGC CACGGCAAGT CGACCACCCT GCAGCAGATC AAGACCGCCC ACGAGGCCCG CGGCTGGCGC ACCGCCTGGC TCACCCTCGA CGACGCCGAC AACGACCCGC GACGCTTCGA GTCGCACCTG GTCGCGGTGA TGAGCCTGCT GCACGGCCGT GCCGCGTCGC CCGGCGCCTC GCGCAGCGGC ACCGGCGATG CGCCGCGCGA TCTGGCCGAC TGGATGCTCG ACACCCTGTC GGGCCGCATC ACGCCGGCCT CGATCTTCAT CGACGAGTTC CAGGCGCTGC GCAACGAGGC CCTGCTGCGC TTCTTCCGCT CGGTGCTCGC GCGCCTGCCG GCCAACGTGC ATGTGTTCAT CGGCTCCCGC ACGCTGCCGG AGATCGGCCT GGCCACGCTG ATGGTCAACC GTGTGGCGTC GGTGGTGCGG GCCGACGACC TGCGCTTCAC GCCGGGCGAG GTGACGCAGT TCTTCGCCGA TTCGGCCAGC CTGCAGGTCA GCGCCGGCGA GGTGGACGCG ATCTACCGCC GCACCGAGGG CTGGCCGGCC GGCGTGCAGC TGTTCCGGCT GGCGCTGGTG AGTCCCGAGG TGCGCATGGC GCTGGACGGC GCCGACGACC ACGGGCCGCG CGAGCTGGCC GAATACCTGG CCGACAACGT GATGTCGCTG CAGTCGCCGC GCATGCAGGA GTTCCTGCTG AAGACCTCGC TGCTGCAGCG CCTGTCGGCA CCGCTGTGCA CCGCCGTGAC CGGCTTCGAG GACGCGCAGG AGCTGCTGGT GCGGCTGGAG CGCTCCGGCC TGTTCCTGCG CGCGCTCGAC TCCGACAACC GCTGGTTCCG GTACCACGGC CTGTTCTCGA CCTACCTGGC CGAGACCCTG CAGCGCAACG GCCCCGAAGC GCTGCGGCAG GTGCACAAGA AGGCGGCGCA GTGGTGCCTG GCGCACGAGC TGCCCGAGGA AGCGATCCAC CACGCGTTGT GCTGCAGGAA CTTTCCGCTC GCGGCGTCCA CGCTGACCGA CTGGTCGTCG CAGCTGGTGG CCGGGGCCGA GCTGATCACG CTGGAGCGCT GGCACGACCG CCTGCCCTTC CACGAGGTGG CGCAGCGGCC GGCGCTGGTG ATCCGCGCCG CCTATGCGCT GATGTTCCTG CGCCGCCGCC CCAAGCTGCG GCCGCTGCTG GAGCTGATGG CACCGCAGGC CGGCGGCGGC GACATCGTGC CGACCACCAA CCCCGACCTG TGCCGCGCGA TGTCCTTCCT GCTGGTGGAC GACGACATGG CGGCGGCGGC CGACACCGTC GAGCAGGCCG GCGTGGTGCA GCGCGAGCTG GAGGGCTTCC CGGCCTTCGA GCTGGGAGCC GCCGCGAACG TGCTCGCGCT GGGCAAGGTG GCCAGCGGCG ATTTCGAGGG CGCGCGGCAG GCCCTGGCGC TGGCGCGTGC ACACCTCGGT CGCGGCGGCG GCTCGTTCGT CGGCGGCTAC ACCGCCGCCA TCACCGGCAG CAACCTGCTC GTGCAGGGCC GGCTGCAGGA GGCGCTGGCG CACCTGCGCG ACGAGAACGC GCAGGAGGCG CCGCTCGACA CCTCGGTGGC CGGTGCGGCG CTCGCGGCCT GCCACATGTT CGCGCTGTAC GAGGCCAACG ACCTGGCGAC GCTGGAGTCG CTGGCGCACC GCTTCCAGCG CGAGATCTCC GAGTCGGTGA CGCTCGACTT CATCGCCGCG GCCCACATCG CCATCTCGCG CATGCACGAG GCGCGCGGGC GCTCCGACGA GGCCGTCGCG GTGCTCGACG AACTGGAGCG CATCGGCCAC ACCAGCCCCT GGCAACGCCT GGTGGCGGTG AGCGAGTGGG AGCGCGTGCG GCGTGCGCTG GCGGGCGGCG AGATCGAGCG CGCGGTGGCG CTGGCGACGC GCATCGCCCC GGACTCGCGC GACGACGCGC CGCACTGGAT CCACCTGGCC GAGGACGTGG AAGGCTCGGG CTACGGGTGG ATCCGCCTCG CCATCGCGCG GCACGACCAC GCCGACGCCG CGCAGCGCAT CGCCCGGGAG CGGGCGCGCC AGACCGGTCG GGTCTACCGC GACATCAAGC TGAGCGTGCT GGAGACCCTG CTCCAGCAAC GCATGGGCGC CCGTAACGCC GCCCATCGCT GCCTGCGCAA GGCGCTGCAG CTGGGCCGAC GGGGCCGCTA CGTGCGCTGC CTGCTCGACG AGGGGGACGG CGTCATCGAG CTGCTGCGCG AGGCCTACCA GAACCTGCTG CGAGGTCACG AACCGGGCGG CGGCACCGGT ACCGACCCGG ACCGCGACTA CATCGAGCTG CTGCTCGAGG CCTCGGGGAC CGACCTGGGC CGCCAGGCCG CCGGCAACGC CCTGACCGAG GCACTGTCGG AGCGCGAAAA GGAGATGCTG CGTTTCCTGC TCGACGGCAC CACCAACCGC GAGATCGCCG GACGGCTGTT CGTATCGGAG AACACCGTCA AGTTCCACCT GAAGAACATC TACTCCAAGC TCGGCGTCGG CAACCGATTG CAGGCCATCA ACACGGCGCG GGCGCTGCGG TTGATCGACT GA
|
Protein sequence | MPPSSPHRQL RSHPLEATTD WPTPLADELA STERGFPSDV RMHKLFTPPV YPGAVPRQAI LDRVLQDDSL RVTVLQGPAG HGKSTTLQQI KTAHEARGWR TAWLTLDDAD NDPRRFESHL VAVMSLLHGR AASPGASRSG TGDAPRDLAD WMLDTLSGRI TPASIFIDEF QALRNEALLR FFRSVLARLP ANVHVFIGSR TLPEIGLATL MVNRVASVVR ADDLRFTPGE VTQFFADSAS LQVSAGEVDA IYRRTEGWPA GVQLFRLALV SPEVRMALDG ADDHGPRELA EYLADNVMSL QSPRMQEFLL KTSLLQRLSA PLCTAVTGFE DAQELLVRLE RSGLFLRALD SDNRWFRYHG LFSTYLAETL QRNGPEALRQ VHKKAAQWCL AHELPEEAIH HALCCRNFPL AASTLTDWSS QLVAGAELIT LERWHDRLPF HEVAQRPALV IRAAYALMFL RRRPKLRPLL ELMAPQAGGG DIVPTTNPDL CRAMSFLLVD DDMAAAADTV EQAGVVQREL EGFPAFELGA AANVLALGKV ASGDFEGARQ ALALARAHLG RGGGSFVGGY TAAITGSNLL VQGRLQEALA HLRDENAQEA PLDTSVAGAA LAACHMFALY EANDLATLES LAHRFQREIS ESVTLDFIAA AHIAISRMHE ARGRSDEAVA VLDELERIGH TSPWQRLVAV SEWERVRRAL AGGEIERAVA LATRIAPDSR DDAPHWIHLA EDVEGSGYGW IRLAIARHDH ADAAQRIARE RARQTGRVYR DIKLSVLETL LQQRMGARNA AHRCLRKALQ LGRRGRYVRC LLDEGDGVIE LLREAYQNLL RGHEPGGGTG TDPDRDYIEL LLEASGTDLG RQAAGNALTE ALSEREKEML RFLLDGTTNR EIAGRLFVSE NTVKFHLKNI YSKLGVGNRL QAINTARALR LID
|
| |