Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4197 |
Symbol | |
ID | 5833531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 4666463 |
End bp | 4669165 |
Gene Length | 2703 bp |
Protein Length | 900 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641369987 |
Product | hypothetical protein |
Protein accession | YP_001641637 |
Protein GI | 163853594 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.382569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.29939 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGATA GTTTTTTCCT GAGCGCGGCA CTGATCGTGC TGGCGCTTGC GCTGTCCGGT CCGGCCGGTC TGCTGCTGGC GCTGCGGCAG CGGCAGCATA TCCGCGTACT CGAGCGGCGC CTCGCTTTGG TGGAGGCGCG TCCGGTCGCC GGCTCCGGGC CGATGGCGGC CCCGATGACG ACCCCGATTC TTGCACCGGA GCCGGCTCGT CCGGAACCGG CGCCTGCCGC GGTGCCACCC GTTCCCTCCC CTGCCCTTCC GCCTGCGCCG CCGCGCGGTG CGGTTCCGGC CGTGGCACCG GAGCGGCCGA TCTCGGCTCC CGCCCCGTCC ATCGAGGAGC GCTTCGGCAC CCGCTGGACT GTGTGGGTCG GCGGCCTCGC CCTCGCCTTC GGCGCGGTGC TGCTCGTCCG CTACTCGGCC GAGCGCGGCC TGTTCGGGCC GGGCGTGCGC ATCGCGGGCG CTCTTGCCCT CGCGCTCAGT CTCGTCGGGC TCGGAGAGTT CCTGCGGCGG CGCCTGCGCG GGTCGGAGCC CGTGCCTCCG ATCTGGCCCG ACGTGCCCGC CATGGTGACC GCGGCGGGCA CGGTCGGCCT GTTCGGTGCG GTCTACGCCG CGCACGCCCT CTATGGCTTC ATCGGGCCGG GCCTGGCCTT TGCCGGACTG GCGGCCACGG GCCTTGCCGC CATGGTCGCC GCCCTGCTGC ACGGGCCGGC TTTGGCCGGC ATCGGCCTCG TCGGTGCGCT CGCCACGCCG CTCCTCGTCG GGGGCGGGGG GACGAGCCTG TGGCCGCTCG CTCTCTACCT GCCGGTGGTG GCGGGCAGCG CCTACGCCTT CGCCTGGCTG AAGGGCTGGC GCGCGCTCGC GGTGGCGGGC GGCCTTGGCG CGGCGGCCTG GGCTCTGTTT CTCACCATCC TGCCCGGTGA GAATGTCGCC GTGCAGGTCC ACCTCGTGCT GCAGCAGGCG CTGGCGATCC TCGTCTTCGC CGTCCTGCCG GGGCGCGGCG TACCGGACGC GGAGGCGCGC CTTGATCGTT TCGCCGCGCT CGCCCTCGCC GCGGCGGCGG GCGTGGCGCT GGCGGTGCTT GGCCTGACGG CCCATGCCGG CAGCGGCCTG GCTTGGGCTG CCGCTGCCCT GGCGGTGATC GTCCTGCCCG CGCTCGCGGG TTTCGTCGCC GCGCCCGCCG CCGCGGGTGG CGCCGTCGCG GCCCTGACGC TCGCCGGCGC CCTCGTGCTC TGGCCGGAGG CGGCCGAGCC CGCGAGCGCC CCCTTCTTCT TCCTCTGGTA CGGCACGGAC CATCCCGGTC TGCTGCTGGC GCTGGCGGCG GCCGGATCGC TCGCAGTCGC GGGTCTCGGC ACGCTGCGGC TCCTGGGCGG CAGCCGCCTG CCCTATGCCA CCGCGCTGGC CTTCGCGGCG GGCGCGGCGG TGGCACCCCT CGCGGCCCTC GCCCTGGCCT ATCTGCGAAT CACGGTCGGC GCGGTGGCGC CGGACTTTGC CGCCGTCGCA GGGGGGCTGG CCGCCGGATT CTGCGGTCTT GCCGTCCTGT GCCGGCGGCA GGGCGACGCC ACCCCCTCCC CGGCCCTCAC GCTCGGGCTC GGCGCCTTCG CCGCGGCGGC GGTGGCGAGC CTGTGCCTCG GCCTCGTCTT CGCTCTCGCG GGCGGCTCGC TGACGCCGGC CCTAGCCGGG GCGGCGCTCG CCACCGGCCT GATCGCCCGG CGCCTCGACA TCCCGGCCCT GCGCTGGTGC GTGGCCGGTC TGGCCGTCAT CGTCGCGGCG CGGCTCGCCT GGGACCCGGC GCTGATCCGG GGCGGCCTGT CGGACTGGCT CATCCTCAAC GGGCTCGTGA CCGCCTATGG CCTGCCGGCC CTATGCTTCG GCCTCGCCGC CTGGGCGATC CGCCGCCCGG ACCGCCCGGC CGACATCCCG GAGCAGGTGG CCCAAGCCTT GAGCCTTCTG CTTTCGGGCC TGTTCGTCTT CCTGGAGATC CGTCACGCGC TCCACGGCGG CACCCTGGCC GATCCGGGTA CGAGCGTGCT GGAACAGGGT CTGACGACGC TGACCTCCCT CGGCTTCTCC CTGGTGCTGG TGCGCTTCTC CGGCCCCTCG GCCTCGCCGG TGATCCGCTT TGCTGGCCTC GCCTTCGCCT GCCTCGCCTT GTTCCAGGGT GCGCTCGGCC TCGGGCTCGC CGCCAACCCG CTCCTGACCG ACGAGCCGGT CACGGGCGGC CTCATCCTGA ACGATGCCGT GCTCGCCTAC GCCCTGCCGG CGGCGGCGGC CTTCGCCTTG GCGCGGGCGG CACGCGGCGT GCGGCCCGTC TGGTTCGTGC GCATGGCCGG CGGTCTGGGG CTGGCCCTGA GCTTCCTCGC CCTGTGCCTC GCCGTGCGCC ACGGCTTCCA GGGCGAGCGG CTCGGTCTCG ACCGCGAGAC GGGACAGGCG GAGTGGTATG CCTACTCGGC GGTGTGGCTC GGCCTGGGCC TCGTCGCCCT CGGCTACGGC ATCCTGCGCG GGTCGGCGAC CGCGCGGCTG GCCTCCGCCG TGCTGGTGGG GCTGGCGACG CTCAAGGTGT TCCTGTTCGA TCTCTCGGGC CTGGAGGGGC CGCTGCGGGC GCTCTCCTTC CTCGGGCTCG GCGGCTGCCT GATCGGCATC GGTCTCGTCT ACCAGCGCCT CGTCTTCGCG CCCGCTCCCC GGCCGGCCGC GCCGGATGCG TGA
|
Protein sequence | MDDSFFLSAA LIVLALALSG PAGLLLALRQ RQHIRVLERR LALVEARPVA GSGPMAAPMT TPILAPEPAR PEPAPAAVPP VPSPALPPAP PRGAVPAVAP ERPISAPAPS IEERFGTRWT VWVGGLALAF GAVLLVRYSA ERGLFGPGVR IAGALALALS LVGLGEFLRR RLRGSEPVPP IWPDVPAMVT AAGTVGLFGA VYAAHALYGF IGPGLAFAGL AATGLAAMVA ALLHGPALAG IGLVGALATP LLVGGGGTSL WPLALYLPVV AGSAYAFAWL KGWRALAVAG GLGAAAWALF LTILPGENVA VQVHLVLQQA LAILVFAVLP GRGVPDAEAR LDRFAALALA AAAGVALAVL GLTAHAGSGL AWAAAALAVI VLPALAGFVA APAAAGGAVA ALTLAGALVL WPEAAEPASA PFFFLWYGTD HPGLLLALAA AGSLAVAGLG TLRLLGGSRL PYATALAFAA GAAVAPLAAL ALAYLRITVG AVAPDFAAVA GGLAAGFCGL AVLCRRQGDA TPSPALTLGL GAFAAAAVAS LCLGLVFALA GGSLTPALAG AALATGLIAR RLDIPALRWC VAGLAVIVAA RLAWDPALIR GGLSDWLILN GLVTAYGLPA LCFGLAAWAI RRPDRPADIP EQVAQALSLL LSGLFVFLEI RHALHGGTLA DPGTSVLEQG LTTLTSLGFS LVLVRFSGPS ASPVIRFAGL AFACLALFQG ALGLGLAANP LLTDEPVTGG LILNDAVLAY ALPAAAAFAL ARAARGVRPV WFVRMAGGLG LALSFLALCL AVRHGFQGER LGLDRETGQA EWYAYSAVWL GLGLVALGYG ILRGSATARL ASAVLVGLAT LKVFLFDLSG LEGPLRALSF LGLGGCLIGI GLVYQRLVFA PAPRPAAPDA
|
| |