Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4502 |
Symbol | |
ID | 5832227 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 5023016 |
End bp | 5025841 |
Gene Length | 2826 bp |
Protein Length | 941 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641370295 |
Product | bifunctional transaldolase/phosoglucose isomerase |
Protein accession | YP_001641941 |
Protein GI | 163853898 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0166] Glucose-6-phosphate isomerase [COG0176] Transaldolase |
TIGRFAM ID | [TIGR00876] transaldolase, mycobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.270677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCGC TGAACGCCCT CTTCGCCGAG CACGAGCAGG CGGTCTGGCT CGATTTCGTG GCCCGCGGCT TCATCGCCAA GGGCGAACTC CAGGCGCTGG TGGAGAAGGA CGACCTGCGC GGCGTCACCT CCAACCCGGC CATTTTCGAG AAGGCGATCG GCCACTCGGC CGAGTACGAC GACAGCCTCA AGGCCGTGCT GAGCCAGGGC GACGCCCGCG TCATCGATCT CTACGAGGGG CTGGCCATCG CCGACATCCA GGCTGCGGCC GACGTCCTGC GCCGGGTCTA CGACACCAGC GACGGCGCCG ACGGCTATGT CAGCCTCGAA GTCTCCCCCT ACCTCGCCCT CGACACCGAG GAGACCCTGA ACGAGGCCCG CCGCCTGCAC GCGGCGGTCG GGCGCGACAA CCTCATGGTC AAGGTGCCGG CCACCCCCGC CGGCCTGCCG GCAATCCGCC AGCTCACGGC GGAGGGCATC TCGGTCAACA TTACGCTGCT GTTCTCGCAG AGTGTCTACG AGGAGGTCGC CAACGCCTTC ATCGACGGGC TGACCGAGTT CGGCGCCAAG GGCGGCGACG TCTCGAAGGT CGCGAGCGTG GCGAGCTTCT TCATCAGCCG CATCGACAGC CTCGTCGACA AGAAACTGGA TGAGGTCGGC GGCTACGAGG ATCTCAAGGG CAAGGTGGCG ATCGCCAACG CCAAGCTCGC CTACCAGCGC TACAAGCGCA TCTTCTCCGG GCCGAAATGG GAGGCGCTGG AGGCCAAGGG GGCGAAGGCG CAGCGCCTGC TCTGGGCCTC CACGGGCACC AAGAACAAGG CCTATTCCGA CGTGCTCTAC GTCGAGGAGC TGATCGGCAA GAACACCGTC AACACCATGC CGCCGGCCAC CATGGACGCG TTCCGCGACC ACGGCCGGGT GCGCGCGACG CTGGAGGAGA ATATCGGCGA GGCCGAGACC GTGATGGCAC GGTTGGCCGA GGCCGGCATC GACATCGAGG CGGTGGCGCG CCAGCTCGTC GAGGAGGGCG TGCAGCTCTT CGTCGATGCC GCCGACGCCC TGCTTGGCGC GGTCGCGGGC AAGCGGGCGG CGCTGCTCGA CCATCGGCTC GACGCGCAGA CCTTCAAGTT CGACGAGCCG CTTCAGGCGG CGACCGACAA GGCGGTCGAA TCCTGGCGCG CGAGCGGCGC GATCCGCCGG CTCTGGGCGC ACGACGCCAC GGTCTGGACT GGCCGTGACG AGGACAAGTG GCTCGGCTGG CTGCGCATCG TCGAGGACGA ACTGGAGCGG GTCGATCTCT ACGAAAGCTT TGCCGAAGAG GTCCGCGCCG AGGGCTTCAC CGATGCGGTG GTGCTCGGCA TGGGCGGCTC CAGCCTCGGC CCGGAGGTGA TCTCCGCCAC CTACGGCCAC CGCGAGGGCT TCCCGAAGCT GCGCATCCTC GACTCGACCG ACCCGGACGA GGTCCGCGCC GTCGAGGCGG CGGTGAAGCT CGAGACGACC CTCTTCATCG TCGCCTCGAA GTCCGGCTCG ACGCTCGAGC CCAACGTGTT CCGCGACTAC TTCCTCGGCC GGATGAAGGA CGTTGTCGGG GCCGACAAGG CCGGTCGGCA TTTCGTCGCC GTGACCGATC CCGGCTCGGC GATGGAAAAG GCCGCCAGGG ACGACAACTT CCGCAAGATC TTCCTGGGCG TGCCGCAGAT CGGCGGGCGC TACTCCGTGC TCTCGGCCTT CGGCCTCGTT CCCGCAGCGG CGGCCGGGGT CGAGATCCGC GAGTTCCTCG ACAGCGCCCG GATGATGGTC CGCTCCTGCG GGCCGGCGGT GCCGCCCGCC GTGAATCCCG GCGTGCGCCT CGGCGCGGCG ATGGGTGTCG CGGCCAAGGA TTTCGGGCGC GACAAGATCA CGATCATCGC CTCCCCCGGC ATCGGCACCT TCGGCACCTG GGCCGAGCAG CTCATCGCCG AGTCGACCGG CAAGGAGGGC GTCGGCATCA TCCCCGTCGA GGGCGAGCCG GTCGGCGTGC CGGCCGTCTA CGGCGAGGAC CGGCTGTTCG TGTATCTGCG CCTGACCAGC CAGGCCGATG CGCGCCAGGA CGAGGCGGTG AAGATCCTGG AGAGCGAGGC GCAGCCGGTG GTGCGCATCG ATCTCGACAA GGTGGAACAG CTCCCGCAGG AATTCTTCCG CTTCGAGATC GCGACCGCCG TGGCCGGGGC GGTGCTCGGC ATCAACCCGT TCGACCAGCC CGATGTCGAG GCGAGCAAGA TCGAGACGAA GAAGCTGTTC GCCAGCGCCG AGGAAACCGG CGCCCTGCCG GCCGAGACGC CGATCTTCGA GGACGAGACG GTCGCGCTCT ATGCCGATGC GGCCAACGCG GAGGCCCTGC GCCCCGGCGA GGGCTTCGAG GCGATCGTGG CCGCGCATCT GGCGCGGGTG AAGCCGTGCG ACTACGTCGC CGTGCTCGCC TATGTGGAGC GCAACGAGGC GCATCACGCG GCGTTGCAGG AAGCCCGGCT GACGGTGCGC GACGCGCGTC AGGTCGCGAC TTGCCTGGAA TTCGGCCCGC GCTTCCTCCA CTCGACCGGG CAGGCTTACA AGGGCGGTCC CGCTTCCGGC GTGTTCCTCC AGATCACCGC CGACCCCTCG GCCGATCTGC CGATCCCCGG GCGCAAGCTC GGGTTCAAGA CGGTGATCGC GGCGCAGGCG CGGGGCGATT TCGCCGTGCT GTCCGAGCGC AAGCGCCGGG CGCTGAGAAT CCACCTCAAG GGCGGTGATG TCTCCGGCGG CGTGAAGCGC GTCGCCGCGG CGATCAAGGC AGCGGTCGCC GGATAA
|
Protein sequence | MNALNALFAE HEQAVWLDFV ARGFIAKGEL QALVEKDDLR GVTSNPAIFE KAIGHSAEYD DSLKAVLSQG DARVIDLYEG LAIADIQAAA DVLRRVYDTS DGADGYVSLE VSPYLALDTE ETLNEARRLH AAVGRDNLMV KVPATPAGLP AIRQLTAEGI SVNITLLFSQ SVYEEVANAF IDGLTEFGAK GGDVSKVASV ASFFISRIDS LVDKKLDEVG GYEDLKGKVA IANAKLAYQR YKRIFSGPKW EALEAKGAKA QRLLWASTGT KNKAYSDVLY VEELIGKNTV NTMPPATMDA FRDHGRVRAT LEENIGEAET VMARLAEAGI DIEAVARQLV EEGVQLFVDA ADALLGAVAG KRAALLDHRL DAQTFKFDEP LQAATDKAVE SWRASGAIRR LWAHDATVWT GRDEDKWLGW LRIVEDELER VDLYESFAEE VRAEGFTDAV VLGMGGSSLG PEVISATYGH REGFPKLRIL DSTDPDEVRA VEAAVKLETT LFIVASKSGS TLEPNVFRDY FLGRMKDVVG ADKAGRHFVA VTDPGSAMEK AARDDNFRKI FLGVPQIGGR YSVLSAFGLV PAAAAGVEIR EFLDSARMMV RSCGPAVPPA VNPGVRLGAA MGVAAKDFGR DKITIIASPG IGTFGTWAEQ LIAESTGKEG VGIIPVEGEP VGVPAVYGED RLFVYLRLTS QADARQDEAV KILESEAQPV VRIDLDKVEQ LPQEFFRFEI ATAVAGAVLG INPFDQPDVE ASKIETKKLF ASAEETGALP AETPIFEDET VALYADAANA EALRPGEGFE AIVAAHLARV KPCDYVAVLA YVERNEAHHA ALQEARLTVR DARQVATCLE FGPRFLHSTG QAYKGGPASG VFLQITADPS ADLPIPGRKL GFKTVIAAQA RGDFAVLSER KRRALRIHLK GGDVSGGVKR VAAAIKAAVA G
|
| |