Gene Mext_4502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_4502 
Symbol 
ID5832227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp5023016 
End bp5025841 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content69% 
IMG OID641370295 
Productbifunctional transaldolase/phosoglucose isomerase 
Protein accessionYP_001641941 
Protein GI163853898 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0166] Glucose-6-phosphate isomerase
[COG0176] Transaldolase 
TIGRFAM ID[TIGR00876] transaldolase, mycobacterial type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.270677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGC TGAACGCCCT CTTCGCCGAG CACGAGCAGG CGGTCTGGCT CGATTTCGTG 
GCCCGCGGCT TCATCGCCAA GGGCGAACTC CAGGCGCTGG TGGAGAAGGA CGACCTGCGC
GGCGTCACCT CCAACCCGGC CATTTTCGAG AAGGCGATCG GCCACTCGGC CGAGTACGAC
GACAGCCTCA AGGCCGTGCT GAGCCAGGGC GACGCCCGCG TCATCGATCT CTACGAGGGG
CTGGCCATCG CCGACATCCA GGCTGCGGCC GACGTCCTGC GCCGGGTCTA CGACACCAGC
GACGGCGCCG ACGGCTATGT CAGCCTCGAA GTCTCCCCCT ACCTCGCCCT CGACACCGAG
GAGACCCTGA ACGAGGCCCG CCGCCTGCAC GCGGCGGTCG GGCGCGACAA CCTCATGGTC
AAGGTGCCGG CCACCCCCGC CGGCCTGCCG GCAATCCGCC AGCTCACGGC GGAGGGCATC
TCGGTCAACA TTACGCTGCT GTTCTCGCAG AGTGTCTACG AGGAGGTCGC CAACGCCTTC
ATCGACGGGC TGACCGAGTT CGGCGCCAAG GGCGGCGACG TCTCGAAGGT CGCGAGCGTG
GCGAGCTTCT TCATCAGCCG CATCGACAGC CTCGTCGACA AGAAACTGGA TGAGGTCGGC
GGCTACGAGG ATCTCAAGGG CAAGGTGGCG ATCGCCAACG CCAAGCTCGC CTACCAGCGC
TACAAGCGCA TCTTCTCCGG GCCGAAATGG GAGGCGCTGG AGGCCAAGGG GGCGAAGGCG
CAGCGCCTGC TCTGGGCCTC CACGGGCACC AAGAACAAGG CCTATTCCGA CGTGCTCTAC
GTCGAGGAGC TGATCGGCAA GAACACCGTC AACACCATGC CGCCGGCCAC CATGGACGCG
TTCCGCGACC ACGGCCGGGT GCGCGCGACG CTGGAGGAGA ATATCGGCGA GGCCGAGACC
GTGATGGCAC GGTTGGCCGA GGCCGGCATC GACATCGAGG CGGTGGCGCG CCAGCTCGTC
GAGGAGGGCG TGCAGCTCTT CGTCGATGCC GCCGACGCCC TGCTTGGCGC GGTCGCGGGC
AAGCGGGCGG CGCTGCTCGA CCATCGGCTC GACGCGCAGA CCTTCAAGTT CGACGAGCCG
CTTCAGGCGG CGACCGACAA GGCGGTCGAA TCCTGGCGCG CGAGCGGCGC GATCCGCCGG
CTCTGGGCGC ACGACGCCAC GGTCTGGACT GGCCGTGACG AGGACAAGTG GCTCGGCTGG
CTGCGCATCG TCGAGGACGA ACTGGAGCGG GTCGATCTCT ACGAAAGCTT TGCCGAAGAG
GTCCGCGCCG AGGGCTTCAC CGATGCGGTG GTGCTCGGCA TGGGCGGCTC CAGCCTCGGC
CCGGAGGTGA TCTCCGCCAC CTACGGCCAC CGCGAGGGCT TCCCGAAGCT GCGCATCCTC
GACTCGACCG ACCCGGACGA GGTCCGCGCC GTCGAGGCGG CGGTGAAGCT CGAGACGACC
CTCTTCATCG TCGCCTCGAA GTCCGGCTCG ACGCTCGAGC CCAACGTGTT CCGCGACTAC
TTCCTCGGCC GGATGAAGGA CGTTGTCGGG GCCGACAAGG CCGGTCGGCA TTTCGTCGCC
GTGACCGATC CCGGCTCGGC GATGGAAAAG GCCGCCAGGG ACGACAACTT CCGCAAGATC
TTCCTGGGCG TGCCGCAGAT CGGCGGGCGC TACTCCGTGC TCTCGGCCTT CGGCCTCGTT
CCCGCAGCGG CGGCCGGGGT CGAGATCCGC GAGTTCCTCG ACAGCGCCCG GATGATGGTC
CGCTCCTGCG GGCCGGCGGT GCCGCCCGCC GTGAATCCCG GCGTGCGCCT CGGCGCGGCG
ATGGGTGTCG CGGCCAAGGA TTTCGGGCGC GACAAGATCA CGATCATCGC CTCCCCCGGC
ATCGGCACCT TCGGCACCTG GGCCGAGCAG CTCATCGCCG AGTCGACCGG CAAGGAGGGC
GTCGGCATCA TCCCCGTCGA GGGCGAGCCG GTCGGCGTGC CGGCCGTCTA CGGCGAGGAC
CGGCTGTTCG TGTATCTGCG CCTGACCAGC CAGGCCGATG CGCGCCAGGA CGAGGCGGTG
AAGATCCTGG AGAGCGAGGC GCAGCCGGTG GTGCGCATCG ATCTCGACAA GGTGGAACAG
CTCCCGCAGG AATTCTTCCG CTTCGAGATC GCGACCGCCG TGGCCGGGGC GGTGCTCGGC
ATCAACCCGT TCGACCAGCC CGATGTCGAG GCGAGCAAGA TCGAGACGAA GAAGCTGTTC
GCCAGCGCCG AGGAAACCGG CGCCCTGCCG GCCGAGACGC CGATCTTCGA GGACGAGACG
GTCGCGCTCT ATGCCGATGC GGCCAACGCG GAGGCCCTGC GCCCCGGCGA GGGCTTCGAG
GCGATCGTGG CCGCGCATCT GGCGCGGGTG AAGCCGTGCG ACTACGTCGC CGTGCTCGCC
TATGTGGAGC GCAACGAGGC GCATCACGCG GCGTTGCAGG AAGCCCGGCT GACGGTGCGC
GACGCGCGTC AGGTCGCGAC TTGCCTGGAA TTCGGCCCGC GCTTCCTCCA CTCGACCGGG
CAGGCTTACA AGGGCGGTCC CGCTTCCGGC GTGTTCCTCC AGATCACCGC CGACCCCTCG
GCCGATCTGC CGATCCCCGG GCGCAAGCTC GGGTTCAAGA CGGTGATCGC GGCGCAGGCG
CGGGGCGATT TCGCCGTGCT GTCCGAGCGC AAGCGCCGGG CGCTGAGAAT CCACCTCAAG
GGCGGTGATG TCTCCGGCGG CGTGAAGCGC GTCGCCGCGG CGATCAAGGC AGCGGTCGCC
GGATAA
 
Protein sequence
MNALNALFAE HEQAVWLDFV ARGFIAKGEL QALVEKDDLR GVTSNPAIFE KAIGHSAEYD 
DSLKAVLSQG DARVIDLYEG LAIADIQAAA DVLRRVYDTS DGADGYVSLE VSPYLALDTE
ETLNEARRLH AAVGRDNLMV KVPATPAGLP AIRQLTAEGI SVNITLLFSQ SVYEEVANAF
IDGLTEFGAK GGDVSKVASV ASFFISRIDS LVDKKLDEVG GYEDLKGKVA IANAKLAYQR
YKRIFSGPKW EALEAKGAKA QRLLWASTGT KNKAYSDVLY VEELIGKNTV NTMPPATMDA
FRDHGRVRAT LEENIGEAET VMARLAEAGI DIEAVARQLV EEGVQLFVDA ADALLGAVAG
KRAALLDHRL DAQTFKFDEP LQAATDKAVE SWRASGAIRR LWAHDATVWT GRDEDKWLGW
LRIVEDELER VDLYESFAEE VRAEGFTDAV VLGMGGSSLG PEVISATYGH REGFPKLRIL
DSTDPDEVRA VEAAVKLETT LFIVASKSGS TLEPNVFRDY FLGRMKDVVG ADKAGRHFVA
VTDPGSAMEK AARDDNFRKI FLGVPQIGGR YSVLSAFGLV PAAAAGVEIR EFLDSARMMV
RSCGPAVPPA VNPGVRLGAA MGVAAKDFGR DKITIIASPG IGTFGTWAEQ LIAESTGKEG
VGIIPVEGEP VGVPAVYGED RLFVYLRLTS QADARQDEAV KILESEAQPV VRIDLDKVEQ
LPQEFFRFEI ATAVAGAVLG INPFDQPDVE ASKIETKKLF ASAEETGALP AETPIFEDET
VALYADAANA EALRPGEGFE AIVAAHLARV KPCDYVAVLA YVERNEAHHA ALQEARLTVR
DARQVATCLE FGPRFLHSTG QAYKGGPASG VFLQITADPS ADLPIPGRKL GFKTVIAAQA
RGDFAVLSER KRRALRIHLK GGDVSGGVKR VAAAIKAAVA G