Gene Mext_0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0036 
Symbol 
ID5831752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp38822 
End bp41194 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content56% 
IMG OID641365820 
ProductKojibiose phosphorylase 
Protein accessionYP_001637535 
Protein GI163849492 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1554] Trehalose and maltose hydrolases (possible phosphorylases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0720168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGCC CGCGCTATGT TCCACCCGAA GATATCTTTC CGCCAAACCC CTGGGCATTT 
GAGGCCGTCC AATACGACGC ACGTCTGGCG CAAGAATTGA CGGGCCAAGC CGAGACGATG
TTTGCCCTGT CCAACGGGTA TCTGGGCATC CGCGGTTCAG TAGAGGAAGG TACTCCGGTT
CAAGAGGCCG GCACGTATCT GAGCGGTTTC TACGAACACC GCCCGATTTC CTACGGTGAG
CATGCCTACG GCTTCCCAAC CGTGGGCCAG AGCATCTTGA ACTGCCCGAG CGGTACGGCC
CTGAAGCTCT TCGTCGAAGA CGAGCCGTTC GTCGTTCCGC AAGCCGAGAT CCTGTCATAT
CGGCGGAGCC TCGATCTGCG AACCGGCACG CTGAACAGAG ATGTTCGCTG GGCTTCGCCG
GCGGGCACGC GCCTGCATAT GCAGACGGTG CGTCTTGTCT CCCTCGCGCA TCGGCACCTC
GCTGCCATCG TGTTCGTCCT CACGGCAGAA GATGCCGACG TCGAGATCGC GATCTCCTCG
GAACTCGAAA ACGCTCCGTC ATCGACGGCC GACATCGCGG ACCCCCGCCT CGCGGCGAGC
CTTGCCGGGC GTGTCCTGCA CCCCACGGGA TTTCAAGCCG AGGGGATGCG CGCCATGTTG
AGTTACCGGA CTGAGAGCTC CGGGTTTCAC CTCGGCTGCG GGATGGACCA CGCGGTCTCC
TCGGAGAGAC CATATTTCAC AGAGCAGGTT TGCAGCGACG ATTTCGCCGC CGTAACGATC
CGCTGTAAGC TTGCGCGAAA CAAGCCGATC GTGATTTATA AATATTTGAG CTATCATTAT
TCAGACAACT CTGCTCCGGC GCGGATCTTG TTCCAAACCG CACTAACCCT GGACCGCGCG
CTCAAGAGTG GATTTCAGGA AATCGTCGAG CGCCAAAGGA GCGATGTCGA ACGATTTTGG
GCGCGAGCCG ATGTTGCGGT TGAGGGTGAC AATCCGAGAA CTCAGCAGAC GATCCGCTGG
AACCTCTTTC AGCTTCTGCA GGCTTCGGAG CGATCGGAGG GGCACGGAAT CGGCGCTAGA
GGCCTGACCG GAAGAACTTA TGAGGGGCAC TATTTCTGGG ACACGGAAAT ATACGTTCTG
CCATTCCTGA TTTATACGAA TCCAGTGATC GCGCGTAGTG TTCTAAAATT TCGCTACGAC
ATGCTGGACA AGGCGCGGGC GCGGGCGCGA GAATTGGGTC ACCGTGGGGC AACTTTTCCC
TGGCGAACCA TCAACGGCGA CGAAGCGTCG GCATATTATG CTGCGGGAAC GGCGCAGTAT
CACATCAATG CTGATATCGC GTACGCCCTG CAAAAATATG TGAATGTTAC CGGCGACAAC
GAATTCCTCT GGAGCTATGG AGCCGAAATT CTCACGGAGA CTGCCCGTCT GTGGTTCGAT
CTCGGCTTCT TCTCCGAGTC GAAAGGAGGA AAATTTTGTA TCAACGGCGT CACTGGACCG
GATGAATATA CCGCGATCGT GAACAATAAC TGTTTCACGA ACCTGATGGC ACGAGAAAAT
TTGCGCTATG CCGCCCAAGT GGTTCGCGAT CTCAAGCGCG TGCATCCCGA TCGATTCGAT
GCGTTGGCCC AACGCACTGG TTTGGACAGT TCCGAACTGG CCGATTGGGA GGACGCATCC
GAAAGGATGT ACCTGCCCTA TGACGAGCGT TTGAAGATCC ATCCTCAGGA TGACGATTTT
CTGGATCTCG AGAAATGGGA TTTTGCGGCA ACTCCAGAAA ATAGATATCC CCTCTTACTG
TACTATCATC CCTTGAATCT CTACAGGTCG CAGGTCATCA AGCAGGCCGA TACTGTGATG
GCCATGTTTT TGCTTAACGA GCACTTTACC CACGAGGAGA AAAGACGAAA TTTCGAATAT
TACGATCCGC TCACGACACA CGATTCATCT TTGTCAGTTT GTATTCAGAG CGTTGTGGCG
AATGAAATTG GCCTGCCGCA CAAAGCAATT GAATATTTTA ATTTTGCCGC GGCAATGGAT
ATGTCGGATA TCGGCGGGAA TATGATGAAC GGCGCTCACG TCGCCGCAAT CGGCGGCACG
TGGCTTGCCC TCGTCTACGG CTTCGCGGGA CTTCGCGACA GCAAGGGGTG CATTTCGTTC
AATCCCGTCC TCCCGAAGGA GTGGTCCCAC CTGAGTCTTG TCCTGACCGT AAGAGGGCAG
AGGTTTCGGA TTGAGGTTGA TCCTAACTCT GTCATTTACA CTCTTCTCAG TGGAGAGCGG
CTGAATTTTT CGCATGTTGG CGAGGATCTC GTTCTTTCCT CCGCGGAGCC GGTGATCACG
CGCCCGACCG GTCAGGGGCG TGCGGACGCT TAA
 
Protein sequence
MLRPRYVPPE DIFPPNPWAF EAVQYDARLA QELTGQAETM FALSNGYLGI RGSVEEGTPV 
QEAGTYLSGF YEHRPISYGE HAYGFPTVGQ SILNCPSGTA LKLFVEDEPF VVPQAEILSY
RRSLDLRTGT LNRDVRWASP AGTRLHMQTV RLVSLAHRHL AAIVFVLTAE DADVEIAISS
ELENAPSSTA DIADPRLAAS LAGRVLHPTG FQAEGMRAML SYRTESSGFH LGCGMDHAVS
SERPYFTEQV CSDDFAAVTI RCKLARNKPI VIYKYLSYHY SDNSAPARIL FQTALTLDRA
LKSGFQEIVE RQRSDVERFW ARADVAVEGD NPRTQQTIRW NLFQLLQASE RSEGHGIGAR
GLTGRTYEGH YFWDTEIYVL PFLIYTNPVI ARSVLKFRYD MLDKARARAR ELGHRGATFP
WRTINGDEAS AYYAAGTAQY HINADIAYAL QKYVNVTGDN EFLWSYGAEI LTETARLWFD
LGFFSESKGG KFCINGVTGP DEYTAIVNNN CFTNLMAREN LRYAAQVVRD LKRVHPDRFD
ALAQRTGLDS SELADWEDAS ERMYLPYDER LKIHPQDDDF LDLEKWDFAA TPENRYPLLL
YYHPLNLYRS QVIKQADTVM AMFLLNEHFT HEEKRRNFEY YDPLTTHDSS LSVCIQSVVA
NEIGLPHKAI EYFNFAAAMD MSDIGGNMMN GAHVAAIGGT WLALVYGFAG LRDSKGCISF
NPVLPKEWSH LSLVLTVRGQ RFRIEVDPNS VIYTLLSGER LNFSHVGEDL VLSSAEPVIT
RPTGQGRADA