Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_0038 |
Symbol | |
ID | 5835558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | - |
Start bp | 43248 |
End bp | 45533 |
Gene Length | 2286 bp |
Protein Length | 761 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641365822 |
Product | Kojibiose phosphorylase |
Protein accession | YP_001637537 |
Protein GI | 163849494 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1554] Trehalose and maltose hydrolases (possible phosphorylases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGAGG TGCTTCGACC GACACAGGAG CCCGGCTGGG TTCTCACGCA CGAAGGCTAC AGCGTGCTTA CGGAGAGCGC GGTCGAATCC CGCTTTGCTC TCGGCAACGG CTTCCTCGGC ATGCGTGCCG CGCGCTCGAC GGGCCGAGGG CCGACCTGGG TGAGCTGGCT CGGCTACATC CGATGGGCCT CGTGGCCGCG CTGCTACGTC GCCGGGCTGT TCGACATGCC CAACACCGAG CCGCCTGTGC CGGCGCTCGT GCCCGTCGCC GACTGGTCGC GGATCCGCCT CATCCTCGAT GGGGAGCCGC TGGTGGTGCG CGAAGGCGAG ATTCTTCATG GCATGCGGCG GCTCGACATG CGGCGCGGCG TGCTTCTCTC CGAATGGACG CATCGGACAC CGGCGCAGGT GACCGCGAAG GGCCACGAGC TGCGCCTCCT GTCGCTGGCG GACCGGTCGG TGGGGCTCCA GCTTCAGCAG ATCGTGCTGG ACCGCGACGA CATCGACGTC CGCCTCGAAG CGAGCTTCGG GCTAGCCGGC GTCGGTATGG AGCCGGTGCG TCTCGAAAAC GACCTCGGCG CGTGGCGCAC CGAGGGGACC GGTAAGGTCG TGGCGATGGC AGGTGCCGCA TCGTTGCATC TTGATGGCGC CTTGGCCGAC TCCGAGCGCC CATTTCCGCT GCGTTGGATC TGGCGTTGGC GCTCGAAGGC TGGCCAGGTG GCGCAATTCG CCCGCCTCGT CGCCGTCGCT CGCGCCGAGC GGTCGGAAGA GGATCCTGCG CCCCGCGCCG CGGCGACGCT CGCGCGCAGC ACATCGGTGG GCTGGCGCGC GATCCTCAAG GCTCATGAAT CCGCATGGGA TGCACACTGG AGCGACAGCG GCATCGTCAT CGACGGCGAC GATGACCTGC AGCGCGCGCT GCGGTTTGCC GTGTACCACC TGACGAGCGC CGCGAACCCG AGCGACGACC GGGTTTCGAT CGGCGCGCGC GCGCTGACCG GCGATGCCTA TTTCGGCCAC GTCTTCTGGG ACACCGAGAT CTATCTTCTG CCGTTCTACA CCGCGGTCTG GCCGGAAGCG GCGCGCGCGC TGCTGATGTA CCGGTTCCAT ACGCTGCCCG GAGCACGGGC CAAGGCGACG CTCGGCGGCT GGCGAGGCGC CCTCTATCCA TGGGAATCGG CCGACACCGG CGATGAGACT ACGCCGGACT CGGTGCTGGG GCCCGACGGG AAGCCGATCG AGATCCTGAC TGGCAAGATG GAGCACCACA TCAGCGCCGA CGTCGCCTAC GCGGTGTGGC AGTACTGGCG TGCCACCGGC GACGACGATT TCTTCCGCGA TGCGGGGGCG GAAATTCTCC TTGAGACGGC GCGTTTCTGG GCGTCCCGAG CCGTCGCCGA AGCGGATGGC CGGCGCCACA TCCGCCATGT GATCGGGCCG GACGAGTACC ATGAGGATGT CGACGACAAC GCCTTCACCA ACGTGATGGC GCGCTGGAAC ATCGGCTGCG CCCTGGAGGC GCTCGACCTG TTGCGCAAGG GTTGGCCGGA CCGTGCCGAG GCGCTTCGAG ACAAGCTCGC GCTCGACGAC AGGGAACTCG ATGACTGGCG GGACGCGGTC GCGCGGATCG TCACCGGCCT CGACCCCGCG ACCGGGCTGT ACGAGCAGTT CGCTGGCTTC CACGGCCTCA AGCAGCTGAA CGTCGCGGAC TATGTCGACC ATGCACTGCC GATCGACGTG GTCATCGGCC GGGAGCAGAC GCAAAGCTCG CAGGTGATCA AGCAAGCCGA CGTCGTCGCG CTGATCGCCT TGTTGCCCCA GGAATTTCCC GGACAGGGAG CGGAGATCAA TTTCCGCCAT TACGAGCCGC GCTGTGCCCA TGGCAGCTCC TTGAGCGCCG CGATGCATGC CCGCGTGGCC GCGCGTCTGG GCGCCTCGGA CACGGCTCTT CGATACATGC GCGAGACCGC GTCTCTCGAC CTCGACCTCG ATCCGAACAG CGCCGGCGGC GTCCGGATCG CCGGGCTCGG CGGGTTGTGG CAGGCGGCGA TCCTGGGCAT CGCCGGCCTG AACTTAGCGG GCGACACGCT GGAGCTCGAT CCCAAGCTGC CGCCTCAGTG GGATACCCTT TCGTTCAAGG TCTGGTGGAG AGGCCGATCC GTCGGGCTCA GCGTCAGCCG CCCTATGCTG GAGGCCAGGC TGATGGACGG AGACGGGATG GACGTCACGG TCGCGGGCGT GACGCAGCAC CTGACACCTG GATCGCCACT GCGATTCGAG CTGTAG
|
Protein sequence | MLEVLRPTQE PGWVLTHEGY SVLTESAVES RFALGNGFLG MRAARSTGRG PTWVSWLGYI RWASWPRCYV AGLFDMPNTE PPVPALVPVA DWSRIRLILD GEPLVVREGE ILHGMRRLDM RRGVLLSEWT HRTPAQVTAK GHELRLLSLA DRSVGLQLQQ IVLDRDDIDV RLEASFGLAG VGMEPVRLEN DLGAWRTEGT GKVVAMAGAA SLHLDGALAD SERPFPLRWI WRWRSKAGQV AQFARLVAVA RAERSEEDPA PRAAATLARS TSVGWRAILK AHESAWDAHW SDSGIVIDGD DDLQRALRFA VYHLTSAANP SDDRVSIGAR ALTGDAYFGH VFWDTEIYLL PFYTAVWPEA ARALLMYRFH TLPGARAKAT LGGWRGALYP WESADTGDET TPDSVLGPDG KPIEILTGKM EHHISADVAY AVWQYWRATG DDDFFRDAGA EILLETARFW ASRAVAEADG RRHIRHVIGP DEYHEDVDDN AFTNVMARWN IGCALEALDL LRKGWPDRAE ALRDKLALDD RELDDWRDAV ARIVTGLDPA TGLYEQFAGF HGLKQLNVAD YVDHALPIDV VIGREQTQSS QVIKQADVVA LIALLPQEFP GQGAEINFRH YEPRCAHGSS LSAAMHARVA ARLGASDTAL RYMRETASLD LDLDPNSAGG VRIAGLGGLW QAAILGIAGL NLAGDTLELD PKLPPQWDTL SFKVWWRGRS VGLSVSRPML EARLMDGDGM DVTVAGVTQH LTPGSPLRFE L
|
| |