Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_4500 |
Symbol | |
ID | 5832021 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 5018815 |
End bp | 5020674 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641370293 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_001641939 |
Protein GI | 163853896 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGGAC GGATCGAGGA TTACGCCCTG ATCGGCGACG GGCGCACCGC GGCCCTAGTC GGGCGCGACG GCAGCATCGA CTGGCTGTGC ATGCCGCGCT TCGATGCCTC GGCCCTGTTC GCGAGCCTTC TCGGAACGGA GGAGCACGGC TTCTGGAAGC TCGCTCCGGC GGCGGAGGGG GCACAGCATT CCTGGCGCTA CCGCGGCGGC TCGCTGGTGC TGGAGACCAC GCACAAGACC CATGAGGGAG AGGTCCGCGT CACCGACTTC ATGCCCGTCG GCGACGGCAG CCACGTGATC CGTCTCGTCG AGGGGGTGCG CGGCCGGATG GCGATGCGGA TGGAACTGGC CGTGCGCTTC GATTACGGCT CGGCGGTGCC CTGGGTGTCG CGCAGCGAAC TCGGGGATCT GCGCGCCATC TCCGGGCCGC ACAAGGTGGT CCTGCGCACC AACGCGCCGA TGCGCGGCTC CGCCCACCAG ACCACGGTCT CGGAATTCAC GGTCTACAAG GGCGACGCAG TCCGCTTCGT GCTGAGCTAC GGCGCCTCGC ACGAAGACGA TCCTGCACCG ATCGAGCCGC GCCGCTGGCT CGACGACACC AACCGTTTCT GGCGCGAGTG GTCGAGCCGC TGCACGCTCC AGACGCCTTG GGACGCCATC CTGCGCCGCT CGCTCCTGAC GCTGAAGGCG CTGATCTACC AACCGACCGG CGGGATCGTC GCCGCGCCCA CCACGTCGCT GCCGGAAGAA CTCGGCGGGG TGCGCAACTG GGACTACCGC TTCTGCTGGC TGCGCGACTC CACCTTCACG CTGCTGGCGC TGATGGATTC AGGCTACATC GAAGAGGCCC GCGCCTGGCG CGACTGGCTG ACCCGGGCGG TGGCCGGCAA CCCGGAGCAG GCGCATATCC TCTACGGCAT TGCCGGCGAG CGGCTGTTGC CGGAGATCGA ACTCGATTGG CTGCCCGGCT ACGAGGGTTC GCGGCCCGTG CGCGTCGGCA ACGCCGCGAT CGCCCAGTTC CAGCTCGACG TCTACGGGGA GTTGTTCGAC GCCCTGTTCC AGGCCCGTGC CCGCGGCATG GGGCAGAACA AGGACGGCCT GCGCGTCGGC CAAGCGATCA TCAAGCACCT CGAGACGGCG TGGCGCCAGC CCGATGAGGG CATCTGGGAG GTGCGCGGCG GGCGGCGCCA CTTCGTCCAT TCCAAGGTGA TGGCCTGGGT GGCCTTCGAC CGGGCGATCC GCAGCGTCGA GCTGATGGGC GACGACGATC CGCACACCGT CGAAGCGCCG GTCGCGCACT GGAAGGCGAT CCGCGACGAG ATCCACGCCG AGGTCTGTGC CAAGGGGTTC GACCCGGAGC TGAACAGCTT CGTCCAATCC TACGGAAGCA AGGCGCTCGA TGCGAGCCTG CTGCTCATCG CGCATATGGG CTTCCTGCCC CAGGACGATC CCCGCGTCGT CGGCACGGTG GCGGCCGTCG AATCGCATCT GATGCGCGAG GGGTTCATCC TGCGCTACGA GACCGAAGGC CAGACCACCG ACGGCCTGCC CGGCAACGAG GGTGCGTTCC TGCCCTGCAG CTTCTGGTAC GCCGACAATC TGATCGGCCT CGGCCGCTGC GACGAGGCTC GCGAGCTGAT CGAGCGCCTG ATCGGTGTCT GCACCGATCT CGGATTGGTC TCCGAAGAGT ACGACGTGCA CGCGAAACGG CTGGTGGGGA ACTTCCCTCA GGCGTTCACG CATGTTGCAC TCGTCAACAC GATCCTCAAT TACAGCCGCG CGACCGGTCC TGCGAAGGAA CGGGGCAGCG GCGCGGACGT ATCCGAGTCG AGGGTAGGCG AATCCATCGC AGCGCAGTAG
|
Protein sequence | MAGRIEDYAL IGDGRTAALV GRDGSIDWLC MPRFDASALF ASLLGTEEHG FWKLAPAAEG AQHSWRYRGG SLVLETTHKT HEGEVRVTDF MPVGDGSHVI RLVEGVRGRM AMRMELAVRF DYGSAVPWVS RSELGDLRAI SGPHKVVLRT NAPMRGSAHQ TTVSEFTVYK GDAVRFVLSY GASHEDDPAP IEPRRWLDDT NRFWREWSSR CTLQTPWDAI LRRSLLTLKA LIYQPTGGIV AAPTTSLPEE LGGVRNWDYR FCWLRDSTFT LLALMDSGYI EEARAWRDWL TRAVAGNPEQ AHILYGIAGE RLLPEIELDW LPGYEGSRPV RVGNAAIAQF QLDVYGELFD ALFQARARGM GQNKDGLRVG QAIIKHLETA WRQPDEGIWE VRGGRRHFVH SKVMAWVAFD RAIRSVELMG DDDPHTVEAP VAHWKAIRDE IHAEVCAKGF DPELNSFVQS YGSKALDASL LLIAHMGFLP QDDPRVVGTV AAVESHLMRE GFILRYETEG QTTDGLPGNE GAFLPCSFWY ADNLIGLGRC DEARELIERL IGVCTDLGLV SEEYDVHAKR LVGNFPQAFT HVALVNTILN YSRATGPAKE RGSGADVSES RVGESIAAQ
|
| |