Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_0105 |
Symbol | |
ID | 6131567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | - |
Start bp | 122895 |
End bp | 124076 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641640445 |
Product | cellulase |
Protein accession | YP_001767124 |
Protein GI | 170738469 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.535219 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.625739 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATCCT TCCTGCGCCT CGCGGCGACG ATCCTGATCC TGGGCAGCGC CGCGCTGGGC GGCGCCCGGC CAGGCCTCGC CCAGCAGCCG GGCGCCGCCG AGCCGCCGCC CCGCGCGCGC GACGCCAAGA ACCGTGATGG CGAGAGCCGG ACTGGCGAGA ACCGGACTGG CGAGCCGGCG TCGCTCCACA ACGCCCTCGG CAACGGGGCG GCGTGGCGCG CCTACAAGGC GCGCTTCGTC ACCGACCAGG GCCGCGTCGT CGACACGGCC AACGGCCGCA TCAGCCACAG CGAGGGGCAG GGCTACGGCA TGCTGCTCGC CGTCGCGGCC GGGGACCGCG ACGCCTTCCA GCGGATCTGG GACTGGACCC GCGCCAACCT CATGGTCCGG GACGATTCCC TGCTGGCCTG GCGCTGGGAG CCGGACAAGC GCCCGGGCGT CGCCGACATG AACAACGCCA CCGACGGCGA CCTCCTGGTG GCCTGGGCCC TCATCGAGGC CGCGGACGCC TGGCAGGACG AGGGCTACCG CCTCGCGGCA CGGCGCATCG CCGTCGATAT CGGCCGCCGC ACGGTGCTGT TCCGCTCCGA GGGCGCGGCC CTGCTCCTGC CGGGCATGGC CGGGTTCTCG GCCGAGGACC GGGCGGATGG GCCGGTGATC AACCTGTCCT ACTGGATCTT CCCCGCCCTG GCGCGCCTGC CCGCGGTGGC GCCCGAATTC GACTGGGCCC GGCTCAGCGC GGCCGGCCTC GACCTCGCGC TGCGGGCGCG GTTCGGGGAG GCGGCGCTGC CGGTCGAGTG GACCTCGCTG CGCGGGGGCG AGCCGAAAGC GGCGGCCGGC TTCCCCCCGG TCTTCTCCTA CAACGCGGTC CGGGTGCCGC TCTACCTCGC CATGGCGGGC ATCGCGGAGC GCCGCTACTA CGCCCCCTTC GTGAAGGCCT GGGCCGAGGT CGGCGCCTCG GGCCTGCCGG TGGTCGACAC CGCGAGCAAC GACGTCGTCG GACGGATGCA GGAGCCCGGC TACCTCGCGG TCGCGGCCCT CACCGCCTGC GCGGCCGGCA CGGCCTCGCT GCCGCCCGCC CTTCCCGACC CGACCTCGCC GCAGAACTAC TACCCGGCGA CGCTCCAGCT CCTGGCCCTC GCCGCCGTCA ACATGAGGTA CGCGTCATGC CTTGGGCGCT GA
|
Protein sequence | MRSFLRLAAT ILILGSAALG GARPGLAQQP GAAEPPPRAR DAKNRDGESR TGENRTGEPA SLHNALGNGA AWRAYKARFV TDQGRVVDTA NGRISHSEGQ GYGMLLAVAA GDRDAFQRIW DWTRANLMVR DDSLLAWRWE PDKRPGVADM NNATDGDLLV AWALIEAADA WQDEGYRLAA RRIAVDIGRR TVLFRSEGAA LLLPGMAGFS AEDRADGPVI NLSYWIFPAL ARLPAVAPEF DWARLSAAGL DLALRARFGE AALPVEWTSL RGGEPKAAAG FPPVFSYNAV RVPLYLAMAG IAERRYYAPF VKAWAEVGAS GLPVVDTASN DVVGRMQEPG YLAVAALTAC AAGTASLPPA LPDPTSPQNY YPATLQLLAL AAVNMRYASC LGR
|
| |