Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1369 |
Symbol | |
ID | 5831228 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1525352 |
End bp | 1526611 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641367162 |
Product | cellulase |
Protein accession | YP_001638841 |
Protein GI | 163850798 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.140068 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0392414 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCCCC CCCGCACCCC GCGCCGCATC GGATGGGCTT TCGCCTCGGC GCTCGCCCTC GCGGCGGCCG GATCCGCCGC GGCACAGACG ACCCCGCAGC CGGCATCCCC ACAGCCGGCC TCCCCGGAGA CGTCGTCCCA AATGCCTCAG CCCGCGCCTC AGCCCGCGCC CCAGCAGACG GAGGCCACCA CGGTGCCCCC CGCCACGGCC CTGCCGGGCC CGCCGCGCGC CGAGACGTCG GCGCGGACCG ATTCCGGGCC GCTCCTCGCC AACACCCTGG GCGACGACGC GGCATGGCGC GCCTACCGGT CGCGGTTCAT CACCGAGCAG GGCCGGATCG TCGATACCGC CAACGGCCTG ATCAGCCACA GCGAGGGCCA GGGCTACGGC ATGCTGCTCG CGGTCGCGGC CGGCGACCGG TCCACCTTCG AGCGGATCTG GGGCTGGACC CGCGCCAACC TCATGGTGCG CTCCGACGAA TTGCTGGCAT GGCGCTGGGC GCCCGACCAC CGCCCGGCGG TCTCCGACAT GAACAACGCC ACCGACGGCG ACATCCTGGT CGCCTGGGCG CTCACCGAGG CAGCGGAGGC CTGGGGCGAA CCCTCCTACC GCACCGCCGC GCGCCGCATC GCCGTCGAGT TCGGCCGCAA GACCATCCTG TTCAAGGACC CGCACGGACC CGTCCTGCTG CCGGCCGTCT CCGGCTTCTC GGCCCGCGAG CGCGCCGACG GCCCGCTCAT CAACCTGTCC TACTGGGTCT TCCCCGCCTT CCAGCGGCTG CCGATCGTCG CCCCGGAATA CGATTGGGCC TCGCTGATCC GCAGCGGCGT CGATTTCCTG CGCCAGTCCC GCTTCGGGCC GAGCAGCCTG CCGACCGAGT GGATCTCGGC CAAGGACAGC TTACGCCCCG CCGACGGCTT CCCGCCGCTA TTCTCCTACA ACGCGATCCG GGTGCCGCTC TATCTCGCCT GGGCCGGCGT CGGGCGGCCG GAGGATTACG CGCCGTTCAA GACGCTCTGG GGCGGGATCG AGCGCGAGCG CCTGCCGATC GTCGACACCC GCGACGGGCA GCCGGTCGAG TGGCTGAGCG AGCCGGGCTA TATGGCGATC TCCGCGATCA CCGCCTGCGC CGCGGACGGA ACACCCTTTC CGGAAGCCTT AAGGACGGTG CAGGACAATC AGAACTACTA TCCCGCCACC CTCCAACTCC TCTCGCTGAT CGCGGCCCGG ATGCGGTATC CGTCATGCGT GAAATCCTGA
|
Protein sequence | MRPPRTPRRI GWAFASALAL AAAGSAAAQT TPQPASPQPA SPETSSQMPQ PAPQPAPQQT EATTVPPATA LPGPPRAETS ARTDSGPLLA NTLGDDAAWR AYRSRFITEQ GRIVDTANGL ISHSEGQGYG MLLAVAAGDR STFERIWGWT RANLMVRSDE LLAWRWAPDH RPAVSDMNNA TDGDILVAWA LTEAAEAWGE PSYRTAARRI AVEFGRKTIL FKDPHGPVLL PAVSGFSARE RADGPLINLS YWVFPAFQRL PIVAPEYDWA SLIRSGVDFL RQSRFGPSSL PTEWISAKDS LRPADGFPPL FSYNAIRVPL YLAWAGVGRP EDYAPFKTLW GGIERERLPI VDTRDGQPVE WLSEPGYMAI SAITACAADG TPFPEALRTV QDNQNYYPAT LQLLSLIAAR MRYPSCVKS
|
| |