Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_4803 |
Symbol | |
ID | 7118253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | - |
Start bp | 5098995 |
End bp | 5100152 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643527500 |
Product | Cellulase |
Protein accession | YP_002423502 |
Protein GI | 218532686 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3405] Endoglucanase Y |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTTCG GCCGCCCACG CGCCATTGCA GCCTCGCTGC TCCTAGGCCT GACCCTCGCC CCCTTGCAAG CGATGGCGGA GCCGGCGCCC ATCGAAGCAA AGCCGGCCGC CGGAGCCACC GTCGGGAAGG CGGCCTCCGT GACTGCCGAG ACGCTGCCGA TGCTGAACAG CCTCGGTCAA GCCGGGGCCT GGCGCAGCTA CAAGGCGCGC TTCGTCACCG ATCAGGGCCG CGTAGTCGAC ACGGCCAACG GCCGCATCAG CCATAGCGAG AGCCAGGGCT ACGGCCTGCT GCTCGCCGTA GCGGCCGGCG ATCGCGACAC CTTCCAGCGG ATCTGGAATT GGACGCGGGC CAATCTGATG GTGCGCGACG ACGCGCTGCT GGCCTGGCGC TGGGAGCCGG ACAAGCGGCC CGCCGTAGCC GACATGAACG ACGCGACGGA CGGCGACATC CTCGTGGCCT GGGCGCTGGT CGAGGCGGGC GAGGGCTGGG CGGATGATAG CTACCGCTTG GCGGCCCGCC GGATCGCCGT GGACATCGCC CGCCGGACGG TCCTCTTCCG CACGGAGGGC CCTCCGCTCC TGCTGCCGGC CATGAGCGGG TTCTCGGCGG AGGACCGGCC GGACGGGCCG GTTATCAATC TCTCCTACTG GATCTTCCCG GCCTTCCCGC GACTCGCCGC TGTCGCGCCG GAATTCGACT GGGACCGGCT CGGCGCGACT GGTCGGGACC TCGTCCTGCG GGCCCGCTTC GGCGATGCGA AACTTCCGAC CGAGTGGATC TCGATGCGGG GCGGCCAGCC CCAGCCCGCA TCCGGCTTCC CGCCCCACTT CTCCTACAAC GCCCTGCGGG TGCCCCTCTA CCTCGCCATG GCGGGCATCA GCGAGCGACG CTACTACGAG CCGCTGTTGG CCCTCTGGGG CGAGCCGGAC CCGGCCGGAC TGCCGATCAT TGACACGGAG GGCGGGTCTG TTGCGGGACG CATGGCCGAG CCCGGCTACG CGATCATCCC GGCCCTGGCC GCCTGCGCGG TGACCGGTGC GCCGCTGCCG GCCGGCCTGC AGGACCCGGC CACCGACGAG AACTACTATC CCGCCACCCT CCACCTCCTG GCCTTGACGG CCGCCAACAT GAGGTATCGG CCATGCCTCG GCCGCTGA
|
Protein sequence | MMFGRPRAIA ASLLLGLTLA PLQAMAEPAP IEAKPAAGAT VGKAASVTAE TLPMLNSLGQ AGAWRSYKAR FVTDQGRVVD TANGRISHSE SQGYGLLLAV AAGDRDTFQR IWNWTRANLM VRDDALLAWR WEPDKRPAVA DMNDATDGDI LVAWALVEAG EGWADDSYRL AARRIAVDIA RRTVLFRTEG PPLLLPAMSG FSAEDRPDGP VINLSYWIFP AFPRLAAVAP EFDWDRLGAT GRDLVLRARF GDAKLPTEWI SMRGGQPQPA SGFPPHFSYN ALRVPLYLAM AGISERRYYE PLLALWGEPD PAGLPIIDTE GGSVAGRMAE PGYAIIPALA ACAVTGAPLP AGLQDPATDE NYYPATLHLL ALTAANMRYR PCLGR
|
| |