Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_5390 |
Symbol | |
ID | 4613074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | - |
Start bp | 5625002 |
End bp | 5625991 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639795085 |
Product | cellulase |
Protein accession | YP_941366 |
Protein GI | 119871414 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.217743 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCAG CTGTTGGTGC AGTCGCGCGG TGGGTCGCGC CGTTCCTGAC GGTCGCGGCC GTCGCGGGTA CGGCCGCCGT CGCCGAACCC GTGAACGTCG ACCCCGCTCC GGCGGTGCGT CTGGTCAGCG ATGCGAACCC GCTGGTCGGC AGGCCCTTCT ATGTCAATCC GGCGTCCAAG GCCATGCGGG CGGTGCAGGG CAACTCGGAC CCGTTGCTGG CTTCGGTCGC CAACACCCCG ACGGCGTACT GGATGGATCA CCTCTCCACC CCGTCGGTCG ACTCGAAGTA CATCGCCGAC GCACAGGCCG CGGGCACCAC ACCGATCCTG GCGCTGTACG GCATCCCCAA CCGCGACTGC GGGAGCTTCG CCGCGGGCGG ATTCGGCTCG GCCGGGGCGT ATCGAGCGTG GATCGACGGC GTGGCCGGAG CCATCGGAGG GGGCCCGGCG GCGGTCGTCC TCGAACCCGA CGCGCTGGCC ATGATCGACT GCCTGTCACC GGGCCAGCAG CAGGAACGCC TCGAGCTGAT CGGCTACGCC GTCGACACCC TGACCCGCAA CCCGGCCACC GCGGTGTACG TGGACGCCGG TCATCCGCGC TGGGTGGCCG CCGATGTGAT GGCCGGCCGG CTGAACCAGG TCGGCGTCGC CAAGGCGCGC GGCTTCAGCC TCAACACCGC CAACTTCTTC ACCACCGAGG AGTCGATCGG CTACGGCCAG GCCGTCTCGG GGATGACGAA CGGATCGCAC TTCGTGATCG ACACGTCGCG CAACGGCGTC GGACCGGTCG ACAGCGATTC GTGGTGCAAC CCTCCCGGCC GCGCGTTGGG CACCCCGCCC ACGACGGCCA CCGGCCACCC GCAGGTCGAC GCCTTCCTGT GGGTCAAGCG TCCCGGTGAG TCCGACGGAT CGTGCGGCGG CGGGGCGCCC AGCGCGGGCA CGTTCGTCGC TCAGTACGCC ATCGATCTGG CCCGCACCGC AGGCTGGTAG
|
Protein sequence | MSSAVGAVAR WVAPFLTVAA VAGTAAVAEP VNVDPAPAVR LVSDANPLVG RPFYVNPASK AMRAVQGNSD PLLASVANTP TAYWMDHLST PSVDSKYIAD AQAAGTTPIL ALYGIPNRDC GSFAAGGFGS AGAYRAWIDG VAGAIGGGPA AVVLEPDALA MIDCLSPGQQ QERLELIGYA VDTLTRNPAT AVYVDAGHPR WVAADVMAGR LNQVGVAKAR GFSLNTANFF TTEESIGYGQ AVSGMTNGSH FVIDTSRNGV GPVDSDSWCN PPGRALGTPP TTATGHPQVD AFLWVKRPGE SDGSCGGGAP SAGTFVAQYA IDLARTAGW
|
| |