Gene Mkms_5390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_5390 
Symbol 
ID4613074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp5625002 
End bp5625991 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content71% 
IMG OID639795085 
Productcellulase 
Protein accessionYP_941366 
Protein GI119871414 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.217743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCAG CTGTTGGTGC AGTCGCGCGG TGGGTCGCGC CGTTCCTGAC GGTCGCGGCC 
GTCGCGGGTA CGGCCGCCGT CGCCGAACCC GTGAACGTCG ACCCCGCTCC GGCGGTGCGT
CTGGTCAGCG ATGCGAACCC GCTGGTCGGC AGGCCCTTCT ATGTCAATCC GGCGTCCAAG
GCCATGCGGG CGGTGCAGGG CAACTCGGAC CCGTTGCTGG CTTCGGTCGC CAACACCCCG
ACGGCGTACT GGATGGATCA CCTCTCCACC CCGTCGGTCG ACTCGAAGTA CATCGCCGAC
GCACAGGCCG CGGGCACCAC ACCGATCCTG GCGCTGTACG GCATCCCCAA CCGCGACTGC
GGGAGCTTCG CCGCGGGCGG ATTCGGCTCG GCCGGGGCGT ATCGAGCGTG GATCGACGGC
GTGGCCGGAG CCATCGGAGG GGGCCCGGCG GCGGTCGTCC TCGAACCCGA CGCGCTGGCC
ATGATCGACT GCCTGTCACC GGGCCAGCAG CAGGAACGCC TCGAGCTGAT CGGCTACGCC
GTCGACACCC TGACCCGCAA CCCGGCCACC GCGGTGTACG TGGACGCCGG TCATCCGCGC
TGGGTGGCCG CCGATGTGAT GGCCGGCCGG CTGAACCAGG TCGGCGTCGC CAAGGCGCGC
GGCTTCAGCC TCAACACCGC CAACTTCTTC ACCACCGAGG AGTCGATCGG CTACGGCCAG
GCCGTCTCGG GGATGACGAA CGGATCGCAC TTCGTGATCG ACACGTCGCG CAACGGCGTC
GGACCGGTCG ACAGCGATTC GTGGTGCAAC CCTCCCGGCC GCGCGTTGGG CACCCCGCCC
ACGACGGCCA CCGGCCACCC GCAGGTCGAC GCCTTCCTGT GGGTCAAGCG TCCCGGTGAG
TCCGACGGAT CGTGCGGCGG CGGGGCGCCC AGCGCGGGCA CGTTCGTCGC TCAGTACGCC
ATCGATCTGG CCCGCACCGC AGGCTGGTAG
 
Protein sequence
MSSAVGAVAR WVAPFLTVAA VAGTAAVAEP VNVDPAPAVR LVSDANPLVG RPFYVNPASK 
AMRAVQGNSD PLLASVANTP TAYWMDHLST PSVDSKYIAD AQAAGTTPIL ALYGIPNRDC
GSFAAGGFGS AGAYRAWIDG VAGAIGGGPA AVVLEPDALA MIDCLSPGQQ QERLELIGYA
VDTLTRNPAT AVYVDAGHPR WVAADVMAGR LNQVGVAKAR GFSLNTANFF TTEESIGYGQ
AVSGMTNGSH FVIDTSRNGV GPVDSDSWCN PPGRALGTPP TTATGHPQVD AFLWVKRPGE
SDGSCGGGAP SAGTFVAQYA IDLARTAGW