Gene Mmcs_5301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmcs_5301 
Symbol 
ID4114128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. MCS 
KingdomBacteria 
Replicon accessionNC_008146 
Strand
Start bp5587380 
End bp5588369 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content71% 
IMG OID638034457 
Productcellulase 
Protein accessionYP_642458 
Protein GI108802261 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.104046 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTCAG CTGTTGGTGC AGTCGCGCGG TGGGTCGCGC CGTTCCTGAC GGTCGCGGCC 
GTCGCGGGTA CGGCCGCCGT CGCCGAACCC GTGAACGTCG ACCCCGCTCC GGCGGTGCGT
CTGGTCAGCG ATGCGAACCC GCTGGTCGGC AGGCCCTTCT ATGTCAATCC GGCGTCCAAG
GCCATGCGGG CGGTGCAGGG CAACTCGGAC CCGTTGCTGG CTTCGGTCGC CAACACCCCG
ACGGCGTACT GGATGGATCA CCTCTCCACC CCGTCGGTCG ACTCGAAGTA CATCGCCGAC
GCACAGGCCG CGGGCACCAC ACCGATCCTG GCGCTGTACG GCATCCCCAA CCGCGACTGC
GGGAGCTTCG CCGCGGGCGG ATTCGGCTCG GCCGGGGCGT ATCGAGCGTG GATCGACGGC
GTGGCCGGAG CCATCGGAGG GGGCCCGGCG GCGGTCGTCC TCGAACCCGA CGCGCTGGCC
ATGATCGACT GCCTGTCACC GGGCCAGCAG CAGGAACGCC TCGAGCTGAT CGGCTACGCC
GTCGACACCC TGACCCGCAA CCCGGCCACC GCGGTGTACG TGGACGCCGG TCATCCGCGC
TGGGTGGCCG CCGATGTGAT GGCCGGCCGG CTGAACCAGG TCGGCGTCGC CAAGGCGCGC
GGCTTCAGCC TCAACACCGC CAACTTCTTC ACCACCGAGG AGTCGATCGG CTACGGCCAG
GCCGTCTCGG GGATGACGAA CGGATCGCAC TTCGTGATCG ACACGTCGCG CAACGGCGTC
GGACCGGTCG ACAGCGATTC GTGGTGCAAC CCTCCCGGCC GCGCGTTGGG CACCCCGCCC
ACGACGGCCA CCGGCCACCC GCAGGTCGAC GCCTTCCTGT GGGTCAAGCG TCCCGGTGAG
TCCGACGGAT CGTGCGGCGG CGGGGCGCCC AGCGCGGGCA CGTTCGTCGC TCAGTACGCC
ATCGATCTGG CCCGCACCGC AGGCTGGTAG
 
Protein sequence
MSSAVGAVAR WVAPFLTVAA VAGTAAVAEP VNVDPAPAVR LVSDANPLVG RPFYVNPASK 
AMRAVQGNSD PLLASVANTP TAYWMDHLST PSVDSKYIAD AQAAGTTPIL ALYGIPNRDC
GSFAAGGFGS AGAYRAWIDG VAGAIGGGPA AVVLEPDALA MIDCLSPGQQ QERLELIGYA
VDTLTRNPAT AVYVDAGHPR WVAADVMAGR LNQVGVAKAR GFSLNTANFF TTEESIGYGQ
AVSGMTNGSH FVIDTSRNGV GPVDSDSWCN PPGRALGTPP TTATGHPQVD AFLWVKRPGE
SDGSCGGGAP SAGTFVAQYA IDLARTAGW