Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_5837 |
Symbol | |
ID | 4643419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 6222708 |
End bp | 6223733 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639809313 |
Product | cellulase |
Protein accession | YP_956608 |
Protein GI | 120406779 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.323665 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTTCGGCG GCACCGCTAA GTTTCCTCAG GTGACGTTCT CAGCTGCAGG TGTAATCGCG CGCTGGATCA CCCCGTGTCT GGCCGTGGCG GCGATCGCCG CCACCGGGGT GGTCGCCGAA CCCGGGCCCA TCCCGGCCGC TCCCGTCGTG CGGCTGGCCA GCGAGGCGAA TCCGCTGGTC GGGCTGCCGT TCTATGTGAA CCCGCAGTCG AAGGGCATGC GGGCCGCGCA GGGCAATCCG GATCCGCTGC TGTCCGCCGT GGTGAACACG CCGACCGCGT ACTGGATGGA CCACATCTCC ACCCCTGCCG TGGACGCGAA GTACATTGCT ACCGCGCAGG CCGCGGGCAC CATGCCGGTG CTGGCGCTCT ACGGGATCCC CAACCGCGAC TGCGGCAGCT ACGCCGCGGG CGGATTCGGG TCGGCGGGTG CCTACCGGGC GTGGATCGAC GGCGTCGCCG CCGCGATCGG CCCCGGACGC GCCGCGGTAA TCCTGGAACC CGACGCGTTG GCCATGATCG ACTGCCTGTC ACCGGGTCAG CAGCAGGAAC GGCTGGAGCT GATCGGCTAC GCCGTCGACA CGCTGTCGCG CAACCCGGGC ACCGCGGTGT ACGTCGACGC GGGCCACCCG CGCTGGGTGG CCGCCGACGT GATGGCGGGC CGGCTCAACC AGGTCGGCAT CGACAGGGCA CGCGGCTTCA GCCTCAACAC CGCCAACTTC TTCACCACCG AGGAGTCGAT GGGCTACGGA GGCGCGATCT CCGGGATGAC GGGTGGCAAG CCGTTCGTCA TCGACACGTC GCGCAACGGC GCCGGACCCG TCGAGGGCGA CGACCTGTAC TGGTGCAACC CGAGCGGCCG CGCGCTCGGC GCCCGGCCCA CCACCAACAC CGGCAACCCG ATGGTGGACG CGTTCCTGTG GGTGAAGCGT CCCGGCGAGT CCGACGGCGC CTGCCGCGGC TTCCCGAGTG CGGGCACCTT CATGAGCCAG AACGCCATCG ACCTGGCCCG CAACGCAGGC TGGTAG
|
Protein sequence | MFGGTAKFPQ VTFSAAGVIA RWITPCLAVA AIAATGVVAE PGPIPAAPVV RLASEANPLV GLPFYVNPQS KGMRAAQGNP DPLLSAVVNT PTAYWMDHIS TPAVDAKYIA TAQAAGTMPV LALYGIPNRD CGSYAAGGFG SAGAYRAWID GVAAAIGPGR AAVILEPDAL AMIDCLSPGQ QQERLELIGY AVDTLSRNPG TAVYVDAGHP RWVAADVMAG RLNQVGIDRA RGFSLNTANF FTTEESMGYG GAISGMTGGK PFVIDTSRNG AGPVEGDDLY WCNPSGRALG ARPTTNTGNP MVDAFLWVKR PGESDGACRG FPSAGTFMSQ NAIDLARNAG W
|
| |