Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_3505 |
Symbol | |
ID | 4649321 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 3723963 |
End bp | 3730880 |
Gene Length | 6918 bp |
Protein Length | 2305 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639806982 |
Product | chitinase, cellulase |
Protein accession | YP_954306 |
Protein GI | 120404477 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0820772 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTACG GACGATTTGT TGGTCGTGTC GGTGCATTAG CTGTGGCGCT GGGCATCGGG CTCGCGGTCC CCGCCGTGGC GGCCGCCGAC CCGACCGACG ACAGCACCGC AGGCGAGTCG AGCGCGAAGG CTGACCAGTC CGACGCCGCC GGCGTCGGCA GTCAAGACCC GGGGGATACC TCGTCGGAGA CCGACGACGT GGGCTCGGAG GATGCAGACT CCGACGATCC CGACGCCGAG GAATCTGACG ACCTGGCGGA GCACGGTTCG CCCGAGGGAG ACGACGAGGA CACCGCGACT CCGCCGGGAG AAGTAGCGGC CGAGCCGCCT GCAGACGATG CGACGCCGGC TGCCGAACCG AACGATGAGT CGCAGCAGGC CGATCGCCAC CAGGCGCACT CGCCCGTCGA CGAGGAAACC GACGGGGACG CCGACCCGGA AGCCGACGGT CTCGACACCG GCCCCGAGCC CACGGACGCG CCGGCCGGCC ACCGCGGTGC CGACGAGGCC GTCACGCCGG ATGCTGCCCC ACCGGCAGTC GAGCCCGCCA CCCCGACGAT GGCCGTGTCG GTCTCGTCTG CAGCAGCCGA GACCGACACC GCGACCGCGT CCACCGGGCT GGCGAGCATC GTCTCGAAGT TCCTGGGTGT GCTGGGAATC GGTCCGTCCG CGGCCAACGG CGCCGTGCCG TTGCCGGCGT TCGAGTTCGT CACCGCGGTG TTCGGCGCGA TCCGCCGCGA GATGGACCGA CTCTTCGCCA ACCACGGGCC GACCGCCGCG CTCGCGTGGT CCAGCCAGCC GGCGCCCGGC CTGATCACCG GAACGCTGAC CGCCTCCGAT CCGGAAGGTG ACCGACTGGT CTACACGGTG GTGCAGGCGC CGACCAAGGG CACCGTGACC GTCGACGCCT CAGGCAACTT CACCTACACG GTCAACCCGC TGCTGGCTTC GGCCGCCGGC ACCGACAGCT TCGTCATTCA GGTTCGCGAC GCCGGATTCC ACCTGATCTC GGTGAGTCCC ACCAAGACTC GGGTCCCCGT GGCGATCACC GTGGACTCCG GCGGCGTGGT CACCAAGACG ACGTCTGCGC GCAGCGGCGC CACCGCACTC GCGGCCACCA GCGCCGCGAC GGGCACGATC CACCACGTCG TCACATCCGG GCCCGACATC GTCGGATTCG ATCCTGCCAG AGACAAATTG GATCTCGGCG ATGTGTCGGT GCACAACTTC ATCGTCGTCG ACACGGCCGA GGGGGTCGGC TTCCGCAACC CCTGGTCCGG CGAGACCGCT GTCGTCAAGG GTGTGTCTCT GAGCCAGCTC ACCGTCGACA GCTTCACTCC GGTCATCAAC GATCATCTCC GCCAGGACCT TTCCGGCGCG CTCGCCTGGG AGAGTGGTGT CACCCCGCAG CCCGACACCG TCTATGCACG GTCCCACGAA GTCGGACAGA TCGACAGGGT GGCCTTCGAT GCGGCCACCG ATGTGGTCGA CTTCCGCTAT TTCGGCACCC GCGAACAGGT GTACATGTCC GACACCCCCG AGGGCGTGGT GATCTCCAAC GCCGGCACCG GTCAGGCCCT GATCCTGCAA GGTGTCACCA AGAGCCAGCT CACCGCGAGC AATTTCATCT TCCACAATGC CCAGGTGCGC GAAGACCGGC TCAACCAGCA GCTGGGTATC GGAGCGGTGT CGGACTCGCA GATCCTGCCC CAGGGTGTTC CGGTGGCGGG TACCGACAGC TGGCCCACCG CCGCGGGTGA CGGCCAGCCG CCCGCCGGCG AAACCGGCAC CACCACAACG ATTTCCTGGC AGTACGGACG TCACGCCACG CTCGACTTCG ACCCGTCCAC GGATAAGCTC GACTTCGGCT GGTTCAAGGC CCACGAGTTC GACGTCACCG AGGTCGCCGG CTCCACCCGC ATCGCGATCA CCGGAAACAA CCAGTCGTAC ACCCTCACCG GTGTGGCCAT CGGCGAGCTG CAGACGAGCA ACATCGTCGC GCTCGACGAC GGTGCGCGGA CCAAGTGGAG CAACCTCATC TACAGTGCGG GGCAGTCTGT TTCGCAGCCG AGCCTGTCGG TCAGCGACGG CAGCGTGTCC GAGGGGAACT CCGGGACGTC GACGGTGAAC TTCACCGTCA CGCTGTCGAA GGCGTCCACC GAGACGGTCA CGGTCAGCTA CAGCACGAGC AACGGCACCG CCACCGCGGC AGGCGGCGAC TATGCCCCGG CGGTCGGCAC GCTCACGTTC GCGCCCGGAC AGACGACCAA GGCCGTCACG GTCACGGTCA ACGGCGACAC CCTCGTCGAA CTCGACGAGC AGTTCACGCT GACGCTGTCC TCCCCGCTGA ACGCGACCAT CGCCGACGGC AGCGGAGTCG GCACCATCCG CAACGACGAC ATCGACCAGG CGCCTGCCAC GCCGCCGACC GTTTCCATCG CCGACCTGTC CGTCACCGAA GGCAACGGCG ATCACAGCCA TTTCATGTTC GTCGCGACCC TGAACAAGGC GTCGACGGAG ACGGTGACCG TCAGCTACGC CACCTCCAAC GGCACCGCGA TCGCCGGCCT CGACTACAGC GCGACCTCGG GCACCATCAC CTTTGCACCC GGCGTGACCT CGCAACTGGT ACACGTCGAC GTCGTGGGGG ATGCGCTCGC CGAGACGAGC GAGACGTTCC TCGTCACGCT GTCGAGCCCG ACAGCCGCCA CCATCGGTGA CGGTTCGGCG ACCGGGACGA TCCTCGACGA CGACACCGTG GTGCCCGGCA CCGGCGGCGT CAACTCCGGT AACCCCGGCG ACGCCCTCTG GGGCGAGGCG TATTTCGCGC CCTACGTCGA CATGGGTGCG TGGCCGGTGC CGGATCTGCT GGCCATCGCC CGGAATTACG GCACGTCGCT GATCACCCTC GGTTTCCTGC AGGCGACGCC GGACGGCAAG CTGGCGTGGG CCGGGCTGTC GGCGCTGACC CCGGACTCCG ACTTCGACCA GGCCAAGGCG ATCAACCAGT CGATCGCGGC GCTGCAGGCC GCAGGCGGCG ACGTGATGAT CTCCCTGGGC GGGGCGTCGG GAACCAGCCT GGCGCAGTGG TACGTCACCC GTGGGCTCAG CGCGCAGGCA CTGGCAACCG CATACGCCGG CATCGTCGAC ACCTACCACC TCAACCGCAT CGACTTCGAC ATCGAAGGTG CCGCGGTGGC CGACCAGGCG TCGATCGCGC TGAACGCCCA GGCGCTGAAA CTGCTGCAAC AGCAGAAGCC CGACCTGGAG ATCTGGTACA CCCTGCCGGT GCTGCCCACC GGCTTGACGG CCGACGGCCT CAACGTCGTG CGCGCCGCGC TGACGACGGG TGTCAAGCTC GACGGCGTCA ACGTGATGGC GATGGACTAC GGCGAGTCGG CTGCGCCCAC CAGCGGGCCG AACGCAAAGA CCATGGGCGC CTACGCCATC CAGGCCGCGG AGTCGACCCA CGCGCAGCTC TCCGCGCTCT ACACCCAGCA TGGGCAGAGC TTCGGCTGGA ACCAGCTCGG CGTCACCCCG ATGATCGGCG TCAACGACGT TCTGACCGAG GTGTTCACCG TCGCCGATGC GCAGGCACTC GAAGACTTCG CCCGCGCAAA GGATCTCGGC ATGCTGTCGA TGTGGTCGGT GAACCGCGAC AAACCGGGCA ATCTCGGCCA GGCCACCACC AACACCTCCG GGACCAACGC CCCCGAGGGC AGCTTCAGCA ACGTCTTCAA CGACTACGGA ACTGTCAATC CGGTGTCCGG ACCGCCGCCG GTGATCTCCA TCGCCGACCT CGCGGTGGCC GAAGGCAACG GCGATCACGC CCATTTCATG TTCGTCGTGA CCCTGGACAG GGCGTCCACC GAGGCGGTCA CGGTGCACTA CACCACAGCC AACGGCACCG CGACCGCCGG CGTGGACTAC ACCGCAGCCT CGGGCGTCAT CGAGTTCGCG CCGGGCGTCA CGTCGCGCAC CGTGCACGTC GACATCCTCG GCGACACCCT TGCCGAATCG AGCGAGACGC TCACCGTGAC GCTGTCGAGT CCGACCGGTG CGACCATCGC CGACGGCACC GCCACCGGCA CCATCACCGA CGACGACGGT GGTACCACCC CGACCCCGGT TGACTCGTCG GTGAAGTACG TCGTCAACGA CAACTGGGGA TCCGGCTTCG TTGCGACGGT CATCGTGACC GCGGGGACGT CCGGATTCTC AGGCTGGACA GTGGAATTCG ATTCGCCGGC ACAGATCAGC AACATCTGGA ACGCGGAGAT CGTCAGCCAG GTGGGCAACC ACTACGTGGT GCGCAACGTC TCGTGGAACC CCAAGGTCGT CGCCGGGCAG ACCGTCGAGT TCGGCTTCCA GGCGTCCCCT GGCGGCGCCA CGGCGACGGC CACCGGCTTC GTCGTCAACG GCGTGCCCGC CGGTGGTCAG GGTCCGGCCC CGGTGGTGTT GCCGAAGGTC GCGATCGCCG ACGCCAGTGT CACCGAATCC GACAGCGGCA CAAAGAACAT GGTGTTCGTG GTGACGTTGG ACAAGGCGCC GGCGACGCCG GTGAGCATCG CCTATGCGAC GTCGAACGGC ACCGCGACCG CCGGAAGTGA CTTCACCGCG ACCTCGGGTG TTCTGACCTT CGCCGCCGGT GCGACCACCG CGCAGATCAC GGTCCCGATC CTGGGCGACA CGGTCGTCGA ACAGAACGAG ACGTTCACGC TGACCCTTTC GAATCCGAAC GGGGTGACCA TCGCCGACGC TTCCGCGGTC GGCACCGTCA CCAACGACGA CGTGGCCACC CCCACGCCGG GCAATTCGTC GGCCACGTTG GCGGTCAACG ACAATTGGGG CTCCGGCTTC ACCGCTACCG TCACCGTGAC AGCCGGGTCT GCCGGACTCA ACGGCTGGAC AGTCGAATTC GACACCCCTG CGCAGATCAG CAACATCTGG AACGCCGAGA TCGTGAGCAG GGTCGGCAAC CACTACGTGG TGCGCAACGC CTCCTGGAAC CCCAAGGTCG CGGCGAACCA GACCGTCAGC TTCGGCTTTC AGGCATCTCC AGGAGGTGCT TCGGCGACGG CCACCAACTT CGTCGTCAAC GGACAGGCGT CGGCCCCTGT GCAGCCGACG GTGTCGGTCG CCGATGCCGC AGTCAACGAA GCACACAGCG GCGCAACGCA GATGACGTTC GTGGTGACAC TGTCGAAGGC GTCGACCACG CCGGTAACCG TCACCTACGC GACCGGCAAC GGTACGGCCA CCGCCGGCGT GGACTACACC GCGAAGTCGG GGACGGTGAC CTTCGCCCCG GGAGTGCTCT CGCAGCAGAT CCAGGTCTCC ATCACCGGTG ACACCGCGGT CGAGGCGAAC GAGACCTTCA CGCTGACACT CTCGAACCCC AGCGGCGCGA CCGTGTCCGA CGGCTCCGCC ACCGGGACAA TCACCAACGA CGACGTCGCG GTGCCCGGCA ACGTGTCGCT GAGCATCTCG GATGCCTCGG TGACCGAAGG CGTGCCCGGC ACCGGGGTCG CGGCCGGCTG GTTCAAGACC GCCGGCAACC AGATCGTCGA TTCGGCGGGC AACCCGGTAC AGATCGCCGG CGTCAACTGG TTCGGCATGG AGAGTGACAT ATTCACGCCG CACGGCCTGT GGACGCGCAA CTACAAAGAC ATGATGAACC AGATGGCGGC GCTGGACTTC AACACCATCC GGCTCGCGTA CTCCAGCGAG AGCCTGCACA CCACCAAGGC GCCGTCCGGC ATCGACTTCT CGAAGAATCC CGACCTGGTC GGGCTCAGCT CGCTGCAGAT CATGGACAAG ATCGTCGCGT ACGCCGGCGA AATCGGGATG CGTGTCATCC TGGACCACCA TCGCAGCAGT GCAGGAGCGG GCCCGAACGG CAACGGCCTC TGGTACGAGG GCTCCTACAC CGAGGCGGCG TGGATCGCCG ACTGGAAGAT GCTGGCGCAG CGGTATGCCA ACGATCCGAC GGTGATCGGT GCCGACCTGC ACAACGAACC GCACAATGGA ACCTGGGGCG GTGGGGGAGC CACCGACTGG GCCGCCGCTG CCGAGCGGGC GGGCAACGCC GTGCTCTCGG TGAACTCGAA CTGGCTGATC TTCGTGGAGG GCGTGGAAAC CTATCAGGGC AACAACTACT GGTGGGGTGG CAACCTGATG GGTGTCAAGG ACCGCCCGAT CGTGTTGAAC GTGCCCGACC GGGTGGTGTA CTCACCGCAC GACTACCCGA ACTCGGTGTA CAACCAGCCG TGGTTCCAGA CCGCCAACTT CGGTGCGGCG CTGCCGGACA AGTTCGAGCA GATGTGGGGC TACATCTACG AGCAGAACAT CGCACCGATC TACCTCGGCG AGTTCGGCAC CAGGATGACC GACCCGAAGG ACCTCGTCTG GTACGAGGCG ATCACGTCGT ATCTGTCCGG CGACTTCGAC AACAACGGCA CCATCGACAT CGCGGCCGGC ACCGAGGACA TGTCGTGGAC GTTCTGGTCG TGGAACCCGA ATTCCACTGA CACCGGCGGG ATCCTGGCCG ACGACTGGAA CACGGTCAAC ACGAACAAGA TGGCCTACCT GCAGGCCATC CAGTTCGACT TCGACGAAGG CAGCCCCGGT GTGCTGGCGC AGTTCGTGGT GTCGCTGGCC GCCCCGTCGA CCCAGGCCGT GACGGTCCAG TACGCGACGT CGAACGGCAC CGCCACCGGG GGCAGCGACT TCGCTGCCAC CTCGGGCACG CTGACATTCC AGCCCGGGGA GACCAGCAAG ACGATCACCG TGGTGGTGTT CGGGGACACC CTGATGGAAG GCAACGAGAG CTTCGTCGTC ACACTGTCCA GTCCGGCCGG GGCGACGATC GCCGACCCGA CCGGTGCCGG CACCATCGTC GACCGGGTGA TCGTCTAG
|
Protein sequence | MGYGRFVGRV GALAVALGIG LAVPAVAAAD PTDDSTAGES SAKADQSDAA GVGSQDPGDT SSETDDVGSE DADSDDPDAE ESDDLAEHGS PEGDDEDTAT PPGEVAAEPP ADDATPAAEP NDESQQADRH QAHSPVDEET DGDADPEADG LDTGPEPTDA PAGHRGADEA VTPDAAPPAV EPATPTMAVS VSSAAAETDT ATASTGLASI VSKFLGVLGI GPSAANGAVP LPAFEFVTAV FGAIRREMDR LFANHGPTAA LAWSSQPAPG LITGTLTASD PEGDRLVYTV VQAPTKGTVT VDASGNFTYT VNPLLASAAG TDSFVIQVRD AGFHLISVSP TKTRVPVAIT VDSGGVVTKT TSARSGATAL AATSAATGTI HHVVTSGPDI VGFDPARDKL DLGDVSVHNF IVVDTAEGVG FRNPWSGETA VVKGVSLSQL TVDSFTPVIN DHLRQDLSGA LAWESGVTPQ PDTVYARSHE VGQIDRVAFD AATDVVDFRY FGTREQVYMS DTPEGVVISN AGTGQALILQ GVTKSQLTAS NFIFHNAQVR EDRLNQQLGI GAVSDSQILP QGVPVAGTDS WPTAAGDGQP PAGETGTTTT ISWQYGRHAT LDFDPSTDKL DFGWFKAHEF DVTEVAGSTR IAITGNNQSY TLTGVAIGEL QTSNIVALDD GARTKWSNLI YSAGQSVSQP SLSVSDGSVS EGNSGTSTVN FTVTLSKAST ETVTVSYSTS NGTATAAGGD YAPAVGTLTF APGQTTKAVT VTVNGDTLVE LDEQFTLTLS SPLNATIADG SGVGTIRNDD IDQAPATPPT VSIADLSVTE GNGDHSHFMF VATLNKASTE TVTVSYATSN GTAIAGLDYS ATSGTITFAP GVTSQLVHVD VVGDALAETS ETFLVTLSSP TAATIGDGSA TGTILDDDTV VPGTGGVNSG NPGDALWGEA YFAPYVDMGA WPVPDLLAIA RNYGTSLITL GFLQATPDGK LAWAGLSALT PDSDFDQAKA INQSIAALQA AGGDVMISLG GASGTSLAQW YVTRGLSAQA LATAYAGIVD TYHLNRIDFD IEGAAVADQA SIALNAQALK LLQQQKPDLE IWYTLPVLPT GLTADGLNVV RAALTTGVKL DGVNVMAMDY GESAAPTSGP NAKTMGAYAI QAAESTHAQL SALYTQHGQS FGWNQLGVTP MIGVNDVLTE VFTVADAQAL EDFARAKDLG MLSMWSVNRD KPGNLGQATT NTSGTNAPEG SFSNVFNDYG TVNPVSGPPP VISIADLAVA EGNGDHAHFM FVVTLDRAST EAVTVHYTTA NGTATAGVDY TAASGVIEFA PGVTSRTVHV DILGDTLAES SETLTVTLSS PTGATIADGT ATGTITDDDG GTTPTPVDSS VKYVVNDNWG SGFVATVIVT AGTSGFSGWT VEFDSPAQIS NIWNAEIVSQ VGNHYVVRNV SWNPKVVAGQ TVEFGFQASP GGATATATGF VVNGVPAGGQ GPAPVVLPKV AIADASVTES DSGTKNMVFV VTLDKAPATP VSIAYATSNG TATAGSDFTA TSGVLTFAAG ATTAQITVPI LGDTVVEQNE TFTLTLSNPN GVTIADASAV GTVTNDDVAT PTPGNSSATL AVNDNWGSGF TATVTVTAGS AGLNGWTVEF DTPAQISNIW NAEIVSRVGN HYVVRNASWN PKVAANQTVS FGFQASPGGA SATATNFVVN GQASAPVQPT VSVADAAVNE AHSGATQMTF VVTLSKASTT PVTVTYATGN GTATAGVDYT AKSGTVTFAP GVLSQQIQVS ITGDTAVEAN ETFTLTLSNP SGATVSDGSA TGTITNDDVA VPGNVSLSIS DASVTEGVPG TGVAAGWFKT AGNQIVDSAG NPVQIAGVNW FGMESDIFTP HGLWTRNYKD MMNQMAALDF NTIRLAYSSE SLHTTKAPSG IDFSKNPDLV GLSSLQIMDK IVAYAGEIGM RVILDHHRSS AGAGPNGNGL WYEGSYTEAA WIADWKMLAQ RYANDPTVIG ADLHNEPHNG TWGGGGATDW AAAAERAGNA VLSVNSNWLI FVEGVETYQG NNYWWGGNLM GVKDRPIVLN VPDRVVYSPH DYPNSVYNQP WFQTANFGAA LPDKFEQMWG YIYEQNIAPI YLGEFGTRMT DPKDLVWYEA ITSYLSGDFD NNGTIDIAAG TEDMSWTFWS WNPNSTDTGG ILADDWNTVN TNKMAYLQAI QFDFDEGSPG VLAQFVVSLA APSTQAVTVQ YATSNGTATG GSDFAATSGT LTFQPGETSK TITVVVFGDT LMEGNESFVV TLSSPAGATI ADPTGAGTIV DRVIV
|
| |