Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mvan_5030 |
Symbol | |
ID | 4644640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium vanbaalenii PYR-1 |
Kingdom | Bacteria |
Replicon accession | NC_008726 |
Strand | - |
Start bp | 5382544 |
End bp | 5385201 |
Gene Length | 2658 bp |
Protein Length | 885 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639808501 |
Product | glycoside hydrolase family protein |
Protein accession | YP_955808 |
Protein GI | 120405979 |
COG category | [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.277971 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.200228 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCTG CCCAGGTCGT AGGTCGTGTC GGCGCGCTGG CCGTCGCGTT GGGGGTCGGG GCGGCCGTGT TCACCGGGTC AGGTGTCGCG TGGGCAACCG AGGACGCCGC GGGCGAGACC ACCGCGGAAT CCGGGGCGAC GACCTCCGAG CCGGCGGACG ACGGGGACGC GGTTGAACCG GCCGACGACG GGGACACGGT CGAGCAGGTC GAGTCCGACG ACGATGACGA AGCGGTCGGG CAGGTCGAGC CCGAGGAGGA AGCAGAGGCC GACGAGGACG AGGCGGTCGT CGAGCCCGAC GAGGACGAAG TCGCCCCCGA CGTGGAGGAG GCTGCCGACG CGGACGGGGC GCCCTCGGAG GACGGCGACG GTGGATTCCA GGTGCCGGCC GCGCCACGTG AGGACGACGC TCCGCCCACC GAACCCGATG AGGACACCGG GAGTGCGGCC GCGGTCGCAA GCTCCGAAAC CGGAAGCTCC GAAGCCGACG GCGTCGAACC CGCGGTCGAC ACCGTCGCGG TGCAGCGCCT GACCGTCGAG CCGACCGCTG CCGTCTCGAC GACCGTCGCC TCACCGCCCG AGGTTCCGTC GTGGCGGCCG TGGCCCACCG CGTTCGACCT GCGCAGCGCG GTGACCTACG TGGTGGATCT CGCCACGAGC TTCGTCGACG CGCTGCTGAG CCCCTTCGCC GCGGGTCCGC CCAGGCCGCC GGCCGATCCG TCGGGCTGGG CGCTGCTGGC CTGGGTGCGG CGGGAATTCT TCAACGGCAC GCCGGCTCCG GTGGAGAATC CGCTGCCCCA CACCCAGAGC CTCACCGCCG ACGGCGGCGT CGTGATCACC GGCAACGTTG GCGTCGTGGA TCCCGACGGC GACGCGCTGA CCTACTCGGT GATCGGCCGC CCGCACAACG GTGGGACCGT CGCGGTCGAC GCCGACGGGA ACTTCGTCTA TCGACCGATG AACGCGATGG CCGCGGTCGG CGGGACGGAC ACGTTCACCG TCGCCGTCAG CGACGAACAT GACGGGCTAC ACGTCCACGG CCTGTTCGGC TGGCTGCAGT TCGTGCCGAT TCTCGGCAAC CTGCTCAACC CCGGCGGTGG CCACGGCCGC ACCGTCACGA TCGCCGTCAC CGTCACACCG GTCGACGGCA TCGACCTTTC GCTGCCCGAT GATTTCCGTT GGGGTGTAGC GCATTCCGGT TTCCAGGCGG AAGGCGGACC CGGGTCGCCG GTGGACACCC GGTCCGATTG GTACCGCTGG GTGCACGACC CGGTCAACCG GCTGCTCGGC CTGGTCAAGG GGGTGCCGGA GGACGGCCCC GGGGCCTACG TCTCCTACGA CGGCGACGCC GCGCTGGCGC GTGACGAGCT CGGCATGAAC ACCTTCCGGA TGGGTATCGA ATGGAGCCGG ATCTTCCCGG AGTCCACAGC GGCAGTGGAT ATCTCCGACG AAGGCGGGGC GCTGAGCCTG GCTGACCTCG AAGCGCTCGA CGAGCTGGCC GACCAGGATG AGGTGGCCCA CTACCGGGCC GTCTTCGCCG CGCTGCGCCG GCGCGGCCTC GACCCGTTCG TCACCGTCAA CCACTTCACC CTGCCGGTGT GGGTGCACGA CCCCATCGTC GCGCGTCCGC TGATCCAGTT GGGGCTGCCG GCCCCCGCGG CGGGCTGGCT GTCGTCGAAC ACCGCCGAGG AGTTCGAGAA GTACGCCGCC TACGTCGCAT GGAAGTACGG CGATCAGGTG GACAACTGGG CGACCCTCAA CGAGCCGTTC CCTCCGGTGC TCACGGAGTT CCTCGCGATC CCCTGGGTGG TCCCGAACTG GCCGCCCGGC GTCCTGCGGC CTGACCTGGC GTCGACGTTC GTGGTCAACC AGGCGATCGG CCACGTCGCC GCCTACGACG CGATCCATAC CTGGGACACC ACTGCGGCCG CCGCCGACGG ACCCGCGGCG TTCGTCGGCT TCACCCACAA CATGATCCCC GCGCGGCCCG CCAACCCGGT CAACCCGCTC GACGTGCAGG CCGCCGACGC GTGGAATCAC TTCTACAACA AGTGGTTCCC CAACGCGGTG ATCGACGGTT GGGTGGATGC GAACTTCGAC GGCGTCAAGA CCGCCGACGA GATCCACCCC GAGATGGCCG GAAAGGTCGA CTTCCTCGGG GTGCAGTACT ACGGGTCGCA ACCCATGGTC GGCTTCGGTG TCGCGCCGCT GCCCGGCTTC CCGTTTCTGC GCGGCTTCCC GATCCGGTGC TCGGGCGAGG AGCCGACCTG CAGCGACTTC AACCAGCCCA CCGATCCCGG CGGCTTCCGC GAGGTGCTCG AACTCGCGGG GTCGTACGGA AAACCGCTGT GGGTGACCGA GAACGGGATC GCCGATGCCG ACGACTCGAA ACGGCCCTCC TACATCGTCA ACCACGTCGC GGTGGTGCAG GACCTGGTGG CCCATGGTGC CGATATCCGC GGCTACACCT ACTGGTCGTT CGTCGACAAC CTGGAATGGT CAGAAGGCTA CGAGCTGCAG TTCGGACTGT ACGGCTCCGA TCCTGAAACC CCTGAGCTGG AACGCATTCC GAAGCCGGCG AGCATCGCCG CGCTGAGCGG GATCACCACC GCCAACGGCC TGCCGGTGTC GCTGCTGCAG ACCTACATCC CGAGCTAG
|
Protein sequence | MKSAQVVGRV GALAVALGVG AAVFTGSGVA WATEDAAGET TAESGATTSE PADDGDAVEP ADDGDTVEQV ESDDDDEAVG QVEPEEEAEA DEDEAVVEPD EDEVAPDVEE AADADGAPSE DGDGGFQVPA APREDDAPPT EPDEDTGSAA AVASSETGSS EADGVEPAVD TVAVQRLTVE PTAAVSTTVA SPPEVPSWRP WPTAFDLRSA VTYVVDLATS FVDALLSPFA AGPPRPPADP SGWALLAWVR REFFNGTPAP VENPLPHTQS LTADGGVVIT GNVGVVDPDG DALTYSVIGR PHNGGTVAVD ADGNFVYRPM NAMAAVGGTD TFTVAVSDEH DGLHVHGLFG WLQFVPILGN LLNPGGGHGR TVTIAVTVTP VDGIDLSLPD DFRWGVAHSG FQAEGGPGSP VDTRSDWYRW VHDPVNRLLG LVKGVPEDGP GAYVSYDGDA ALARDELGMN TFRMGIEWSR IFPESTAAVD ISDEGGALSL ADLEALDELA DQDEVAHYRA VFAALRRRGL DPFVTVNHFT LPVWVHDPIV ARPLIQLGLP APAAGWLSSN TAEEFEKYAA YVAWKYGDQV DNWATLNEPF PPVLTEFLAI PWVVPNWPPG VLRPDLASTF VVNQAIGHVA AYDAIHTWDT TAAAADGPAA FVGFTHNMIP ARPANPVNPL DVQAADAWNH FYNKWFPNAV IDGWVDANFD GVKTADEIHP EMAGKVDFLG VQYYGSQPMV GFGVAPLPGF PFLRGFPIRC SGEEPTCSDF NQPTDPGGFR EVLELAGSYG KPLWVTENGI ADADDSKRPS YIVNHVAVVQ DLVAHGADIR GYTYWSFVDN LEWSEGYELQ FGLYGSDPET PELERIPKPA SIAALSGITT ANGLPVSLLQ TYIPS
|
| |