Gene Mvan_5030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5030 
Symbol 
ID4644640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5382544 
End bp5385201 
Gene Length2658 bp 
Protein Length885 aa 
Translation table11 
GC content69% 
IMG OID639808501 
Productglycoside hydrolase family protein 
Protein accessionYP_955808 
Protein GI120405979 
COG category[R] General function prediction only
[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase
[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.277971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.200228 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCTG CCCAGGTCGT AGGTCGTGTC GGCGCGCTGG CCGTCGCGTT GGGGGTCGGG 
GCGGCCGTGT TCACCGGGTC AGGTGTCGCG TGGGCAACCG AGGACGCCGC GGGCGAGACC
ACCGCGGAAT CCGGGGCGAC GACCTCCGAG CCGGCGGACG ACGGGGACGC GGTTGAACCG
GCCGACGACG GGGACACGGT CGAGCAGGTC GAGTCCGACG ACGATGACGA AGCGGTCGGG
CAGGTCGAGC CCGAGGAGGA AGCAGAGGCC GACGAGGACG AGGCGGTCGT CGAGCCCGAC
GAGGACGAAG TCGCCCCCGA CGTGGAGGAG GCTGCCGACG CGGACGGGGC GCCCTCGGAG
GACGGCGACG GTGGATTCCA GGTGCCGGCC GCGCCACGTG AGGACGACGC TCCGCCCACC
GAACCCGATG AGGACACCGG GAGTGCGGCC GCGGTCGCAA GCTCCGAAAC CGGAAGCTCC
GAAGCCGACG GCGTCGAACC CGCGGTCGAC ACCGTCGCGG TGCAGCGCCT GACCGTCGAG
CCGACCGCTG CCGTCTCGAC GACCGTCGCC TCACCGCCCG AGGTTCCGTC GTGGCGGCCG
TGGCCCACCG CGTTCGACCT GCGCAGCGCG GTGACCTACG TGGTGGATCT CGCCACGAGC
TTCGTCGACG CGCTGCTGAG CCCCTTCGCC GCGGGTCCGC CCAGGCCGCC GGCCGATCCG
TCGGGCTGGG CGCTGCTGGC CTGGGTGCGG CGGGAATTCT TCAACGGCAC GCCGGCTCCG
GTGGAGAATC CGCTGCCCCA CACCCAGAGC CTCACCGCCG ACGGCGGCGT CGTGATCACC
GGCAACGTTG GCGTCGTGGA TCCCGACGGC GACGCGCTGA CCTACTCGGT GATCGGCCGC
CCGCACAACG GTGGGACCGT CGCGGTCGAC GCCGACGGGA ACTTCGTCTA TCGACCGATG
AACGCGATGG CCGCGGTCGG CGGGACGGAC ACGTTCACCG TCGCCGTCAG CGACGAACAT
GACGGGCTAC ACGTCCACGG CCTGTTCGGC TGGCTGCAGT TCGTGCCGAT TCTCGGCAAC
CTGCTCAACC CCGGCGGTGG CCACGGCCGC ACCGTCACGA TCGCCGTCAC CGTCACACCG
GTCGACGGCA TCGACCTTTC GCTGCCCGAT GATTTCCGTT GGGGTGTAGC GCATTCCGGT
TTCCAGGCGG AAGGCGGACC CGGGTCGCCG GTGGACACCC GGTCCGATTG GTACCGCTGG
GTGCACGACC CGGTCAACCG GCTGCTCGGC CTGGTCAAGG GGGTGCCGGA GGACGGCCCC
GGGGCCTACG TCTCCTACGA CGGCGACGCC GCGCTGGCGC GTGACGAGCT CGGCATGAAC
ACCTTCCGGA TGGGTATCGA ATGGAGCCGG ATCTTCCCGG AGTCCACAGC GGCAGTGGAT
ATCTCCGACG AAGGCGGGGC GCTGAGCCTG GCTGACCTCG AAGCGCTCGA CGAGCTGGCC
GACCAGGATG AGGTGGCCCA CTACCGGGCC GTCTTCGCCG CGCTGCGCCG GCGCGGCCTC
GACCCGTTCG TCACCGTCAA CCACTTCACC CTGCCGGTGT GGGTGCACGA CCCCATCGTC
GCGCGTCCGC TGATCCAGTT GGGGCTGCCG GCCCCCGCGG CGGGCTGGCT GTCGTCGAAC
ACCGCCGAGG AGTTCGAGAA GTACGCCGCC TACGTCGCAT GGAAGTACGG CGATCAGGTG
GACAACTGGG CGACCCTCAA CGAGCCGTTC CCTCCGGTGC TCACGGAGTT CCTCGCGATC
CCCTGGGTGG TCCCGAACTG GCCGCCCGGC GTCCTGCGGC CTGACCTGGC GTCGACGTTC
GTGGTCAACC AGGCGATCGG CCACGTCGCC GCCTACGACG CGATCCATAC CTGGGACACC
ACTGCGGCCG CCGCCGACGG ACCCGCGGCG TTCGTCGGCT TCACCCACAA CATGATCCCC
GCGCGGCCCG CCAACCCGGT CAACCCGCTC GACGTGCAGG CCGCCGACGC GTGGAATCAC
TTCTACAACA AGTGGTTCCC CAACGCGGTG ATCGACGGTT GGGTGGATGC GAACTTCGAC
GGCGTCAAGA CCGCCGACGA GATCCACCCC GAGATGGCCG GAAAGGTCGA CTTCCTCGGG
GTGCAGTACT ACGGGTCGCA ACCCATGGTC GGCTTCGGTG TCGCGCCGCT GCCCGGCTTC
CCGTTTCTGC GCGGCTTCCC GATCCGGTGC TCGGGCGAGG AGCCGACCTG CAGCGACTTC
AACCAGCCCA CCGATCCCGG CGGCTTCCGC GAGGTGCTCG AACTCGCGGG GTCGTACGGA
AAACCGCTGT GGGTGACCGA GAACGGGATC GCCGATGCCG ACGACTCGAA ACGGCCCTCC
TACATCGTCA ACCACGTCGC GGTGGTGCAG GACCTGGTGG CCCATGGTGC CGATATCCGC
GGCTACACCT ACTGGTCGTT CGTCGACAAC CTGGAATGGT CAGAAGGCTA CGAGCTGCAG
TTCGGACTGT ACGGCTCCGA TCCTGAAACC CCTGAGCTGG AACGCATTCC GAAGCCGGCG
AGCATCGCCG CGCTGAGCGG GATCACCACC GCCAACGGCC TGCCGGTGTC GCTGCTGCAG
ACCTACATCC CGAGCTAG
 
Protein sequence
MKSAQVVGRV GALAVALGVG AAVFTGSGVA WATEDAAGET TAESGATTSE PADDGDAVEP 
ADDGDTVEQV ESDDDDEAVG QVEPEEEAEA DEDEAVVEPD EDEVAPDVEE AADADGAPSE
DGDGGFQVPA APREDDAPPT EPDEDTGSAA AVASSETGSS EADGVEPAVD TVAVQRLTVE
PTAAVSTTVA SPPEVPSWRP WPTAFDLRSA VTYVVDLATS FVDALLSPFA AGPPRPPADP
SGWALLAWVR REFFNGTPAP VENPLPHTQS LTADGGVVIT GNVGVVDPDG DALTYSVIGR
PHNGGTVAVD ADGNFVYRPM NAMAAVGGTD TFTVAVSDEH DGLHVHGLFG WLQFVPILGN
LLNPGGGHGR TVTIAVTVTP VDGIDLSLPD DFRWGVAHSG FQAEGGPGSP VDTRSDWYRW
VHDPVNRLLG LVKGVPEDGP GAYVSYDGDA ALARDELGMN TFRMGIEWSR IFPESTAAVD
ISDEGGALSL ADLEALDELA DQDEVAHYRA VFAALRRRGL DPFVTVNHFT LPVWVHDPIV
ARPLIQLGLP APAAGWLSSN TAEEFEKYAA YVAWKYGDQV DNWATLNEPF PPVLTEFLAI
PWVVPNWPPG VLRPDLASTF VVNQAIGHVA AYDAIHTWDT TAAAADGPAA FVGFTHNMIP
ARPANPVNPL DVQAADAWNH FYNKWFPNAV IDGWVDANFD GVKTADEIHP EMAGKVDFLG
VQYYGSQPMV GFGVAPLPGF PFLRGFPIRC SGEEPTCSDF NQPTDPGGFR EVLELAGSYG
KPLWVTENGI ADADDSKRPS YIVNHVAVVQ DLVAHGADIR GYTYWSFVDN LEWSEGYELQ
FGLYGSDPET PELERIPKPA SIAALSGITT ANGLPVSLLQ TYIPS