Gene Mvan_2784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_2784 
Symbol 
ID4645313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp2950723 
End bp2952453 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content69% 
IMG OID639806265 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_953597 
Protein GI120403768 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR02402] malto-oligosyltrehalose trehalohydrolase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.496519 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.888452 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGC ACGAGTTCGC GGTGTGGGCC CCGCGGCCTG AGCGGGTACG CCTGGACGTC 
GACGGGACAC TGCACCCGAT GACGCGCGGC GACGACGACT GGTGGCGCGC GACCGTCGAC
GCCCCCGCCG ACGCCCGCTA CGGGTTCGTG CTCGACGACG ACCCCAAGGT GCTTCCCGAT
CCCCGTTCGC CGCGGCAGCC CGACGGCGTG CACGAGCGCT CCCAGCTGTG GCGGCCGCAG
GCCGACCCCT GGACCGACAG CGGGTGGCAG GGCCGCTCCG TCGAAGGCGC GGTGGTCTAC
GAACTGCACG TGGGCACCTT CACCCCCGGC GGAACGTTCG ACATGGTGGT CGACAAGCTG
GACCACCTCG TCGGCCTCGG TGTCGATTTC GTCGAGGTGA TGCCCGTCAA CGCATTCGGC
GGCACCCACG GCTGGGGCTA CGACGGCGTG CTGTGGTACG CCGTACACGA ACCCTACGGC
GGTCCGGACG GGCTGATCCG GCTCGTCGAC GCCTGCCATG CCCGTGGCCT CGGCGTGCTC
ATCGACGCGG TGTTCAACCA TCTGGGACCG TCCGGCAACT ACCTGCCCAA GTTCGGGCCC
TACCTGTCCA GCGGCTCCAA CCCCTGGGGT GAGTCGATCA ACATCGGCGA CGCCGGCGCC
GACGAGGTGC GCCGCTACAT CCTCGACTGT GCGCTGCGAT GGATGCGGGA CTTCCACGCC
GACGGCCTGC GCCTGGACGC GGTGCACGCC CTCGTCGACA CCACCGCGAT CCACATCCTG
GAAGAACTCT CTGCCGAAAC CGACGCGCTG GCCGACGAAC TCGGACGCCC GCTGTCGCTG
ATCGCCGAGA GCGACATGAA CGACCCGAGG CTGATCACCC CGCGCGAGCA CGGCGGCCTC
GGCATGGCCG CCCAGTGGGA CGACGACATC CACCACGCGA TCCACACCGC GGTATCCGGT
GAACGGCAGG GCTACTACGG CGATTTCGGG ACACTGGAGA CATTGTCAGA GACGTTGCGA
CACGGGTATT TTCACGCCGG CAGCTATTCG TCCTTCCGCC GCCGCAGACA CGGCCGCCCG
CTCGACACCG CGGCGGTACC GGCTACCCGG CTGCTGGCCT ACACCCTGAC CCATGACCAG
GTGGGCAACC GGGCCGTCGG TGACCGGCCG TCACAGAACC TGACGACCGG CCAACTGGCC
GTCAAGGCGG CACTGGCCCT CGGATCGCCC TACACGGCAA TGCTCTTCAT GGGTGAGGAG
TGGGCGTCCT CGTCGCCGTT CCAGTTCTTC AGCTCCCATC CCGAACCCGA ACTGGCCAGG
GCCACCGCCG AAGGCCGCAA GCGGGAGTTC GCCGAGCACG GTTGGGATGC CGACGAGATC
CCCGATCCGC AGGATCCGGA GACGTTCGAG CGTTCGAAGC TGCAGTGGGA TGAGGTCGGC
GTCGGGGACC ATGCCCGACT GCTGGAGTTC TACCGGAGCC TCATCGCCCT GCGGCACACC
GAGCCGGACA TGGCCGATCC GTGGCTGGAC CACCTGAAGA TCGACTTCGA CGAGCATGCG
CGCTGGTTCG TGATGCACCG CGGCGCGCTG GCGATCGCCT GCAACCTCGG CGCCGATCCC
GTCGATGTCC CGGTCACCGG CGACGTGGTG CTGGCATGGG ACGAACCCAC GGTCGGCGTC
GACACCACCC GCGTGGGCGG GCATTCTGTC GCAATCCTGC GGGCTGCCTA G
 
Protein sequence
MPEHEFAVWA PRPERVRLDV DGTLHPMTRG DDDWWRATVD APADARYGFV LDDDPKVLPD 
PRSPRQPDGV HERSQLWRPQ ADPWTDSGWQ GRSVEGAVVY ELHVGTFTPG GTFDMVVDKL
DHLVGLGVDF VEVMPVNAFG GTHGWGYDGV LWYAVHEPYG GPDGLIRLVD ACHARGLGVL
IDAVFNHLGP SGNYLPKFGP YLSSGSNPWG ESINIGDAGA DEVRRYILDC ALRWMRDFHA
DGLRLDAVHA LVDTTAIHIL EELSAETDAL ADELGRPLSL IAESDMNDPR LITPREHGGL
GMAAQWDDDI HHAIHTAVSG ERQGYYGDFG TLETLSETLR HGYFHAGSYS SFRRRRHGRP
LDTAAVPATR LLAYTLTHDQ VGNRAVGDRP SQNLTTGQLA VKAALALGSP YTAMLFMGEE
WASSSPFQFF SSHPEPELAR ATAEGRKREF AEHGWDADEI PDPQDPETFE RSKLQWDEVG
VGDHARLLEF YRSLIALRHT EPDMADPWLD HLKIDFDEHA RWFVMHRGAL AIACNLGADP
VDVPVTGDVV LAWDEPTVGV DTTRVGGHSV AILRAA