Gene Mvan_1686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1686 
Symbol 
ID4645679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1790946 
End bp1792730 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content65% 
IMG OID639805180 
Productheparinase II/III family protein 
Protein accessionYP_952520 
Protein GI120402691 
COG category[S] Function unknown 
COG ID[COG5360] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTTG GACGCGTCTT TCGCACGACA CGGCATCTGA CGGCGGAGCA ATGGCTCTAT 
CGTTTCATCT GCCGGGGCAA GTTTGTGGCG ATGGAAAAGA TGCCCGGAGC GGCGGCGCGG
CGGTTCGGAA AGGCCGCCCA AGCAGTACCG CAGCCCGATG CAGCGGCACC GGAACTGGCC
GCGGTCAGCG CGCATGTGCG TCAGCTCCAG CGTGCGGTGC ATGGAACATG GGCCGACGAC
GTCCCCGAGG GGCGATTCAC CCTGCTCAAC CAGACGATCG ACTTCGGCGG GCTGGATCAG
GTTGAGTGGC GGCGGGAGCT CGGCGAGAAA AATAATCGCC TGTGGCGCAT GAACCTTTCA
TACATGGGCT ATCTGGTACC GCTGTTCGAG CATGATGCGC GCTCCGCGCT GCCCGTCGCC
CAGTCTCTCT TAGCCAGCAT GTCGGCACAG AATCCCTGGT CATCGAGAGG CATTTTCCGG
GACGTGTGGC ACCCCTATTC GGTGTCCCAT CGGGTCATCA ATCTGCTGGC CTGCTTGCGG
CTGCTTCACG AACAGGCTCC CGAACTGACC TCCGAGGCGG GCGCTTTGAC GGACGAGATC
CGACTCGGAG GCGCTTTCAT TCTGGGCAAT CTGGAACGCG ATCTCCAATA CAATCACCTG
CTCAAAAACT ATGTCTGCCT TGCGGCCATC GCCTCGGCCG CACCGGGCAA GGGCTTCGCG
CGCGCTGTGC TGGCCGGCAC TAGGGCATCC ATCGAGCAGC AGTTCCTGCC CGACGGCGGG
CAGGCGGAGC GCGCGCCGAT GTATCACATC CTTTCGCTCC TCGATCTGCG TATCCTTCGT
GACAGCGGCG CCCTCGCGCC TGACACGCAG CCGCTGGTCG AGAAAGCAGC TGTCGCAAGC
GAAATCGCGG TTGCGGCTAT GGTGCATCCT GACGGCGAGG TCGCCTTGTT CAACGACAGT
TGGCTGGGTG AAGGACCGCC GGCCGTCGAG ACCGTCCCCG GTCTGGCTTT ACGGCCGGAA
CGACCGTCAC GCCACGTACT GCCCGATGCC GGCTATGTCC GGCTCGCCTG TGGCGGCGAC
AGCGTAGTGA TGGACTTTGG CCCGTGCGGG CCGGACGACA ATCCCGGCCA CGCCCATGCG
GACTTTCTCT CGCTGGAGCT TTCGGTGTCG GGTCGGCGCC TGCTGGTCGA CACCGGTGTG
CCGACTTATT CGGAAGGCGA ACAGCGTGAC ATGTCGCGTT CTGCCGCTGC GCATAACGGC
CCGATCCGGA CCGGCCTCGA GCCGATTGAG TTCTGGGAAT CCTTCCGGGT CGGCCATCGC
GGCTATGCCC ATGCACTTCC GGTCGCAGAC GAGATGAACT TCGCTGCCTG GCACGACGGC
TACATCGGCC ACGGCACGGC GGTGGCGCGG GCGATCCGAC TGCTGCCAGG GCGCGGTCTG
CTGGTCTGCG ATGTGTGGGT GGGCGCGCCC GCCGGCACGG CCATGACTCA TTTCCTGCTG
CCCGGCGAAT GGCGAATCCA AGGCCATGTC GCGTACGTCG ACGATGTGGC CGCTCGGTTT
CAGGCGATAG GAGGTACGCT CGCGGCGTTC GAGCCGGCGG AGCACTGGCT GCGTTTCGGC
GAGCCCCGCA CGGCACACCG GACGACACTT GCGCCTGCAT CGGCGCGGGA CCTGCAAGCC
GCGAGTCTGT GGATCGGCTG GGGTGAGCCT GCTGAAGCGC CGCTCGACGA GGAAGAACGG
CTGCGTGAAG CCTTGGTCAG AACTTTTACC GAAACCCTCG AGTGA
 
Protein sequence
MDVGRVFRTT RHLTAEQWLY RFICRGKFVA MEKMPGAAAR RFGKAAQAVP QPDAAAPELA 
AVSAHVRQLQ RAVHGTWADD VPEGRFTLLN QTIDFGGLDQ VEWRRELGEK NNRLWRMNLS
YMGYLVPLFE HDARSALPVA QSLLASMSAQ NPWSSRGIFR DVWHPYSVSH RVINLLACLR
LLHEQAPELT SEAGALTDEI RLGGAFILGN LERDLQYNHL LKNYVCLAAI ASAAPGKGFA
RAVLAGTRAS IEQQFLPDGG QAERAPMYHI LSLLDLRILR DSGALAPDTQ PLVEKAAVAS
EIAVAAMVHP DGEVALFNDS WLGEGPPAVE TVPGLALRPE RPSRHVLPDA GYVRLACGGD
SVVMDFGPCG PDDNPGHAHA DFLSLELSVS GRRLLVDTGV PTYSEGEQRD MSRSAAAHNG
PIRTGLEPIE FWESFRVGHR GYAHALPVAD EMNFAAWHDG YIGHGTAVAR AIRLLPGRGL
LVCDVWVGAP AGTAMTHFLL PGEWRIQGHV AYVDDVAARF QAIGGTLAAF EPAEHWLRFG
EPRTAHRTTL APASARDLQA ASLWIGWGEP AEAPLDEEER LREALVRTFT ETLE