Gene Mvan_1685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_1685 
Symbol 
ID4645678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp1788770 
End bp1790944 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content62% 
IMG OID639805179 
Productalcohol dehydrogenase 
Protein accessionYP_952519 
Protein GI120402690 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins
[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAG TTGTTCAGAA CGCGCGTGGC GGGAAGCTGT CCCTGAAATC GGTCCCGGCA 
CCGGGAATTG CCCGCAACAG CGTGCTCATC AAGACGCATT GCTCGCTGAT CTCGGCGGGT
ACAGAACGTC AGATGGTCGG CTTTGCCCAA TCGAGCTTGG TCGCCAAGGC ACGGGCACGC
CCCGACCTTG TTCGTAAGGT TGTCGACAAG GTCCGCCGCG ACGGGCCTAT GGCGACCCTC
AAGTCGGTGA CGGCGCGGCT CGACGAAGCA CTGCCGCTGG GGTACTCGAT CGCCGGTGAA
GTCGTCGAGG TAGGCGCAGG GCTGGAGGGT CTCTACCAAA TCGGACAGCT GGTCGCAGGT
GCGGGCATTG GTTTGGCCAA TCATTCGGAA TACAACCTTG TGCCGGCGAA TCTGGTCGTT
CCCGTTCCTG CGGATGTGAC GCCTGAGGAA GCCTGTTTTT CGACTCTCGG TTCCATCGCA
CTGCATTCCG TCCGCTTGAT CGAGCCGCAG CTCGGCGACG TGGTCGCGGT TCTGGGCGCA
GGTCTGGTCG GACAACTTTG CGCGCAGTTC GCCCGCCTCG CCGGCGCGCG CGTCATCGTG
CTCGATTTCA ATACCGAACG CCTCGCACTC GCCAAATCGC TCGGTGCGGA GGCAACCATC
CAGCTCGGCG TGGGTAACCC GGCCGAAGCG GTGCTCGACC TGACACGCGG CATCGGCTCC
GACAGCGTCC TGATTGCGGC TGCCACACCG ACGAGTGAGC CGTTCGAAAC AGCAGCAGAA
ATAGCGCGTG ACCGAGCGAC TGTCTGCCTT GTGGGTATTA CGGGCACGGA ATTTCCATAT
CGACCCTACA TGCAGAAAGA ACTCAGAGTG CTGGTCTCAC GCTCGTATGG CCCGGGGCGT
TACGACCGCG ATTACGAGAA CAAACACCAG AAATATCCGC TGGGCTACGT CCGCTGGACT
GAACACGAGA ATCTTCGTGA GGTAGCGCGG CTCATGTCGC CGTCGCTGCC GAATCGGCTC
AACGTGCAGC CGCTGATCTC GCATCGCTTT GCCTTCGATG ATGCGGAAAA CGCCTACGGC
TTGGTGATGG AGGGCAAGGA GCCGCACTTG GGCGTGGTGC TGGAATACGG TTCGAGCGCG
CCCGCCGACA ACCGGCGCGA TTTGGCCCTG AAAGGCAAGG GCAGGGCATC CACTGGCACG
GCCATCGGGG CCGTCGGAGC CGGTAACTTC GGTAAGACAA TGATCCTGCC GGCCCTCCGG
GCGGATACCC GTATTCAGCT CAAGACACTT GTCACGAGTC GCGGTGTCTC AGCCGAAGGC
ACGGGCAGCA AATTCGGCTT CGCCACTGCA TCCACAGAGA TGGCATCGGT GTTGAGCGAC
CCGGACATCG CAGGGGTCGT TATCACCACG CCTCATTCCA CTCACGCCAC GATGGTTCGC
AAGGTGTTGG AGGCCGGCAA GAGCGTCTTC GTTGAGAAGC CGCTCGCACT GAGCCATGAG
GAGTTGGAAA GCGTTGTGGA GGCACGCAAC GCCTCGGACG GTTTCGTAAC GGTGGGCTTC
AATCGCCGCT TTGCGCCCTA CGTGCAGGAC ATGAAGAATT TCATGAAGAC CAAAGCGGGG
CGCGCGGTGG TCAACATCCG CGTCAACGCC GGCCAGTTGC CACCGGATAG TTGGCAGCGC
GATGCCGAGG AAGGGCAAGG ACGCATCCTA GGCGAACTAT GCCACTTCAT CGATCTGGCC
ATGGATCTGG TCGATGCGCC GCTGGCTTCG GTGTCCGCCA CGGCAGCAGA GGCAGCGCGC
GCGCTCTGTG AGGATCTCAG CGTCGCGTTG CGCTTCGCGG ACGGCAGCCT CGCCAATATC
GTCTATACGG CGCTCGGCGA CACCAGCTTC AGCAAGGAGC TGGTCGAGGT CTATAAGGGC
GCGGCGGTCT GCCAGATCGA GAACTTCCGC GAATTCACAA CGGTCGTCGA TGGCAAGAGT
TCGACCAAGA AGACGATGGC GCAGGACAAG GGGCATAACG CTCAGATCAA AGCGTGGGTC
GGCGGTGTGC TCTCCGGCAT ACCGCCGGTT GACGAGCAGT CGCTCATTGA TTCGAGCCTG
GCGACCATCC TGGTGCTCGA CTCGCTCCGC CTTGGCCGCC CGGTCGAGTT CGGGGAAGCC
CCGTCCGGCG CCTGA
 
Protein sequence
MKQVVQNARG GKLSLKSVPA PGIARNSVLI KTHCSLISAG TERQMVGFAQ SSLVAKARAR 
PDLVRKVVDK VRRDGPMATL KSVTARLDEA LPLGYSIAGE VVEVGAGLEG LYQIGQLVAG
AGIGLANHSE YNLVPANLVV PVPADVTPEE ACFSTLGSIA LHSVRLIEPQ LGDVVAVLGA
GLVGQLCAQF ARLAGARVIV LDFNTERLAL AKSLGAEATI QLGVGNPAEA VLDLTRGIGS
DSVLIAAATP TSEPFETAAE IARDRATVCL VGITGTEFPY RPYMQKELRV LVSRSYGPGR
YDRDYENKHQ KYPLGYVRWT EHENLREVAR LMSPSLPNRL NVQPLISHRF AFDDAENAYG
LVMEGKEPHL GVVLEYGSSA PADNRRDLAL KGKGRASTGT AIGAVGAGNF GKTMILPALR
ADTRIQLKTL VTSRGVSAEG TGSKFGFATA STEMASVLSD PDIAGVVITT PHSTHATMVR
KVLEAGKSVF VEKPLALSHE ELESVVEARN ASDGFVTVGF NRRFAPYVQD MKNFMKTKAG
RAVVNIRVNA GQLPPDSWQR DAEEGQGRIL GELCHFIDLA MDLVDAPLAS VSATAAEAAR
ALCEDLSVAL RFADGSLANI VYTALGDTSF SKELVEVYKG AAVCQIENFR EFTTVVDGKS
STKKTMAQDK GHNAQIKAWV GGVLSGIPPV DEQSLIDSSL ATILVLDSLR LGRPVEFGEA
PSGA