Gene Mvan_5563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5563 
Symbol 
ID4647132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5943040 
End bp5945280 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content69% 
IMG OID639809035 
Productglycerol dehydratase 
Protein accessionYP_956334 
Protein GI120406505 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4909] Propanediol dehydratase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.811495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAGACG AGTTGGGCCG GTTTCGGGTA CTGAACTCGA AACCGGTCAA CCTCGACGGG 
TTCAGCGTCC CCGACGCCGG GCTCGGGCTG GTCGCGATGA GCAGTCCCCA TGACCCCGCA
CCCTCTCTGA AGATCCGGGG GGGTGAGGTA GTCGAACTCG ACGGCAAAGG CGCCGGCGAG
TTCGACGTCA TCGACGAGTT CATCGCCCGC TACGGCATCG ACCTCACGGT CGCCGAAGAG
GCGATGGCCC TCGGTGACGA GACGCTCGCG CGCATGGTGG TCGACATCAA CGTGCCGAGG
GCGGAGGTGG TGCGGTTGAT CGGCGGCACC ACCCCCGCCA AGCTGGCCCG CGTCGTGGCG
CTGCTGTCCC CGGTGGAGAT GCAGATGGCG ATGGCCAAGA TGCGGGCCCG CCGCACCCCG
AGCAACCAGG CACACGTCAC CAACCAACTC GACGACCCGT TGCTGATTGC GGCGGACGCC
GCATCGGCGG TGGCCTACGG CTTCCGCGAG GTGGAGACGA CGGTGCCGGT GCTCGGTGAC
GCACCGTCGA ATGCGGTGGC GCTGTTGATC GGCAGTCAGG TCGGCTCCCC CGGTGCGATG
GCCCAGTGCT CGATCGAAGA GGCGCTGGAA TTGCGGCTGG GGCTACGCGG GCTCACCAGC
TACGCCGAGA CGATCTCGAT CTACGGCACC GAGCAGGTGT TCGTCGACGG TGACGACACC
CCGTTCAGCA AGGCGATCCT GACGTCGGCG TACGCGTCAC GCGGGCTCAA GATGCGGGTC
ACCAGCGGCG GCGGAGCCGA GGTTCTGATG GGCGCCGCCG AGAAGTGCTC GATCCTGTAC
CTGGAGTCGC GTTGCGTGTC GCTGGCCCGT GCGCTCGGGT CCCAGGGCGT GCAGAACGGC
GGCATCGACG GTGTCGGGGT GGTCGCGTCG GTGCCCGAGG GCATGAAGGA ACTGCTCGCC
GAGAACCTGA TGGTGATGAT GCGCGACCTG GAATCGTGTG CGGGCAACGA CAACCTGATC
TCCGAGTCGG ACATCCGGCG CAGTGCGCAC ACCCTGCCGG TGCTGCTGGC CGGGGCCGAC
TTCGTCTTCT CCGGCTTCGG TTCGATCCCG CGCTACGACA ACGCGTTCGC ACTGTCGAAC
TTCAACTCCG ACGACATGGA CGACTTCCTG GTGCTGCAGC GGGACTGGGG TGCCGACGGC
GGTCTGCGCA CCGTGTCGCC GGAGCATCTG GAGGCCGTGC GTCGCCGCGC GGCCAAGGCC
GTCCAGGCGG TGTACCGCGA TCTCGGGCTG GCCGACTACG AGGATGCGCG CGTCGAGGAG
GTGGTGGCCG CCAACGGGTC CCGCGACCTG CCCGCCGGCC ACCCGAAGAT GGTGGCCGAA
GCGGCGGCGT CGATCGAGGC GAGACAGCTG ACGGTGTTCG ACGTCATCGC GTCGCTGCAC
CGCACCGGTT TCACCGACGA GGCCGAGGCG ATCACCACCC TGACGCGGGA ACGACTGCGC
GGTGACCAAC TGCAGACCTC GGCGATCTTC GACGAGAAGT TCCGGGTGCT GTCCAAGCTC
ACCGATCCCA ACGACTACAC GGGTCCGGCA ACGGGTTACG CCCTCACCGA CCGGCGGCGG
GCCGAGATCG ACGCGATCCG GCAGGCCCGC AGCAGCGCCG AGTTGACTGC CGACCAGGAG
TCGTACCGCG GACATGTCCT GGTCACCGAC GTCGAACCGG CGCAGCAGGG CAGCGATCCG
CGCGAGGTCT GTATCGGGCT GTCGCCCGCC TGGGGGCGCA GCGTGTGGCT GACGCTGTGC
GGGCTGACCA TCGGGGAGGT GCTACGTCAG ATCTCGGCGG GCCTGGAGGA GGAGGGCTGC
ATTGCCCGCC CGGTGCGGGT GCGCTCCACC ATCGATGTCG GGCTGATCGG TCTGACCGCC
GCGCGGCTGT CGGGCTCCGG CATCGGAATC GGGTTGCAGG GCAAAGGGAC CGCGTTGATC
CACCGCCGGG ACCTGGCGCC GCTGGCCAAC CTGGAACTGT TCAGCGTGGC GCCGCTGCTG
ACGGCCAAGA TGTACCGCGA GCTCGGCAAG AACGCCGCGC GGCACGCCAA GGGGATGGCG
CCGGTGCCGA TCTTCACCGG CGGCACCGAC GAGTCGATCT CCGCGCGCTA CCACGCCAGA
GCGGTCGCGC TGGTAGCGCT GGAGCGCGAG TCCTGCGAAC CGGGTCAGCC GCCGGTCACG
GTGAAGGTGG AATGGCCATG A
 
Protein sequence
MADELGRFRV LNSKPVNLDG FSVPDAGLGL VAMSSPHDPA PSLKIRGGEV VELDGKGAGE 
FDVIDEFIAR YGIDLTVAEE AMALGDETLA RMVVDINVPR AEVVRLIGGT TPAKLARVVA
LLSPVEMQMA MAKMRARRTP SNQAHVTNQL DDPLLIAADA ASAVAYGFRE VETTVPVLGD
APSNAVALLI GSQVGSPGAM AQCSIEEALE LRLGLRGLTS YAETISIYGT EQVFVDGDDT
PFSKAILTSA YASRGLKMRV TSGGGAEVLM GAAEKCSILY LESRCVSLAR ALGSQGVQNG
GIDGVGVVAS VPEGMKELLA ENLMVMMRDL ESCAGNDNLI SESDIRRSAH TLPVLLAGAD
FVFSGFGSIP RYDNAFALSN FNSDDMDDFL VLQRDWGADG GLRTVSPEHL EAVRRRAAKA
VQAVYRDLGL ADYEDARVEE VVAANGSRDL PAGHPKMVAE AAASIEARQL TVFDVIASLH
RTGFTDEAEA ITTLTRERLR GDQLQTSAIF DEKFRVLSKL TDPNDYTGPA TGYALTDRRR
AEIDAIRQAR SSAELTADQE SYRGHVLVTD VEPAQQGSDP REVCIGLSPA WGRSVWLTLC
GLTIGEVLRQ ISAGLEEEGC IARPVRVRST IDVGLIGLTA ARLSGSGIGI GLQGKGTALI
HRRDLAPLAN LELFSVAPLL TAKMYRELGK NAARHAKGMA PVPIFTGGTD ESISARYHAR
AVALVALERE SCEPGQPPVT VKVEWP