Gene Mvan_4019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_4019 
Symbol 
ID4647475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp4299219 
End bp4301168 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content68% 
IMG OID639807481 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_954802 
Protein GI120404973 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.14756 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTTTCCA CCGCGCCGAA GTCGGCCGAG AAGTTCGCCG ACAACGAGCC GACCGTCCGG 
CTCACCGTCG CCCAGGCGAC CGTGCGGTTC CTGGCCAACC AGTACGTCGA ACACGACGGC
GAGCGCCACA AGTTCTTCGC CGGATGTCTG GGCATCTTCG GCCACGGCAA CGTCGCGGGC
ATCGGCCAGG CGCTGCTGCA GGACGACCTG GAGGCCTATG AGGCCGGCAC GGAACCCGGC
CTGCGGTACG TGCTGGGCCG CAACGAACAG GCCATGGTGC ACACCGCGGT CGCCTACGCC
CGGCAGAAGG ACCGCACCCA GGCGTGGGCG GTGACCGCCA GCGTCGGACC CGGCGCGACG
AACATGCTGA CCGGCGCGGC GCTGGCGACC ATCAACCGGC TGCCGGTGTT GCTGCTTCCC
TCAGACACAT TCGCGACCCG GGTGAGTTCA CCGGTGCTGC AGGAGCTGGA GTTGCCGTCG
TCCGGCGACG TCACCGTCAA TGATGCGTTC AAACCGCTGT CGCGGTACTT CGACCGGGTG
TGGCGACCCG AGCAGCTGCC GGCAGCCCTG CTCGGGGCCA TGCGGGTGCT GACCGATCCC
GTCGAGACCG GCGCAGCGAC GGTGGCGCTC CCCCAGGACG TGCAGGCCGA GGCATTCGAC
TGGCCGGTCT CGTTGTTCGC CGAGCGGACC TGGCACATCG CCCGGCCGCT GCCCGAGCGG
GCGGTGATCG CCAAGGCCGC CGAGGTCATC CGGTCGGCGA CCAAGCCGCT GATCGTGGCC
GGCGGTGGTG TCATCTACTC CGGCGCCAGT GAGGCTCTGG CCGCGCTGGC CTCGAGCACC
GGAATCCCGG TGTGCGAGAG CCAGGCCGGG AAGGGTTCGC TGCTGCACGA CGACCCGCGC
TCGGTCGGGG CGGTCGGTTC GACCGGAACG ACGGCCGCCA ACGCGCTGGC CGCCGAGGCC
GATGTCGTGA TCGGGATCGG CACCCGCTAC AGCGATTTTA CGTCGGCCTC GCGTACCGCG
TTCAACAACC CGGACGTCCG GTTCGTCAAC ATCAATGTGG CATCCCTGGA TTCGGTGAAG
CACGGCGGCA TCAGCGTCGT CTCCGACGCC AGGGAGGCGC TGGAGGTGCT CACCGGACTG
CTCGACGACT ACTCGGTCAG CGACGACTAC GCCACGCGCG TAAACGAACT GATGTCCGAG
TGGAACGACA CCGTGTCGGA GTCCTACCGC ACCCAGGACG GCGCCGCATT GAACCAGAAC
CAGGTCATCG GCCTGGTCAA CACACTGTCG GACCCCCGCG ACGTCGTGGT GTGCGCGGCC
GGCTCGATGC CGGGAGACCT GCACAAGCTG TGGCGGATGC GCGACCGCAA GGGCTACCAC
GTCGAGTACG GGTTCTCCTG CATGGGCTAC GAGATCGCCG GCGGCATCGG CGTGCGGATG
GCCGCCCCGG ACCGCGACGT GTTCGTCATG GTCGGCGACG GCTCCTACCT GATGATGGCC
ACCGAACTCG TCACCGCCGT CCAGGAAGGC GTCAAGATCA TCCCGGTGCT GGTGCAGAAC
CACGGTTTCG CTTCGATCGG AGGGCTGTCG GATTCCGTGG GCTCCCAACG GTTCGGCACC
AGCTACCGGT ACCGCAGCGA CGACGGTCGG CTCGACGGCG ACAAGCTGCC CGTCGATCTC
GCGGCCAATG CGGCCAGCCT GGGCGCCGAC GTTATTCGCG TCACGACCGC GGCCGAGTTC
ACCGACGCCG TCAAGGTCGC CAAGGCCGGT GACCGTACCA CGGTCATCCA CGTCGAGACC
GACCCGATGA TCTTCGCGCC GGACAGCCAC TCCTGGTGGG ACGTCCCGGT CAGCCAGGTA
TCCGCACTGG AGTCCACCCG GCAGGCCTAT CAGCGGTACG CGGACTGGAA GAAAGTCCAA
CGGCCGCTGA TCAATCCGTC CGACCGCTGA
 
Protein sequence
MVSTAPKSAE KFADNEPTVR LTVAQATVRF LANQYVEHDG ERHKFFAGCL GIFGHGNVAG 
IGQALLQDDL EAYEAGTEPG LRYVLGRNEQ AMVHTAVAYA RQKDRTQAWA VTASVGPGAT
NMLTGAALAT INRLPVLLLP SDTFATRVSS PVLQELELPS SGDVTVNDAF KPLSRYFDRV
WRPEQLPAAL LGAMRVLTDP VETGAATVAL PQDVQAEAFD WPVSLFAERT WHIARPLPER
AVIAKAAEVI RSATKPLIVA GGGVIYSGAS EALAALASST GIPVCESQAG KGSLLHDDPR
SVGAVGSTGT TAANALAAEA DVVIGIGTRY SDFTSASRTA FNNPDVRFVN INVASLDSVK
HGGISVVSDA REALEVLTGL LDDYSVSDDY ATRVNELMSE WNDTVSESYR TQDGAALNQN
QVIGLVNTLS DPRDVVVCAA GSMPGDLHKL WRMRDRKGYH VEYGFSCMGY EIAGGIGVRM
AAPDRDVFVM VGDGSYLMMA TELVTAVQEG VKIIPVLVQN HGFASIGGLS DSVGSQRFGT
SYRYRSDDGR LDGDKLPVDL AANAASLGAD VIRVTTAAEF TDAVKVAKAG DRTTVIHVET
DPMIFAPDSH SWWDVPVSQV SALESTRQAY QRYADWKKVQ RPLINPSDR