Gene Mvan_3752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_3752 
SymbolaceE 
ID4646818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp3990551 
End bp3993325 
Gene Length2775 bp 
Protein Length924 aa 
Translation table11 
GC content67% 
IMG OID639807217 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_954540 
Protein GI120404711 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0660504 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCCAGG ACCTGGCCCA AAACTCCGGC AGCACAGCCG AACCCGACCG GGTTCGCGTC 
ATCAGAGAAG GCGTCGCCTC GTATCTGCCG GACATCGACC CCGAAGAGAC CGGCGAGTGG
CTGGAGTCGT TCGATGATCT GCTCGAGCGT TCCGGCCCGG CGCGGGCCCG CTATCTCATG
TTGCGGCTGC TGGAACGGTC CCGGGAGAAG CGGGTGGCCA TCCCGGCGCT GACGTCCACC
GACTACGTCA ACACGATCCC GACCGAACTG GAGCCGTGGT TCCCCGGCGA CGAGGACGTC
GAGCGCCGCT ACCGCGCGTG GATCCGCTGG AACGCCGCGA TCATGGTGCA CCGCGCGCAG
CGGCCGGGAG TCGGTGTGGG TGGCCATATC TCGACCTACG CGTCCTCGGC GGCGCTCTAC
GAGGTGGGCT TCAACCACTT CTTCCGCGGC AAGAGCCACC CCGGCGGCGG CGACCAGATC
TTCATCCAGG GCCACGCCTC CCCCGGCATC TACGCCCGCG CCTACCTCGA AGGCCGGCTG
AGCGCCGACC AACTCGACGG TTTCCGTCAG GAGCACAGCC ATCCCGGCGG CGGGCTGCCG
TCGTATCCGC ACCCGCGGCT GATGCCCGAT TTCTGGGAGT TCCCGACGGT GTCGATGGGC
CTGGGCCCGA TGAACGCGAT CTATCAGGCG CGGTTCAACC ATTACCTGCA CGACCGGGGC
ATCAAGGACA CCTCCAAACA GCACGTGTGG GCGTTCCTCG GCGACGGTGA GATGGACGAG
CCGGAAAGCC GCGGGCTGAT CCAGGTGGCC GCCAACGAGG CCCTGGACAA CCTGACCTTC
GTCGTCAACT GCAACCTGCA GCGCCTCGAC GGCCCGGTGC GCGGCAACGG CAAGATCATC
CAGGAGCTCG AGTCGTTCTT CCGCGGCGCC GGCTGGAACG TCATCAAGGT GGTGTGGGGC
CGCGAGTGGG ACGCGCTGCT GCACGCCGAC CGGGACGGCG CTCTGGTGAA CCTGATGAAC
ACCACCCCGG ACGGGGATTA CCAGACCTAC AAGGCCAACG ACGGCGCCTA CGTCCGCGAC
CATTTCTTCG GCCGCGATCC GCGCACCAAG GCCCTCGTCG AGAACATGAC CGACCGGGAG
ATCTGGAACC TCAAGCGCGG CGGGCACGAC TACCGCAAGG TGTATGCGGC GTACCACGCG
GCGCTGGAGC ACAAGGGCCA GCCGACGGTG ATCCTGGCCA AGACCATCAA GGGCTACACG
CTGGGCAAGC ACTTCGAGGG CCGCAACGCT ACCCACCAGA TGAAAAAGCT TGCGCTGCAA
GACCTCAAGG ACTTCCGCGA CGCGCAGCGC ATCCCGGTCA GCGACGAGCA GCTGGAAGCC
GATCCGTACC TGCCGCCGTA CTACCATCCG GGTCCCGAGG CCCCGGAGAT CCGCTACATG
CTCGACCGCC GGCGCGCCCT TGGCGGGTTC CTGCCCGAGC GTCGCACCAA GTCCAGAGCG
CTGACCCTGC CGGGGCGCGA CGTCTACAAG GCGCTCAAGA AGGGGTCGGG CAAGCAGGAG
GTCGCCACCA CCATGGCGAC CGTGCGCACC TTCAAGGAAC TGTTGCGCGA CAAGAACATC
GGATCGCGCA TTGTTCCCAT CATCCCCGAC GAGGCCCGCA CGTTCGGCAT GGACTCGTGG
TTCCCCAGCC TCAAGATCTA CAACCGCAAC GGCCAGCTCT ACACCGCGGT GGACGCCGAG
TTGATGCTCG CCTACAAGGA GAGCGAAATC GGCCAGATCC TGCACGAAGG CATCAACGAG
GCGGGGTCGA CGGCGTCGTT CACCGCGGTG GGCACCTCGT ATGCCACCCA CAACGAGCCG
ATGATCCCGA TCTACATCTT CTACTCGATG TTCGGGTTCC AGCGGACCGG AGACGGACTG
TGGGCGGCCG CCGACCAGAT GGCGCGCGGC TTCGTCCTCG GCGCCACCGC CGGGCGCACC
ACACTCGTCG GGGAAGGGCT GCAGCACGCC GACGGCCACT CGCTGCTGCT GGCGTCCACC
AACCCCGCGG TGGTGGCCTA CGACCCGGCG TTCGCCTACG AGATCGCCTA CATCATCGAA
AGCGGCCTGC ACCGCATGTA CGGCGAGAAC CCGGAGAACG TCTACTTCTA CATGACGATC
TACAACGAGC CCTACGTCCA GCCCGCCGAA CCCGACGGCC TCGACGTCGA GGGCCTGCTG
CGCGGCATGT ACCGGTACCA GGCCGCCCCC GAGAAGCGCA CCAACGCCGC CCAGATCCTG
GTGTCCGGGG TGGGCATGCC GTCGGCGCTC AAGGCCGCCG AGCTGCTGGC GCAGGAGTGG
GACGTCGCCG CCGACGTCTG GTCGGTGACC AGCTGGGGCG AGCTCAACCG CGACGGCGTC
GCCATCGAGA GGGAGAAGCT GCGCCATCCC GAGCTCCCCG CCGCCACGCC CTACGTCACC
AAGATGCTGG CAGAGGCCGC CGGACCCGTC GTCGCGGTGT CGGACTGGAT GCGCGCCGTG
CCTGAGCAGA TCCGGCCCTG GGTGCCCGGC ACGTACATCA CGCTCGGCAC CGACGGCTTC
GGCTTCTCCG ACACCCGCCC GGCCGCCCGG CGCTACTACA ACACCGACGC CGAGTCGGTG
GTGGTGGCCG TGCTCGAAGG GCTGGCACGC GACGGCAACA TCGACATCTC GGTGGCCGTG
GAGGCGGCGA AGAAGTACGA GATCGACGAT GTGATGGCGG CACCGGAGCA GACGTCGGAT
CCCGGAGTCG CGTAG
 
Protein sequence
MRQDLAQNSG STAEPDRVRV IREGVASYLP DIDPEETGEW LESFDDLLER SGPARARYLM 
LRLLERSREK RVAIPALTST DYVNTIPTEL EPWFPGDEDV ERRYRAWIRW NAAIMVHRAQ
RPGVGVGGHI STYASSAALY EVGFNHFFRG KSHPGGGDQI FIQGHASPGI YARAYLEGRL
SADQLDGFRQ EHSHPGGGLP SYPHPRLMPD FWEFPTVSMG LGPMNAIYQA RFNHYLHDRG
IKDTSKQHVW AFLGDGEMDE PESRGLIQVA ANEALDNLTF VVNCNLQRLD GPVRGNGKII
QELESFFRGA GWNVIKVVWG REWDALLHAD RDGALVNLMN TTPDGDYQTY KANDGAYVRD
HFFGRDPRTK ALVENMTDRE IWNLKRGGHD YRKVYAAYHA ALEHKGQPTV ILAKTIKGYT
LGKHFEGRNA THQMKKLALQ DLKDFRDAQR IPVSDEQLEA DPYLPPYYHP GPEAPEIRYM
LDRRRALGGF LPERRTKSRA LTLPGRDVYK ALKKGSGKQE VATTMATVRT FKELLRDKNI
GSRIVPIIPD EARTFGMDSW FPSLKIYNRN GQLYTAVDAE LMLAYKESEI GQILHEGINE
AGSTASFTAV GTSYATHNEP MIPIYIFYSM FGFQRTGDGL WAAADQMARG FVLGATAGRT
TLVGEGLQHA DGHSLLLAST NPAVVAYDPA FAYEIAYIIE SGLHRMYGEN PENVYFYMTI
YNEPYVQPAE PDGLDVEGLL RGMYRYQAAP EKRTNAAQIL VSGVGMPSAL KAAELLAQEW
DVAADVWSVT SWGELNRDGV AIEREKLRHP ELPAATPYVT KMLAEAAGPV VAVSDWMRAV
PEQIRPWVPG TYITLGTDGF GFSDTRPAAR RYYNTDAESV VVAVLEGLAR DGNIDISVAV
EAAKKYEIDD VMAAPEQTSD PGVA