Gene Namu_2054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2054 
SymbolaceE 
ID8447663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2266398 
End bp2269163 
Gene Length2766 bp 
Protein Length921 aa 
Translation table11 
GC content68% 
IMG OID645041177 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_003201423 
Protein GI258652267 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.419391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0127021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACGACAG TGAACGAACC GGCGCCGAAG ATGACGCTGA TCAAGGACGG CATCGCCGCC 
CAATTGCACG ACATCGACCC GGAAGAGACG AGCGAGTGGC TCGCCTCCTT CGACGCCATG
CTCGAGGCGG GCGGCAGCCA GCGGGCCCGG TACCTGATGC TGCGGATGCT GGACCGGGCC
AAGCAGCAGC ACATCGCGCT GCCGGCGCTG ACCACCACGG ACTACATCAA CACCATCCCG
ACCGAGTCGG AGCCGTTCTT CCCCGGTGAT GAGGCGATCG AGCGCCGCTA CCGCCGGTTC
ATCCGCTGGA ACGCCGCGAT GCTGGTGCAC CGCGCGCAGC GGCCCGGCAT CGGCGTCGGC
GGCCACATCT CCACCTACGC CTCCTCGGCG ACGTTGTACG AGGTCGGCTT CAACCACTTC
TGGCGCGGCA AGGACCACCC CGGCGGCGGC GACCAGGTCT TCTTCCAGGG CCACGCCTCC
CCCGGCATGT ACGCCCGCGC CTTCCTCGAG GGCCGGCTGT CCGAGAACGA CCTGGACGGC
TTCCGCCAGG AGAAGTCGCA CCCGGGTGGC GGCATTCCGT CGTACCCGCA CCCGCGGCTG
ATGCCGGACT TCTGGGAATT CCCGACCGTG TCCATGGGCC TGGGCCCGAT GAACGCGATC
CAGCAGGCCC GGGTCAACCG CTTCCTGCAC CACCGCGGCA TCAAGGACAC CTCCGACCAG
CACGTCTGGG CGTTCCTGGG CGACGGCGAG CTCGACGAGG TCGAGTCCCG CGGTCTGATC
CACATCGCCG CGATCGACGG CCTGGACAAC CTGACCTACG TCATCAACTG CAACCTGCAG
CGCCTGGACG GCCCGGTCCG CGGCAACGGC AAGATCGTGC AGGAGCTGGA GGCCTTCTTC
CGGGGCGCCG GTTGGAACGT CATCAAGGTC ATCTGGGGCC GCGAGTGGGA CCGCCTGCTC
GAGAAGGACA AGGACGGCGC CCTGGTCCAC CTGATGAACA CCACCGCCGA CGGCGACTTC
CAGACCTACC GGGCCAACGA CGGCGCGTAC ATCCGGGAGC ACTTCTTCGG CCGCGACCCG
CGGACCAAGC AGATGGTCAC CGACCTGTCC GACGAGGACA TCTGGCGGCT GCGGCGCGGC
GGCCACGACT ACCGCAAGGT CTACGCGGCC TACAACGCGG CGACCCAGCA CACCGGGCAG
CCGACCGTCA TCCTGGTCAA GACGATCAAG GGCTTCGGGC TCGGCCCGTC GTTCCAGGGC
CGCAACGCGA CCCACCAGAT GAAGAAGATG ACCTCGCAGA ACCTGCACGA GTTCCGCGAC
AGCCTGCAGC TGCCCATCCC GGACAGCCAG CTCGAGGACG TCTACCGGCC GCCGTACTTC
CACCCCGGAC AGGACTCCGA AGAGATCCAG TACATGCTCG ATCGCCGCAA GCGCCTCGGC
GGCTTCGTGC CCGAGCGTCG GGTCGCCGCC AAGCCGCTGG TGCTGCCGGG CGACAAGGTC
TACGACGTGC TGAAGAAGGG CTCCGGCAAG CAGGAGATCG CCACCACCAT GGCGTTCGTC
CGGCTCGTCC GCGACCTGTT CAAGGACCCG GAGATCGGCA ACCGCGTGGT GCCGATCATC
CCGGACGAGG CCCGCACCTT CGGCATGGAC TCGTTCTTCC CGACGCAGAA GATCTACAAC
CCCTCGGGTC AGCTTTACAC CGCGGTCGAC GCCCAGCTGA TGCTGGCCTA CCGCGAGTCC
GAGCAGGGCA TGATCCTGCA CGAGGGCATC GACGAGGCCG GTTCGGTGGC CACGCTGACC
GCGGTGTCCA CCGCGTACGC CACCCACGGC GAGCCGATGA TCCCGATGTA CATCTTCTAT
TCGATGTTCG GGTTCCAGCG CACGGGCGAC GGCATGTGGG CCATCGGCGA CCAGATGGGC
CGCGGCTTCG TGCTCGGCGC CACCGCCGGC CGGACCACGC TGACCGGTGA GGGCCTGCAG
CACGGTGACG GGCACTCGCA CCTGCTGGCC GCCACGCAGC CGCACTTCGT CTCCTACGAC
CCGGCGTACG GCTACGAGAT CGCGCACATC GTCAAGGACG GCCTGCGGCG GATGTACGGC
GGGAGCGAGG AGTTCCCGCA CGGCGAGAAC ATCATGTACT ACATCACCCT GTACAACGAG
CCGTACCAGC AGCCCAAGCA GCCGGACGAC CTCGATGTCG ACGGCCTGCT CAAGGGCATC
TACAAGCTCT CGCCGGCCGC CGAGACCGAG GGCAAGGCGC GCGCGCAGCT CCTGGCCTCC
GGCGTCGGGG TGCGCTGGGC CCTGGAAGCC CAGCAGCTGC TGGCCCAGGA CTGGGGGGTG
GCCGCCGACG TCTGGTCGGT CACCAGCTGG ACCGAGCTGA GCCGGGACGC CGAGCGGGTC
GAACGGGCCC GCTTGCTGGA TCCGGCCGCC GAGGTCGGCG TGCCGTACAT CTCCAAGGTG
CTGGCCGAGA CCGAGGGACC GGTCATTGCC ACCAGCGATT GGCAGCGGGC GATTCAGAAC
CTGATCGCAC CGTGGGTGCC GGGCGATTTC GTCGCTCTCG GCGCCGACGG ATTCGGATTC
TCCGACACCC GCGCGGCGGC TCGGCGGCAT TTCCTCATCG ACGGTCCGTC GATGGTGGTG
GCCACCCTGT CGGCCCTGGA ACGGCGCGGT CAGTACCGGG CCGGAGCAGC TGCCGAAGCC
GCGGAGAAGT ACGAACTGCA CGACGTCCGG GCCGGACGTT CGGGCAGCAC CGGTGGCGAC
TCCTGA
 
Protein sequence
MTTVNEPAPK MTLIKDGIAA QLHDIDPEET SEWLASFDAM LEAGGSQRAR YLMLRMLDRA 
KQQHIALPAL TTTDYINTIP TESEPFFPGD EAIERRYRRF IRWNAAMLVH RAQRPGIGVG
GHISTYASSA TLYEVGFNHF WRGKDHPGGG DQVFFQGHAS PGMYARAFLE GRLSENDLDG
FRQEKSHPGG GIPSYPHPRL MPDFWEFPTV SMGLGPMNAI QQARVNRFLH HRGIKDTSDQ
HVWAFLGDGE LDEVESRGLI HIAAIDGLDN LTYVINCNLQ RLDGPVRGNG KIVQELEAFF
RGAGWNVIKV IWGREWDRLL EKDKDGALVH LMNTTADGDF QTYRANDGAY IREHFFGRDP
RTKQMVTDLS DEDIWRLRRG GHDYRKVYAA YNAATQHTGQ PTVILVKTIK GFGLGPSFQG
RNATHQMKKM TSQNLHEFRD SLQLPIPDSQ LEDVYRPPYF HPGQDSEEIQ YMLDRRKRLG
GFVPERRVAA KPLVLPGDKV YDVLKKGSGK QEIATTMAFV RLVRDLFKDP EIGNRVVPII
PDEARTFGMD SFFPTQKIYN PSGQLYTAVD AQLMLAYRES EQGMILHEGI DEAGSVATLT
AVSTAYATHG EPMIPMYIFY SMFGFQRTGD GMWAIGDQMG RGFVLGATAG RTTLTGEGLQ
HGDGHSHLLA ATQPHFVSYD PAYGYEIAHI VKDGLRRMYG GSEEFPHGEN IMYYITLYNE
PYQQPKQPDD LDVDGLLKGI YKLSPAAETE GKARAQLLAS GVGVRWALEA QQLLAQDWGV
AADVWSVTSW TELSRDAERV ERARLLDPAA EVGVPYISKV LAETEGPVIA TSDWQRAIQN
LIAPWVPGDF VALGADGFGF SDTRAAARRH FLIDGPSMVV ATLSALERRG QYRAGAAAEA
AEKYELHDVR AGRSGSTGGD S