Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2054 |
Symbol | aceE |
ID | 8447663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2266398 |
End bp | 2269163 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 645041177 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_003201423 |
Protein GI | 258652267 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.419391 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0127021 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACGACAG TGAACGAACC GGCGCCGAAG ATGACGCTGA TCAAGGACGG CATCGCCGCC CAATTGCACG ACATCGACCC GGAAGAGACG AGCGAGTGGC TCGCCTCCTT CGACGCCATG CTCGAGGCGG GCGGCAGCCA GCGGGCCCGG TACCTGATGC TGCGGATGCT GGACCGGGCC AAGCAGCAGC ACATCGCGCT GCCGGCGCTG ACCACCACGG ACTACATCAA CACCATCCCG ACCGAGTCGG AGCCGTTCTT CCCCGGTGAT GAGGCGATCG AGCGCCGCTA CCGCCGGTTC ATCCGCTGGA ACGCCGCGAT GCTGGTGCAC CGCGCGCAGC GGCCCGGCAT CGGCGTCGGC GGCCACATCT CCACCTACGC CTCCTCGGCG ACGTTGTACG AGGTCGGCTT CAACCACTTC TGGCGCGGCA AGGACCACCC CGGCGGCGGC GACCAGGTCT TCTTCCAGGG CCACGCCTCC CCCGGCATGT ACGCCCGCGC CTTCCTCGAG GGCCGGCTGT CCGAGAACGA CCTGGACGGC TTCCGCCAGG AGAAGTCGCA CCCGGGTGGC GGCATTCCGT CGTACCCGCA CCCGCGGCTG ATGCCGGACT TCTGGGAATT CCCGACCGTG TCCATGGGCC TGGGCCCGAT GAACGCGATC CAGCAGGCCC GGGTCAACCG CTTCCTGCAC CACCGCGGCA TCAAGGACAC CTCCGACCAG CACGTCTGGG CGTTCCTGGG CGACGGCGAG CTCGACGAGG TCGAGTCCCG CGGTCTGATC CACATCGCCG CGATCGACGG CCTGGACAAC CTGACCTACG TCATCAACTG CAACCTGCAG CGCCTGGACG GCCCGGTCCG CGGCAACGGC AAGATCGTGC AGGAGCTGGA GGCCTTCTTC CGGGGCGCCG GTTGGAACGT CATCAAGGTC ATCTGGGGCC GCGAGTGGGA CCGCCTGCTC GAGAAGGACA AGGACGGCGC CCTGGTCCAC CTGATGAACA CCACCGCCGA CGGCGACTTC CAGACCTACC GGGCCAACGA CGGCGCGTAC ATCCGGGAGC ACTTCTTCGG CCGCGACCCG CGGACCAAGC AGATGGTCAC CGACCTGTCC GACGAGGACA TCTGGCGGCT GCGGCGCGGC GGCCACGACT ACCGCAAGGT CTACGCGGCC TACAACGCGG CGACCCAGCA CACCGGGCAG CCGACCGTCA TCCTGGTCAA GACGATCAAG GGCTTCGGGC TCGGCCCGTC GTTCCAGGGC CGCAACGCGA CCCACCAGAT GAAGAAGATG ACCTCGCAGA ACCTGCACGA GTTCCGCGAC AGCCTGCAGC TGCCCATCCC GGACAGCCAG CTCGAGGACG TCTACCGGCC GCCGTACTTC CACCCCGGAC AGGACTCCGA AGAGATCCAG TACATGCTCG ATCGCCGCAA GCGCCTCGGC GGCTTCGTGC CCGAGCGTCG GGTCGCCGCC AAGCCGCTGG TGCTGCCGGG CGACAAGGTC TACGACGTGC TGAAGAAGGG CTCCGGCAAG CAGGAGATCG CCACCACCAT GGCGTTCGTC CGGCTCGTCC GCGACCTGTT CAAGGACCCG GAGATCGGCA ACCGCGTGGT GCCGATCATC CCGGACGAGG CCCGCACCTT CGGCATGGAC TCGTTCTTCC CGACGCAGAA GATCTACAAC CCCTCGGGTC AGCTTTACAC CGCGGTCGAC GCCCAGCTGA TGCTGGCCTA CCGCGAGTCC GAGCAGGGCA TGATCCTGCA CGAGGGCATC GACGAGGCCG GTTCGGTGGC CACGCTGACC GCGGTGTCCA CCGCGTACGC CACCCACGGC GAGCCGATGA TCCCGATGTA CATCTTCTAT TCGATGTTCG GGTTCCAGCG CACGGGCGAC GGCATGTGGG CCATCGGCGA CCAGATGGGC CGCGGCTTCG TGCTCGGCGC CACCGCCGGC CGGACCACGC TGACCGGTGA GGGCCTGCAG CACGGTGACG GGCACTCGCA CCTGCTGGCC GCCACGCAGC CGCACTTCGT CTCCTACGAC CCGGCGTACG GCTACGAGAT CGCGCACATC GTCAAGGACG GCCTGCGGCG GATGTACGGC GGGAGCGAGG AGTTCCCGCA CGGCGAGAAC ATCATGTACT ACATCACCCT GTACAACGAG CCGTACCAGC AGCCCAAGCA GCCGGACGAC CTCGATGTCG ACGGCCTGCT CAAGGGCATC TACAAGCTCT CGCCGGCCGC CGAGACCGAG GGCAAGGCGC GCGCGCAGCT CCTGGCCTCC GGCGTCGGGG TGCGCTGGGC CCTGGAAGCC CAGCAGCTGC TGGCCCAGGA CTGGGGGGTG GCCGCCGACG TCTGGTCGGT CACCAGCTGG ACCGAGCTGA GCCGGGACGC CGAGCGGGTC GAACGGGCCC GCTTGCTGGA TCCGGCCGCC GAGGTCGGCG TGCCGTACAT CTCCAAGGTG CTGGCCGAGA CCGAGGGACC GGTCATTGCC ACCAGCGATT GGCAGCGGGC GATTCAGAAC CTGATCGCAC CGTGGGTGCC GGGCGATTTC GTCGCTCTCG GCGCCGACGG ATTCGGATTC TCCGACACCC GCGCGGCGGC TCGGCGGCAT TTCCTCATCG ACGGTCCGTC GATGGTGGTG GCCACCCTGT CGGCCCTGGA ACGGCGCGGT CAGTACCGGG CCGGAGCAGC TGCCGAAGCC GCGGAGAAGT ACGAACTGCA CGACGTCCGG GCCGGACGTT CGGGCAGCAC CGGTGGCGAC TCCTGA
|
Protein sequence | MTTVNEPAPK MTLIKDGIAA QLHDIDPEET SEWLASFDAM LEAGGSQRAR YLMLRMLDRA KQQHIALPAL TTTDYINTIP TESEPFFPGD EAIERRYRRF IRWNAAMLVH RAQRPGIGVG GHISTYASSA TLYEVGFNHF WRGKDHPGGG DQVFFQGHAS PGMYARAFLE GRLSENDLDG FRQEKSHPGG GIPSYPHPRL MPDFWEFPTV SMGLGPMNAI QQARVNRFLH HRGIKDTSDQ HVWAFLGDGE LDEVESRGLI HIAAIDGLDN LTYVINCNLQ RLDGPVRGNG KIVQELEAFF RGAGWNVIKV IWGREWDRLL EKDKDGALVH LMNTTADGDF QTYRANDGAY IREHFFGRDP RTKQMVTDLS DEDIWRLRRG GHDYRKVYAA YNAATQHTGQ PTVILVKTIK GFGLGPSFQG RNATHQMKKM TSQNLHEFRD SLQLPIPDSQ LEDVYRPPYF HPGQDSEEIQ YMLDRRKRLG GFVPERRVAA KPLVLPGDKV YDVLKKGSGK QEIATTMAFV RLVRDLFKDP EIGNRVVPII PDEARTFGMD SFFPTQKIYN PSGQLYTAVD AQLMLAYRES EQGMILHEGI DEAGSVATLT AVSTAYATHG EPMIPMYIFY SMFGFQRTGD GMWAIGDQMG RGFVLGATAG RTTLTGEGLQ HGDGHSHLLA ATQPHFVSYD PAYGYEIAHI VKDGLRRMYG GSEEFPHGEN IMYYITLYNE PYQQPKQPDD LDVDGLLKGI YKLSPAAETE GKARAQLLAS GVGVRWALEA QQLLAQDWGV AADVWSVTSW TELSRDAERV ERARLLDPAA EVGVPYISKV LAETEGPVIA TSDWQRAIQN LIAPWVPGDF VALGADGFGF SDTRAAARRH FLIDGPSMVV ATLSALERRG QYRAGAAAEA AEKYELHDVR AGRSGSTGGD S
|
| |