Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1993 |
Symbol | aceE |
ID | 4598308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 2130565 |
End bp | 2133363 |
Gene Length | 2799 bp |
Protein Length | 932 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639776596 |
Product | pyruvate dehydrogenase subunit E1 |
Protein accession | YP_923190 |
Protein GI | 119716225 |
COG category | [C] Energy production and conversion |
COG ID | [COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component |
TIGRFAM ID | [TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.883092 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGAAG ACCCAACCCC TGCTTCTGGC CCCACCCCTG GCACGGACAC CAAGCGGAGC GGCGCGATCC CGACCGTCAT CCACGAAGGA CTGCCGACCC AGCTGCCCGA CACCGACCCG GACGAGACCA CCGACTGGAT CGACTCCTTC GACTCACTCG TCGGGGAACG CGGCCGCGAA CGTGCCCGCT ACGTCATGCT GCGCCTCCTC GAACGTGCGC GGGAGATGCA GGTGGGCGTG CCCGCCCTGC GGAGCACCGA CTACATCAAC ACCATCCCGC CCGAGCGTGA GCCGTGGTTC CCCGGGGACG AGGAGACCGA GCGTCGGATC CGCGCGTTCA TCCGCTGGAA CGCCGCGGTC ATGGTGTCGA GCGCCAACCG CAAGGGCCTC GAGGTCGGTG GTCACATCGC CACCTACCAG TCCTCCGCGA GCCTCTACGA GGTCGGCTTC AACCACTTCT TCCGCGGCAA GGACCACCCC GGTGGCGGCG ACCAGGTCTT CATCCAGGGC CACGCCTCCC CCGGCATCTA CGCTCGCGCG TTCCTCGAGG GCCGGCTGAC CGAGACCCAG CTCTCCCGGT TCCGCCAGGA GGTCCAGCAC GGCCCGCACG CCGGCCTCTC GTCGTACCCC CACCCGCGCC TGATGCCGGA GTTCTGGGAG TTCCCGACGG TGTCGATGGG GCTGACCTCG CTCAACTCGA TCTACCAGGC ACGGTTCAAC CGCTACCTGC ACAACCGCGG CATCAAGGAC ACCGCCCAGC AGCGCGTGTG GGCGTTCCTC GGCGACGGCG AGATGGGCGA GCCGGAGTCG CTCGGCGCGA TCCGGGTCGC CGCCCGCGAG GAGCTGGACA ACCTGGTCTG GGTCGTGAAC TGCAACCTGC AGCAGCTCGA CGGACCGGTG ACCGGCAACG GCAAGATCAT CCAGGAGCTC GAGGCCAACT TCCGGGGCGC CGGCTGGAAC GTGATCAAGG TCGTGTGGGG CCGCGAGTGG GACCAGCTGC TGGCCCGCGA CGTCGACGGC GTCCTGGTCA ACCGGATGAA CTCCACGCCG GACGGCGCGT TCCAGACCTA CTCGGTCGAG TCCGGCGAGT ACGTCCGCGA GAGCTTCTTC GGGGCCGACC CGCGGCTGCG CAAGATGGTC GAGCACATGA GCGACGACCA GATCCGCAAG CTGCCGCGCG GTGGCCACGA CTACCGCAAG GTGTACGCCG CCTTCGACGC GGCCACCAAG CACGTCGGCC AGCCGACCGT GATCCTGGCC AAGACCGTCA AGGGCTGGAC GATCGACGCG CTGGAGGGCC GCAACGCGAC CCACCAGATG AAGAAGCTGA CCCAGGACGA CCTGAAGAAG TTCCGGGACC GGCTCTACCT CCCGATCTCC GACCGCGACC TGGAGCGCAC CTACGAGGAG ACCGGCGCCG CACCGTTCTT CCACCCCGGC ATGGAGTCCC CCGAGATCGA GTACATGCTC GAGCGTCGCC GCCAGCTCGG CGGGTCGATC CCCCAGCGGG TCCAGCGCGC CAAGCCGCTC CAGCTGCCGG GCGACGCGAT GTACGCCGAC CTCAAGCAGG GCTCGGGCAA GCACGCCATC GCCTCCACGA TGGCGCTGGT GCGGCTGCTC AAGGACTGGA TGAAGGACCC CGAGATCGGC AAGCGGATCG TGCCGATCGC CCCGGACGAG TACCGCACGT TCGGCATGGA CTCGATGTTC CCGAGCGCGA AGGTGTACAA CCCGGGCGGC CAGCAGTACG AGTCGGTCGA CCGGAAACTG TTGCTCTCCT ACAAGGAGTC CGCCCAGGGG CAGCTGCTCC ACGAGGGCAT CTCGGAGGCC GGTGGCGTCG CGTCCGCGAC CGCCGCGGGC TCGGCGTACT CCACGCACGG CGAGCACATG ATCCCGTTCT TCATCTTCTA CTCGATGTTC GGGTTCCAGC GCACCGGCGA CTCGATCTGG GCGATGAGCG ACCAGCTCGC CCGCGGCTTC CTGATCGGCG CCACCGCCGG CCGGACCACG CTGACCGGCG AGGGCCTGCA GCACGCCGAC GGCCACTCGC CGCTGCTCGC GGCGTCCAAC CCGGCGGTCG TGCACTACGA CCCGGCGTTC GCCTACGAGA TCAGCCACGT GATGCGCTCC GGCCTGGAGC GGATGTACGG CCCGGACGCC GAGGACGTGA TCTTCTACAT CACCGTCTAC AACGAGCCGG TGCAGCAGCC GGCCGAGCCC GAGGACGTCG ACGTCGAGGG CATCCTCAAG GGCATCCACC ACGTCTCCTC CGCGGACGGC GAGGGACCGC GGGCACAGCT GCTCGCCTCC GGTGTCGGGT TCCCGTGGAT CAAGGAGGCC CAGCAGATCC TGGCCGACGA GTGGGGCGTG CGCGCCGACA CCTGGTCGGT CACCTCGTGG AACGAGCTGG CCCGGGACGG GGCGGCCGCC GAGGAGTGGA ACCTGCTGCA CCCGGGCGAG ACCCCGCGCA CGGCGTACGT CACGGACAAG CTGGCCGGCG CGTCCGGTCC GGTCGTGGCG GTCTCGGACT ACATGCGCGC GGTGCCGCTG CAGATCGCCC GCTGGGTCCC GGCCGACTAC CGCGTGCTCG GCGCCGACGG CTACGGCTTC GCCGACACCC GGCCCGCCGC CCGCCGGTTC TTCCACATCG ACGCCCAGTC GGTGGTCGTG CAGACCCTGC AGGCCCTCGC CGACGCCGGC CAGATCGACC GCTCGAAGGT CGAGGAGGCG TTCGCGAAGT ACCGCATCGA CGACCCCACC GCGGTCGCCG GCGTCAAGCA GGAAGGTGGC GACGCCTGA
|
Protein sequence | MTEDPTPASG PTPGTDTKRS GAIPTVIHEG LPTQLPDTDP DETTDWIDSF DSLVGERGRE RARYVMLRLL ERAREMQVGV PALRSTDYIN TIPPEREPWF PGDEETERRI RAFIRWNAAV MVSSANRKGL EVGGHIATYQ SSASLYEVGF NHFFRGKDHP GGGDQVFIQG HASPGIYARA FLEGRLTETQ LSRFRQEVQH GPHAGLSSYP HPRLMPEFWE FPTVSMGLTS LNSIYQARFN RYLHNRGIKD TAQQRVWAFL GDGEMGEPES LGAIRVAARE ELDNLVWVVN CNLQQLDGPV TGNGKIIQEL EANFRGAGWN VIKVVWGREW DQLLARDVDG VLVNRMNSTP DGAFQTYSVE SGEYVRESFF GADPRLRKMV EHMSDDQIRK LPRGGHDYRK VYAAFDAATK HVGQPTVILA KTVKGWTIDA LEGRNATHQM KKLTQDDLKK FRDRLYLPIS DRDLERTYEE TGAAPFFHPG MESPEIEYML ERRRQLGGSI PQRVQRAKPL QLPGDAMYAD LKQGSGKHAI ASTMALVRLL KDWMKDPEIG KRIVPIAPDE YRTFGMDSMF PSAKVYNPGG QQYESVDRKL LLSYKESAQG QLLHEGISEA GGVASATAAG SAYSTHGEHM IPFFIFYSMF GFQRTGDSIW AMSDQLARGF LIGATAGRTT LTGEGLQHAD GHSPLLAASN PAVVHYDPAF AYEISHVMRS GLERMYGPDA EDVIFYITVY NEPVQQPAEP EDVDVEGILK GIHHVSSADG EGPRAQLLAS GVGFPWIKEA QQILADEWGV RADTWSVTSW NELARDGAAA EEWNLLHPGE TPRTAYVTDK LAGASGPVVA VSDYMRAVPL QIARWVPADY RVLGADGYGF ADTRPAARRF FHIDAQSVVV QTLQALADAG QIDRSKVEEA FAKYRIDDPT AVAGVKQEGG DA
|
| |