Gene Noca_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1993 
SymbolaceE 
ID4598308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2130565 
End bp2133363 
Gene Length2799 bp 
Protein Length932 aa 
Translation table11 
GC content70% 
IMG OID639776596 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_923190 
Protein GI119716225 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.883092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGAAG ACCCAACCCC TGCTTCTGGC CCCACCCCTG GCACGGACAC CAAGCGGAGC 
GGCGCGATCC CGACCGTCAT CCACGAAGGA CTGCCGACCC AGCTGCCCGA CACCGACCCG
GACGAGACCA CCGACTGGAT CGACTCCTTC GACTCACTCG TCGGGGAACG CGGCCGCGAA
CGTGCCCGCT ACGTCATGCT GCGCCTCCTC GAACGTGCGC GGGAGATGCA GGTGGGCGTG
CCCGCCCTGC GGAGCACCGA CTACATCAAC ACCATCCCGC CCGAGCGTGA GCCGTGGTTC
CCCGGGGACG AGGAGACCGA GCGTCGGATC CGCGCGTTCA TCCGCTGGAA CGCCGCGGTC
ATGGTGTCGA GCGCCAACCG CAAGGGCCTC GAGGTCGGTG GTCACATCGC CACCTACCAG
TCCTCCGCGA GCCTCTACGA GGTCGGCTTC AACCACTTCT TCCGCGGCAA GGACCACCCC
GGTGGCGGCG ACCAGGTCTT CATCCAGGGC CACGCCTCCC CCGGCATCTA CGCTCGCGCG
TTCCTCGAGG GCCGGCTGAC CGAGACCCAG CTCTCCCGGT TCCGCCAGGA GGTCCAGCAC
GGCCCGCACG CCGGCCTCTC GTCGTACCCC CACCCGCGCC TGATGCCGGA GTTCTGGGAG
TTCCCGACGG TGTCGATGGG GCTGACCTCG CTCAACTCGA TCTACCAGGC ACGGTTCAAC
CGCTACCTGC ACAACCGCGG CATCAAGGAC ACCGCCCAGC AGCGCGTGTG GGCGTTCCTC
GGCGACGGCG AGATGGGCGA GCCGGAGTCG CTCGGCGCGA TCCGGGTCGC CGCCCGCGAG
GAGCTGGACA ACCTGGTCTG GGTCGTGAAC TGCAACCTGC AGCAGCTCGA CGGACCGGTG
ACCGGCAACG GCAAGATCAT CCAGGAGCTC GAGGCCAACT TCCGGGGCGC CGGCTGGAAC
GTGATCAAGG TCGTGTGGGG CCGCGAGTGG GACCAGCTGC TGGCCCGCGA CGTCGACGGC
GTCCTGGTCA ACCGGATGAA CTCCACGCCG GACGGCGCGT TCCAGACCTA CTCGGTCGAG
TCCGGCGAGT ACGTCCGCGA GAGCTTCTTC GGGGCCGACC CGCGGCTGCG CAAGATGGTC
GAGCACATGA GCGACGACCA GATCCGCAAG CTGCCGCGCG GTGGCCACGA CTACCGCAAG
GTGTACGCCG CCTTCGACGC GGCCACCAAG CACGTCGGCC AGCCGACCGT GATCCTGGCC
AAGACCGTCA AGGGCTGGAC GATCGACGCG CTGGAGGGCC GCAACGCGAC CCACCAGATG
AAGAAGCTGA CCCAGGACGA CCTGAAGAAG TTCCGGGACC GGCTCTACCT CCCGATCTCC
GACCGCGACC TGGAGCGCAC CTACGAGGAG ACCGGCGCCG CACCGTTCTT CCACCCCGGC
ATGGAGTCCC CCGAGATCGA GTACATGCTC GAGCGTCGCC GCCAGCTCGG CGGGTCGATC
CCCCAGCGGG TCCAGCGCGC CAAGCCGCTC CAGCTGCCGG GCGACGCGAT GTACGCCGAC
CTCAAGCAGG GCTCGGGCAA GCACGCCATC GCCTCCACGA TGGCGCTGGT GCGGCTGCTC
AAGGACTGGA TGAAGGACCC CGAGATCGGC AAGCGGATCG TGCCGATCGC CCCGGACGAG
TACCGCACGT TCGGCATGGA CTCGATGTTC CCGAGCGCGA AGGTGTACAA CCCGGGCGGC
CAGCAGTACG AGTCGGTCGA CCGGAAACTG TTGCTCTCCT ACAAGGAGTC CGCCCAGGGG
CAGCTGCTCC ACGAGGGCAT CTCGGAGGCC GGTGGCGTCG CGTCCGCGAC CGCCGCGGGC
TCGGCGTACT CCACGCACGG CGAGCACATG ATCCCGTTCT TCATCTTCTA CTCGATGTTC
GGGTTCCAGC GCACCGGCGA CTCGATCTGG GCGATGAGCG ACCAGCTCGC CCGCGGCTTC
CTGATCGGCG CCACCGCCGG CCGGACCACG CTGACCGGCG AGGGCCTGCA GCACGCCGAC
GGCCACTCGC CGCTGCTCGC GGCGTCCAAC CCGGCGGTCG TGCACTACGA CCCGGCGTTC
GCCTACGAGA TCAGCCACGT GATGCGCTCC GGCCTGGAGC GGATGTACGG CCCGGACGCC
GAGGACGTGA TCTTCTACAT CACCGTCTAC AACGAGCCGG TGCAGCAGCC GGCCGAGCCC
GAGGACGTCG ACGTCGAGGG CATCCTCAAG GGCATCCACC ACGTCTCCTC CGCGGACGGC
GAGGGACCGC GGGCACAGCT GCTCGCCTCC GGTGTCGGGT TCCCGTGGAT CAAGGAGGCC
CAGCAGATCC TGGCCGACGA GTGGGGCGTG CGCGCCGACA CCTGGTCGGT CACCTCGTGG
AACGAGCTGG CCCGGGACGG GGCGGCCGCC GAGGAGTGGA ACCTGCTGCA CCCGGGCGAG
ACCCCGCGCA CGGCGTACGT CACGGACAAG CTGGCCGGCG CGTCCGGTCC GGTCGTGGCG
GTCTCGGACT ACATGCGCGC GGTGCCGCTG CAGATCGCCC GCTGGGTCCC GGCCGACTAC
CGCGTGCTCG GCGCCGACGG CTACGGCTTC GCCGACACCC GGCCCGCCGC CCGCCGGTTC
TTCCACATCG ACGCCCAGTC GGTGGTCGTG CAGACCCTGC AGGCCCTCGC CGACGCCGGC
CAGATCGACC GCTCGAAGGT CGAGGAGGCG TTCGCGAAGT ACCGCATCGA CGACCCCACC
GCGGTCGCCG GCGTCAAGCA GGAAGGTGGC GACGCCTGA
 
Protein sequence
MTEDPTPASG PTPGTDTKRS GAIPTVIHEG LPTQLPDTDP DETTDWIDSF DSLVGERGRE 
RARYVMLRLL ERAREMQVGV PALRSTDYIN TIPPEREPWF PGDEETERRI RAFIRWNAAV
MVSSANRKGL EVGGHIATYQ SSASLYEVGF NHFFRGKDHP GGGDQVFIQG HASPGIYARA
FLEGRLTETQ LSRFRQEVQH GPHAGLSSYP HPRLMPEFWE FPTVSMGLTS LNSIYQARFN
RYLHNRGIKD TAQQRVWAFL GDGEMGEPES LGAIRVAARE ELDNLVWVVN CNLQQLDGPV
TGNGKIIQEL EANFRGAGWN VIKVVWGREW DQLLARDVDG VLVNRMNSTP DGAFQTYSVE
SGEYVRESFF GADPRLRKMV EHMSDDQIRK LPRGGHDYRK VYAAFDAATK HVGQPTVILA
KTVKGWTIDA LEGRNATHQM KKLTQDDLKK FRDRLYLPIS DRDLERTYEE TGAAPFFHPG
MESPEIEYML ERRRQLGGSI PQRVQRAKPL QLPGDAMYAD LKQGSGKHAI ASTMALVRLL
KDWMKDPEIG KRIVPIAPDE YRTFGMDSMF PSAKVYNPGG QQYESVDRKL LLSYKESAQG
QLLHEGISEA GGVASATAAG SAYSTHGEHM IPFFIFYSMF GFQRTGDSIW AMSDQLARGF
LIGATAGRTT LTGEGLQHAD GHSPLLAASN PAVVHYDPAF AYEISHVMRS GLERMYGPDA
EDVIFYITVY NEPVQQPAEP EDVDVEGILK GIHHVSSADG EGPRAQLLAS GVGFPWIKEA
QQILADEWGV RADTWSVTSW NELARDGAAA EEWNLLHPGE TPRTAYVTDK LAGASGPVVA
VSDYMRAVPL QIARWVPADY RVLGADGYGF ADTRPAARRF FHIDAQSVVV QTLQALADAG
QIDRSKVEEA FAKYRIDDPT AVAGVKQEGG DA