Gene Dgeo_1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_1887 
SymbolaceE 
ID4059013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp1983610 
End bp1986318 
Gene Length2709 bp 
Protein Length902 aa 
Translation table11 
GC content65% 
IMG OID641230915 
Productpyruvate dehydrogenase subunit E1 
Protein accessionYP_605351 
Protein GI94985987 
COG category[C] Energy production and conversion 
COG ID[COG2609] Pyruvate dehydrogenase complex, dehydrogenase (E1) component 
TIGRFAM ID[TIGR00759] pyruvate dehydrogenase E1 component, homodimeric type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCACCGC GCGCCGCCCT GTCCCCACAG GAGCGTGAAC AGCTCAATTC TGTGGAAACG 
CAGGAGTGGC TCGACTCGCT CGCCTACGTT CTGGCAGACG CAGGCGACGA CCGCGCCGCG
CAGCTGTTGG AAGAGCTGGA CCACTACGCC TACTTCCACG GCGCCCCCAT CCTCTTTAAG
CAGAACACGC CCTACATCAA CACGATCGAC GTAGAGGCGC AGCCCGAGTA TCCCGGCAAC
CTGGAGCTGG AGCGCAAGAT TCGCAACGCG GTGCGCTGGA ACGCCGTCGT GATGGTGCTG
CGGGCCAACA AGCGGGCCGA AGGCATCGGC GGGCACCTCG CGACCTACGC GAGCAGTGCG
GAGCTGTACG AGGTGGGCTT TAACCACTTT TTCCGGGGGC ACGGCGCGGG GGTGAACCGC
GACCTCATCT TCTTCCAGGG TCACGCCAGT CCCGGCATCT ATGCCCGCTC CTTCCTGGAG
GGCCGCATCA GCGAGGCGCA GATGAACAAC TTCCGCCGGG AACTCAGCCC CGATGGTCCC
GGCCTATCGA GTTACCCGCA TCCCTGGCTG ATGCCGCACT ACTGGGAGTT TCCGACCGTC
AGCATGGGTC TCGGGCCCAT CCAGGCGATC TACCAGGCGC GGTACATCCG GTACCTCGAA
AACCGCGGTC TCAAGGCGAA GGGCAACGCG AAGGTCTGGG CCTTTTTGGG GGACGGCGAG
ATGGACGAGC CGCAGTCGGT GGGTGCGCTG CGCTTTGCCG CCTACGAGAA CCTGGACAAT
CTCGTCTTTG TGCTCAACGC GAACCTGCAG CGCCTCGACG GCCCGGTGCG CGCCAACTCC
AAGGTGATCC AGGAGTTCGA GGCCCTGTTC CGCGGGGCGG GCTGGAACGT GATCAAGGTC
GTGTGGGACT CCAAGTGGGA CGAGCTGCTC GCCAAGGACT ACAACGGCGC GATCGTCAAG
CGCTTCGAGG CGCTCGTGGA CGGCGAGTCG CAGCGCTACG CGGCCTTTGG TGGCAAGGAG
CTGCGCGAGA AGTTCTTCAA CACGCCCGAA CTTCAGCAGC TGATCGAAGG CTGGAGCGAC
GCCGACCTTG AACTGCTCAA CCGTGGCGGT CACGATGTCA AGAAGGTCTT TGCTGCCTAC
GACGCCGCCG TCAAGCACCG GGGCCAGCCC ACCGTCATCA TCGCCCGCAC CGTGAAGGGC
TACGGCCTGG GCGAGACAGC GCAGGCGCGC AACGTGGCCC ACCAGGTCAA GAAGCTGGAC
TTCCACGCGC TGAAGAACCT GCGCGACCTG CTTGAGCTGC CGCTGACTGA CGAGCAGGTC
GAACACCTGG AGTACTACAA CCCCGGCCCC GACAGCCCCG AGATTCGGTA CATGCTCGAG
CGCCGCGCGG CGCTGGGCGG CTTCGTGCCC GAGCGCCGGG TGGATTACCC GCGCCCCAGC
GTCCCCACCG GCGAGTTCTA CGAGGAATTT GCCGCCGGCA GCAAGGGCCG CGCCGTCAGC
ACCACGATGG CCGCCGTGCA GATCTTGAGC AAGCTGCTGC GCGACCCCGA GGTCGGCAAG
TACATCGTGC CGATTGTGCC CGATGAGGCG CGCACCTTTG GGATGGACGC CCTGGTGCCA
CGCATCGGCA TCTACTCGCC GCGTGGCCAG ACCTACACCC CAGTCGACTC CGGCAGCCTG
ATGGTCTACA AGGAGAGCAC CGACGGCCAG ATGCTGGAAG AGGGCATCAC CGAAGACGGG
GCGATGTCGT CCTGGATTGC GGCCGCAACC GCCTACGCCA ACCACGGCGT TCCGACCATC
CCCTTCTACG TCTTCTACTC GATGTTTGGC ATGCAGCGCA TCGGCGACCT GGTGTGGGCT
GCGGCCGACC AGCGTGCGCG CGGCTTCCTT TTCGGCGCGA CGGCGGGCCG CACCACGCTG
GCGGGTGAGG GATTGCAGCA CCAGGACGGC AACAGCCTGC TGCAGGCCTA TGTGGTGCCC
ACCCTCAAGG TGTATGACCC AGCCTTTGCC TACGAACTCG CGGTGATTGT CGAACACGGC
ATCCAGCGGA TGTACGTGGA CAACATCGAC GAGTTTTATT ACGTCACCAT CGACAACGAG
AACGAGGTGC AGCCGCCCAT GCCGGAGGAC GGCCGCAGCC ACGACGAGAT TCGCCAGGGC
ATCATCCGGG GACTGTACCG CTTTCAGCAG AGCGGCAACA AAAAGGCCAA GCTGCGCGCT
CAGCTTCTCG CCAGCGGCCC CGCGATGGGC GCAGCGCTCG AAGCGGTGCA GAAGCTGGAA
GCCTACGGCG TGGCCGCCGA CGTGTGGAGC GTGACGAGTT ACAAGGAACT CCACCAGGAC
GCCCTGCTGA CCCAGCGTTA CAACATGCTG CACCCCACCG CAGAACCGCG CGTCTCCTAC
GTGGCCTCTC AGCTCAGCCA GGAGAACGCT CCCGGCGTGC TGGTTTCAGT GAGTGATTAC
GTGAAACTGG GCGCCGACGG CCTGAACGGA CACCTCGATC GCAAGCTCTG GGTGCTGGGC
ACCGACGGCT TTGGCCGCTC GGAAGACCGC TCCGAACTGC GTGACTTCTT CGAGGTGGAC
GCCCGCTACG TGACCCTCGC CACCCTCTAC GCCCTCCAGC GCGAAGGCAA GCTCAAGGGA
GACGTGGTGG CGCGGGCCAT TTCCGAGCTT GGCATCGACC CGGAACGCGA GGCGCCAGTT
TTGCGTTAA
 
Protein sequence
MPPRAALSPQ EREQLNSVET QEWLDSLAYV LADAGDDRAA QLLEELDHYA YFHGAPILFK 
QNTPYINTID VEAQPEYPGN LELERKIRNA VRWNAVVMVL RANKRAEGIG GHLATYASSA
ELYEVGFNHF FRGHGAGVNR DLIFFQGHAS PGIYARSFLE GRISEAQMNN FRRELSPDGP
GLSSYPHPWL MPHYWEFPTV SMGLGPIQAI YQARYIRYLE NRGLKAKGNA KVWAFLGDGE
MDEPQSVGAL RFAAYENLDN LVFVLNANLQ RLDGPVRANS KVIQEFEALF RGAGWNVIKV
VWDSKWDELL AKDYNGAIVK RFEALVDGES QRYAAFGGKE LREKFFNTPE LQQLIEGWSD
ADLELLNRGG HDVKKVFAAY DAAVKHRGQP TVIIARTVKG YGLGETAQAR NVAHQVKKLD
FHALKNLRDL LELPLTDEQV EHLEYYNPGP DSPEIRYMLE RRAALGGFVP ERRVDYPRPS
VPTGEFYEEF AAGSKGRAVS TTMAAVQILS KLLRDPEVGK YIVPIVPDEA RTFGMDALVP
RIGIYSPRGQ TYTPVDSGSL MVYKESTDGQ MLEEGITEDG AMSSWIAAAT AYANHGVPTI
PFYVFYSMFG MQRIGDLVWA AADQRARGFL FGATAGRTTL AGEGLQHQDG NSLLQAYVVP
TLKVYDPAFA YELAVIVEHG IQRMYVDNID EFYYVTIDNE NEVQPPMPED GRSHDEIRQG
IIRGLYRFQQ SGNKKAKLRA QLLASGPAMG AALEAVQKLE AYGVAADVWS VTSYKELHQD
ALLTQRYNML HPTAEPRVSY VASQLSQENA PGVLVSVSDY VKLGADGLNG HLDRKLWVLG
TDGFGRSEDR SELRDFFEVD ARYVTLATLY ALQREGKLKG DVVARAISEL GIDPEREAPV
LR