Gene Dgeo_2341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2341 
Symbol 
ID4057194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2460963 
End bp2462513 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content65% 
IMG OID641231390 
Productbranched-chain alpha-keto acid dehydrogenase subunit E2 
Protein accessionYP_605802 
Protein GI94986438 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.968697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAG TGCTGCTGCC CGAGCTTGCC GAAAGCGTCG TTGAGGGCGA AATCCTGAAG 
TGGCTGGTAC AGGAGGGCGA GACGGTCGCC CTGGAACAGC CGCTCTGTGA AGTGATGACC
GACAAGGTCA CCGTCGAACT CCCCAGCCCC TATGCCGGAG TGCTGCAAAA GCGGCTGGCA
CAGGAAGGGG ACGTGGTGGC TGTGCATGCG CCCATTGCCC TGATTGCCGA AGCGGGCGAG
GCCAGCGGCC GGAAAGGGGA GAGCACGCCG GAAGCAGCGG CCAGCACGGC TCCCAGCGCC
ATTCAGGCCA TCCAGGAGAC GGCCGAGAAC CCAGCCACTA CGGGCGCTCA ACTGCCCCCC
CAAGCTGCCG AGGAACGTGA GCAGGTTGGG GGCAGCATCG TGGAGGCCGG ACACGTGGCG
GCCAAGAGCG ACGATGACGC CAGCCTCTTC AAGGCCTTTG CCTCCGAAGA GCCGGTGAGG
GTGCAGGGGC TGGGCAGCCA GAGGAGCGGC GTGGCTACCC TGACCCGACC GGCACCCACG
AGCGCCGCAG CCCGCCAGGA GGGCCGTGTG CTGGCAGTCC CAGCTGCGCG CCAGCTCGCC
CGCGAACTCG GGGTGGATCT TGCACAGGTG CCGGGCAGTG GCCCCAATGG GCGGGTCCGA
GTGCAGGATG TGACGGCTTA TCTCCAACAG CAGGACGGTA GGGCGGCAAA TGTTCCAGCT
CCGGCAGCCC AGGCACCACT CACGCCCGTG CCTCAATCGG CAACCACCCC GGCAGCAACC
ACACGGGGCA CCGGTGGAAT GCCGGTTCCC CCCGTGCAGT ACCGTACCCC CAAGGGCTAT
GAGCATCTGG AAGACCGGGT GCCCCTGCGC GGGATGCGCC GCGCGATCTC GAACCAGATG
CAGGCCAGCC ACCTCTACAC CGTCCGCACC CTGACCGTGG ATGAGGTGAA TCTGTCCAAA
CTCGTGGCTT TCCGCAGCCG CGTCAAGGAT GAAGCTCAGG CAGCGGGCGT AAAACTCAGC
TACCTGCCCT TCATCTTCAA GGCGGTGGCA GTTGCCCTGC GCAAGTACCC CAGCCTGAAT
TCCTCCTTCG ACGAGGCGAC AGGCGAGATC GTGCTGAAGC GCTACTTCAA TATCGGCATG
GCGGTTGCCA CCGATGCAGG CCTGACCGTG CCGGTGCTGC GCGACATGAA CCGCAAGAGC
ATCTTCGAGC TGGCGCGGGA GGTAAGTGAC CTGGCCGCCC GCGCGCAGGC TGGTAAGCTC
ACGCCCGACG AACTCGCGGG CAGTACCTTC TCCGTCACCA ACATTGGCTC GATCGGGGCG
CTGTTCTCCT TCCCCATCAT CAACGTGCCG GACGCCGCGA TCCTGGGGGT GCACTCCATT
CAGAAACGGC CTATTGTGAA TGAACGGGAT GAGATCGTCG CGGCACACAT GATGTACCTC
TCGCTCAGCT TTGACCACCG TCTGGTGGAC GGTGCCGAGG CGGCGCGCTT CTGCAAGGAA
GTGATCCGTC TGCTCGAAAA CCCTGACCGC CTGATGTTGG AAGCGATGTA G
 
Protein sequence
MKEVLLPELA ESVVEGEILK WLVQEGETVA LEQPLCEVMT DKVTVELPSP YAGVLQKRLA 
QEGDVVAVHA PIALIAEAGE ASGRKGESTP EAAASTAPSA IQAIQETAEN PATTGAQLPP
QAAEEREQVG GSIVEAGHVA AKSDDDASLF KAFASEEPVR VQGLGSQRSG VATLTRPAPT
SAAARQEGRV LAVPAARQLA RELGVDLAQV PGSGPNGRVR VQDVTAYLQQ QDGRAANVPA
PAAQAPLTPV PQSATTPAAT TRGTGGMPVP PVQYRTPKGY EHLEDRVPLR GMRRAISNQM
QASHLYTVRT LTVDEVNLSK LVAFRSRVKD EAQAAGVKLS YLPFIFKAVA VALRKYPSLN
SSFDEATGEI VLKRYFNIGM AVATDAGLTV PVLRDMNRKS IFELAREVSD LAARAQAGKL
TPDELAGSTF SVTNIGSIGA LFSFPIINVP DAAILGVHSI QKRPIVNERD EIVAAHMMYL
SLSFDHRLVD GAEAARFCKE VIRLLENPDR LMLEAM