Gene Dgeo_2339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2339 
Symbol 
ID4057192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2459673 
End bp2460695 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content67% 
IMG OID641231388 
Producttransketolase, central region 
Protein accessionYP_605800 
Protein GI94986436 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCAA CCCAATCCAA ACCGCCGAGC ACGAGCAGCG CAGGGGAGAC GCGCACCCTC 
ACACTGATCC AGGCGATCAA CGAGGCGATG CAGGAGGAGC TGGCCCGCGA CGAGCGTGTG
GTCGTCTTCG GGGAAGACGT CGGCGCGCGT GGTGGCGTGT TCCTGGCCAC AGCGGGCTTG
CAGGAACAGT TCGGCAAGAA GCGCGTCTTC GATACGCCGC TCTCAGAGGC GAGCATCGTG
GGCGCGGCGG TCGGCATGGC GGTGCGGGGC CTGCGCCCCA TCGCGGAAAT CCAGTTTGCC
GACTACATGG GACCGGGCTT CGACCAGATC ATCAGCCAGG CCGCCAAGAT CCGCTACCGC
AGCGGCGGGC AGTTCACGGC GCCCCTGGTC ATCCGCACCC CCTCAGGGGG CGGCGTGAAG
GGCGGACACC ACCACTCCCA GAGTCCTGAG AGCTACTTCA CCCACACGCC CGGCCTCAAG
GTCGTGATGC CCAGTACCCC CTACGACGCC AAAGGCCTGC TTAAGAGCGC CGTGCGCGGG
GGTGACCCGG TGATCTACTT CGAGCCCAAG CGCCTCTACC GCGCGGCAAA GGGTGAGGTG
CCGACTCAGG ACTACACGGT GGAACTCGGC AAGGGTGCCG TGCGCCGCGA GGGCAGCGAC
CTCACGATCA TCGGCTACGG CGGCGTGATG CCAGACGCCG AAAAGGCCGC GCAGGCGCTC
GCGACCGAGG GGGTGCAGGC TGAGGTCATT GACCTGCGCT CGCTGGTGCC CTGGGACCGC
GACCTCGTGC TCACCAGTGT GGCCAAGACC GGCCGCGCCG TGCTGGTGAG CGAGGCCCCG
CGCATCTCCA ACTTTATGGG CGAGGTGGCC TACGTGATCC AGGAGCAGCT CTTCGACCAG
CTCCTCGCAC CGGTGATGCA GGTGGCCGGC TTCGACACAC CGTACCCCTA CGTGCAGGAC
AAGGTGTATC TCCCCGGCGC CAACCGCATT GCCGCAGCGT GCGTGCGGGC ACTGAACTAC
TGA
 
Protein sequence
MTATQSKPPS TSSAGETRTL TLIQAINEAM QEELARDERV VVFGEDVGAR GGVFLATAGL 
QEQFGKKRVF DTPLSEASIV GAAVGMAVRG LRPIAEIQFA DYMGPGFDQI ISQAAKIRYR
SGGQFTAPLV IRTPSGGGVK GGHHHSQSPE SYFTHTPGLK VVMPSTPYDA KGLLKSAVRG
GDPVIYFEPK RLYRAAKGEV PTQDYTVELG KGAVRREGSD LTIIGYGGVM PDAEKAAQAL
ATEGVQAEVI DLRSLVPWDR DLVLTSVAKT GRAVLVSEAP RISNFMGEVA YVIQEQLFDQ
LLAPVMQVAG FDTPYPYVQD KVYLPGANRI AAACVRALNY