Gene Dgeo_2416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2416 
Symbol 
ID4073644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008010 
Strand
Start bp70652 
End bp72205 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content67% 
IMG OID641228537 
Product5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 
Protein accessionYP_593924 
Protein GI94971884 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR02299] 5-carboxymethyl-2-hydroxymuconate semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.562889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACGA TCTCTCATCC CAACCATGAG CTGGCCCGGC AACTGCGGGA AAGCAGGCTG 
AAGGGCGGCC TGAAGCACTT CATCGGCGGC GAGTGGGTGG ACTCGCTGAG CGGCGAGACC
TTCGAGACGC ACACCCCCAC CGACAATTCG GTGCTGGCGA CGGTGGCGAG CGGCGACGCG
GCGGACATCG ACCGGGCCGC CCGCGCGGCC TCCGAAGCCT TCCAGACTTG GCGGGAGGTG
AGCGGGATGG AACGCCGGAG GCTCCTGCAC CGCGTGGCCG ACCTGATCGA GAAGCGTTCG
CAGGAAATCG CCCTGCTGGA GAGCGTGGAT ACTGGCCAAG CCATCCGCTT CATGAAGTCG
GCGGCGGCGC GGGGCGCGGA GAACTTCCGT TTCTACGCGG ACCGCGCGCC CGGTGCGGCC
GACGGGCAGA GCCTCCCCGC GCCCGGGTTC CTGAACTACA CGTTGCGCCA GCCCATCGGC
CCAGTCGGGG TGATCACGCC CTGGAACACG CCCTTCATGC TGTCCACCTG GAAGATCGCC
CCGGCCCTCG CCGCAGGCTG CACCGTCGTC CACAAGCCCG CCGAATGGAG TCCGGTCACC
GCCACGCTGC TCGCGGAGAT CATGGACGAG GCGGGCATTC CCAAGGGCGT GGTGAACCTC
GTTCACGGCT TCGGAGAGAC GGCAGGTAAG GCATTGACCG AGCATCCGCT CATCCAGGCG
ATCGCCTTTG TGGGCGAGAC GACCACCGGC AGCCACATCA TGCGGCAGGG CGCGGACACG
CTGAAGCGCG TGCATTTCGA ACTGGGCGGC AAGAACCCGG TCGTGGTGTT CGACGACGCG
GACCTCGACC GGGCACTCGA CGCCGTCGTC TTTATGATCT ACAGCCTGAA CGGCGAGCGC
TGCACCTCTT CCAGCCGCGT GCTCATCCAG GAAGGCATCT ACGACGAGTT CACCGCCCGC
ATTGCCGAGC GCGCGCGGAA CATCCGCGTC GGCGATCCGC TCGACCCGAC CACCGAGATT
GGCCCGCTGG TCCATCCCCG CCATCTCGAG AAGGTGATGG GCTACTTCGA CAGGGCGCGT
GAAGAAGGCG CGACCGTCGC GGCGGGCGGC GAGCGCGTGG GCGAAGGGGG CAACTACGTC
GCCGCCACCC TGTTCACTGG GGCCAGAAAC GACATGCGAA TTGCCCAAGA GGAAATCTTC
GGCCCAGTCC TGACCGCTAT TCCCTTCCGC GACGAGGCCG AAGCCCTTCA GCTCGCCAAC
GATGTGAAGT ACGGCCTCGC CGGGTACCTG TGGACGAATG ACCTCACCCG CGCGCACCGC
TTCGCGCAGG CTCTCGAGGC CGGGATGGTC TGGGTGAACA GTGAAAACGT GCGCCACCTG
CCGACCCCCT TCGGGGGCAT GAAGGCCAGC GGCATTGGCC GCGACGGCGG CGACTACTCC
TTTGATTTCT ACATGGAGAC AAAAAACATC GCGATTTCGC TGGGAACGCA CAGGGCGGCG
CAGTTGGGGG TAGGGCAGCC GGTGAAGGTG GACCGGCGGG AGGTGGAGGG ATGA
 
Protein sequence
MTTISHPNHE LARQLRESRL KGGLKHFIGG EWVDSLSGET FETHTPTDNS VLATVASGDA 
ADIDRAARAA SEAFQTWREV SGMERRRLLH RVADLIEKRS QEIALLESVD TGQAIRFMKS
AAARGAENFR FYADRAPGAA DGQSLPAPGF LNYTLRQPIG PVGVITPWNT PFMLSTWKIA
PALAAGCTVV HKPAEWSPVT ATLLAEIMDE AGIPKGVVNL VHGFGETAGK ALTEHPLIQA
IAFVGETTTG SHIMRQGADT LKRVHFELGG KNPVVVFDDA DLDRALDAVV FMIYSLNGER
CTSSSRVLIQ EGIYDEFTAR IAERARNIRV GDPLDPTTEI GPLVHPRHLE KVMGYFDRAR
EEGATVAAGG ERVGEGGNYV AATLFTGARN DMRIAQEEIF GPVLTAIPFR DEAEALQLAN
DVKYGLAGYL WTNDLTRAHR FAQALEAGMV WVNSENVRHL PTPFGGMKAS GIGRDGGDYS
FDFYMETKNI AISLGTHRAA QLGVGQPVKV DRREVEG