Gene Dret_0009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_0009 
Symbol 
ID8417811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp8770 
End bp10440 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content62% 
IMG OID645036572 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003196889 
Protein GI258404147 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.961342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGCA GGAAAATGAC CCAAGGTTTG GAGCGTGCCC CACATCGCTC ACTGCTTTAT 
GCGACCGGTA TGACCCGGGA AGAAATGGAC CGTCCGCTGA TCGGGGTGGT CAACGCCGCG
AATGATATTG TTCCGGGGCA TATCCATCTC GGGACCATCA GCCAGGCCGT GAAAGACGGC
GTGCGCATGG GCGGCGGGAC ACCGCTTTCG TTTCCGGCCA TCGGGGTTTG CGACGGGCTG
GCCATGGGGC ATGAAGGCAT GCGCATGAGT CTGCCGAGCC GGGAGATCAT CGCCGATTCC
ATCGAACTCA TGGCCACCGC TCATCCGTTT GACGGTCTGG TGCTCATCCC CAACTGCGAC
AAGATCGTCC CGGCTATGCT CATGGCCATG CTGCGGTTGA ATATCCCGGC GATCCTGGTC
AGTGGCGGGC CGATGCTGGC TGGGAAGTCT AAGGGGCAAG CGACGGATCT GATCAAGGTT
TTTGAAGGAG TTGGGCAGGT CAAGCGCGGC ACCATGCCCT CCGAAGAACT CGACGAGTTG
GAGCAATCGG CTTGCCCCGG TTGCGGCTCC TGTTCAGGGA TGTTTACTGC CAATTCCATG
AATTGTCTGG CTGAAGCCAT CGGCTTGGCC CTGCCCGGCA ACGGGACCAT TCCAGCAGTG
GCCGCCGGCC GAGTCCGTCT GGCCAAGGCC GCTGGGCAGC AGGTGCTCCA TCTGGTTGAA
AAGCAGATCA CACCACGCTC CATTGTCACG GCCGAGAGTG TGGCCAATGC AGTGACCGTG
GACATGGCTC TGGGGTGCTC GACCAATACG GTGCTCCATC TGCCGGCGAT TTTTCGGGAG
GCCAAGCTCG AACTCGGTCT GGACATCTTC GACGCCATCA GCAGCAAGAC CCCGAACCTC
TGCCGGTTGT CGCCTGCCGG TCCGGATCAT ATCGAAGACC TCGATCAGGT CGGCGGAATT
CCGGCGGTCA TGCAGGAACT CGCTTCCGGT GGGCTGTTGA ACACGGGTGT CGCCACTGTG
ACCGGTCGCA CCCTGCAAGA GAATCTGGCC TCTGTGCAGC GCCCCGGGCA CCAGGAGGTC
GTCAGATCGC TGGACAACCC GTATTCCGAG CGGGGAGGGA TCGCCATTTT GCGCGGCAAC
ATCGCGCCGG ACGGGGCGGT AGTCAAACAG TCCGCAGTCC ATCCGGACAT GATGGTCCGC
TCAGGACCGG CGAGAGTCTT TGACAGTGAG GAAGACGCGG TGGAGGCCAT TTTGGGCGAT
GCGATCTCGG CTGGCGATGT CATCGTCATC CGGTACGAGG GACCGAAAGG CGGCCCTGGG
ATGCGAGAGA TGCTTTCCCC CACCTCGGCT ATTGCCGGCA TGGGGTTGGA CGCTGATGTG
GGCTTGATCA CTGACGGACG TTTCAGCGGG GGCACACGCG GCGCGGCCAT CGGTCATGTC
TCTCCCGAAG CGGCTGAGGG CGGGGTTATT GGCCTGATCG AAGAAGGGGA TACCATCCAT
ATCAATATCC CGGAACGGCG GCTGCAACTC GAGGTCGAGG CAAGCGAACT TCAGCGCCGT
CGCGAGGTCT GGCAGCCGGT GCATAAAGAA GTCCAGTCCC CGGTTTTGCG CCGGTATCGG
AAGTTGGCCA CCTCCGCAGC CCAGGGAGCG GTGTACCGCG ACGACGAGTA A
 
Protein sequence
MRSRKMTQGL ERAPHRSLLY ATGMTREEMD RPLIGVVNAA NDIVPGHIHL GTISQAVKDG 
VRMGGGTPLS FPAIGVCDGL AMGHEGMRMS LPSREIIADS IELMATAHPF DGLVLIPNCD
KIVPAMLMAM LRLNIPAILV SGGPMLAGKS KGQATDLIKV FEGVGQVKRG TMPSEELDEL
EQSACPGCGS CSGMFTANSM NCLAEAIGLA LPGNGTIPAV AAGRVRLAKA AGQQVLHLVE
KQITPRSIVT AESVANAVTV DMALGCSTNT VLHLPAIFRE AKLELGLDIF DAISSKTPNL
CRLSPAGPDH IEDLDQVGGI PAVMQELASG GLLNTGVATV TGRTLQENLA SVQRPGHQEV
VRSLDNPYSE RGGIAILRGN IAPDGAVVKQ SAVHPDMMVR SGPARVFDSE EDAVEAILGD
AISAGDVIVI RYEGPKGGPG MREMLSPTSA IAGMGLDADV GLITDGRFSG GTRGAAIGHV
SPEAAEGGVI GLIEEGDTIH INIPERRLQL EVEASELQRR REVWQPVHKE VQSPVLRRYR
KLATSAAQGA VYRDDE