Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dret_0009 |
Symbol | |
ID | 8417811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfohalobium retbaense DSM 5692 |
Kingdom | Bacteria |
Replicon accession | NC_013223 |
Strand | + |
Start bp | 8770 |
End bp | 10440 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 645036572 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_003196889 |
Protein GI | 258404147 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.961342 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAGCA GGAAAATGAC CCAAGGTTTG GAGCGTGCCC CACATCGCTC ACTGCTTTAT GCGACCGGTA TGACCCGGGA AGAAATGGAC CGTCCGCTGA TCGGGGTGGT CAACGCCGCG AATGATATTG TTCCGGGGCA TATCCATCTC GGGACCATCA GCCAGGCCGT GAAAGACGGC GTGCGCATGG GCGGCGGGAC ACCGCTTTCG TTTCCGGCCA TCGGGGTTTG CGACGGGCTG GCCATGGGGC ATGAAGGCAT GCGCATGAGT CTGCCGAGCC GGGAGATCAT CGCCGATTCC ATCGAACTCA TGGCCACCGC TCATCCGTTT GACGGTCTGG TGCTCATCCC CAACTGCGAC AAGATCGTCC CGGCTATGCT CATGGCCATG CTGCGGTTGA ATATCCCGGC GATCCTGGTC AGTGGCGGGC CGATGCTGGC TGGGAAGTCT AAGGGGCAAG CGACGGATCT GATCAAGGTT TTTGAAGGAG TTGGGCAGGT CAAGCGCGGC ACCATGCCCT CCGAAGAACT CGACGAGTTG GAGCAATCGG CTTGCCCCGG TTGCGGCTCC TGTTCAGGGA TGTTTACTGC CAATTCCATG AATTGTCTGG CTGAAGCCAT CGGCTTGGCC CTGCCCGGCA ACGGGACCAT TCCAGCAGTG GCCGCCGGCC GAGTCCGTCT GGCCAAGGCC GCTGGGCAGC AGGTGCTCCA TCTGGTTGAA AAGCAGATCA CACCACGCTC CATTGTCACG GCCGAGAGTG TGGCCAATGC AGTGACCGTG GACATGGCTC TGGGGTGCTC GACCAATACG GTGCTCCATC TGCCGGCGAT TTTTCGGGAG GCCAAGCTCG AACTCGGTCT GGACATCTTC GACGCCATCA GCAGCAAGAC CCCGAACCTC TGCCGGTTGT CGCCTGCCGG TCCGGATCAT ATCGAAGACC TCGATCAGGT CGGCGGAATT CCGGCGGTCA TGCAGGAACT CGCTTCCGGT GGGCTGTTGA ACACGGGTGT CGCCACTGTG ACCGGTCGCA CCCTGCAAGA GAATCTGGCC TCTGTGCAGC GCCCCGGGCA CCAGGAGGTC GTCAGATCGC TGGACAACCC GTATTCCGAG CGGGGAGGGA TCGCCATTTT GCGCGGCAAC ATCGCGCCGG ACGGGGCGGT AGTCAAACAG TCCGCAGTCC ATCCGGACAT GATGGTCCGC TCAGGACCGG CGAGAGTCTT TGACAGTGAG GAAGACGCGG TGGAGGCCAT TTTGGGCGAT GCGATCTCGG CTGGCGATGT CATCGTCATC CGGTACGAGG GACCGAAAGG CGGCCCTGGG ATGCGAGAGA TGCTTTCCCC CACCTCGGCT ATTGCCGGCA TGGGGTTGGA CGCTGATGTG GGCTTGATCA CTGACGGACG TTTCAGCGGG GGCACACGCG GCGCGGCCAT CGGTCATGTC TCTCCCGAAG CGGCTGAGGG CGGGGTTATT GGCCTGATCG AAGAAGGGGA TACCATCCAT ATCAATATCC CGGAACGGCG GCTGCAACTC GAGGTCGAGG CAAGCGAACT TCAGCGCCGT CGCGAGGTCT GGCAGCCGGT GCATAAAGAA GTCCAGTCCC CGGTTTTGCG CCGGTATCGG AAGTTGGCCA CCTCCGCAGC CCAGGGAGCG GTGTACCGCG ACGACGAGTA A
|
Protein sequence | MRSRKMTQGL ERAPHRSLLY ATGMTREEMD RPLIGVVNAA NDIVPGHIHL GTISQAVKDG VRMGGGTPLS FPAIGVCDGL AMGHEGMRMS LPSREIIADS IELMATAHPF DGLVLIPNCD KIVPAMLMAM LRLNIPAILV SGGPMLAGKS KGQATDLIKV FEGVGQVKRG TMPSEELDEL EQSACPGCGS CSGMFTANSM NCLAEAIGLA LPGNGTIPAV AAGRVRLAKA AGQQVLHLVE KQITPRSIVT AESVANAVTV DMALGCSTNT VLHLPAIFRE AKLELGLDIF DAISSKTPNL CRLSPAGPDH IEDLDQVGGI PAVMQELASG GLLNTGVATV TGRTLQENLA SVQRPGHQEV VRSLDNPYSE RGGIAILRGN IAPDGAVVKQ SAVHPDMMVR SGPARVFDSE EDAVEAILGD AISAGDVIVI RYEGPKGGPG MREMLSPTSA IAGMGLDADV GLITDGRFSG GTRGAAIGHV SPEAAEGGVI GLIEEGDTIH INIPERRLQL EVEASELQRR REVWQPVHKE VQSPVLRRYR KLATSAAQGA VYRDDE
|
| |