Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1142 |
Symbol | |
ID | 4027707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1304155 |
End bp | 1305900 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637966319 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_573197 |
Protein GI | 92113269 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCATA AAAAGCGCCC GCTACGCTCG GCCGAATGGT TCGGCAACGA TGACAAGAAC GGCTTCATGT ACCGCAGCTG GATGAAGAAC CAGGGCATTC CCGATCATGA GTTTCGCGGC AAGCCGATCA TCGGTATCTG CAACACCTTC TCCGAGCTGA CACCGTGCAA CGCCCATTTC CGCAAGCTGG CCGAGCACGT CAAGAAGGGG GTGCTCGAAG CCGGCGGCTA TCCGGTGGAG TTTCCGGTCT TCTCCAATGG CGAGTCCAAT CTGCGTCCCA CCGCGATGTT CACGCGCAAC CTGGCGAGCA TGGATGTCGA GGAAGCCATC CGCGGAAACC CGCTCGACGC CGTGGTGTTG CTGGTGGGGT GCGACAAGAC CACGCCGGCT CTGCTGATGG GCGCGGCAAG CTGCGATATC CCCACCATCG TGGTCACCGG CGGTCCGATG CTCAACGGCA AGCACAAGGG GCGTGACATC GGCTCGGGCA CGGTCGTCTG GCAGCTCTCC GAGGAGGTCA AGGCCGGCAA GATATCGCTG CATGACTTCA TGGCCGCCGA GGCCGGCATG TCGCGCTCCG CCGGCACCTG CAACACCATG GGGACGGCCT CGACCATGGC CTGCATGGCC GAGTCGCTGG GCACCTCGCT GCCGCACAAT GCCGCGATTC CTGCCGTCGA CTCGCGCCGT TACGTACTGG CGCACCTGTC CGGCAATCGC ATCGTCGAGA TGGTCGACGA GGACCTCACC CTGTCCAAGG TGCTCACCAA GTCCGCATTC GAGAATGCCA TTCGCACCAA TGCCGCCATC GGCGGTTCGA CCAACGCGGT GATTCACCTC CAGGCCATCG CGGGGCGCAT GGGGGTCGAT CTCACGCTCG ACGACTGGAC CCGGGTGGGA CGCGGCACCC CGACCATCGT CGATCTGCAG CCCTCCGGCC GTTATCTGAT GGAGGAGTTC TATTACGCCG GCGGCCTGCC CGCCGTGCTG CGCCGTCTCG GCGAGGCCGA CCGGCTGCCC CACAAGGACG CGCTGACCGT CAACGGCAAG ACGCTGTGGG AAAACGTCCA GGACGCGCCG CTCTACAACG ATGCCGTGAT TCTGCCGCTG GATGCGCCGC TTCGCGAGGA TGGCGGAATG TGCGTGATGC GCGGCAATCT CGCGCCCAAC GGGGCGGTGC TCAAGCCCTC TGCCGCGACC CCGGCGCTGA TGCAGCACCG TGGGCGTGCG GTCGTCTTCG AGAACTTCGA CGATTACAAG GCGCGTATCA ACGACCCGGA TCTCGACGTC ACCGCCGACG ACATTCTGGT GATGAAGAAC TGCGGGCCGC GCGGTTACCA CGGCATGGCC GAAGTCGGCA ACATGGGGCT GCCCGCCAAG CTGCTCGAGC AAGGCGTCAC CGACATGGTG CGGATCTCCG ATGCGCGCAT GAGCGGCACG GCGTATGGCA CCGTGGTACT CCACGTGGCG CCGGAAGCCG CGGCGGGCGG GCCGCTGGCC GCGGTTCGCA ACGGCGACTG GATCGCGCTC GACGCCTATT CCGGCAAGTT GCACCTGGAG GTCGACGACG CCGAGATCGC CTCACGGCTG GCCGAGGCGG ACCCCACCGC CGAGTCGACG CGCATCGCCA GCACCGGCGG ATATCGCCAG CTCTACATCG AGCACGTGCT GCAGGCCGAC CAGGGCTGCG ATTTCGACTT CCTGGTGGGA TGTCGCGGCG CGGAAGTCCC CCGTCACTCG CACTGA
|
Protein sequence | MTHKKRPLRS AEWFGNDDKN GFMYRSWMKN QGIPDHEFRG KPIIGICNTF SELTPCNAHF RKLAEHVKKG VLEAGGYPVE FPVFSNGESN LRPTAMFTRN LASMDVEEAI RGNPLDAVVL LVGCDKTTPA LLMGAASCDI PTIVVTGGPM LNGKHKGRDI GSGTVVWQLS EEVKAGKISL HDFMAAEAGM SRSAGTCNTM GTASTMACMA ESLGTSLPHN AAIPAVDSRR YVLAHLSGNR IVEMVDEDLT LSKVLTKSAF ENAIRTNAAI GGSTNAVIHL QAIAGRMGVD LTLDDWTRVG RGTPTIVDLQ PSGRYLMEEF YYAGGLPAVL RRLGEADRLP HKDALTVNGK TLWENVQDAP LYNDAVILPL DAPLREDGGM CVMRGNLAPN GAVLKPSAAT PALMQHRGRA VVFENFDDYK ARINDPDLDV TADDILVMKN CGPRGYHGMA EVGNMGLPAK LLEQGVTDMV RISDARMSGT AYGTVVLHVA PEAAAGGPLA AVRNGDWIAL DAYSGKLHLE VDDAEIASRL AEADPTAEST RIASTGGYRQ LYIEHVLQAD QGCDFDFLVG CRGAEVPRHS H
|
| |