Gene Csal_1142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1142 
Symbol 
ID4027707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1304155 
End bp1305900 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content66% 
IMG OID637966319 
Productdihydroxy-acid dehydratase 
Protein accessionYP_573197 
Protein GI92113269 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCATA AAAAGCGCCC GCTACGCTCG GCCGAATGGT TCGGCAACGA TGACAAGAAC 
GGCTTCATGT ACCGCAGCTG GATGAAGAAC CAGGGCATTC CCGATCATGA GTTTCGCGGC
AAGCCGATCA TCGGTATCTG CAACACCTTC TCCGAGCTGA CACCGTGCAA CGCCCATTTC
CGCAAGCTGG CCGAGCACGT CAAGAAGGGG GTGCTCGAAG CCGGCGGCTA TCCGGTGGAG
TTTCCGGTCT TCTCCAATGG CGAGTCCAAT CTGCGTCCCA CCGCGATGTT CACGCGCAAC
CTGGCGAGCA TGGATGTCGA GGAAGCCATC CGCGGAAACC CGCTCGACGC CGTGGTGTTG
CTGGTGGGGT GCGACAAGAC CACGCCGGCT CTGCTGATGG GCGCGGCAAG CTGCGATATC
CCCACCATCG TGGTCACCGG CGGTCCGATG CTCAACGGCA AGCACAAGGG GCGTGACATC
GGCTCGGGCA CGGTCGTCTG GCAGCTCTCC GAGGAGGTCA AGGCCGGCAA GATATCGCTG
CATGACTTCA TGGCCGCCGA GGCCGGCATG TCGCGCTCCG CCGGCACCTG CAACACCATG
GGGACGGCCT CGACCATGGC CTGCATGGCC GAGTCGCTGG GCACCTCGCT GCCGCACAAT
GCCGCGATTC CTGCCGTCGA CTCGCGCCGT TACGTACTGG CGCACCTGTC CGGCAATCGC
ATCGTCGAGA TGGTCGACGA GGACCTCACC CTGTCCAAGG TGCTCACCAA GTCCGCATTC
GAGAATGCCA TTCGCACCAA TGCCGCCATC GGCGGTTCGA CCAACGCGGT GATTCACCTC
CAGGCCATCG CGGGGCGCAT GGGGGTCGAT CTCACGCTCG ACGACTGGAC CCGGGTGGGA
CGCGGCACCC CGACCATCGT CGATCTGCAG CCCTCCGGCC GTTATCTGAT GGAGGAGTTC
TATTACGCCG GCGGCCTGCC CGCCGTGCTG CGCCGTCTCG GCGAGGCCGA CCGGCTGCCC
CACAAGGACG CGCTGACCGT CAACGGCAAG ACGCTGTGGG AAAACGTCCA GGACGCGCCG
CTCTACAACG ATGCCGTGAT TCTGCCGCTG GATGCGCCGC TTCGCGAGGA TGGCGGAATG
TGCGTGATGC GCGGCAATCT CGCGCCCAAC GGGGCGGTGC TCAAGCCCTC TGCCGCGACC
CCGGCGCTGA TGCAGCACCG TGGGCGTGCG GTCGTCTTCG AGAACTTCGA CGATTACAAG
GCGCGTATCA ACGACCCGGA TCTCGACGTC ACCGCCGACG ACATTCTGGT GATGAAGAAC
TGCGGGCCGC GCGGTTACCA CGGCATGGCC GAAGTCGGCA ACATGGGGCT GCCCGCCAAG
CTGCTCGAGC AAGGCGTCAC CGACATGGTG CGGATCTCCG ATGCGCGCAT GAGCGGCACG
GCGTATGGCA CCGTGGTACT CCACGTGGCG CCGGAAGCCG CGGCGGGCGG GCCGCTGGCC
GCGGTTCGCA ACGGCGACTG GATCGCGCTC GACGCCTATT CCGGCAAGTT GCACCTGGAG
GTCGACGACG CCGAGATCGC CTCACGGCTG GCCGAGGCGG ACCCCACCGC CGAGTCGACG
CGCATCGCCA GCACCGGCGG ATATCGCCAG CTCTACATCG AGCACGTGCT GCAGGCCGAC
CAGGGCTGCG ATTTCGACTT CCTGGTGGGA TGTCGCGGCG CGGAAGTCCC CCGTCACTCG
CACTGA
 
Protein sequence
MTHKKRPLRS AEWFGNDDKN GFMYRSWMKN QGIPDHEFRG KPIIGICNTF SELTPCNAHF 
RKLAEHVKKG VLEAGGYPVE FPVFSNGESN LRPTAMFTRN LASMDVEEAI RGNPLDAVVL
LVGCDKTTPA LLMGAASCDI PTIVVTGGPM LNGKHKGRDI GSGTVVWQLS EEVKAGKISL
HDFMAAEAGM SRSAGTCNTM GTASTMACMA ESLGTSLPHN AAIPAVDSRR YVLAHLSGNR
IVEMVDEDLT LSKVLTKSAF ENAIRTNAAI GGSTNAVIHL QAIAGRMGVD LTLDDWTRVG
RGTPTIVDLQ PSGRYLMEEF YYAGGLPAVL RRLGEADRLP HKDALTVNGK TLWENVQDAP
LYNDAVILPL DAPLREDGGM CVMRGNLAPN GAVLKPSAAT PALMQHRGRA VVFENFDDYK
ARINDPDLDV TADDILVMKN CGPRGYHGMA EVGNMGLPAK LLEQGVTDMV RISDARMSGT
AYGTVVLHVA PEAAAGGPLA AVRNGDWIAL DAYSGKLHLE VDDAEIASRL AEADPTAEST
RIASTGGYRQ LYIEHVLQAD QGCDFDFLVG CRGAEVPRHS H