Gene Csal_0936 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_0936 
Symbol 
ID4026790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1049797 
End bp1051668 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content66% 
IMG OID637966113 
Productphosphogluconate dehydratase 
Protein accessionYP_572992 
Protein GI92113064 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR01196] 6-phosphogluconate dehydratase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTTAA ATCAGACCGT CGCCAGCGTC ACCCAACGGA TCGAGGAGCG TTCGCGCCCG 
CGCCGCGCGC TCTACGAACA GCACATGCAG GAACAGCAGC GGCGCGGCGT GCACCGGGCC
GAGCTCTCGT GCGGCAACCT CGCCCACGGT TTCGCGGCTT GCGGCAGCGG CGACAAGGAC
CGCCTCAAGC TGATGAACAG CGCCAACCTG GCGATCGTGT CGTCATATAA CGACATGCTC
TCGGCGCATC AGCCTTTCGA GACCTTTCCC GCCACCATCA AGGAAGCCGC CCGTCAGATG
GGCTCCACGG CCCAGTTCGC CGGCGGGGTC CCGGCCATGT GCGATGGCGT GACCCAGGGG
CAGCCGGGCA TGGAGCTGTC GCTCTTCTCG CGCGACGTGA TCGCCATGTC CACCGCCGTG
GCCTTGTCGC ACAACATGTT CGATGCCGCC CTCTTCCTGG GCATCTGCGA CAAGATCGTG
CCCGGACTGT TCATCGGCGC GGCGCGTTTC GGCCACCTGC CTGCCGTCTT CGTGCCGGGC
GGCCCCATGA AGAGCGGTTT GCCCAACGAC GAGAAATCCC GGGTCCGCCA GCTCTACGCC
GAAGGCAAGG TCGGTCGCGA GGAGCTGCTG GAGGCAGAGT CCCAGTCCTA CCACAGCCCC
GGCACCTGCA CCTTCTATGG CACCGCCAAC TCGAATCAGC TGATGATGGA GATGATGGGG
CTGCACCTGC CCGGCGCCTC GTTCGTCAAT CCCGGCACGC CTCTGCGCGA AGCGCTGACC
CGCTATGCCA CCGAGCAGGC CATTCGTCAT GCCGAAAGTA CCGGCAACTA CCGGCCCTTC
TACAAGCAGA TCGACGCCCG CGCCATCGTC AACGCCATGG TCGGCCTCCT GGTCTCCGGC
GGTTCCACCA ACCACACCAT GCACCTGGTC GCCATGGCCG GCGCCGCCGG CATCACCATC
ACCTGGGACG ATTTCGCCGA GCTCTCGAGC GTGGTCCCCA GCCTGACGCG CATCTATCCC
AACGGTAAGG CCGACATCAA TCACTTCCAG GCCGCGGGCG GCATGAGTTT CCTGTTCCGC
GAACTCATCG ACGCCGGCTT GATCCATGCC GACATTCCCA CCGTGTTCGG TACCGACATG
CACGCCTATA CCCAGGAACC CTTCCTCGAG GATGGGGAGC TGGTCTGGCG CGAAGGCCCC
GGCGCGAGCC TCGACGAGGA CGTCCTGCGG CCGGTGGCCA ATCCTTTCGC GGCCACTGGC
GGCCTGGCCG TGCTCGACGG CAACCTGGGA CGCGGCGTGA TCAAGATCTC GGCGGTCAAG
CCGGAGAACC GCGTGGTCGA AGCGCCGATA CGTCTGTTCG ATGACCAGAA TCAGCTCAAG
GCGGCGTTCG AGGCCGGCGA ACTGGACCGC GACGTGGTCG TGGTGGTGCG CTTCCAGGGG
CCTCGGGCCA ATGGCATGCC CGAGCTTCAC AAGCTCACGC CCTACCTGGG TGTGCTGCAG
GACCGTGGTT TCAAGGTCGC GCTGGTGACC GACGGACGCA TGTCGGGCGC TTCCGGCAAG
GTGCCCGCCG CCATTCACGT GACGCCGGAA GCCGTCGACG GCGGCCCGCT GGCCAAGTTG
CGCGACGGCG ACGTGATTCG TCTCGACCCC GATAACGGAG AGCTTCGTGC CCTGGTCGGC
GACGCCGAGT GGCAGGCACG CGACAATGCC ACGGCCGACC TCGACCATCA CCATCACGGC
CTGGGGCGTG AACTGTTCGC CGGATTCCGC GCCCTGGTGG GCGGTGCCGA AGACGGCGCC
TCGGTATTCG GCGGTTTCGA CGCGCAAGCG CTGGAGCCCA CCCAGGCCTC GCGGGTGGAG
CAGGACGCAT GA
 
Protein sequence
MSLNQTVASV TQRIEERSRP RRALYEQHMQ EQQRRGVHRA ELSCGNLAHG FAACGSGDKD 
RLKLMNSANL AIVSSYNDML SAHQPFETFP ATIKEAARQM GSTAQFAGGV PAMCDGVTQG
QPGMELSLFS RDVIAMSTAV ALSHNMFDAA LFLGICDKIV PGLFIGAARF GHLPAVFVPG
GPMKSGLPND EKSRVRQLYA EGKVGREELL EAESQSYHSP GTCTFYGTAN SNQLMMEMMG
LHLPGASFVN PGTPLREALT RYATEQAIRH AESTGNYRPF YKQIDARAIV NAMVGLLVSG
GSTNHTMHLV AMAGAAGITI TWDDFAELSS VVPSLTRIYP NGKADINHFQ AAGGMSFLFR
ELIDAGLIHA DIPTVFGTDM HAYTQEPFLE DGELVWREGP GASLDEDVLR PVANPFAATG
GLAVLDGNLG RGVIKISAVK PENRVVEAPI RLFDDQNQLK AAFEAGELDR DVVVVVRFQG
PRANGMPELH KLTPYLGVLQ DRGFKVALVT DGRMSGASGK VPAAIHVTPE AVDGGPLAKL
RDGDVIRLDP DNGELRALVG DAEWQARDNA TADLDHHHHG LGRELFAGFR ALVGGAEDGA
SVFGGFDAQA LEPTQASRVE QDA