Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0936 |
Symbol | |
ID | 4026790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1049797 |
End bp | 1051668 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637966113 |
Product | phosphogluconate dehydratase |
Protein accession | YP_572992 |
Protein GI | 92113064 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR01196] 6-phosphogluconate dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTTAA ATCAGACCGT CGCCAGCGTC ACCCAACGGA TCGAGGAGCG TTCGCGCCCG CGCCGCGCGC TCTACGAACA GCACATGCAG GAACAGCAGC GGCGCGGCGT GCACCGGGCC GAGCTCTCGT GCGGCAACCT CGCCCACGGT TTCGCGGCTT GCGGCAGCGG CGACAAGGAC CGCCTCAAGC TGATGAACAG CGCCAACCTG GCGATCGTGT CGTCATATAA CGACATGCTC TCGGCGCATC AGCCTTTCGA GACCTTTCCC GCCACCATCA AGGAAGCCGC CCGTCAGATG GGCTCCACGG CCCAGTTCGC CGGCGGGGTC CCGGCCATGT GCGATGGCGT GACCCAGGGG CAGCCGGGCA TGGAGCTGTC GCTCTTCTCG CGCGACGTGA TCGCCATGTC CACCGCCGTG GCCTTGTCGC ACAACATGTT CGATGCCGCC CTCTTCCTGG GCATCTGCGA CAAGATCGTG CCCGGACTGT TCATCGGCGC GGCGCGTTTC GGCCACCTGC CTGCCGTCTT CGTGCCGGGC GGCCCCATGA AGAGCGGTTT GCCCAACGAC GAGAAATCCC GGGTCCGCCA GCTCTACGCC GAAGGCAAGG TCGGTCGCGA GGAGCTGCTG GAGGCAGAGT CCCAGTCCTA CCACAGCCCC GGCACCTGCA CCTTCTATGG CACCGCCAAC TCGAATCAGC TGATGATGGA GATGATGGGG CTGCACCTGC CCGGCGCCTC GTTCGTCAAT CCCGGCACGC CTCTGCGCGA AGCGCTGACC CGCTATGCCA CCGAGCAGGC CATTCGTCAT GCCGAAAGTA CCGGCAACTA CCGGCCCTTC TACAAGCAGA TCGACGCCCG CGCCATCGTC AACGCCATGG TCGGCCTCCT GGTCTCCGGC GGTTCCACCA ACCACACCAT GCACCTGGTC GCCATGGCCG GCGCCGCCGG CATCACCATC ACCTGGGACG ATTTCGCCGA GCTCTCGAGC GTGGTCCCCA GCCTGACGCG CATCTATCCC AACGGTAAGG CCGACATCAA TCACTTCCAG GCCGCGGGCG GCATGAGTTT CCTGTTCCGC GAACTCATCG ACGCCGGCTT GATCCATGCC GACATTCCCA CCGTGTTCGG TACCGACATG CACGCCTATA CCCAGGAACC CTTCCTCGAG GATGGGGAGC TGGTCTGGCG CGAAGGCCCC GGCGCGAGCC TCGACGAGGA CGTCCTGCGG CCGGTGGCCA ATCCTTTCGC GGCCACTGGC GGCCTGGCCG TGCTCGACGG CAACCTGGGA CGCGGCGTGA TCAAGATCTC GGCGGTCAAG CCGGAGAACC GCGTGGTCGA AGCGCCGATA CGTCTGTTCG ATGACCAGAA TCAGCTCAAG GCGGCGTTCG AGGCCGGCGA ACTGGACCGC GACGTGGTCG TGGTGGTGCG CTTCCAGGGG CCTCGGGCCA ATGGCATGCC CGAGCTTCAC AAGCTCACGC CCTACCTGGG TGTGCTGCAG GACCGTGGTT TCAAGGTCGC GCTGGTGACC GACGGACGCA TGTCGGGCGC TTCCGGCAAG GTGCCCGCCG CCATTCACGT GACGCCGGAA GCCGTCGACG GCGGCCCGCT GGCCAAGTTG CGCGACGGCG ACGTGATTCG TCTCGACCCC GATAACGGAG AGCTTCGTGC CCTGGTCGGC GACGCCGAGT GGCAGGCACG CGACAATGCC ACGGCCGACC TCGACCATCA CCATCACGGC CTGGGGCGTG AACTGTTCGC CGGATTCCGC GCCCTGGTGG GCGGTGCCGA AGACGGCGCC TCGGTATTCG GCGGTTTCGA CGCGCAAGCG CTGGAGCCCA CCCAGGCCTC GCGGGTGGAG CAGGACGCAT GA
|
Protein sequence | MSLNQTVASV TQRIEERSRP RRALYEQHMQ EQQRRGVHRA ELSCGNLAHG FAACGSGDKD RLKLMNSANL AIVSSYNDML SAHQPFETFP ATIKEAARQM GSTAQFAGGV PAMCDGVTQG QPGMELSLFS RDVIAMSTAV ALSHNMFDAA LFLGICDKIV PGLFIGAARF GHLPAVFVPG GPMKSGLPND EKSRVRQLYA EGKVGREELL EAESQSYHSP GTCTFYGTAN SNQLMMEMMG LHLPGASFVN PGTPLREALT RYATEQAIRH AESTGNYRPF YKQIDARAIV NAMVGLLVSG GSTNHTMHLV AMAGAAGITI TWDDFAELSS VVPSLTRIYP NGKADINHFQ AAGGMSFLFR ELIDAGLIHA DIPTVFGTDM HAYTQEPFLE DGELVWREGP GASLDEDVLR PVANPFAATG GLAVLDGNLG RGVIKISAVK PENRVVEAPI RLFDDQNQLK AAFEAGELDR DVVVVVRFQG PRANGMPELH KLTPYLGVLQ DRGFKVALVT DGRMSGASGK VPAAIHVTPE AVDGGPLAKL RDGDVIRLDP DNGELRALVG DAEWQARDNA TADLDHHHHG LGRELFAGFR ALVGGAEDGA SVFGGFDAQA LEPTQASRVE QDA
|
| |