Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0781 |
Symbol | |
ID | 4026084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 874886 |
End bp | 876307 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637965947 |
Product | L-sorbosone dehydrogenase |
Protein accession | YP_572837 |
Protein GI | 92112909 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTAA CATCCAAGCC CACTGGCATG CGTTACGTCG CGCCCCTTTC GCTTCTGCTG CTCGCCGGCG GCGCCCAGGC CGCCGACTCG GGCCAGGAGT ACGGTCCTGA CCCGGAACTG CCCGAACCGC AGCGAGGCCT TCTGCCCAAC ATGACGGTGC CCGAGCAGGC CCCCTGGGGC GACGCGAAAC CCACGGTACC CGATGGCTAC ACGATCACCG CCATCGCCAC CGACCTGAAG GTCCCACGTC AGACGTTGGT GCTTCCGAAC GGCGATATCC TGGTGGCGGA AGGCAAAGGC GGCGGCAAGG CGCCGAAATC GCGGCCTAAG GATTTCATCG CCGGGCTGAT TCAGTCGCAA GGCACCACCT CGGTCAAGGG CGGCGACCGG CTGACCTTGC TGCGCGACGG CGACGATGAT GGCAAATACG AGGAACGAAC GGTCTTCGCC GAGAACCTCA ATGCGCCCTA CGGCCTCGCC CTGGTGGATG ACGATCTCTA CGTGGCCAAT CAAGATTCAC TCGTTCGCTT CGACTACGAA ACGGGCCAGA CCGAAGCCAG CGGCCCACCG GAACTGGTCA CGCCGCTGCC GTCGGAGATC AATCATCACT GGACCAAGGC CCTGACCGCC AGCGCGGACG GCGACTATCT CTACGTCGGC ATCGGTTCCA ACAGCAACAT CACCGAACGC GGTATGGCGG CGGAAGTGAA CCGTGCGGAA ATCTGGGAGA TCGACCCGGA AACCGGGGCA CATCGCGCCT ATGCCACCGG CGTGCGCAAC CCCACCGCGC TGACCATTCA GCCGGAGACC GACCGGCTCT GGGCCGTCGC CAACGAGCGT GACGAACTCG GCCCCAATCT GGTGCCCGAC TATCTCACCT CGATCCAGGA GGGCGGTTTC TACGGCTGGC CGTACAGCTA CTGGGGGCAG CATGTCGACC CGCGCGTTCG TCCGCAGAAC CCCGAGAAGG TGGAAGCGGC GATCGCCCCC GACTACAGCC TCGGCTCGCA CCACGCGCCG CTCGGCGTCG ACTTCTCCAA CCCGGCCGTG GGCGGAGAAT TCGCCAATGG CGTCTTCGTC GGCGAGCACG GCAGTTGGAA CCGTGCCGAT CCCGTGGGGT ACAAGGTGGT CTTCATCCCG TTCGAGAACG GCCGCCCCGC CGGCGACCCG GTCGACTTCG TCTCCGGCTT CCTGACCGAC GACGGCAAGA CCCGCGGCCG CCCCGTCGGC GTCACCGTCG CGCCGGATGG CTCGGTGATC GTCGCCGACG ACATGACCAA TGCGATCTGG CGGGTGACGC GAGATGACGA CCAGGCACCG TCAGCGGAGT CCGCCACCGA GACATCGGGC TCCAGTGAGG AAACGGCATC TTCTGAAGGC GAGTCCGAGA TGCCCGAGGA CGGATACGCC GGGGATGGAT GA
|
Protein sequence | MKLTSKPTGM RYVAPLSLLL LAGGAQAADS GQEYGPDPEL PEPQRGLLPN MTVPEQAPWG DAKPTVPDGY TITAIATDLK VPRQTLVLPN GDILVAEGKG GGKAPKSRPK DFIAGLIQSQ GTTSVKGGDR LTLLRDGDDD GKYEERTVFA ENLNAPYGLA LVDDDLYVAN QDSLVRFDYE TGQTEASGPP ELVTPLPSEI NHHWTKALTA SADGDYLYVG IGSNSNITER GMAAEVNRAE IWEIDPETGA HRAYATGVRN PTALTIQPET DRLWAVANER DELGPNLVPD YLTSIQEGGF YGWPYSYWGQ HVDPRVRPQN PEKVEAAIAP DYSLGSHHAP LGVDFSNPAV GGEFANGVFV GEHGSWNRAD PVGYKVVFIP FENGRPAGDP VDFVSGFLTD DGKTRGRPVG VTVAPDGSVI VADDMTNAIW RVTRDDDQAP SAESATETSG SSEETASSEG ESEMPEDGYA GDG
|
| |