Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0961 |
Symbol | |
ID | 4026739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1080049 |
End bp | 1081227 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637966138 |
Product | glucose/sorbosone dehydrogenases-like protein |
Protein accession | YP_573017 |
Protein GI | 92113089 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCTGC GAACCGCATT CCACCGGCTC GGCCGAGGCT CACTGGCATG CCTCGGTCTC GGGCTCGGCG CCATCATGAG TGCTTTCGGC CAGGCGCAGT CGCTGACACC GACATCCGTG GCGGGCAGCG AACAGCGCGC GCAAGTGCCC GAGGGCGCCA CTCTGACCCA TCTGGTCTCG CTGAGTGCAC CGCGCCTGAT CAGCATCGGT CCCGACAATA AAATGTTCAT CGGCACCCGG GCGGGACGCC TGTATCGCCT CAAGCCGCCG TACACGAGCG TCGATGCGGA AGTGGCGCTG TCCGGCTATC CGCATAGTGC CGTAGTGCAT CGAGGATACC TGTATGTCGC CACGACCGAC CGGCTGCTGC GCGCGTCCTA TGCGCCGGAT CGGCGTCTCG AGGACGTGGC GTTCGAAGAA GTTGCCGCTC TGCCCGGCGG TGGCGGGCAC AGCTCGCGCA CCTTGTCCGC GGGGCCCGAT GGACGGCTCT ACGTGGCACT CGGCATTCAA GGCAATTGTT CGCCGCAGCG CGTCTCGGAG GACGTTGCCT TCGAGGATCG CCGAGGCGGT GTCATGGCGC TCGACCCGAC ACGGTCGACG CCCGAATGGA CGCCCTATGT CGGCGGACTG CGCAATCCCA TCGGCATGGC CTGGAGCCCC GAAGGGGTGC TGTATCTCAA CAATAACGGC CCCGACCATT GGGGCTACGA GGCGCCCCCC GAATTGCTGG TGCGCGCCGA GGCCGGCAGC TTTCACGGCA TGCCGTGGTA TCAGTGGAAC GACGGTGCCT GGCAGCGCGA TGACTGCATC GACTCGACGC CCCCCACACC CGGCGACGCG CTCGCACCGC CGGTCGCTAC CTTCGCCGCC CGCAGCGCCC CCATGGGATT GACGTTCTTG CCCGCCGACA ACCCCTGGAA GCTGGACATG ATCAGCGCCA TCCATGGCAG CTGGGCCACC CAGGCCGCCG GCGGCCCCGC TTCTCGGCGA CCGCCCAAGA TCGTCGGCAT ACGGCTCGCG ACCGACGGCC GCCAAGGCAC GGTGACCGAC CTGGTGAGCG GTTTCCAGGC GGAGAATGGC GAGCGCTGGG CCCGCCCCAC CGGGGTCGCC TACCACGATG GAGCGCTGTA TTTCACCTCC GACAGCGGCG AGGCCGGCCT GTATCGTCTG ACGCCATGA
|
Protein sequence | MPLRTAFHRL GRGSLACLGL GLGAIMSAFG QAQSLTPTSV AGSEQRAQVP EGATLTHLVS LSAPRLISIG PDNKMFIGTR AGRLYRLKPP YTSVDAEVAL SGYPHSAVVH RGYLYVATTD RLLRASYAPD RRLEDVAFEE VAALPGGGGH SSRTLSAGPD GRLYVALGIQ GNCSPQRVSE DVAFEDRRGG VMALDPTRST PEWTPYVGGL RNPIGMAWSP EGVLYLNNNG PDHWGYEAPP ELLVRAEAGS FHGMPWYQWN DGAWQRDDCI DSTPPTPGDA LAPPVATFAA RSAPMGLTFL PADNPWKLDM ISAIHGSWAT QAAGGPASRR PPKIVGIRLA TDGRQGTVTD LVSGFQAENG ERWARPTGVA YHDGALYFTS DSGEAGLYRL TP
|
| |