Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1736 |
Symbol | |
ID | 4028951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1979624 |
End bp | 1980673 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637966924 |
Product | alcohol dehydrogenase GroES-like protein |
Protein accession | YP_573787 |
Protein GI | 92113859 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACGC TTGTCTGTAC CCAACCCCGA CACATGGAAC TGCGCGACGT GCCCGAGCCG CAGTGCGCGC CCGGCGAGGC CATGCTCAGG ATTCGTCGCG TGGGTATTTG CGGTACCGAC ATCCACGCCT ATGGCGGCAA TCAGCCCTAC TTCACCTATC CCCGCGTGCT GGGCCACGAG CTGTCCGGCG ATATCGTCGG CGTCGGGGAA GGCGTCGACG AGTCGCTCGT CGGTCACAGC GCCTACGTGA TCCCCTATCT GCATTGCGGT GAATGCCGCG CCTGCCGTCA GGGCAAGACC AATTGCTGCC AGCACATGCA GGTCATCGGC GTGCATCGCG ATGGCGGCAT GGCGGAATAC CTCAGCGTGC CCGTGGATCA CCTGGTCACC TCGAGCACCC TCGACGCCGA ACAGCTGGCG CTGGTGGAGT GTCTTTCCAT CGGCGCGCAC GCCGTGCGCC GCAGTGCCAT CGAAGCCGAC GAGCTGGCCG TCGTGGTCGG CGCCGGACCG ATCGGCATCG GCATCGTGCA GATCGCCCAG AGCCGTGGCG CGCGGGTGCT GGTGGTGGAC ACCAACCGCG AGCGCCTGGC CTTCTGCCGC GACACCCTGG GCGTGGAGGA CGTACTGCAT GCACTGGACG ACGACGTGGA GGCCGCCATC GCGCGCCATC ACGACGGCGC CCTGGCCGAC GTGGTGTTCG ACGCCACCGG CAATCCCGCC GCCATGAACC GCGGTTTCGA TTTCGCCGGC CATGGCGGAC GCTACGTGCT GGTGAGCGTG GTCAAGGCCG ACATCACCTT CAACGACCCC GACTTCCACA AGCGCGAGCT CAGCCTGCTG GGCAGCCGCA ACGCCACGCG GGAAGACTTC GAGACGGTGG CCCGTCTGAT GGAGGAAGGC CGCCTGCAGT CGACGGCGAT GATCACCCAC CGCGGCCCGC ACCACGAGCT GCCGACGCTG ATGCCGCAGT GGTGCGACCC GGCCACCGGC GTCATCAAGG CGATGGTGAC GTTCGATGAC GCCAGCGCCC AGGGAGGTCG TCCGGCATGA
|
Protein sequence | MQTLVCTQPR HMELRDVPEP QCAPGEAMLR IRRVGICGTD IHAYGGNQPY FTYPRVLGHE LSGDIVGVGE GVDESLVGHS AYVIPYLHCG ECRACRQGKT NCCQHMQVIG VHRDGGMAEY LSVPVDHLVT SSTLDAEQLA LVECLSIGAH AVRRSAIEAD ELAVVVGAGP IGIGIVQIAQ SRGARVLVVD TNRERLAFCR DTLGVEDVLH ALDDDVEAAI ARHHDGALAD VVFDATGNPA AMNRGFDFAG HGGRYVLVSV VKADITFNDP DFHKRELSLL GSRNATREDF ETVARLMEEG RLQSTAMITH RGPHHELPTL MPQWCDPATG VIKAMVTFDD ASAQGGRPA
|
| |