Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_3116 |
Symbol | |
ID | 4028757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 3470824 |
End bp | 3471573 |
Gene Length | 750 bp |
Protein Length | 249 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637968330 |
Product | HAD family hydrolase |
Protein accession | YP_575159 |
Protein GI | 92115231 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.553412 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGATCA CCGCCCTCAC TTTCGATCTC GACGACACCC TCTGGGACAA CCGCCCCATT CTCGAACGCG CCGAGGCCGA ACATTACCAG TGGCTCAGCG AGGCCATCGC CGCCGCCCAG ACGTCACCGC AGACATCCTT CGGGGACTGC TATCCCCTGA GCGCTTACCA GCAGCATCGC GCCGACGTGG CGCGTCGTCA TCCGCTCAAG CGCGGTGATT TCACGTGGAT TCGCGAACGT GCGCTGTTCG AGCTTGTAGA GGCCTACGGG CTACCCCGAC TCCAGGCCCG CCTCTGGGCC GCGCATGCCA TCGCCCACTT CCTCGACCTG CGTCACGACC TCACGCCCTA CCCGGACGTC GTGCCCCTGC TCGACGCCCT GCGGCAGCGC TATCGTCTCG CCGCAATCAC CAACGGCAAC GCCGACCTCA AACGGCTGGC GCTGGCCGAA CACTTTCCGG TGATGATCGC GGCGGGAGAA CTGCACGCTC CCAAGCCCGA CCCGCGTGCC TTTCTCGCGG CGCTGGCACG CCTCGGCGCC ACGCCATCGC GCGCCCTGCA TGTCGGAGAC TCCTGGCGGG AAGACGTGCT GCCGGCGCAG CGCCTGGGCA TGCAGGTGGC CTGGGTCGAT GCCAAGGACG AGGGCCCCCG CGCGCTGCCG CCCGGGGTCC ACCGTCTCGC CCATGTACGC GAGCTGCCGG CCCTGCTCGA CCGCCTGACC ACGCAGGACA GCGCGCGCCA GGGGGGCTGA
|
Protein sequence | MAITALTFDL DDTLWDNRPI LERAEAEHYQ WLSEAIAAAQ TSPQTSFGDC YPLSAYQQHR ADVARRHPLK RGDFTWIRER ALFELVEAYG LPRLQARLWA AHAIAHFLDL RHDLTPYPDV VPLLDALRQR YRLAAITNGN ADLKRLALAE HFPVMIAAGE LHAPKPDPRA FLAALARLGA TPSRALHVGD SWREDVLPAQ RLGMQVAWVD AKDEGPRALP PGVHRLAHVR ELPALLDRLT TQDSARQGG
|
| |