Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2324 |
Symbol | |
ID | 4026283 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2613278 |
End bp | 2613949 |
Gene Length | 672 bp |
Protein Length | 223 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637967529 |
Product | phosphoglycolate phosphatase |
Protein accession | YP_574373 |
Protein GI | 92114445 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E [TIGR01662] HAD-superfamily hydrolase, subfamily IIIA |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.289687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCCCA ATCCGCCCCT GCGCATTCTC GACGGTATCC GCCTGGTCGC GTTCGACCTC GATGGCACGC TGGTGGATTC GGTCCCCGAC CTGGCCGCTG CCGTCGATGC CGCCCTGCGT TCGCTGGGAC TCGCGGGCGT CGACGAAGCG TCGGTGCGCG ACTGGGTGGG CAATGGCTCC CGCAAGCTGG TGGAGCGCGC CCTGGAAGCG CTCGACGCGC AAGACACGGA CCCGGAGGCG GCTCACGAGG CGTTTCTGCA TCATTATCGC CTGGCGCCTT GTCGCGCGAC GCGCCTGTAC CCCGGTGTGC GCGAGGCGCT CGAAGGGTTG CGCGCACGCG GTCTGACGCT GGTGCTGATC ACCAACAAGC CGGCGGCGTT CATCGCCCCG ATCCTCGAAA CGCTGGGACT GAGCGACTTT TTCGACCTGA CGCTGGGCGG CGATTCGCTG GCGGCGAAGA AGCCAGATCC GGCGCCCTTG CTGCACGTGG CATCCCGTTT CGGGGTGACG CCGAGCGTCT GTCTGATGGT GGGGGACTCC CGGCATGACA TCGAGGCCGG GCGGGGTGCC GGATTCCGCA CCCTGGCGGT GCCCTACGGC TACAATCACG GCGATCCGGT GGCGGCGAGC GCGCCGGATG CCATGGTTGA ATCGCTGGGG GAACTCGTTT AG
|
Protein sequence | MSPNPPLRIL DGIRLVAFDL DGTLVDSVPD LAAAVDAALR SLGLAGVDEA SVRDWVGNGS RKLVERALEA LDAQDTDPEA AHEAFLHHYR LAPCRATRLY PGVREALEGL RARGLTLVLI TNKPAAFIAP ILETLGLSDF FDLTLGGDSL AAKKPDPAPL LHVASRFGVT PSVCLMVGDS RHDIEAGRGA GFRTLAVPYG YNHGDPVAAS APDAMVESLG ELV
|
| |