Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4662 |
Symbol | |
ID | 5704841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5283281 |
End bp | 5284000 |
Gene Length | 720 bp |
Protein Length | 239 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641274060 |
Product | HAD family hydrolase |
Protein accession | YP_001539406 |
Protein GI | 159040153 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01428] 2-haloalkanoic acid dehalogenase, type II [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAGGA CAGTGGTGTT CGACGCCGAC GAAACCCTGC TCGACCTGCG CCCAGCGGTG ACCGGCGGGC TGCTGGCCGT CCTCGACGAG ATGCGCCGGT TGACCCCGGC GGCGGCCACG GTGTCACTCG CTGACCTGGA GTACGACTGG TCCGTGGCGT CCAGCGCGGA CTCTCCGCGG CCGACACCGG AGATCCGCCG GGTGGCGCTT GCCCGCTCGC TGGCCCGCGT CGGGCTCGAC ACCCACCTGG ACGACCTGTT GGCTCTCTAC TTCGTCCGAC GCTTCGCGCT GACCCGACCC TTCCCGGATG TGCTGCCCGC GCTCGCGGCA CTGGCGGACA GGTGGGTGGT GGGCTTCGCC ACCAACGGCA ACAGCCGCGC GGAGCGGTGC GGGCTCGCGG GGAGGTTCGC CTTCGAGGTG TACGCGGGCG ACAACGGGTT GCCGAGAAAG CCGGCGCCGG AGTTCTACGC GACGGTGGTC CGGGCGGCCG GGGTGCCAGC CGAGCAGGTG GTGCACGTCG GCGATTCGAT CGCCCACGAC GTGGTCGGGC CACAGGCGGC CGGGTTGCGC GGGGTGTGGC TCAACCGTGA CGGTCGGTCC TGCCCGCCAG GGGTGCAGCC AGACGCGGAA CTGTCCACCT TGACGGATCT GCCCGCCGTC TTGACGGCTC TTGCGGACGA AGGCGATCTT CGGGCGCAAT CGGATAGGTC CCTGGTGTAA
|
Protein sequence | MLRTVVFDAD ETLLDLRPAV TGGLLAVLDE MRRLTPAAAT VSLADLEYDW SVASSADSPR PTPEIRRVAL ARSLARVGLD THLDDLLALY FVRRFALTRP FPDVLPALAA LADRWVVGFA TNGNSRAERC GLAGRFAFEV YAGDNGLPRK PAPEFYATVV RAAGVPAEQV VHVGDSIAHD VVGPQAAGLR GVWLNRDGRS CPPGVQPDAE LSTLTDLPAV LTALADEGDL RAQSDRSLV
|
| |