Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_0720 |
Symbol | |
ID | 5057161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 804415 |
End bp | 805293 |
Gene Length | 879 bp |
Protein Length | 292 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640472987 |
Product | HAD family hydrolase |
Protein accession | YP_001157575 |
Protein GI | 145593278 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED [TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.070073 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAGGA TCCAGGCGGT GCTCTTCGAC TTCTTCGGCA CGCTGACCCG CGCCGTACAG CGTGGTAGCG CCCACCGCAC GATGGCCGAG CTGCTCGGCT GCCCGCCCGA GGTATTCGTC AAGGTCCTCG ACCGCACCTA CTATCAGCGC GCCACCGGCG CCCTGGGCAC CGCCGAGGCG ACCCTGCGCT GGGTATGTGA GCAGGCCGGG GTTCGACCCT CCACCGCGGC GCTCCGGTCG GCGGTGATCG CCCGATTCCG CGCCATCCGC GCTGACACCC GCCTTCGCAC CGAGGCAGTA CCCACCCTCG CCGCGCTGCG CCAGCGCGGG CTCCGAATCG GGTTGGTCAG TGACTGCACC CATGAACTGC CCGCCTTCCT GCCGCAGTTG CCGATCGATC CGCTCCTCGA TGTCCGAGTC CTCTCGGTCC AGTTCGGGCG CTGCAAGCCC GACCCGGAGC TGTACCGGGC CGCCTGCCGG CAGCTGGGCC TGACGCCCGC CGCCTGCCTG TACGTGGGGG ACGGGGGGAG TCAGGAACTG ACCGGGGCGG AGCGGGCCGG GCTGCACGCG GTACGCCTGG CGGCCCCGGA CCTCGCCGGC CATCTGACCT TCAACCCGGA CGCCGGCTGG ACCGGACCGG AGCTGACCTC CCTGGCCGGG GTGGTTGACG TGATCGACCG GGCCGATGCC GGGCCGCCGC GGATGACCGA CCCCGATCCG CACCACCAGC CGGCTGAGGT CAGCGCCGTC GGACGCGGTG CAGCCGGAAC GGCTTACGGG CTGCCCGAAC CACCGCCGCC GGGCGAGGCG GCTGGGCCAG CGACGTGGTG GGGAGTGCCG AGCCGGCCAG CGGATCGCCG GCGGGAACGA GGCGGTTGA
|
Protein sequence | MPRIQAVLFD FFGTLTRAVQ RGSAHRTMAE LLGCPPEVFV KVLDRTYYQR ATGALGTAEA TLRWVCEQAG VRPSTAALRS AVIARFRAIR ADTRLRTEAV PTLAALRQRG LRIGLVSDCT HELPAFLPQL PIDPLLDVRV LSVQFGRCKP DPELYRAACR QLGLTPAACL YVGDGGSQEL TGAERAGLHA VRLAAPDLAG HLTFNPDAGW TGPELTSLAG VVDVIDRADA GPPRMTDPDP HHQPAEVSAV GRGAAGTAYG LPEPPPPGEA AGPATWWGVP SRPADRRRER GG
|
| |