Gene Strop_0720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_0720 
Symbol 
ID5057161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp804415 
End bp805293 
Gene Length879 bp 
Protein Length292 aa 
Translation table11 
GC content73% 
IMG OID640472987 
ProductHAD family hydrolase 
Protein accessionYP_001157575 
Protein GI145593278 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.070073 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGGA TCCAGGCGGT GCTCTTCGAC TTCTTCGGCA CGCTGACCCG CGCCGTACAG 
CGTGGTAGCG CCCACCGCAC GATGGCCGAG CTGCTCGGCT GCCCGCCCGA GGTATTCGTC
AAGGTCCTCG ACCGCACCTA CTATCAGCGC GCCACCGGCG CCCTGGGCAC CGCCGAGGCG
ACCCTGCGCT GGGTATGTGA GCAGGCCGGG GTTCGACCCT CCACCGCGGC GCTCCGGTCG
GCGGTGATCG CCCGATTCCG CGCCATCCGC GCTGACACCC GCCTTCGCAC CGAGGCAGTA
CCCACCCTCG CCGCGCTGCG CCAGCGCGGG CTCCGAATCG GGTTGGTCAG TGACTGCACC
CATGAACTGC CCGCCTTCCT GCCGCAGTTG CCGATCGATC CGCTCCTCGA TGTCCGAGTC
CTCTCGGTCC AGTTCGGGCG CTGCAAGCCC GACCCGGAGC TGTACCGGGC CGCCTGCCGG
CAGCTGGGCC TGACGCCCGC CGCCTGCCTG TACGTGGGGG ACGGGGGGAG TCAGGAACTG
ACCGGGGCGG AGCGGGCCGG GCTGCACGCG GTACGCCTGG CGGCCCCGGA CCTCGCCGGC
CATCTGACCT TCAACCCGGA CGCCGGCTGG ACCGGACCGG AGCTGACCTC CCTGGCCGGG
GTGGTTGACG TGATCGACCG GGCCGATGCC GGGCCGCCGC GGATGACCGA CCCCGATCCG
CACCACCAGC CGGCTGAGGT CAGCGCCGTC GGACGCGGTG CAGCCGGAAC GGCTTACGGG
CTGCCCGAAC CACCGCCGCC GGGCGAGGCG GCTGGGCCAG CGACGTGGTG GGGAGTGCCG
AGCCGGCCAG CGGATCGCCG GCGGGAACGA GGCGGTTGA
 
Protein sequence
MPRIQAVLFD FFGTLTRAVQ RGSAHRTMAE LLGCPPEVFV KVLDRTYYQR ATGALGTAEA 
TLRWVCEQAG VRPSTAALRS AVIARFRAIR ADTRLRTEAV PTLAALRQRG LRIGLVSDCT
HELPAFLPQL PIDPLLDVRV LSVQFGRCKP DPELYRAACR QLGLTPAACL YVGDGGSQEL
TGAERAGLHA VRLAAPDLAG HLTFNPDAGW TGPELTSLAG VVDVIDRADA GPPRMTDPDP
HHQPAEVSAV GRGAAGTAYG LPEPPPPGEA AGPATWWGVP SRPADRRRER GG