Gene Strop_4215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4215 
Symbol 
ID5060699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4779797 
End bp4780477 
Gene Length681 bp 
Protein Length226 aa 
Translation table11 
GC content69% 
IMG OID640476476 
ProductHAD family hydrolase 
Protein accessionYP_001161021 
Protein GI145596724 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01428] 2-haloalkanoic acid dehalogenase, type II
[TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.842339 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGAGG CGGTGTTGTG GGACGTCGAC GACACCCTCT TCGACTTCTC TGGCTCGGAC 
CGGCGGGCGT TGTCGCAGCA CTTCCAGGCG GAGGGTCTAT CCGCCTCAGC GACGTCGTTC
GAGCGTTGGC AGCGGGCCAC CGAGGCTGCC TACGGTCGGT TGGCTGCTGG GGAGCTTACG
TACGAGGAGT GTCAGCGACA GCGGGTCAGT GGGTTCCTGG GGCGGACGCC GGACGACGTG
GAGGCGGACG CGTGGCTCCG GCGCTACACC GCGCTGTTCG AGCACGCGTG GACGGCGTTC
CCGGACGTCA CGGCGGCCCT TGCGGCGCTG CCGCACCGGC ACGGGATCCT GTCCAACTCC
AGCACCACGC ATCAGGAGCG CAGACTGACC GCCCTGGGAT TGCGGCACCA CTTCGAGGTG
CTGCTGTGTT CGGACCGACT TGCCCGGGCG AAGCCGGATC CCTCGGCGTT CCTGGCCGGC
TGCGGATCCC TCGGGTTGCC GCCGGAAGCG GTCGCCTACG TGGGTGACCA GTTGGACACG
GACGCAGTGG CCGCTCGCGA CGCGGGACTG GTCGGAATCT GGCTGGATCG CGCACGATCC
ACAGTGCCGG TGCCGGTGGG TGTGCACCGG ATCCCGAGCC TCGCTGATCT CCCCCGGCTG
CTTGCCTCGC TGAGTGGATG A
 
Protein sequence
MIEAVLWDVD DTLFDFSGSD RRALSQHFQA EGLSASATSF ERWQRATEAA YGRLAAGELT 
YEECQRQRVS GFLGRTPDDV EADAWLRRYT ALFEHAWTAF PDVTAALAAL PHRHGILSNS
STTHQERRLT ALGLRHHFEV LLCSDRLARA KPDPSAFLAG CGSLGLPPEA VAYVGDQLDT
DAVAARDAGL VGIWLDRARS TVPVPVGVHR IPSLADLPRL LASLSG