Gene BBta_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_1654 
Symbol 
ID5154014 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp1728158 
End bp1728913 
Gene Length756 bp 
Protein Length251 aa 
Translation table11 
GC content63% 
IMG OID640556612 
Product2-haloacid dehalogenase 
Protein accessionYP_001237770 
Protein GI148253185 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01428] 2-haloalkanoic acid dehalogenase, type II
[TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCCG TGCCATTCAA AGCTGTGGTC TTTGACGCCT ATGGCACGCT GTACGACATC 
CAGTCGGTGG CGGCCGTCAC CGAACAGGCC TTCCCCGGCC ATGGCGACAT CATCACCCAG
ATCTGGCGCA TCAAGCAGCT CGAATACACG TGGCTGCGCT CGCTGATGCA GCGCTACGAG
GATTTTGCCG TCGTCACCCG CGACTCGCTG ATCTACACGC TGCGCATTCT CGGGCTTCCC
GCTGATGCAG CCACGCTGGA CCGCATCATC GCCAAGTACC TCGATCTGGA TCCCTATCCC
GACGCACGGG CCGCGCTTGC AGCGATGAAG CCTCACCGGC TTGCGACCCT CTCCAACGGC
AGCCCCGCGA TGCTCGGCGC GCTGGTCAAG GCCAGCGGTT TCGACACGCT GCTCGACGCC
GTCATCAGCG TCGATGCGCG CCAGATCTTC AAGCCGAGCC CAGAGGCCTA TGCGCTGATT
GAGCAGACGC TCGGCCTGGC GCCGTCCGAT GTGCTGTTCG TGTCGTCCAA TCCGTGGGAC
GTGTGCGGCG CCAAGGCCTT TGGATTGAGC GTCGCCTGGA TCGAGCGTGT CACGCCGCAG
GCCATGGCTG CAGCCTGCAT CAAGACCGAG TTGGTGGCGC CGCTGACCCT GTTCAAGGCG
CTGCGCACCC AGATGGACGA GTTGGGTTTT GCGCCCGATT ATCGTATTGC TGCGCTGTCC
GACCTGCCTG AGATCGCAAG CCAGCATGAG CCTTGA
 
Protein sequence
MSAVPFKAVV FDAYGTLYDI QSVAAVTEQA FPGHGDIITQ IWRIKQLEYT WLRSLMQRYE 
DFAVVTRDSL IYTLRILGLP ADAATLDRII AKYLDLDPYP DARAALAAMK PHRLATLSNG
SPAMLGALVK ASGFDTLLDA VISVDARQIF KPSPEAYALI EQTLGLAPSD VLFVSSNPWD
VCGAKAFGLS VAWIERVTPQ AMAAACIKTE LVAPLTLFKA LRTQMDELGF APDYRIAALS
DLPEIASQHE P