Gene Rpal_4679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4679 
Symbol 
ID6412365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5039698 
End bp5040438 
Gene Length741 bp 
Protein Length246 aa 
Translation table11 
GC content66% 
IMG OID642714558 
Producthaloacid dehalogenase, type II 
Protein accessionYP_001993645 
Protein GI192293040 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01428] 2-haloalkanoic acid dehalogenase, type II
[TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATTG CCGCCGTCGT GTTCGATGCC TATGGCACGC TGTACGACAT CCAATCGGTT 
GCGACTGTCA CCGAGCGGGA GTTTCCGGGC TACGGCGAGG TGATCACGCA GATCTGGCGG
ATCAAGCAGC TCGAATACAC TTGGCTGCGC TCGCAGATGG GGACCTACGA AGACTTCGCC
GTCGTCACCC GTGATTCGCT CGCCTACACG CTCGACTGTC TCGGCATCGA GGCTGGCGGC
GGCGCGTTCG AGCGGATCTT CGCCAAATAT CTCGATCTCA CGCTCTATCC CGAAGCGCTG
GCGGCGCTGG AGGCACTCGC GTCCTGCAAG CGAGCGATCC TGTCCAACGG CAGCCCAGAT
ATGCTTGGCG CCCTCACCCG CAACACCGGC CTCGACCGCG TGCTCGACGA CGTGATCAGT
GTCGACGCCG CCAAGGTGTT CAAGCCGCAT CCGCGCGCCT ATGCGCTGGC CGAGGCACGG
CTCGGCGTGG CGCCGCGTGA GATGTTGTTC GTGTCTTCGA ATCCCTGGGA CGTGGCGGGC
GCGAAAGCGT TCGGCTTCAA CGTCGCCTGG ATCGAGCGCG TCAGCCGCGA GGCAATGGCG
CGCGAACTGC GACGGCCAGG GCCGCTGCCA CCGCAGACGT TGTTCAAGGC GCTGCGCACC
CAGATGGACG TGCTCGGCTT CGAGCCCGAC CACCGCATCG GCTCGCTGAC GGCGCTGGTG
GAGATCGTCG CCGGCCGCTG A
 
Protein sequence
MPIAAVVFDA YGTLYDIQSV ATVTEREFPG YGEVITQIWR IKQLEYTWLR SQMGTYEDFA 
VVTRDSLAYT LDCLGIEAGG GAFERIFAKY LDLTLYPEAL AALEALASCK RAILSNGSPD
MLGALTRNTG LDRVLDDVIS VDAAKVFKPH PRAYALAEAR LGVAPREMLF VSSNPWDVAG
AKAFGFNVAW IERVSREAMA RELRRPGPLP PQTLFKALRT QMDVLGFEPD HRIGSLTALV
EIVAGR