Gene Hhal_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2083 
Symbol 
ID4710087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2286876 
End bp2287556 
Gene Length681 bp 
Protein Length226 aa 
Translation table11 
GC content71% 
IMG OID639856557 
Productphosphoglycolate phosphatase 
Protein accessionYP_001003649 
Protein GI121998862 
COG category[R] General function prediction only 
COG ID[COG0546] Predicted phosphatases 
TIGRFAM ID[TIGR01449] 2-phosphoglycolate phosphatase, prokaryotic
[TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.609164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTCT CGCTAACCCG CGCGATTCTC TTCGATCTGG ACGGGACGCT GGTGGACAGC 
GCCCCGGATC TGACCGTCGC CATCAACCAG GTGCTCGCCG AGCGGGACCA CGCCCCGGTG
ACCGAGGAGC AGGTCCGCGG TTGGGTGGGC AATGGTGCCC GTCGGCTGGT GGCCCGCGCC
CTCACCGGCG CGGACGACGG CAACCCGCCA GAAGAGGAGC TGGACGCCGC GCTCGAGCGC
TTCTTCGAGT GCTACGGCGA GGCCGTCTAC GTACACAGCC GCCCCTACCC GGAGGCCGTC
GAGACCCTGC AGGCCCTGGC CCAGGCCGGC ATGCGTCTGG CCGTGGTCAC CAACAAGCCG
CGGCGCTTCG CCGAGCCGAT CCTCCAGGGC ATGGGGGTGA CGGATGCCAT CGACGTGGTC
GTTGGCGGCG AGTGCACCGA GGCCCGCAAG CCCGACCCGG AGCCGCTCCG GCTGGCCATG
GAGCGCCTGG GGGCGGCGTC GCGGACCGTG CTGATGGTCG GTGACTCGCG AACCGACGTG
GAGGCGGCGC GCAACGCCGG TATTCCGGTG GTGTGTGTGC CCTACGGCTA CCGCCGCGGG
GTGGCGCTGG AAGACCTGGG CGCCGACGCC ATCGTGGATG ATCTCAGCGG CGTGGTCGCG
CTGCTACGCG AGGCGGCCTG A
 
Protein sequence
MDLSLTRAIL FDLDGTLVDS APDLTVAINQ VLAERDHAPV TEEQVRGWVG NGARRLVARA 
LTGADDGNPP EEELDAALER FFECYGEAVY VHSRPYPEAV ETLQALAQAG MRLAVVTNKP
RRFAEPILQG MGVTDAIDVV VGGECTEARK PDPEPLRLAM ERLGAASRTV LMVGDSRTDV
EAARNAGIPV VCVPYGYRRG VALEDLGADA IVDDLSGVVA LLREAA