Gene Haur_1592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1592 
Symbol 
ID5733479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1845412 
End bp1846212 
Gene Length801 bp 
Protein Length266 aa 
Translation table11 
GC content55% 
IMG OID641278731 
ProductHAD family hydrolase 
Protein accessionYP_001544363 
Protein GI159898116 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0647] Predicted sugar phosphatases of the HAD superfamily 
TIGRFAM ID[TIGR01452] phosphoglycolate/pyridoxal phosphate phosphatase family
[TIGR01458] HAD-superfamily subfamily IIA hydrolase, TIGR01458
[TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAATC TCAAGCAGCT CAAGCTTGTT TTGCTCGATA TGGATGGGGT GCTGCATCGC 
GGCGGGGAGA TTTTACCAGG TGCGGCAGAG TTGACGACTG TGCTTGATCG CTTAGGCTTA
GGCTATGCCT GTTTGACCAA TAATTCATCG CAATTGCCTG CCACCTTCGC CCGTCATCTG
CAAGATTTAG GGGTTGCGAT TGCGCCTGAG CATGTGATTA CCTCGTCAAC TGCTACGGCT
ACGCTGTTGC GCACGCGCTA CCCGCAAGGC ACGCGCTTGC TGGCAATCGG CATGGATGGG
ATTCAGTCGT CGTTATTTGC TGATCGCTAT TTTGTATCAG CCGAAACCGA TGTAGCAGCA
GTGGTGGTTG GGGTTGATTT TAACCTGACC TATGCCCGCT TGAAAACTGC AACCTTGGCG
TTACGCGCAG GCGCAGCGTT TATTGCTACC AACAGCGACC GTACATTTCC TGCACCTGAA
GGCTTGATTC CTGGGGCTGG CTCGATTGTA GCAGCCTTGG CAGCCGCTAG CGATTGCACG
CCCGAAGTGA TTGGCAAACC TGAACCAGCC ATGTTCGAAG CGGCCTTGCA GTTGTTTGGA
GTAACCGCCG AACAAACCTT GATGGTCGGC GATCGGTTGG ATACCGATAT TGCAGGAGCG
CAGCGGGTTG GCATTGCCAC GGCCTTTGTG GGCAGCGGCG TACATAGCAT GCAACAAGCC
CAAGCCTGGC AACCAGCAAT CGATTTGGTG GCTGATGATT TGGCAGGCAT TTTGGCCTTG
CTCAGGGCTG GGCGGGAGTA G
 
Protein sequence
MLNLKQLKLV LLDMDGVLHR GGEILPGAAE LTTVLDRLGL GYACLTNNSS QLPATFARHL 
QDLGVAIAPE HVITSSTATA TLLRTRYPQG TRLLAIGMDG IQSSLFADRY FVSAETDVAA
VVVGVDFNLT YARLKTATLA LRAGAAFIAT NSDRTFPAPE GLIPGAGSIV AALAAASDCT
PEVIGKPEPA MFEAALQLFG VTAEQTLMVG DRLDTDIAGA QRVGIATAFV GSGVHSMQQA
QAWQPAIDLV ADDLAGILAL LRAGRE