Gene Hlac_1247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_1247 
Symbol 
ID7399515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp1257875 
End bp1259662 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content71% 
IMG OID643708311 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002565909 
Protein GI222479672 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.853909 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAC AGCAGCCGCG GTCGGAGGGA GACGGCTCCC GAGGCCGCGA CGACGCGGAC 
CGGTTCGCGG GCGAGAAGGA CGAGAACCTG CGGAGCAGAG ACGTGACGGA GGGGGCGGAC
AAGGCCCCGC ACCGGGCGAT GTTCCGGGCG ATGGGGTTCG ACGACGAGGA CCTCTCCTCG
CCCATCATCG GCGTGCCGAA CCCGGCGGCC GACATCACGC CGTGTAACGT CCACCTCGAC
GACGTGGCGG ACGCCGCGAT CGAGGGGATC GACGCGGCGG GCGGGATGCC GATCGAGTTC
GGGACGATCA CCATCTCCGA CGCCATCTCG ATGGGGACCG AGGGGATGAA GGCGAGCCTC
ATCTCCCGCG AGGTGATCGC CGACTCCGTC GAGCTGGTCT CCTTCGGCGA GCGGATGGAC
GCGCTGGTGA CGGTGGCGGG CTGTGACAAG AACCTACCCG GCATGCTGAT GGCCGCGATC
CGCACCGACC TCCCGTCGGT GTTCCTCTAC GGCGGCTCGA TCATGCCCGG CCAGCACGAC
GGCCGCGACG TGACCATCGT GCAGGTGTTC GAGGGGGTCG GCGCCTACGC CGAAGGCGAC
ATGAGCGGCG AGGAGCTCGA CGATCTGGAG CGGCACGCCT GCCCCGGCGC GGGCTCCTGT
GGCGGGATGT TCACCGCCAA CACGATGGCC TCTATCGCCG AGGCGCTCGG GATGGCCCCG
CTCGGCTCCG CCTCCGCGCC CGCCGAGAAC CGGGAGCGCT ACGAGGTCGC TGAGCGCGCC
GGCGAACTCG CCGTGGACTG CATCGAGAAC GACCGCCGTC CCTCCGACAT CCTCTCGCGG
GAGTCGTTCG AGAACGCGAT CGCGCTCCAG ACCGCCATCG GCGGCTCCAC AAACGGTGTC
CTCCACCTCC TCGCGCTGGC CGCGGAGGCC GACGTGGACC TCTCGATCGA GGACTTCGAC
GAGATCTCGC GGCGCACGCC GAAGATCGCG AACCTCCAGC CCGGCGGGAG CCGCGTCATG
AACGACCTCC ACGAGATCGG CGGCGTCCCC GTTGTACTCC GACGCCTCTT GGAGGCCGAC
CTGCTCCACG GCGACGCGAT GACCGTCACC GGACGCACCC TCGCCGAGGA GCTGGCGGAG
TTGGAGGACC GCGGCGCGCT CCCGGACGAC GACGAGATCG AGGCGGACTT CCTCTACACC
GTCGACGACC CCAAGCAGGC GGAGGGCGCC ATCAAGATCC TCGACGGCAA CCTCGCGCCC
GAGGGCGCCG TCCTGAAGGT GACCGGCGAC GACGCCTTCT ACCACGAGGG GCCGGCGCGG
ATCTTCGAGA ACGAGGAGGA CGCGATGGAG TACGTTCAGT CGGGCGCGAT CGACTCCGGC
GACGTGATCG TGATCCGCAA CGAGGGGCCG ACCGGCGGCC CGGGAATGCG CGAGATGCTC
GGCGTCACCG CCGCCGTCGT CGGCGCGGGC CACGAGGAGG ACGTGGCGCT GCTCACGGAC
GGCCGCTTCT CGGGCGGGAC CCGCGGCCCG ATGATCGGCC ACATCGCGCC CGAGGCGGCC
GACGGCGGCC CGATCGGGCT CATCGAGGAC GGCGACCACG TCACCGTCGA CATCCCGGAG
CGCGACCTCA CGGTCGACCT CTCCGAGGAG GAACTGGCCG AGCGCCGCGA GGAGTGGGAG
GCGCCGGCGC CCCAGTACGA GGGCGGCGTC CTCGCGAAGT ACGCGCGCGA CTTCGCCTCC
GCCTCCGACG GCGCGGTGAC GAACCCGCGG CTCACGCGGG ATTTATAA
 
Protein sequence
MSEQQPRSEG DGSRGRDDAD RFAGEKDENL RSRDVTEGAD KAPHRAMFRA MGFDDEDLSS 
PIIGVPNPAA DITPCNVHLD DVADAAIEGI DAAGGMPIEF GTITISDAIS MGTEGMKASL
ISREVIADSV ELVSFGERMD ALVTVAGCDK NLPGMLMAAI RTDLPSVFLY GGSIMPGQHD
GRDVTIVQVF EGVGAYAEGD MSGEELDDLE RHACPGAGSC GGMFTANTMA SIAEALGMAP
LGSASAPAEN RERYEVAERA GELAVDCIEN DRRPSDILSR ESFENAIALQ TAIGGSTNGV
LHLLALAAEA DVDLSIEDFD EISRRTPKIA NLQPGGSRVM NDLHEIGGVP VVLRRLLEAD
LLHGDAMTVT GRTLAEELAE LEDRGALPDD DEIEADFLYT VDDPKQAEGA IKILDGNLAP
EGAVLKVTGD DAFYHEGPAR IFENEEDAME YVQSGAIDSG DVIVIRNEGP TGGPGMREML
GVTAAVVGAG HEEDVALLTD GRFSGGTRGP MIGHIAPEAA DGGPIGLIED GDHVTVDIPE
RDLTVDLSEE ELAERREEWE APAPQYEGGV LAKYARDFAS ASDGAVTNPR LTRDL