Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1247 |
Symbol | |
ID | 7399515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1257875 |
End bp | 1259662 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643708311 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_002565909 |
Protein GI | 222479672 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.853909 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAC AGCAGCCGCG GTCGGAGGGA GACGGCTCCC GAGGCCGCGA CGACGCGGAC CGGTTCGCGG GCGAGAAGGA CGAGAACCTG CGGAGCAGAG ACGTGACGGA GGGGGCGGAC AAGGCCCCGC ACCGGGCGAT GTTCCGGGCG ATGGGGTTCG ACGACGAGGA CCTCTCCTCG CCCATCATCG GCGTGCCGAA CCCGGCGGCC GACATCACGC CGTGTAACGT CCACCTCGAC GACGTGGCGG ACGCCGCGAT CGAGGGGATC GACGCGGCGG GCGGGATGCC GATCGAGTTC GGGACGATCA CCATCTCCGA CGCCATCTCG ATGGGGACCG AGGGGATGAA GGCGAGCCTC ATCTCCCGCG AGGTGATCGC CGACTCCGTC GAGCTGGTCT CCTTCGGCGA GCGGATGGAC GCGCTGGTGA CGGTGGCGGG CTGTGACAAG AACCTACCCG GCATGCTGAT GGCCGCGATC CGCACCGACC TCCCGTCGGT GTTCCTCTAC GGCGGCTCGA TCATGCCCGG CCAGCACGAC GGCCGCGACG TGACCATCGT GCAGGTGTTC GAGGGGGTCG GCGCCTACGC CGAAGGCGAC ATGAGCGGCG AGGAGCTCGA CGATCTGGAG CGGCACGCCT GCCCCGGCGC GGGCTCCTGT GGCGGGATGT TCACCGCCAA CACGATGGCC TCTATCGCCG AGGCGCTCGG GATGGCCCCG CTCGGCTCCG CCTCCGCGCC CGCCGAGAAC CGGGAGCGCT ACGAGGTCGC TGAGCGCGCC GGCGAACTCG CCGTGGACTG CATCGAGAAC GACCGCCGTC CCTCCGACAT CCTCTCGCGG GAGTCGTTCG AGAACGCGAT CGCGCTCCAG ACCGCCATCG GCGGCTCCAC AAACGGTGTC CTCCACCTCC TCGCGCTGGC CGCGGAGGCC GACGTGGACC TCTCGATCGA GGACTTCGAC GAGATCTCGC GGCGCACGCC GAAGATCGCG AACCTCCAGC CCGGCGGGAG CCGCGTCATG AACGACCTCC ACGAGATCGG CGGCGTCCCC GTTGTACTCC GACGCCTCTT GGAGGCCGAC CTGCTCCACG GCGACGCGAT GACCGTCACC GGACGCACCC TCGCCGAGGA GCTGGCGGAG TTGGAGGACC GCGGCGCGCT CCCGGACGAC GACGAGATCG AGGCGGACTT CCTCTACACC GTCGACGACC CCAAGCAGGC GGAGGGCGCC ATCAAGATCC TCGACGGCAA CCTCGCGCCC GAGGGCGCCG TCCTGAAGGT GACCGGCGAC GACGCCTTCT ACCACGAGGG GCCGGCGCGG ATCTTCGAGA ACGAGGAGGA CGCGATGGAG TACGTTCAGT CGGGCGCGAT CGACTCCGGC GACGTGATCG TGATCCGCAA CGAGGGGCCG ACCGGCGGCC CGGGAATGCG CGAGATGCTC GGCGTCACCG CCGCCGTCGT CGGCGCGGGC CACGAGGAGG ACGTGGCGCT GCTCACGGAC GGCCGCTTCT CGGGCGGGAC CCGCGGCCCG ATGATCGGCC ACATCGCGCC CGAGGCGGCC GACGGCGGCC CGATCGGGCT CATCGAGGAC GGCGACCACG TCACCGTCGA CATCCCGGAG CGCGACCTCA CGGTCGACCT CTCCGAGGAG GAACTGGCCG AGCGCCGCGA GGAGTGGGAG GCGCCGGCGC CCCAGTACGA GGGCGGCGTC CTCGCGAAGT ACGCGCGCGA CTTCGCCTCC GCCTCCGACG GCGCGGTGAC GAACCCGCGG CTCACGCGGG ATTTATAA
|
Protein sequence | MSEQQPRSEG DGSRGRDDAD RFAGEKDENL RSRDVTEGAD KAPHRAMFRA MGFDDEDLSS PIIGVPNPAA DITPCNVHLD DVADAAIEGI DAAGGMPIEF GTITISDAIS MGTEGMKASL ISREVIADSV ELVSFGERMD ALVTVAGCDK NLPGMLMAAI RTDLPSVFLY GGSIMPGQHD GRDVTIVQVF EGVGAYAEGD MSGEELDDLE RHACPGAGSC GGMFTANTMA SIAEALGMAP LGSASAPAEN RERYEVAERA GELAVDCIEN DRRPSDILSR ESFENAIALQ TAIGGSTNGV LHLLALAAEA DVDLSIEDFD EISRRTPKIA NLQPGGSRVM NDLHEIGGVP VVLRRLLEAD LLHGDAMTVT GRTLAEELAE LEDRGALPDD DEIEADFLYT VDDPKQAEGA IKILDGNLAP EGAVLKVTGD DAFYHEGPAR IFENEEDAME YVQSGAIDSG DVIVIRNEGP TGGPGMREML GVTAAVVGAG HEEDVALLTD GRFSGGTRGP MIGHIAPEAA DGGPIGLIED GDHVTVDIPE RDLTVDLSEE ELAERREEWE APAPQYEGGV LAKYARDFAS ASDGAVTNPR LTRDL
|
| |