Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0996 |
Symbol | |
ID | 4709568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1068145 |
End bp | 1069221 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639855467 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001002574 |
Protein GI | 121997787 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGAGA GCGATCACGT CAACGACGTC AATGTGGCCA CCGGTGAACG CCTTCCGTCT CCGGCCGAGA TCAAGTCCGA GGTCCCGCTG ACGGATGCCG CCCGGCAGAC CGTGCTGGAC GGGCGGCAGG TCCTGCGGGA CATCCTCGAC GGCAAGGATC AGCGCATCTT CGCCGTGGTC GGGCCGTGCT CCATCCACGA CCCGGAGGCG GCGCTCGACT ACGCGCGGCG GCTCAAGGCG CTGCACGATG AGCTCAGCGA TCACATCTAC TTGGTGATGC GGGTTTACTT CGAGAAACCG CGCACCACCA CGGGCTGGAA GGGGCTGATC AACGACCCGG ACATGGACGA CTCCTTCCGG ATCGATAAGG GCCTGCGCAT GGGCCGCGAG CTGCTCCGCG AGATCGCCGC CATGGGGCTG CCCACGGCGA CTGAGGCCCT CGACCCCTAC GCACCGCAAT ACTACGGCGA CCTGGTTTCG TGGACCGCGA TCGGCGCGCG TACCACCGAG TCCCAGACCC ACCGCGAGAT GGCCAGCGGG CTGTCCACGC CGGTGGGCTT CAAGAACGCC ACTGACGGCA GCCAGACGGT GGCGATCAAC GCCCTGCAAT CGGCGGCGTC CCCCCACAGT TTCCTGGGCA TCGACCAGGA GGGCCGCATC ACCGTCATCC GTACCCGGGG CAACCAGTAC GGCCACGTCG TGCTGCGGGG CGGGGCGCAG CCCAACTACG ACTCGGTGAG TATCCGGCTG TGCGAGCAGG CGCTGGAGAA GGCCGGCATG CCGCTGCGCG TGGTGGTCGA CTGCAGCCAC TCCAACTCCA ACAAGGATCC CGGGCTACAG TCGATGGTGC TGGAGGACGT GATCCGTCAG CTCCGCGAGG GCAATCGCTC CATCGTCGGG GTGATGCTGG AGAGCAACAT CGGTTGGGGC AGTCAGAAGC TCGGTGCCGA TCCGGGTGCC CTGGACTACG GCATCTCCAT CACCGATGCT TGTATCGACT GGGAGACCAC CGAACAGGTT CTCCGGGACG CCGCGGGGCA GCTGCGCGGC AGCCTGCGCG AGCGCGAGCT GCTGTAG
|
Protein sequence | MQESDHVNDV NVATGERLPS PAEIKSEVPL TDAARQTVLD GRQVLRDILD GKDQRIFAVV GPCSIHDPEA ALDYARRLKA LHDELSDHIY LVMRVYFEKP RTTTGWKGLI NDPDMDDSFR IDKGLRMGRE LLREIAAMGL PTATEALDPY APQYYGDLVS WTAIGARTTE SQTHREMASG LSTPVGFKNA TDGSQTVAIN ALQSAASPHS FLGIDQEGRI TVIRTRGNQY GHVVLRGGAQ PNYDSVSIRL CEQALEKAGM PLRVVVDCSH SNSNKDPGLQ SMVLEDVIRQ LREGNRSIVG VMLESNIGWG SQKLGADPGA LDYGISITDA CIDWETTEQV LRDAAGQLRG SLRERELL
|
| |