Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_10350 |
Symbol | |
ID | 7314623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 1122890 |
End bp | 1123909 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643611474 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002508786 |
Protein GI | 220931878 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase [TIGR01362] 3-deoxy-8-phosphooctulonate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.000000384679 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGTGG TTATGAAGGA TAATGCCAGT AAAAATGATA TTGAAAAGGT AGTAAAGAGA ATAGAAGAGC TGGGATATAA AACACATATT TCACGGGGTA CTGAAATAAC ACTTATTGGA ATAATAGGAG AATTAAACCG TGAGGAGCTA ATTGATTCTC TGGGGGCCTA TCCCGGAATT GAAAGGTTGG TTCCTATTCA GGAGCCCTAT AAACTGGCCG GGAAATCCTT TAATGACTCC AGATCCAGGA TTAAAATTGG TGAAGATGTC GTTATTGGTG GTAAAGAAGT TGTAATGATG GCTGGTCCCT GTGCAGTTGA AAGTGAACAG CAGATTATTA ATACAGCCCG GGCAGTAAAA AAGGCAGGTG CTAAAATTCT GAGGGGTGGG GCCTTTAAAC CAAGAACCTC ACCCTACAGT TTTCAGGGTT TACATGAAAA GGGATTAAAA TACCTTAAAA AAGCAGCCGA AGAGACTGGT TTAAAGGTAA TAACAGAAGT AATGGACCCC AGGGATGTTG AATTAGTGGC CAGATATGCT GATATCTTTC AAATCGGGGC CAGGAATATG CAAAACTTTT TCCTGTTAAA GGAAGTTGGA AAAACAGATA AACCGGTTAT GTTGAAGCGT GGTATGAATG CTACTTATAA GGAATTTTTA ATGGCAGCAG AGTATATTAT GTCAGAAGGA AACCATGATG TCATATTATG CGAAAGGGGT ATTAGAACTT TTGAAACATA TACCCGTAAT ACCCTGGATC TGGTTAGTGT TCCTGTTTTA AATAAACTAA GCCACTTACC TGTTGTCATT GACCCCAGTC ATGGAACAGG TCAATGGGAC CTGGTAGGCC CGGCAGCAAG AGGGGCAGTA GCTATAGGGG CAGATGGACT TATTATAGAA GTTCATCCTG AGCCGATTAA TGCCTTAAGT GATGGACAGC AATCCCTTAA ATTTGATAAA TTTGAAGAAC TGGTAGATGA TCTGAAAAAG ATTGCCAGGG CAATAGGTCG TGACCTATAA
|
Protein sequence | MIVVMKDNAS KNDIEKVVKR IEELGYKTHI SRGTEITLIG IIGELNREEL IDSLGAYPGI ERLVPIQEPY KLAGKSFNDS RSRIKIGEDV VIGGKEVVMM AGPCAVESEQ QIINTARAVK KAGAKILRGG AFKPRTSPYS FQGLHEKGLK YLKKAAEETG LKVITEVMDP RDVELVARYA DIFQIGARNM QNFFLLKEVG KTDKPVMLKR GMNATYKEFL MAAEYIMSEG NHDVILCERG IRTFETYTRN TLDLVSVPVL NKLSHLPVVI DPSHGTGQWD LVGPAARGAV AIGADGLIIE VHPEPINALS DGQQSLKFDK FEELVDDLKK IARAIGRDL
|
| |