Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4179 |
Symbol | |
ID | 5736041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5329582 |
End bp | 5330607 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281334 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001546939 |
Protein GI | 159900692 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0656088 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGTTG TAATGAAGGC CCACGCTGAC CTCGCCGACC GTGACGCTGT ATTGGCTCGC TTAGCCGAAA ATAAGCTCAA AGGCCATTTA TCGGAAGGCG AGGAACGAAT TGTGATTGGA GTTGTTGGTG CGAAAATTCC AGCGGGCTTG GAAGAACAAT TGCAATCGAT GAGCGGCGTG CAAACAACCT TGCGAATTAC GCGCCCCTAT AAATTAGCTG GCCGCGAGTT TCAACAACAC AATACCGTCA TTCGGATTGG CGATTTGGAA ATTGGCGGCG GTACTCCAGT CGTGATGGCT GGGCCATGTT CGGTCGAAAG TGCTGAGCAA TTATTAAGTA CAGCACACGC GGTCAAAGCA GCCGGAGCCA ATATCCTCCG TGGTGGCGCA TTCAAGCCAC GCACTTCACC CTACGCTTTC CGTGGTTTGG GCGAAGAAGG CCTGAAAATT TTGGCCCAAG CTCGCGAAGA AACAGGTTTA CCAATCATCA CCGAAGCCCT GAATACCCGT GATGTTGAAT TAGTCGCCCG CTACACTGAT ATCATCCAAC TTGGCGCTCG CAATATGCAA AATTTCGCCC TCTTGGAAGA AGCAGGCCAA ACGGGTAAGC CGATCATGGT CAAGCGTGGC CCTTCAGCAA CAGTCGAAGA ATGGTTGTTG GCGGCTGAAT ATATTCTCGC GACTGGTAAT CGCAACGTTA TTTTGTGCGA ACGCGGTATT CGCACCTACG AAACTGCCAC CCGTAACACC CTCGATTTGA ATGCTGTGGC AGTTGCCAAG CGTCGGACTC ACTTGCCGGT GATTGCCGAC CCCAGCCATG GCACTGGCAA ATGGTACTTG GTGCAGCCAA TGGCCTTGGC CGGCTTGGCC GCTGGCGCTG ATGGTCTGAT GATCGAAGTT CACCACGATC CCGACCGTGC TTCTTCCGAT GGCCCTCAAT CACTCAACCA CCACAATTTT GCCCAATTGA TGCAACAAGT TCGTCGCCTG ATCGCAGCTT TGGAGCCAGA ATTAGCAGTT GCCTAG
|
Protein sequence | MIVVMKAHAD LADRDAVLAR LAENKLKGHL SEGEERIVIG VVGAKIPAGL EEQLQSMSGV QTTLRITRPY KLAGREFQQH NTVIRIGDLE IGGGTPVVMA GPCSVESAEQ LLSTAHAVKA AGANILRGGA FKPRTSPYAF RGLGEEGLKI LAQAREETGL PIITEALNTR DVELVARYTD IIQLGARNMQ NFALLEEAGQ TGKPIMVKRG PSATVEEWLL AAEYILATGN RNVILCERGI RTYETATRNT LDLNAVAVAK RRTHLPVIAD PSHGTGKWYL VQPMALAGLA AGADGLMIEV HHDPDRASSD GPQSLNHHNF AQLMQQVRRL IAALEPELAV A
|
| |