Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3977 |
Symbol | |
ID | 5735838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5076437 |
End bp | 5077312 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641281127 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001546737 |
Protein GI | 159900490 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGATCACTC TGGATTGTAA GCTCGCCGCG CGGACGCCCG AACGGCAACG CACGACGGTA GATGTTGGCG GCGTTCGCAT CGGCGCGGAG GAAATTGTCG TCATCGCCGG CCCTTGCAGC GTGGAAAACG AGGAGCAAAT TCTGGCGACG GCGCAGCATG TGCGCGCCGC GGGGGCGCAC ATGCTGCGCG GCGGCGCCTA CAAGCCACGC ACGTCGCCAT ACGCGTTCCG CGGCCTGGGC GAGGAAGGGC TTCAACTGTT GGCCAAAGCC CGCGAAGCAA CCGACCTGCC GGTGGTCACT GAAGTGATGA CGCCGGCCGA TGTGGGATTG GTCGCCGAGT ACGCCGATAT GCTTCAGATT GGCGCGCGGA ATATGCAGAA CTTCCATCTA TTGGAGGCCG TCGGCCGCAT CCAGAAGCCG GTGTTGCTGA AACGCGGCAT GTCGGGGACG ATCCAGGAGT GGCTGCTTGC GGCGGAATAT ATCTTGAACT GTGGTAACCC CAATGTCGTC CTGTGCGAGC GGGGCATCCG CACCTTCGAA CCCAGCCTGC GCAACACGCT CGACCTCGGC GCCTTGGCCT TCGCCAAGGA ACTGAGCCAT CTCCCGGTTA TCGCCGACCC CAGCCATGGC GTGGGGCGTC GCAGCCTGGT CGGCCCGCTT GCCCTGGCCA GCCTCGCCGC CGGGGCCGAC GGGCTGATCT TGGAGGTCCA CCCACATCCC GAGCGCTCGG TTTCGGACTC CCAGCAGACC GTGGATTGCG CCGAATTCGC CGACATTATG CTGCGAGCGG ACGCGGTGGC CTCTGCCGTG GGGCGCAGAC TGCACCTGCC AGCGATCACG CCCGCCTACG CCGAGGCGGC GGATGTTCCT GCCTGA
|
Protein sequence | MITLDCKLAA RTPERQRTTV DVGGVRIGAE EIVVIAGPCS VENEEQILAT AQHVRAAGAH MLRGGAYKPR TSPYAFRGLG EEGLQLLAKA REATDLPVVT EVMTPADVGL VAEYADMLQI GARNMQNFHL LEAVGRIQKP VLLKRGMSGT IQEWLLAAEY ILNCGNPNVV LCERGIRTFE PSLRNTLDLG ALAFAKELSH LPVIADPSHG VGRRSLVGPL ALASLAAGAD GLILEVHPHP ERSVSDSQQT VDCAEFADIM LRADAVASAV GRRLHLPAIT PAYAEAADVP A
|
| |