Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1801 |
Symbol | |
ID | 5733703 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2090174 |
End bp | 2091031 |
Gene Length | 858 bp |
Protein Length | 285 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278944 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001544572 |
Protein GI | 159898325 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAATG CAAACGCATC GACCAAGCCC GCGCCGTATC GACTTGCTGC TCGTACCGAT GCTCACCAAG CCACAATTGT GGATCTTGGT TCGGTCAAAA TTGGCGGTGG CCAACCAGTT GTCATGGCTG GCCCATGCTC AGTCGAATCA GAAAGCCAAT TGCTCAACAC GGCCTATGCA GTGGCCGAAG CTGGGGCTCA TATGCTGCGC GGCGGGGCTT TCAAACCACG CACATCACCC TATGCCTTCC GTGGCTTGGG CGAAGCAGGC TTGAAGATTT TGGCGAAAGC TCGCGCCGAA ACTGGCTTGC CAATTATTAC CGAAGCGCTC AACACCGCTG ATCTAGATTT GGTGGCTGAA TACACCGATG TGATTCAAAT TGGCGCACGC AATATGCAAA ATTTTGCCTT GCTCGAAGCT GCTGGTCGCA CTGGCCGCCC AGTTATGGTC AAACGTGGCC CAGCTGGCAC GATTGAAGAA TGGCTATTGG CTGCCGAATA TGTGTTGGCA ACCGGCAATC CCAATGTGAT TTTGTGTGAG CGTGGCATTC GCACCTTTGA AAATGCTACC CGCAACACCC TCGATCTGAA TGCTGTGGCA ATGGCCAAAC ATCGCAGCCA TCTGCCGGTG ATCGTTGACC CCAGCCATGG CACTGGCAAA TGGTACTTGG TTGCGCCCTT GGCCTTGGCA GGCTTGGCGG TTGGCGGCGA TGGCTTGATG ATCGAAGTTC ACCATGATCC TGATCATGCT AGCTCCGATG GCCCTCAATC GTTGAACCAC GAACATTTTG CTGATTTGAT GGAAAAAATC AATCAACAAT CCGCCCAGCC AGTTGCGCGG GAAGTGGGTT TGAACTAA
|
Protein sequence | MSNANASTKP APYRLAARTD AHQATIVDLG SVKIGGGQPV VMAGPCSVES ESQLLNTAYA VAEAGAHMLR GGAFKPRTSP YAFRGLGEAG LKILAKARAE TGLPIITEAL NTADLDLVAE YTDVIQIGAR NMQNFALLEA AGRTGRPVMV KRGPAGTIEE WLLAAEYVLA TGNPNVILCE RGIRTFENAT RNTLDLNAVA MAKHRSHLPV IVDPSHGTGK WYLVAPLALA GLAVGGDGLM IEVHHDPDHA SSDGPQSLNH EHFADLMEKI NQQSAQPVAR EVGLN
|
| |