Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Afer_0792 |
Symbol | |
ID | 8322854 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidimicrobium ferrooxidans DSM 10331 |
Kingdom | Bacteria |
Replicon accession | NC_013124 |
Strand | + |
Start bp | 801844 |
End bp | 803199 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644951927 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_003109413 |
Protein GI | 256371589 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.616479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCGC TTGACGACGC CGGTTGGCAG CCGTGGGACT GGCAGCAGCG TGTGGCCGCG CAGCAGCCCG AGTGGCCCGA TCCCGAGGCG TTGGACGCGG TCATCAAGGA GTTGGCTCAG CGCCCAGCCC TCGTGGTCGC CAAGGACGTC GACCGGCTAC GAGCCGCACT GGCGCGTGCG GCCCGTGGAC GTGCCTTCGT GCTCCAAGCC GGTGACTGCG CGGAGAGCTT CCACGATCAC TCCGCGAGTT CGCTGCGCGC CAAGCTGAAG ATCATCTTGC AGATGGCCGT GGTCCTGACC TACTCGTCGG GCGTGTCGGT GGTCAAGATC GGGCGTATCG CCGGCCAGTT CGCCAAGCCG AGGTCGGCGC CCGTCGAGGT CGTCGACGGG GCGACCTTGC CGTCGTTTCG CGGGCACATC GTCCATGATG ACGCCCCGAC ACTGGACGCG CGTCGTCCCA ACCCCGAACG CCTACTCTGG GCCTACGATC AGTCCCGAGC GACGGTGAGC GTGTTGCGAG CCCTCACCGA GGGGGGCTTC GCGGATCTCT CCGGGGCGCA TCGGTGGAAC CTCGACTTCG TCGCCTCGTC GCCGGAAGGG CAGCGCTACC AGGCGATCGC CGATGGGGTC GATCGGGCAC TTCGCTTCAT GGCAGGCTGC GGGATCGATC TCGAGCGCGA GGCCGTTCTG CACCAAGTGA ACGTGTGGAC CTCGCACGAG GCGCTCTTAC TGCCCTACGA GGCGGCGCTC ACCCGGCGCG ATCCTGCCTC GCAGCGCTAC TACGACCTCT CGGCCCACAT GGTCTGGGTG GGCGAGCGCA CGCGCCAGCT CGACGGCGCA CACCTGCGTT TTGCATCGGG GATCGCCAAT CCGGTCGGAC TGAAGGTCGG CCCCACGATG GAGCCCGACA CCCTCGTCGA GGCCTGCCGG ATCCTCGATC CTGATCGGAC GCCGGGTCGG CTGGTGCTCA TCTCACGCAT GGGTCACGAC GCCGTGCGTG ATCGCCTCGG AGGTCTCGTC GAGGCCGTAC GTGAGGCGGG CTATCCGGTG GTGTGGCTGT GCGATCCGAT GCACGGCAAC ACCTTCGTCT CCCAGTCAGG CTACAAGACG CGCCGCTTCG AGGACGTGAT GGACGAGATC GCCGGTTTCT TCGAGGTCCA TCGACGGCTT GGTACCCACG CAGGTGGGAT CCACCTCGAG CTCACCGGAG AGGACGTGAC GGAGTGTCTT GGCGGCTCGG AGGCGGTGCT CGAGTCGGAA CTGTGTCGTG CCTACGACAC CATCTGCGAT CCTCGGTTGA ACGCGCGCCA ATCACTCGAC TTGGCGTTTC GCGTCGCTGA ACTCCTGATC CGCTGA
|
Protein sequence | MNALDDAGWQ PWDWQQRVAA QQPEWPDPEA LDAVIKELAQ RPALVVAKDV DRLRAALARA ARGRAFVLQA GDCAESFHDH SASSLRAKLK IILQMAVVLT YSSGVSVVKI GRIAGQFAKP RSAPVEVVDG ATLPSFRGHI VHDDAPTLDA RRPNPERLLW AYDQSRATVS VLRALTEGGF ADLSGAHRWN LDFVASSPEG QRYQAIADGV DRALRFMAGC GIDLEREAVL HQVNVWTSHE ALLLPYEAAL TRRDPASQRY YDLSAHMVWV GERTRQLDGA HLRFASGIAN PVGLKVGPTM EPDTLVEACR ILDPDRTPGR LVLISRMGHD AVRDRLGGLV EAVREAGYPV VWLCDPMHGN TFVSQSGYKT RRFEDVMDEI AGFFEVHRRL GTHAGGIHLE LTGEDVTECL GGSEAVLESE LCRAYDTICD PRLNARQSLD LAFRVAELLI R
|
| |