Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1819 |
Symbol | |
ID | 7267731 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 2231114 |
End bp | 2232169 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643566657 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002463152 |
Protein GI | 219848719 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0334818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGA TACCATTAGC CAATCTACAC GTTCGTGAAC TATCACCGCT TCAACCACCA CGTGTCCTAA AAGCCGAACT TCCGATCAGC CCGGCTGCGG CCCATACGGT TGCCGAAGCC CGGGCCGCGA TTCGGCGGAT TTTACGTGGT GAAGATCACC GCCGGATCAT GGTGGTTGGG CCATGCTCGA TCCATGATCC CGAAGCCGCA CTTGATTATG CTCGTCGCCT CCAAGCGCTG CAACGCCCGC TTGGCGATCA GTTGCTGATC GTGATGCGCA CCTATCTCGA AAAGCCGCGC ACTACCGTCG GCTGGCGTGG CTTGATCAAT GACCCCCATC TCGACGGCTC GTTCGATATG GCCGCCGGTC TGCGTATTGC CCGTCAATTG CTTCTGGCGA TCAACGAACT CGGCGTGCCG GTCGCGACCG AGATGCTTGA TCCGATTAGT CCGCAATACC TCGACGACCA AATCAGCCTC GCGACGATCG GCGCCCGCAC GAGCGAAGCG CAAACGCACC GAGCGTTGGC CAGTGGGGTT TCGATGCCGG TTGGTTTTAA GAATGGCACT GATGGCGGTA TTCAGATCGC TGTCAATGCC TGTGTTTCGG CAGCGGCACC ACACAGCTTT CTCGGTATCG ATGAAGATGG GCGCAGTGCA GTGGTACGTA CCACCGGTAA TCCTGATAGT TTCGTTATTT TGCGTGGTGG CCGCTACGGC CCCAATTATC ATCTCGAGTA TATCGTGCAG GCAACGCGGT TGATGCGCGA AGCCGAACGA ACTCCGGCAG TGATGGTCGA TTGCAGTCAC GCCAACTCCG GTGGCGATTT TCGCCGTCAA GAAGCGGTTT GGCAAACGGT ACTCGGCTAT ATGGTCGAAG AAGAGTTGCC GATCATCGGG ATGATGTTGG AGAGTAATTT GTTTGAAGGG AAGCAACCAC TTCTTGCCGA CCGCAGCCTG CTGAAGTACG GTGTTTCACT GACCGACGGT TGTGTTGGGT GGGACACAAC CGAACGCTTG TTACACGAGG CCCATCTGGC GCTGAGCAGG CGCTAA
|
Protein sequence | MNQIPLANLH VRELSPLQPP RVLKAELPIS PAAAHTVAEA RAAIRRILRG EDHRRIMVVG PCSIHDPEAA LDYARRLQAL QRPLGDQLLI VMRTYLEKPR TTVGWRGLIN DPHLDGSFDM AAGLRIARQL LLAINELGVP VATEMLDPIS PQYLDDQISL ATIGARTSEA QTHRALASGV SMPVGFKNGT DGGIQIAVNA CVSAAAPHSF LGIDEDGRSA VVRTTGNPDS FVILRGGRYG PNYHLEYIVQ ATRLMREAER TPAVMVDCSH ANSGGDFRRQ EAVWQTVLGY MVEEELPIIG MMLESNLFEG KQPLLADRSL LKYGVSLTDG CVGWDTTERL LHEAHLALSR R
|
| |