Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_0968 |
Symbol | |
ID | 6355417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 1061134 |
End bp | 1062195 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642668592 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001943023 |
Protein GI | 189346494 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.110398 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACAGT TACATGATTT GAGAGTTTCA CGCATCAAGC GCCTGTCATC TCCAGGGGCG CTCAAGGATA AATTGCCGGT TAATGATCGT ATTGCATCAA CCGTCAGCTC GGGTCGTCGT GAAGTAGAGA ATATATTGAA CGGTACGGAC AATCGCCTGC TTGTAATCGT CGGCCCCTGC TCGATCCATA ATGTGGATGC CGCCCTCGTT TATGCAGAAA AGCTTTCCGG AATGAGAAGC GAGCTCAGGA GTGAGCTTTG CATCCTGATG CGGGTCTATT TTGAAAAGCC GAGGACAACG GTGGGGTGGA AAGGATTTAT CAACGATCCT CATCTCGACG ATTCATACGA TATAGAACAT GGCCTCTATT ATGCCCGCAA GCTGCTGATC GATATCAATG CGCTTGGCCT TCCGGCGGCG ACGGAGTTTC TCGATCCCAT TACTCCTCAG TATGTTGCCG ATGTGGTGAG CTGGGCGGCT ATAGGCGCCA GAACCATAGA ATCACAGACG CACCGGCAGA TGGCCAGCGG TCTCTCAATG CCTGTCGGGT TTAAAAATTC GACCGACGGA CGCATCAATG TCGCCGTCGA TGCGATTCGC TCGGCAATGC ATCCGCACAG TTTCCTGGGA ATCGATCGTG AGGGTCACAG CAGTGTCATC ACTACGAAAG GAAATCCTTA TGGTCATCTC GTGCTCAGAG GCGGCATGAC GCCGAATTAC GACGCGCAAA GTATTGCTGC TGCGGAACAA CTGCTTGAGA AAGCCGGACT TTCCCAGACC CTTCTGGTGG ATTGCAGTCA TGCCAATTCC GGCAAGAAAC ACGCCCAGCA GCTTAAAGTC TGGGAAAATA TTCTTGAACA GAAAGCCCGC GGCAACAGAA GTATCGCCGG GGTCATGATC GAAAGCAATC TCTGTTCAGG AAACCAGCCC TTTCCCGAAG ACCCGGAAAA ACTCAGATAT GGCGTTTCAA TAACCGACGA ATGTATCTCT TGGGAGGAGA CCGAACGGAT GCTCCGTCAG GGCGCTGACG TTATCGCAAA ACTGATGTCA AAAGAAGCAT AA
|
Protein sequence | MEQLHDLRVS RIKRLSSPGA LKDKLPVNDR IASTVSSGRR EVENILNGTD NRLLVIVGPC SIHNVDAALV YAEKLSGMRS ELRSELCILM RVYFEKPRTT VGWKGFINDP HLDDSYDIEH GLYYARKLLI DINALGLPAA TEFLDPITPQ YVADVVSWAA IGARTIESQT HRQMASGLSM PVGFKNSTDG RINVAVDAIR SAMHPHSFLG IDREGHSSVI TTKGNPYGHL VLRGGMTPNY DAQSIAAAEQ LLEKAGLSQT LLVDCSHANS GKKHAQQLKV WENILEQKAR GNRSIAGVMI ESNLCSGNQP FPEDPEKLRY GVSITDECIS WEETERMLRQ GADVIAKLMS KEA
|
| |