Gene Clim_0968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0968 
Symbol 
ID6355417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1061134 
End bp1062195 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content51% 
IMG OID642668592 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001943023 
Protein GI189346494 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.110398 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAGT TACATGATTT GAGAGTTTCA CGCATCAAGC GCCTGTCATC TCCAGGGGCG 
CTCAAGGATA AATTGCCGGT TAATGATCGT ATTGCATCAA CCGTCAGCTC GGGTCGTCGT
GAAGTAGAGA ATATATTGAA CGGTACGGAC AATCGCCTGC TTGTAATCGT CGGCCCCTGC
TCGATCCATA ATGTGGATGC CGCCCTCGTT TATGCAGAAA AGCTTTCCGG AATGAGAAGC
GAGCTCAGGA GTGAGCTTTG CATCCTGATG CGGGTCTATT TTGAAAAGCC GAGGACAACG
GTGGGGTGGA AAGGATTTAT CAACGATCCT CATCTCGACG ATTCATACGA TATAGAACAT
GGCCTCTATT ATGCCCGCAA GCTGCTGATC GATATCAATG CGCTTGGCCT TCCGGCGGCG
ACGGAGTTTC TCGATCCCAT TACTCCTCAG TATGTTGCCG ATGTGGTGAG CTGGGCGGCT
ATAGGCGCCA GAACCATAGA ATCACAGACG CACCGGCAGA TGGCCAGCGG TCTCTCAATG
CCTGTCGGGT TTAAAAATTC GACCGACGGA CGCATCAATG TCGCCGTCGA TGCGATTCGC
TCGGCAATGC ATCCGCACAG TTTCCTGGGA ATCGATCGTG AGGGTCACAG CAGTGTCATC
ACTACGAAAG GAAATCCTTA TGGTCATCTC GTGCTCAGAG GCGGCATGAC GCCGAATTAC
GACGCGCAAA GTATTGCTGC TGCGGAACAA CTGCTTGAGA AAGCCGGACT TTCCCAGACC
CTTCTGGTGG ATTGCAGTCA TGCCAATTCC GGCAAGAAAC ACGCCCAGCA GCTTAAAGTC
TGGGAAAATA TTCTTGAACA GAAAGCCCGC GGCAACAGAA GTATCGCCGG GGTCATGATC
GAAAGCAATC TCTGTTCAGG AAACCAGCCC TTTCCCGAAG ACCCGGAAAA ACTCAGATAT
GGCGTTTCAA TAACCGACGA ATGTATCTCT TGGGAGGAGA CCGAACGGAT GCTCCGTCAG
GGCGCTGACG TTATCGCAAA ACTGATGTCA AAAGAAGCAT AA
 
Protein sequence
MEQLHDLRVS RIKRLSSPGA LKDKLPVNDR IASTVSSGRR EVENILNGTD NRLLVIVGPC 
SIHNVDAALV YAEKLSGMRS ELRSELCILM RVYFEKPRTT VGWKGFINDP HLDDSYDIEH
GLYYARKLLI DINALGLPAA TEFLDPITPQ YVADVVSWAA IGARTIESQT HRQMASGLSM
PVGFKNSTDG RINVAVDAIR SAMHPHSFLG IDREGHSSVI TTKGNPYGHL VLRGGMTPNY
DAQSIAAAEQ LLEKAGLSQT LLVDCSHANS GKKHAQQLKV WENILEQKAR GNRSIAGVMI
ESNLCSGNQP FPEDPEKLRY GVSITDECIS WEETERMLRQ GADVIAKLMS KEA