Gene Clim_0898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0898 
Symbol 
ID6354135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp984417 
End bp985421 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content51% 
IMG OID642668525 
Productchlorophyll synthesis pathway, BchC 
Protein accessionYP_001942956 
Protein GI189346427 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.714457 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCAA AAGCCATTGT TTTCAGCGGA GTCCGACAGA TTTCTCTCTG CGAAGTCACC 
CTGAAACAGC TCTCATCAAC CGATGTGCTT GTTGAAACCT ACTGGTCGTC AATAAGCACC
GGTACGGAAA AAATGGCCTA TAACGGCCTT ATCCCTTCTC CTCCGTTCAT CTTCCCGTTC
ATTCCGGGCT ATGAAACCGT TGGCAGGATC ATCGAGGCGG GAGATCATGT CAACCAGAGT
TTGATCGGAA AGTTTGCCTA TGTAGCCGGC TCGTTCGGGT ACATAGACGT CAACGCAGCA
TTCGGAGGAG CTTCGCAGTA TATCGTATGT CCCGTTGACA GCATTACGCT GCTTGACTCC
ATTGCGAATC CGCAATGTGG TATCGCACTG CCTCTGGGCG CAACTGCATT GCATATTATC
GATCTTGCCG CAGTTGAGAA CAGAAAGGTG CTGATACTCG GTCAGGGCGC CGTCGGGATT
CTGGCATCGG AACTGGCCCG TCACATGGGA GCCCGGCTGA TTGCCGTTAC GGAGCCCTAT
CAGAACAGGC TCAGGTTTTC TTCTGCCGAT CTGAAAGTCA ACCCTGACAA CGAAGATGTT
TCGGCAGCCC TTGCCGGCCA TGAGTTCGAT GTACTGATTG ACAGTACCGG TATCATGAGC
GCAATCGATA CAGGACTCCG GTTTCTCAAA TTTCATGGCG TCGTGATCTT CGGCGGCTAC
TACCAGCGTA TGAATATCGA TTATTCACAG GCGTTCCAGA AAGAGTTATC GTTCATTGCG
GCAAAGCAGT GGGCGCACGG CGATCTTGAA AGGGTTCGCG ATCTCATCGC GGCCAAAAAA
CTCAATGCCG AAAAAATTTT CACCCACCAG TGCCAGGTCG ATGATAACAT TACGTCGGTC
TACATGCAGG CGTTCGGCGA TCCGGACTGC CTGAAAATGA TTCTGCACTG GAAAACCGAT
GCTGAAGAAC AGGAACGGGC CTGCTATCTG ACCAGCGAAA CCTGA
 
Protein sequence
MKSKAIVFSG VRQISLCEVT LKQLSSTDVL VETYWSSIST GTEKMAYNGL IPSPPFIFPF 
IPGYETVGRI IEAGDHVNQS LIGKFAYVAG SFGYIDVNAA FGGASQYIVC PVDSITLLDS
IANPQCGIAL PLGATALHII DLAAVENRKV LILGQGAVGI LASELARHMG ARLIAVTEPY
QNRLRFSSAD LKVNPDNEDV SAALAGHEFD VLIDSTGIMS AIDTGLRFLK FHGVVIFGGY
YQRMNIDYSQ AFQKELSFIA AKQWAHGDLE RVRDLIAAKK LNAEKIFTHQ CQVDDNITSV
YMQAFGDPDC LKMILHWKTD AEEQERACYL TSET