Gene Caul_4436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4436 
Symbol 
ID5901897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4803410 
End bp4804390 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content71% 
IMG OID641564954 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_001686054 
Protein GI167648391 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAC AATTCGGAGC CAAATCCACC ACCGACGAGG TCCTGGCCGG CGTCGATCTC 
TCCCGCAAGC GCGTGCTGGT CACCGGCGTC TCGGCGGGCC TTGGCGTCGA GACCGCCCGG
GCCCTGGCGG CGCGGGGCGC CCACGTGGTC GGCGCCGCGC GGGATCTCGC CAAGGCGCAG
GGCGCGACGG GCGTGGTGCG CGAAGCCGCG GCCGCCGCCG GCGGCTCGCT GGAGCTGGTG
GCGCTGGACC TGGCCGACCT CGCCAGCGTG CGCGCCTGCG CCGACGCCCT GGTCGCCGAC
GCCAAGGACG GGGAGCAGGC CTTCGACCTG GTCATCGCCA ACGCCGGGGT GATGGCCCCG
CCGTTCGGCA AGACCGTCGA TGGCTTCGAG ACCCAGTTCG GCACCAACCA CCTGGGCCAC
TTCGTGCTGA TCAACCGCAT CGCCAGCCTG CTGAAGCCCG GCTCGCGTGT GGTGTCCCTG
GCCTCGTCGG GCCACCGCTT CTCGGACGTG AATCTTGAGG ATCCGAACTT CGAGACCACC
GAGTACGTCC CGTTCGAGGC CTATGGCCGC TCCAAGACCG CCAACATCCT GTTCGCCGTC
GAGTTCGACC GCCGCCACAA GGATCGCGGC GTGCGGGCCG CGGCCGTCCA TCCGGGCGGC
ATCCAGACCG AACTGGCCCG GCACCTGGAT CCGGCCTTCA TCCAGAACTG GATCGATCAG
CTGAACGCCC AGGCCGAGGC CGCCGGTCAG CCGCCCATCG AGTGGAAGAC CATCCCGCAG
GGCGCGGCCA CCAGCGTCTG GGCTGGGGTC GTTGCGCCCG CCTCGCTGGT CGCCGGCCGC
TATTGCGAGG ACTGCCACGT GGCCGAGCTG GTTGACGACG CCTCGCAGAT CCGCGCCGGC
GTGCGGTCCT ATGCCCTCGA CCCGGCCCGC GCCCAGGCGT TGTGGGCGCT GAGCGAGCGG
ATGGTCGGCG AGACGTTCTA A
 
Protein sequence
MSGQFGAKST TDEVLAGVDL SRKRVLVTGV SAGLGVETAR ALAARGAHVV GAARDLAKAQ 
GATGVVREAA AAAGGSLELV ALDLADLASV RACADALVAD AKDGEQAFDL VIANAGVMAP
PFGKTVDGFE TQFGTNHLGH FVLINRIASL LKPGSRVVSL ASSGHRFSDV NLEDPNFETT
EYVPFEAYGR SKTANILFAV EFDRRHKDRG VRAAAVHPGG IQTELARHLD PAFIQNWIDQ
LNAQAEAAGQ PPIEWKTIPQ GAATSVWAGV VAPASLVAGR YCEDCHVAEL VDDASQIRAG
VRSYALDPAR AQALWALSER MVGETF