Gene Caul_0538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0538 
Symbol 
ID5897993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp586824 
End bp587876 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content65% 
IMG OID641561021 
Productaldo/keto reductase 
Protein accessionYP_001682170 
Protein GI167644507 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTTACA AACTCTTTGG CGAACACACG GGGCTGCGGG TCTCCGAGCT CGTGCTCGGG 
ACCGCGAATT TCGGCACGCG ATGGGGACAT GGCGCCGATG CGGATGAGGC CCGCCGCATC
TTCGACGCCT ACGCCGATGC CGGCGGCAAT TTCATCGACA CGGCCAACGG CTATCAGGAC
GGCCAGTCCG AGGAGTTTTT GGGCGACTTG TTGGCGGGAC GGCGCGACGA CTTTGTGCTC
GCCACGAAGT ACACCGTGAA GACGGATGCC AACTCCGGCA TCCTCGTCAC TGGCAATAGC
CGTCAAGCGA TGGTCTCGTC CGTCGAGGCA AGCTTGAAGC GACTCAAGAC CGACCGGATC
GACCTCTACT GGGCTCATGT CTCGGACGGC GTCACACCGC TTGAGGAGAT CGTGCGAGGC
TTCGACGACC TCGTTCGGAC CGGCAAGATC CTCTATGCCG GCCTTTCAAA TTTTCCCGCG
TGGCGGGTCG CTCGGGCCGC GACCATCGCC GCGCTGCGTG GCGCCGTCCC GATCGCGGGC
CTTCAGGTCG AGCACAGCCT CGTCGCTCGC ACGGCCGAGC AGGAGCTCCT CTCGGCCGGG
CGTGCACTTG GTCTCGGCGT CGTGGCCTTC TCGCCGCTCG GCGGCGGTAT GCTCACCGGA
AAATACCGCA AGCCAAATGG CGAGAAGGGA CGCGAGGAGG GTCTTGCCGG GGCCGGCTTC
CAGCCCGAGA ACTCCCCGCA GCGCACCGCC ACCCTCGACA CCTTGATCGC GGTCGCAGAA
GAGGCCGGCG CGACGCCTAG CGAGATCGCC ATCGCCTGGG TCGCCGCCAA GGGCTCGCTT
CCGATCATCG GGCCACGCAC GCTTGCTCAG CTTGAAAACA ACCTTGCTTC AGCAAAGGTG
ACGCTGTCGC CCGAGCACGT TGCGCGCCTG GACGCGGTAA GCGCGCCCCC ACCGGTTTAC
CCCTACACGG TACTCAATGA TCCACGGATC AGGGATATCA TCACGAGCGG CAAGTTCGGG
CAGATCGACG CCCCCGCCGA GTCCGTCGCA TGA
 
Protein sequence
MRYKLFGEHT GLRVSELVLG TANFGTRWGH GADADEARRI FDAYADAGGN FIDTANGYQD 
GQSEEFLGDL LAGRRDDFVL ATKYTVKTDA NSGILVTGNS RQAMVSSVEA SLKRLKTDRI
DLYWAHVSDG VTPLEEIVRG FDDLVRTGKI LYAGLSNFPA WRVARAATIA ALRGAVPIAG
LQVEHSLVAR TAEQELLSAG RALGLGVVAF SPLGGGMLTG KYRKPNGEKG REEGLAGAGF
QPENSPQRTA TLDTLIAVAE EAGATPSEIA IAWVAAKGSL PIIGPRTLAQ LENNLASAKV
TLSPEHVARL DAVSAPPPVY PYTVLNDPRI RDIITSGKFG QIDAPAESVA