Gene Caul_0526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0526 
Symbol 
ID5897981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp575205 
End bp576230 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID641561009 
Productaldo/keto reductase 
Protein accessionYP_001682158 
Protein GI167644495 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.51564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACTTA GGAAACTTGG CGCGTCGGGA TTTCTGGTCC CGCAGTTGAG CCTGGGGACG 
GGCATGTTCG TACCGAGCGA GGTCTTTTCG CAAGGAAACG TGGACTTGCC GCTCGCGACG
CGGCTCATCG ACATCAGCAT CGAGCACGGC GCCAACATGT TCGATTCCGG CCACACCTAC
TGGAATGGTC ATTCAGAGAT CCTGCTTGGC GAAGCGCTCA AGGGGCGCCG CCACAAGGCG
ATCATTTCGA CCAAGGCGGG CCACCCCCCG CAGGATGGCG GGTCCAACGA TATTGGCGCC
TCGCGCTATC ACCTGACCCA GGCGATCGAC CAATCCCTCA AGCGTCTGGG AACCGACTAT
ATCGATGTGT TCCAACTGCA CACCTTCGAC GCCTTGACGC CGCCCGAAGA GACGCTGGCC
ACTCTCGATA CGTTCGTACG CGCCGGCAAG ATCCGCTATA TCGGCGTTTC CAACACGCCA
GGCTGGGCGC TGATGAAGTC TCTCGCCGTC GCCGAGCGCG CGTCCCTGCC GCGCTATGTG
GTTCATCAGG TCTATTATTC GCTGATCGGG CGCGACTATG AATGGGAACT GATGCCGCTT
GGCGCGGACC AAGGGGTTTC CGCCGCGGTC TGGAGTCCCC TGGGGTGGGG GCGGTTGACC
GGCCGCCTCA AGCGCGGGCA GCCCGCGCCG GCCGACAGCC GATTGATCTT GAGCGAACAC
ATCGCGCCTC AAGCCGACGA GCAGACGCTA CATGACGTGC TTGATGTTCT GCGGGAACTG
GCCGAGGAGA CTGGCAAGCT CATCCCGCAG ATCGCGATCA ACTGGCTCCT CCAGCGCCCC
ACGGTCGCCA CGGTGATCAT GGGCGCCCGA ACGGAAGAGC AGTTGCTGCA GAACCTCGGC
GCGGCGGGCT GGTCCCTGGC CCCCGAGCAA ATCAAGCGCC TTGATGCGGT CAGCCGCCGC
CGCCCGTCCT ATCCCACGGA CTTTTATCTT ACCGCCGATC GCCACCGCAA TCCGCCCTCG
GTCTAG
 
Protein sequence
MELRKLGASG FLVPQLSLGT GMFVPSEVFS QGNVDLPLAT RLIDISIEHG ANMFDSGHTY 
WNGHSEILLG EALKGRRHKA IISTKAGHPP QDGGSNDIGA SRYHLTQAID QSLKRLGTDY
IDVFQLHTFD ALTPPEETLA TLDTFVRAGK IRYIGVSNTP GWALMKSLAV AERASLPRYV
VHQVYYSLIG RDYEWELMPL GADQGVSAAV WSPLGWGRLT GRLKRGQPAP ADSRLILSEH
IAPQADEQTL HDVLDVLREL AEETGKLIPQ IAINWLLQRP TVATVIMGAR TEEQLLQNLG
AAGWSLAPEQ IKRLDAVSRR RPSYPTDFYL TADRHRNPPS V