Gene Caul_2375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2375 
Symbol 
ID5899830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2580567 
End bp2582084 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content67% 
IMG OID641562866 
Productaldehyde dehydrogenase 
Protein accessionYP_001684000 
Protein GI167646337 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCAGG ATCTGGCGAC CGAAACCCGC GCCCTGCTCG CCGATCTCGG CGTCGATCCT 
GCGCGACTGG GGGGCGGATC CCTGACCGTC CGCTCGCCGA TCACGGGCGA CATCCTGGCC
CAGGTCCGCG AGACCAGTGT CGCCGAGGTC GGCTATGAGA TCGCCCGGGC CGAGCAGGCC
TTCCAGATCT GGCGGCGGGT CCCGGCGCCC CGCCGCGGCG AGTTCGTGCG CCTGCTGGGT
GAGGAACTGC GCCGCAGCAA GGAAGCCCTC GGCCAACTGG TGTCGATCGA GGTCGGCAAG
GTTCTGTCCG AGGGCCTGGG CGAGGTCCAG GAGATGATCG ACATCTGCGA TTTCGCCGTT
GGGCTGTCGC GCCAGCTCCA GGGACTGTGC CTTCCGTCCG AGCGCCGCGA TCACCGCATC
ACCGAACAGT GGCATCCAAT CGGCCCGGTC GGGGTGATCT CCGCTTTCAA CTTCCCGGTG
GCGGTGTGGA GCTGGAACGC CGCGCTGGCC TTCATTTGCG GCGACAGCGT GATTTGGAAG
CCGTCCGAGA AGGCGCCGCT GACGGCGCTC GCGGTCAGCG CCCTGGCGGC GCGGGCCTGC
AAGGCCTTTG GCGACGAGGC GCCCGATGGG CTGGCGACGT TGATCATTGG CGGTCGCGAG
GCCGGTCGGA CGCTCGTGGA TGATCCGCGC GTGCCGGTGA TCTCGGCGAC CGGGTCGACG
CGGATGGGCC AGACCGTCGG CGAACGTGTC GCACGCCGGT TTGGCAAGGC GATCCTTGAG
CTCGGCGGGA ACAACGCTTC GATCGTCACG CCGTCCGCCG ATCTCGATCT GACGCTTCGC
GCGGTCGCAT TCGCCGCCAT GGGGACCGCC GGCCAACGCT GTACGACGCT CCGCCGACTG
CTGGTGCACG ACACCGTCTA TGATGCGCTT GTCCCAAGAC TCGCCGCCGT CTACGGCAAG
ATCGCAGTGG GTGATCCCCG CGAAGACGGC AATCTCGTCG GTCCGCTCAT CGACGCCGAG
GCCTTCACCG CCATGGAACG CGCACTAGAC GCCGCGCGCA CGGCGGGCGG TCGCGTTCAC
GGCGGCGGTC GCGTCGATGT CAACGGCGAG AACTCCTTCT ACGCCCGACC TGCCCTGATC
GAGATGAGCC AACATGCCGA GTGCGTCCGT GCGGAGACGT TCGCGCCGAT CCTCTATGTC
TTCCGCTACG AAACACTCGA AGAGGCGATC GCGCTTCAGA ACGATGTGCC GCAGGGTCTG
TCCTCTTCGA TCTTCGCCAC GGACATGCGC GAGGTCGAGC AGTTCCTCTC GGCCACCGGC
TCCGATTGCG GCATCGCCAA CGTCAATATG GGGACGTCGG GCGCCGAGAT TGGCGGTGCT
TTCGGTGGCG AGAAGGAGAC GGGCGGCGGA CGCGAAAGCG GTTCGGACAG CTGGAAGGCC
TACATGCGCC GTCAGACCAA TGCGATCAAC TATGGCCGCA CGCTGCCGTT GGCCCAGGGC
GTCAGGTTCG ACGTCTGA
 
Protein sequence
MTQDLATETR ALLADLGVDP ARLGGGSLTV RSPITGDILA QVRETSVAEV GYEIARAEQA 
FQIWRRVPAP RRGEFVRLLG EELRRSKEAL GQLVSIEVGK VLSEGLGEVQ EMIDICDFAV
GLSRQLQGLC LPSERRDHRI TEQWHPIGPV GVISAFNFPV AVWSWNAALA FICGDSVIWK
PSEKAPLTAL AVSALAARAC KAFGDEAPDG LATLIIGGRE AGRTLVDDPR VPVISATGST
RMGQTVGERV ARRFGKAILE LGGNNASIVT PSADLDLTLR AVAFAAMGTA GQRCTTLRRL
LVHDTVYDAL VPRLAAVYGK IAVGDPREDG NLVGPLIDAE AFTAMERALD AARTAGGRVH
GGGRVDVNGE NSFYARPALI EMSQHAECVR AETFAPILYV FRYETLEEAI ALQNDVPQGL
SSSIFATDMR EVEQFLSATG SDCGIANVNM GTSGAEIGGA FGGEKETGGG RESGSDSWKA
YMRRQTNAIN YGRTLPLAQG VRFDV