Gene Caul_3955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3955 
Symbol 
ID5901417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4283766 
End bp4285253 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content70% 
IMG OID641564476 
Productaldehyde dehydrogenase 
Protein accessionYP_001685578 
Protein GI167647915 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.768123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.683477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCTGC TGTCTCCCAT CGACGCGCTG AAGACGTTGC CGCTGCCCGG CCGGGCGGTG 
ATCGACGGCG CGTTGGTCGA GGCGGCTTCG GGGGCGACGT TCCACAACGT CTCGCCGCGC
GACGGGGCGG TGATCAACCA GGTCGCCGCC TGCCAGGCCG AGGACGTCGA CCGCGCCGTG
GCCAGCGCCC GCGCAGCCTT TGAGGATGGC CGGTGGCGCG ACCAGGGTCC GCGCGACAAG
AAGCGGGTGC TGTTCAGGCT GGCCGAGCTG ATGGAGCGCG ACGCCGAACA GCTGGCCCTG
CTGGAGAGCC TGGACACCGG CAAGCCGATC CGCGACGCCC GGGCGGTCGA CGTGCCCCTG
TCGATCGGCA CAACCCGCTG GTACGCCGAG GCCCTGGACA AGATTTATGG CGAGGTGGGC
GCCTCGCCGA TCGATCGCCT GAGCTGGGCC ACGCATGAGC CGCTGGGGGT GATCGGCGCC
ATCGTGCCGT GGAACTTCCC CCTGCACATG GCGATGTGGA AGGCGGCCCC GGCCCTGGCC
ATGGGCAACA GCGTCGTCCT CAAGCCCGCC GAGCAGTCGC CGCTGACCGC CCTGAAGCTG
GGCGAACTGG CGCTGGAGGC TGGCCTGCCG CCCGGCGTGC TGAACGTGGT TCCGGGCCTG
GGCGCCACGG CCGGCGAGGC GCTCGCCCTG TCGATGGACG TCGACATGAT CGCCTTCACC
GGCTCGGGTC CGGTGGGCCG GCGGCTGATG GAATATTCGG CGCGGAGCAA CCTCAAGCGC
GTGTCGCTGG AGCTGGGCGG CAAGTCGCCC CAGATCGTCT TCGCCGACTG CCCGGACCTG
GACGCCGCCG CCCAGGCCGC CGCCTGGGGC GTGTTCTATA ACCAGGGCGA GGTCTGCACC
GCCGCGTCCC GCTTGCTGGT CGAGGCCTCG ATCAAGGACG CCTTCCTCGA GAAGGTGATC
GCGGTGGCCA AGACCATGGT CCCCGGCGAC CCGCTGGATC CCGACACCGT GTTCGGGGCC
ATGGTCAGCG AGCGGCAGAT GAACACCGCC CTGGACTACA TCGCCACCGC CGACAGCCAG
GGCGCCCGCC GGCTGCTGGG CGGCAAGCAA GTACGTCGGG AGACGGGCGG CTTCTATGTC
GAGCCCACCA TCTTCGATCG GCTGGAGCCC GACCACACCC TGGCCCGCGA AGAGGTGTTC
GGGCCGGTGC TAGGCGTGCA GACCTTCAAG ACCCAGGACG AGGCGATCGC CCTGGCCAAC
GACACCGTCT ACGGCCTGGC CGCCGGCCTG TGGACCAGCG ACATCAACCG CGCCCTCACC
GCCGCGCGGC GGCTGAAGGC CGGCCTGGTG TGGATTAACG GCTGGGACGC CTGCGACATC
ACCATGCCGT TCGGCGGCTT CAAGCAGTCG GGTTTCGGTC GCGACCGCAG CCTTCACGCG
TTGCACAAAT ATGCCGACCT GAAGTCGGTG TCCGTGACGC TGAGGTGA
 
Protein sequence
MTLLSPIDAL KTLPLPGRAV IDGALVEAAS GATFHNVSPR DGAVINQVAA CQAEDVDRAV 
ASARAAFEDG RWRDQGPRDK KRVLFRLAEL MERDAEQLAL LESLDTGKPI RDARAVDVPL
SIGTTRWYAE ALDKIYGEVG ASPIDRLSWA THEPLGVIGA IVPWNFPLHM AMWKAAPALA
MGNSVVLKPA EQSPLTALKL GELALEAGLP PGVLNVVPGL GATAGEALAL SMDVDMIAFT
GSGPVGRRLM EYSARSNLKR VSLELGGKSP QIVFADCPDL DAAAQAAAWG VFYNQGEVCT
AASRLLVEAS IKDAFLEKVI AVAKTMVPGD PLDPDTVFGA MVSERQMNTA LDYIATADSQ
GARRLLGGKQ VRRETGGFYV EPTIFDRLEP DHTLAREEVF GPVLGVQTFK TQDEAIALAN
DTVYGLAAGL WTSDINRALT AARRLKAGLV WINGWDACDI TMPFGGFKQS GFGRDRSLHA
LHKYADLKSV SVTLR