Gene Caul_4341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4341 
Symbol 
ID5901802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4719436 
End bp4720875 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content68% 
IMG OID641564859 
Productaldehyde dehydrogenase 
Protein accessionYP_001685959 
Protein GI167648296 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.977746 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACT ACCTGAAGTT CTACATCGAC GGCCAGTGGG TGCAGCCCAA GGGCACGCAA 
ACGGTCGACG TGATCAATCC GGCCACCGAG GCCGTCGCCG GCCGGGTGAC CCTGGGCACG
GCCGAGGACG TCGACCTGGC CGTCCGCGCC GCCCGCAAGG CCTTCGCCAC CTACTCCCTG
ACCAGCCGCG AAGAGCGCAT CGACCTGCTC GAGCGGATCA TCGCCGAGTA CCAGAAGCGC
TTCGAGGACA TGGCCAAGGC CATCACCGAG GAGATGGGCG CCCCGGCGTG GCTGGCCCAG
CGCGCACAGG CCGCCATGGG CATCGCCCAC GTCCAGACCG CGCTTCAGGT TCTCAAGGAC
TACAAGTTCG AGGAAGATCG CGGCACGACC CGTCTGGTCA AGGAGCCGAT CGGCGTCTGC
GCCTTCATCA CCCCGTGGAA CTGGCCGGTC AACCAGATCG CCTGCAAGGT CGCGCCGGCC
CTGGCCGTCG GCTGCACCAT GGTGCTCAAG CCCTCGGAAG TGGCCCCGTT CTCAGCCTGG
ATCTGGACCG AGATCCTGGC CGCCGCCGGC GTGCCAGCCG GGGTGTTCAA CCTGGTCAAC
GGCGACGGCC CGACCGTGGG CGCGGCGCTG AGCTCGCATC CGGAAGTCGA CATGGTGTCG
TTCACCGGCT CGACCCGGGC CGGCATCGAG GTGGCCAAGA ACGCCGCCCC GACCGTCAAG
CGCGTGCACC AGGAACTGGG CGGCAAGAGC CCCAACATCA TCCTCGACGA CGCCGACTAC
CAGAAGGCGG TCGGCGGCGG CGTGGCCTCG GTGATGATGA ACTCGGGCCA GTCGTGCAAC
GCCCCGACCC GGATGCTGGT GCCGCAAAAG CGCATGGACG AGGTGATCGC CATCGCCAAG
GCCGCCGCCG ACGCGCACAC CGTGGGCGAC CCCAACGGCA ATTCCAAGCT GGGTCCGGTC
GTGTCGGAAG TGCAGTGGAA CAAGATCCAG GGCCTGATCC AGAAGGGCGT CGACGAGGGC
GCGACCCTGG TCTCCGGCGG TCCCGGCAAG CCCGAGGGCC TGGACGTCGG CTACTATGTG
AAGCCGACGG TGTTCGCCAA TGTCACCCCG GACATGACCA TTGCCAAGGA GGAGATCTTT
GGCCCGGTGC TGGCCATCCT CGGCTATGAT GGCGTCGACC AGGCGGTCGA GATCGGCAAC
GACACCGAAT ACGGCCTGGC CGCCTACGTG TCCGGCAACG ACGAGTCCCA GGTGCGAGCC
GTGGCGTCCA AGCTTCGGGC CGGCCAGGTG ATCCTCAACG GCGCCGGTCC CGACCTGATG
GCCCCGTTCG GCGGCTACAA GATGAGCGGC AACGGCCGCG AATGGGGCGA CCACGCGTTC
GGCGAGTTCC TCGAGACCAA GGCGATCCTG GGCTACGGCG CCAAGATCGC GGCGGAGTAA
 
Protein sequence
MRDYLKFYID GQWVQPKGTQ TVDVINPATE AVAGRVTLGT AEDVDLAVRA ARKAFATYSL 
TSREERIDLL ERIIAEYQKR FEDMAKAITE EMGAPAWLAQ RAQAAMGIAH VQTALQVLKD
YKFEEDRGTT RLVKEPIGVC AFITPWNWPV NQIACKVAPA LAVGCTMVLK PSEVAPFSAW
IWTEILAAAG VPAGVFNLVN GDGPTVGAAL SSHPEVDMVS FTGSTRAGIE VAKNAAPTVK
RVHQELGGKS PNIILDDADY QKAVGGGVAS VMMNSGQSCN APTRMLVPQK RMDEVIAIAK
AAADAHTVGD PNGNSKLGPV VSEVQWNKIQ GLIQKGVDEG ATLVSGGPGK PEGLDVGYYV
KPTVFANVTP DMTIAKEEIF GPVLAILGYD GVDQAVEIGN DTEYGLAAYV SGNDESQVRA
VASKLRAGQV ILNGAGPDLM APFGGYKMSG NGREWGDHAF GEFLETKAIL GYGAKIAAE