Gene Caul_3624 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3624 
Symbol 
ID5901079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3913780 
End bp3915357 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content72% 
IMG OID641564135 
Productaldehyde dehydrogenase 
Protein accessionYP_001685249 
Protein GI167647586 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.186491 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCCTTA CCGGTGAACT TCTGATCGAC GGCGGCGCTC GCCCCGGAAC CGGCGGCGCG 
ATCCACGCGG TCGATCCCAG CACGGGCGGC GCGCTGGACG TGACCTTCGG CGGCGCCGCC
AAGGCCGACC TGGACGAAGC CTGCGCCCTG GCGGCGGCGG CCTTCGGTCC GTACCGCGCC
ACCTCGCCCG AGCAACGCGC CGTGTTCCTG GAAACCGTGG CCGCCAAGAT CGAGGCGATC
GGCGACGACC TGATCGTCCG CGCCATGGCC GAGACCGGCC TGCCTCGCGC CCGCCTCGAA
GGCGAGCGTG GTCGCACCAT GGGCCAGCTG AAGATGTTCG CCGGCGTGCT GCGCGACGGC
GGCTGGCTGG AAGCCCGCAT CGATCCGGCC CAACCGGACC GCAAGCCGAT GGCCCGTCCC
GACCTGCGCC TGCGCAACGT GCCGCTGGGC CCGGTGGCCG TGTTCGGCGC CAGCAACTTC
CCGCTGGCCT TCTCGGTGGC CGGCGGCGAC ACCGCCTCGG CCCTGGCCGC CGGTTGTCCG
GTCGTGGTAA AGGCTCACCC GGCGCACCCC GGCACGTCCG AGCTGGTCGG CCGCGCCGTC
CAGGCGGCCG TCAAGGAATG CGGCCTGCCG TCGGGCGTCT TCGCCCTGCT GCACGACGCC
GGCATCGAGA TCTCGCAAGG CCTGGTCGCC GACCACCGCA TCAAGGCCGC CGGCTTCACC
GGGTCGCGCC GCGCCGGCCT GGCCCTGCTG GCCATCGCCC AGGGCCGCCC CGAGCCGATC
CCGTTCTATG CCGAGATGAG CAGCATCAAC CCGGTCCTGC TCTTGCCAAA CGCCCTGAAG
GCGCGCGGCC CGGCCATCGC CCCGGACTTC GTCGCCGCCT TGACGCAAGG CGCCGGCCAG
TTCTGCACCA ATCCGGGCCT GATCCTGGGC ATCGACGGCC CCGAGCTGGA CGCCTTCCTG
GCGGCCACCG CCGCAGTCAT CGCCGAGGCT CCCGCCGGCC AGATGCTGAC CCCCGGGATT
TGCAAGGCCT TCGCCGGCGG GGTGAAGGCG CTGGCCGAGA CCGCCGGCGT CACCGAAGTC
GCCCACGGCC TGGAAGGCGG TCACGGCCAG GGCCGCGCGT CGCTGTTCAG CGTCGATGCG
GCGACCTTCC TGGCCACCCC GCACCTGCAG GACGAGGTGT TCGGCGCCGC CTCGCTGGTC
GTTCACGCCA AGGACCTGGC CCAACTGATC CAGGTGATCG CCGCCCTGGA AGGCCAGCTG
ACCATCGCCG TGCACATGGA TGACGCCGAC ACGGATCTCG CCCGCGCCCT GCTGCCGGCC
CTGGAGCTGA AAGCCGGCCG CGTGCTGATC AACGGCTTCG GCACCGGCGT CGAGGTCGGC
CATGCCATGG TCCACGGCGG CCCGTTCCCC TCGACATCGG ACAGCCGCTC GACCTCGGTC
GGCTCGCTGG CGATCTTCCG CTTCCTGCGC CCGGTCTCGT ACCAGAACCT GCCGGCGGCG
CTGCTGCCGG CGGAACTGCA GGACGCCAAC CCGCTGGGGA TTGCGCGTCG CGTGGATGGG
AAGGTTCAGT TGGCCTAG
 
Protein sequence
MTLTGELLID GGARPGTGGA IHAVDPSTGG ALDVTFGGAA KADLDEACAL AAAAFGPYRA 
TSPEQRAVFL ETVAAKIEAI GDDLIVRAMA ETGLPRARLE GERGRTMGQL KMFAGVLRDG
GWLEARIDPA QPDRKPMARP DLRLRNVPLG PVAVFGASNF PLAFSVAGGD TASALAAGCP
VVVKAHPAHP GTSELVGRAV QAAVKECGLP SGVFALLHDA GIEISQGLVA DHRIKAAGFT
GSRRAGLALL AIAQGRPEPI PFYAEMSSIN PVLLLPNALK ARGPAIAPDF VAALTQGAGQ
FCTNPGLILG IDGPELDAFL AATAAVIAEA PAGQMLTPGI CKAFAGGVKA LAETAGVTEV
AHGLEGGHGQ GRASLFSVDA ATFLATPHLQ DEVFGAASLV VHAKDLAQLI QVIAALEGQL
TIAVHMDDAD TDLARALLPA LELKAGRVLI NGFGTGVEVG HAMVHGGPFP STSDSRSTSV
GSLAIFRFLR PVSYQNLPAA LLPAELQDAN PLGIARRVDG KVQLA