Gene Caul_0443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0443 
Symbol 
ID5897900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp486739 
End bp488172 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content68% 
IMG OID641560929 
Productaldehyde dehydrogenase 
Protein accessionYP_001682078 
Protein GI167644415 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.451373 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCAAC GACTGCATTT CATCGGGGCC GGCGTCGCGC CGCCCGCCTC CGGCCTCTAT 
ATGGACGTGT TCGACCCCGC CACTGGGCGC AAGATCGCCG AGGTCGCGGC CGGCTCGGCC
AGCGATGTCG ATCGCGCCGT CACCGCAGCC TCCGACGCCT TTGGCGCCTG GCGCGATCTT
CGTCCGATCG AGCGTGGCCG CATCTTGACT GAGATCGCGC GTCTGATGCG CGAACGGGCC
GCCGAGTTCA TCGCTTTGGA GGCGGCCGAA ACCGGAAAGC CCGCTTGGCA GACGCCCATC
GAGGTGGAGC GCGCCGCCCA GTATTTCGAA TTCTTCGGCG GGCTGGTGAA CGTCGGCCAC
GGAGAAATCC TCAATCTGGG CTCGGACTAC CACTGCTACA CGCGGCGCGA ACCCTATGGG
GTGGTCGGGG TCATCCTGCC GTGGAACGCG CCGCTCAATC AGGCGGCCCG GGCCATCGCC
CCGGCCATCG CGGTCGGCAA CACCGTGGTG GCCAAGCCTT CGGAGGAAAC CCCCGGCAGC
GTCTTGTTGC TCGCCCGCCT CGCGGTCGAG GCCTGCGGCT TGCCGCCAGG GGTTCTCAAT
GTCGTGCAGG GCCGAGGCCA GGAGGCCGGC CGGCCGCTGA TCGAGCATCC CCAGGTGCGC
AAGGTCGCCT TCACCGGCAG CCTTCGCGCC GGTCAGGAGA TCGGCCGCAT CGCCGCCGAG
CGCATACTGC CCTTGACCTT GGAGCTGGGG GGCAAGTCGG CCAATCTTGT CTTTGACGAC
GCCGATTTCG ACGCCGCGGT CGCCGGAGCG GTTCGCGCCT TCGCGCTCAA TGCCGGGCAG
ATCTGTCTGG CGGGGACCCG CCTGCTGGTT CAGCGCTCGA TCTATGAGCG CTTCGTCGCC
GCCGTCGTCG CGGCGGTGGG CGCGCTGAAG GTCGGCCCCG AAGGCGAGGC CTTTGTTGGC
CCCCTGACCA CGGCGGCCCA GTTCGACAAG GTGCAGGCCT ATTTCGCGAT CGCCGCCGAG
GAGGGCGCGG TCTTGGAGAC GGGCGGCCGG GCGCTCGCCG ACGATAGGCC ACAGGACGGC
TGGTTCGTCT ATCCCACCGT CTACAGCGGC GTGACGACGG ACATGCGCAT CGCGCGCGAG
GAGATCTTCG GCCCGGTCCT GGTGGTCATG CCGTTCGAGG ACGAGGCCCA AGCCGTCCAG
ATCGCCAATG GCACCGATTT TGGTCTGGCC GCCGGTCTTT GGACCCGGGA CCTAGGGCGG
GCCCATCGGG TCTCCGCCCT GCTGGAGGCC GGGCAAATCT ACGTCAATGA ATACCATTCC
GGCGGCATCG AGACACCGAT GGGCGGCTAC AAGAGCAGCG GCTATGGGCG CGAGAAGGGC
GTGGAGGCGC TGGCCCACTA CACCCAGCTC AAGTGCGTCA CGATCCGCCT GTGA
 
Protein sequence
MEQRLHFIGA GVAPPASGLY MDVFDPATGR KIAEVAAGSA SDVDRAVTAA SDAFGAWRDL 
RPIERGRILT EIARLMRERA AEFIALEAAE TGKPAWQTPI EVERAAQYFE FFGGLVNVGH
GEILNLGSDY HCYTRREPYG VVGVILPWNA PLNQAARAIA PAIAVGNTVV AKPSEETPGS
VLLLARLAVE ACGLPPGVLN VVQGRGQEAG RPLIEHPQVR KVAFTGSLRA GQEIGRIAAE
RILPLTLELG GKSANLVFDD ADFDAAVAGA VRAFALNAGQ ICLAGTRLLV QRSIYERFVA
AVVAAVGALK VGPEGEAFVG PLTTAAQFDK VQAYFAIAAE EGAVLETGGR ALADDRPQDG
WFVYPTVYSG VTTDMRIARE EIFGPVLVVM PFEDEAQAVQ IANGTDFGLA AGLWTRDLGR
AHRVSALLEA GQIYVNEYHS GGIETPMGGY KSSGYGREKG VEALAHYTQL KCVTIRL