Gene Caul_0438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0438 
Symbol 
ID5897712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp481038 
End bp482441 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content67% 
IMG OID641560924 
Productaldehyde dehydrogenase 
Protein accessionYP_001682073 
Protein GI167644410 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACCGT TTCATCTGAT CATCGACGGG CGCCGGGTTG CGGGGGATGG CCTGATCGAC 
GTCGTCAATC CCGCCACCGA AGAGGTCCTG GTCGCCGCGC CCCGCGCGTC TCGCGCCCAG
CTTGAGCAAG CCGTGGCCGC GGCCAGGACG GCGTTTCCGG CCTGGGCGGC GACGCCCATC
GCCGAGCGCC GCGCCGCGCT GCTGCGGCTG GCCGACGCGG TCAGCGCGCA GGTCGATGAT
CTGGCGCGCT GGCTGAGCCT CGAGCAGGGC AAGCCGTTGG CGCACGCCCG CTTCGAGATC
GAGAGCTTTG TCGGCGCCCT GCGCGCCCTG CCGGATAACC CGTTCCCCCC AAAGATCATC
GAAGACAGCG CCCGTCGCAC GGTGGAGCTG CATCGCCGTC CCTTGGGCGT GGTGGCCGCC
ATCGTGCCCT GGAATTTCCC GATCTCCTTG CTGGGCTTCA AGCTGCCGCT CGCCTTGCTG
GCGGGCAACA CCATGGTGAT CAAGCCCGCT CCCACCACGC CGCTGACCAC CCTGAAGATC
GGCGAGCTGT GCCTGAAGAC CCTGCCGCCT GGCGTGGTCA ATGTGGTGGT CGACGCCAAT
GATCTGGGGG CGGAGCTCAC GCGACACCCC GACATTCGCA AGATCTCCTT CACCGGCTCC
ACCGAGACCG GCCGCAAGAT CATGGCGGCG GCCTCGGACA CCTTGAAGCG GCTGACCCTG
GAGCTTGGGG GCAATGATCC GGCCATCGTG CTCGATGACG TCGATCCGCT CGTCGTGGCC
CCGCGTATCT TCGGCGGCGC GTTCATGAAC AGTGGGCAGG TGTGCGCGGC GATCAAGCGT
CTCTACGTTC ATGACAGCGT CTACGACGCC CTGTGCGACG CGCTCGTCGG CCTCGCCAAC
GCCGCCATCG TCGGCGACGG CCTGAGCGAA GGCGTGCAAT TTGGACCGCT TCAGAACCGG
GCCCAGTTCG ACAAGGTCAA CGCCCTGATC GACGAAGCCG GCAAGATCGG CACGGTGATC
GCCGGCGGGG CGGCCAGCGG CGGCAAGGGC TACTTCATCC GCCCCACCCT GGTGCGTGAC
ATCACCGACG GCGCACGGCT CGTCGACGAG GAGCAGTTTG GTCCCGTTCT TCCCATCATT
CGCTACACCG ACCTCGACGA GGTCATCGCC CGCGCCAATG CCTCTCCCTT TGGCCTGGGC
GCGTCGATTT GGTCGTCGGA TCCGCAAAGA GCCGCACGGC TAGCGCCGCG GATCGAGGCG
GGCACGGTTT GGATCAACCA GCACCCCGAT TTTGGCCCTC ACATTCCGTT CGGCGGCGCC
AAGCAATCGG GCGTGGGGGT CGAGATGGGC GAGGAGGGGC TCAACGAGTT CACCCAGCTG
CAGGTGGTGA ACCTCGCGCA CTAG
 
Protein sequence
MSPFHLIIDG RRVAGDGLID VVNPATEEVL VAAPRASRAQ LEQAVAAART AFPAWAATPI 
AERRAALLRL ADAVSAQVDD LARWLSLEQG KPLAHARFEI ESFVGALRAL PDNPFPPKII
EDSARRTVEL HRRPLGVVAA IVPWNFPISL LGFKLPLALL AGNTMVIKPA PTTPLTTLKI
GELCLKTLPP GVVNVVVDAN DLGAELTRHP DIRKISFTGS TETGRKIMAA ASDTLKRLTL
ELGGNDPAIV LDDVDPLVVA PRIFGGAFMN SGQVCAAIKR LYVHDSVYDA LCDALVGLAN
AAIVGDGLSE GVQFGPLQNR AQFDKVNALI DEAGKIGTVI AGGAASGGKG YFIRPTLVRD
ITDGARLVDE EQFGPVLPII RYTDLDEVIA RANASPFGLG ASIWSSDPQR AARLAPRIEA
GTVWINQHPD FGPHIPFGGA KQSGVGVEMG EEGLNEFTQL QVVNLAH