Gene Caul_5442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5442 
Symbol 
ID5897159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp154583 
End bp156043 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content59% 
IMG OID641550729 
Productaldehyde dehydrogenase 
Protein accessionYP_001672215 
Protein GI167621707 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0274338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTTT ATGCAATCAC TGCGACCACT GAAGCACCGG TTCATCCCTG CGGCGAGGGG 
GCGATCCGAT CCCTGTTCGC GTCGCAGCGC CGATCAGCCT TGGAGAACAG GACGAAATTC
ACGCTGAAGG CGCGACTGGC TATGTTGTCG CGACTAAAGG CGACGATGAA GAGCCGGGAA
GACGAGATTA TCCGAGCTCT CTGCACGGAT TTCAGGAAGC CTGAATCCGA GGTGCGCCTG
ACCGAACTGT TCCCGGTCTA TCAGGAGATA TCGCATGCCC GGCGCCACCT CCGATCCTGG
CTGAGACCGC ACCGGGTTCA CGACTCTTTG GGGATGTTCG GAATCGCTGC GGAGGTCCGC
TATCAGGCCA AGGGCGTCTG CCTGATAATT TCCCCGTGGA ACTATCCGGT CAATCTCAGT
TTCGGGCCAC TGGTGTCCGC GCTGGCAGCC GGAAACACCG TCATCATCAA GCCTTCCGAA
CTGACGCCGG CGACGTCCGC CCTGGTCAGG GACATCGTCG AGCAGACCTT CCCCCGGGAT
CTCGTCGCCG TCTGCGAAGG CGACGCCGAG GTTTCGCAGG CCCTGCTGGA TCTACCCTTC
GACCACATCT TCTTCACGGG CAGTCCCCAG GTCGGCAAGA TCGTGATGGC GGCCGCAGCG
AAACATTTAA CATCCGTGAC GCTTGAACTC GGGGGCAAGT CCCCGACCAT CGTCGATTCG
ACCGCGAATA TCGAGCAAGC CGCCTGCAAG ATCGTCTGGG GCAAGTTCGC CAATAACGGC
CAGACCTGCA TTGCTCCGGA TCATGTCTAT GTTGCTCGCG ACCAGGCCTC GGCGCTGGTC
GATGCGCTGC GGCATGAGAT CAGGCGGGTC TACGGGCAGA CGGACGGCGA GCAGAAAGCC
GGGCCGGACT ATTGCCGGAT CGTGAACCGG CGGCATTTCG ATCGTCTGAC CGCCCTGGCC
GACGACGCCA CATCGCGCGG TGCGACCCTC CTGGAAGGTG GGGCGCGAGA TTCAGACCAG
AACTATTTCG CGCCGACCAT ACTCGGCGGA ACGACGCCGC AGATGGCGAT TTCCCAGGAA
GAAATATTCG GTCCGCTTCT TCCGATCATC GAATATGACG ACATCAGCGT CGTCATCGAC
GCGATCAACG CGGGCCCAAA GCCGCTTGCC ATGTATGTCT TCAGCAACGA CGCCGCCGCC
CGCGAGGATA TCATCCTTAG GACGAGTTCC GGTGGTGTCT GCGTCAACAA CAATGTCGTC
CAATTCTTGC ATCCAAACCT GCCGTTTGGC GGAGTCAACA ACAGCGGCAT TGGCGCTGCA
CACGGTTTCT ATGGCTTTAA AGCCTTCTCC CATGAACGTG CGATTCTAAG AGACAAATTC
TCCGTCCTGC GTCTTCTTTT CCCGCCGTAC ACCCCGACCG TAAAGAAACT CATCAATCTA
ATCGTCCGTC TTTTGGGTTG A
 
Protein sequence
MNFYAITATT EAPVHPCGEG AIRSLFASQR RSALENRTKF TLKARLAMLS RLKATMKSRE 
DEIIRALCTD FRKPESEVRL TELFPVYQEI SHARRHLRSW LRPHRVHDSL GMFGIAAEVR
YQAKGVCLII SPWNYPVNLS FGPLVSALAA GNTVIIKPSE LTPATSALVR DIVEQTFPRD
LVAVCEGDAE VSQALLDLPF DHIFFTGSPQ VGKIVMAAAA KHLTSVTLEL GGKSPTIVDS
TANIEQAACK IVWGKFANNG QTCIAPDHVY VARDQASALV DALRHEIRRV YGQTDGEQKA
GPDYCRIVNR RHFDRLTALA DDATSRGATL LEGGARDSDQ NYFAPTILGG TTPQMAISQE
EIFGPLLPII EYDDISVVID AINAGPKPLA MYVFSNDAAA REDIILRTSS GGVCVNNNVV
QFLHPNLPFG GVNNSGIGAA HGFYGFKAFS HERAILRDKF SVLRLLFPPY TPTVKKLINL
IVRLLG