Gene Caul_2340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2340 
Symbol 
ID5899795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2538297 
End bp2539817 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content65% 
IMG OID641562831 
Productaldehyde dehydrogenase 
Protein accessionYP_001683965 
Protein GI167646302 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGAGC AAGCCATCAA GACCGTCAAA TCCACGCCGC TCTTCCGCGA GCGCTACGAC 
AACTGGATTG GTGGACGATG GGTCGCGCCG CTCGGCGGCG CCTCATTCAC CGACCATAGC
CCAATCAATG GCCGCGCCAT CGCCACCTTC GCATCGTCCG GGCCTGAAGA CGTGGAACTC
GCCCTCGACG CCGCTCACGG CGCCAAGGCG GCCTGGGCCC GGGTCTCGCC CTCGGATCGC
GCCCGAATGC TCAACCGCAT CGCCGACCGG CTGGAGGACA ATCTCGAATT CTTCGCCCTC
GCCGAGACGA TCGACAACGG CAAGCCCATC CGCGAGACGC GCGCCGCCGA CGTTCCCTTG
GCGATCGACC ACTTCCGCTA TTTCGCCGGC TGCATCCGAG CGGAGGAAGG CGCCATCTCG
ACCATCGACG ACAACACCAT CGCTTACCAT TTCCGCGAAC CGCTCGGCGT GGTCGGCCAG
ATCATCCCGT GGAATTTCCC GCTGCTGATG GCGGCGTGGA AAATCGCCCC GGCGCTGGCG
GCGGGCAACT GCACCGTGAT CAAGCCTGCC TCTCAGACGC CGCTGACCCT GCTGTTGTTC
GCCGAACTGA CGGCTGACAT CCTGCCGCCG GGCGTCCTCA ACGTCCTGAC GGGTCCCGGC
GGGACCGTCG GTCGCGCGAT CGCGGAGAAT CCGCGGATCG CCAAGGTGTC CTTCACCGGG
GAGACGACGA CCGGGCGCCA GATCATGCAC TACGCCGCCG ACCATCTGAT CCCCCAGACG
ATGGAACTCG GTGGCAAGTC GGCCAACATC TTCCTGCCCG ACGTGATGGA TCAGGACGAT
CGCTTCCTCG ACAAGGCCCT GGAGGGGTTT GCGCTGTTCG CCTTCAACAA GGGCGAGGTC
TGCACCTGCC CATCACGCGC GCTGGTGCAT GAATCGATCT TCGACAGGTT CATGGAAAGG
GCGCTCGGCC GCGTCGCGGC CATCCGTCAG GGCGACCCGC TGGATCCGGT CACTCAGGTC
GGCGCTCAGG CCTCCGAGGA CCAACTGCGC AAGATCCTCG GCTATGTCGA GATCGGCAAG
GCCGAGGGCG CCGAGTGCCT GATCGGCGGA GAGCGAGCGC TTCCGGGCGG CGAGCTGAAC
CAGGGCTATT TCGTCCAACC GACCGTGTTC GTCGGTGAGA ACCGCATGCG GATCTTCCAG
GAAGAGATTT TCGGCCCGGT GCTTTCGGTG ACGCGGTTCA GCAGCGTCGA AGAGGCGATC
GATATCGCCA ATGACACGCC CTACGGCCTT GGCGGCGGTG TCTGGTCGCG CAACGGCAAC
AACGCCTACC GCGTCGGCCG CGCTCTGCAG GCTGGGCGGG TCTGGACGAA CTGCTACCAC
GTTTACCCGG CCCACGCCGC GTTCGGCGGC TACAAGGCCT CGGGCTTTGG CCGTGAGAAC
CATAAGATGA TGCTGGATCA CTACCAGCAA ACCAAGAACC TCCTCGTCTC CTACGACGAG
GCGCCCCTGG GTCTGTTCTG A
 
Protein sequence
MFEQAIKTVK STPLFRERYD NWIGGRWVAP LGGASFTDHS PINGRAIATF ASSGPEDVEL 
ALDAAHGAKA AWARVSPSDR ARMLNRIADR LEDNLEFFAL AETIDNGKPI RETRAADVPL
AIDHFRYFAG CIRAEEGAIS TIDDNTIAYH FREPLGVVGQ IIPWNFPLLM AAWKIAPALA
AGNCTVIKPA SQTPLTLLLF AELTADILPP GVLNVLTGPG GTVGRAIAEN PRIAKVSFTG
ETTTGRQIMH YAADHLIPQT MELGGKSANI FLPDVMDQDD RFLDKALEGF ALFAFNKGEV
CTCPSRALVH ESIFDRFMER ALGRVAAIRQ GDPLDPVTQV GAQASEDQLR KILGYVEIGK
AEGAECLIGG ERALPGGELN QGYFVQPTVF VGENRMRIFQ EEIFGPVLSV TRFSSVEEAI
DIANDTPYGL GGGVWSRNGN NAYRVGRALQ AGRVWTNCYH VYPAHAAFGG YKASGFGREN
HKMMLDHYQQ TKNLLVSYDE APLGLF