Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2340 |
Symbol | |
ID | 5899795 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2538297 |
End bp | 2539817 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641562831 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001683965 |
Protein GI | 167646302 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGAGC AAGCCATCAA GACCGTCAAA TCCACGCCGC TCTTCCGCGA GCGCTACGAC AACTGGATTG GTGGACGATG GGTCGCGCCG CTCGGCGGCG CCTCATTCAC CGACCATAGC CCAATCAATG GCCGCGCCAT CGCCACCTTC GCATCGTCCG GGCCTGAAGA CGTGGAACTC GCCCTCGACG CCGCTCACGG CGCCAAGGCG GCCTGGGCCC GGGTCTCGCC CTCGGATCGC GCCCGAATGC TCAACCGCAT CGCCGACCGG CTGGAGGACA ATCTCGAATT CTTCGCCCTC GCCGAGACGA TCGACAACGG CAAGCCCATC CGCGAGACGC GCGCCGCCGA CGTTCCCTTG GCGATCGACC ACTTCCGCTA TTTCGCCGGC TGCATCCGAG CGGAGGAAGG CGCCATCTCG ACCATCGACG ACAACACCAT CGCTTACCAT TTCCGCGAAC CGCTCGGCGT GGTCGGCCAG ATCATCCCGT GGAATTTCCC GCTGCTGATG GCGGCGTGGA AAATCGCCCC GGCGCTGGCG GCGGGCAACT GCACCGTGAT CAAGCCTGCC TCTCAGACGC CGCTGACCCT GCTGTTGTTC GCCGAACTGA CGGCTGACAT CCTGCCGCCG GGCGTCCTCA ACGTCCTGAC GGGTCCCGGC GGGACCGTCG GTCGCGCGAT CGCGGAGAAT CCGCGGATCG CCAAGGTGTC CTTCACCGGG GAGACGACGA CCGGGCGCCA GATCATGCAC TACGCCGCCG ACCATCTGAT CCCCCAGACG ATGGAACTCG GTGGCAAGTC GGCCAACATC TTCCTGCCCG ACGTGATGGA TCAGGACGAT CGCTTCCTCG ACAAGGCCCT GGAGGGGTTT GCGCTGTTCG CCTTCAACAA GGGCGAGGTC TGCACCTGCC CATCACGCGC GCTGGTGCAT GAATCGATCT TCGACAGGTT CATGGAAAGG GCGCTCGGCC GCGTCGCGGC CATCCGTCAG GGCGACCCGC TGGATCCGGT CACTCAGGTC GGCGCTCAGG CCTCCGAGGA CCAACTGCGC AAGATCCTCG GCTATGTCGA GATCGGCAAG GCCGAGGGCG CCGAGTGCCT GATCGGCGGA GAGCGAGCGC TTCCGGGCGG CGAGCTGAAC CAGGGCTATT TCGTCCAACC GACCGTGTTC GTCGGTGAGA ACCGCATGCG GATCTTCCAG GAAGAGATTT TCGGCCCGGT GCTTTCGGTG ACGCGGTTCA GCAGCGTCGA AGAGGCGATC GATATCGCCA ATGACACGCC CTACGGCCTT GGCGGCGGTG TCTGGTCGCG CAACGGCAAC AACGCCTACC GCGTCGGCCG CGCTCTGCAG GCTGGGCGGG TCTGGACGAA CTGCTACCAC GTTTACCCGG CCCACGCCGC GTTCGGCGGC TACAAGGCCT CGGGCTTTGG CCGTGAGAAC CATAAGATGA TGCTGGATCA CTACCAGCAA ACCAAGAACC TCCTCGTCTC CTACGACGAG GCGCCCCTGG GTCTGTTCTG A
|
Protein sequence | MFEQAIKTVK STPLFRERYD NWIGGRWVAP LGGASFTDHS PINGRAIATF ASSGPEDVEL ALDAAHGAKA AWARVSPSDR ARMLNRIADR LEDNLEFFAL AETIDNGKPI RETRAADVPL AIDHFRYFAG CIRAEEGAIS TIDDNTIAYH FREPLGVVGQ IIPWNFPLLM AAWKIAPALA AGNCTVIKPA SQTPLTLLLF AELTADILPP GVLNVLTGPG GTVGRAIAEN PRIAKVSFTG ETTTGRQIMH YAADHLIPQT MELGGKSANI FLPDVMDQDD RFLDKALEGF ALFAFNKGEV CTCPSRALVH ESIFDRFMER ALGRVAAIRQ GDPLDPVTQV GAQASEDQLR KILGYVEIGK AEGAECLIGG ERALPGGELN QGYFVQPTVF VGENRMRIFQ EEIFGPVLSV TRFSSVEEAI DIANDTPYGL GGGVWSRNGN NAYRVGRALQ AGRVWTNCYH VYPAHAAFGG YKASGFGREN HKMMLDHYQQ TKNLLVSYDE APLGLF
|
| |