Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3624 |
Symbol | |
ID | 5901079 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3913780 |
End bp | 3915357 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641564135 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001685249 |
Protein GI | 167647586 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.186491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCCTTA CCGGTGAACT TCTGATCGAC GGCGGCGCTC GCCCCGGAAC CGGCGGCGCG ATCCACGCGG TCGATCCCAG CACGGGCGGC GCGCTGGACG TGACCTTCGG CGGCGCCGCC AAGGCCGACC TGGACGAAGC CTGCGCCCTG GCGGCGGCGG CCTTCGGTCC GTACCGCGCC ACCTCGCCCG AGCAACGCGC CGTGTTCCTG GAAACCGTGG CCGCCAAGAT CGAGGCGATC GGCGACGACC TGATCGTCCG CGCCATGGCC GAGACCGGCC TGCCTCGCGC CCGCCTCGAA GGCGAGCGTG GTCGCACCAT GGGCCAGCTG AAGATGTTCG CCGGCGTGCT GCGCGACGGC GGCTGGCTGG AAGCCCGCAT CGATCCGGCC CAACCGGACC GCAAGCCGAT GGCCCGTCCC GACCTGCGCC TGCGCAACGT GCCGCTGGGC CCGGTGGCCG TGTTCGGCGC CAGCAACTTC CCGCTGGCCT TCTCGGTGGC CGGCGGCGAC ACCGCCTCGG CCCTGGCCGC CGGTTGTCCG GTCGTGGTAA AGGCTCACCC GGCGCACCCC GGCACGTCCG AGCTGGTCGG CCGCGCCGTC CAGGCGGCCG TCAAGGAATG CGGCCTGCCG TCGGGCGTCT TCGCCCTGCT GCACGACGCC GGCATCGAGA TCTCGCAAGG CCTGGTCGCC GACCACCGCA TCAAGGCCGC CGGCTTCACC GGGTCGCGCC GCGCCGGCCT GGCCCTGCTG GCCATCGCCC AGGGCCGCCC CGAGCCGATC CCGTTCTATG CCGAGATGAG CAGCATCAAC CCGGTCCTGC TCTTGCCAAA CGCCCTGAAG GCGCGCGGCC CGGCCATCGC CCCGGACTTC GTCGCCGCCT TGACGCAAGG CGCCGGCCAG TTCTGCACCA ATCCGGGCCT GATCCTGGGC ATCGACGGCC CCGAGCTGGA CGCCTTCCTG GCGGCCACCG CCGCAGTCAT CGCCGAGGCT CCCGCCGGCC AGATGCTGAC CCCCGGGATT TGCAAGGCCT TCGCCGGCGG GGTGAAGGCG CTGGCCGAGA CCGCCGGCGT CACCGAAGTC GCCCACGGCC TGGAAGGCGG TCACGGCCAG GGCCGCGCGT CGCTGTTCAG CGTCGATGCG GCGACCTTCC TGGCCACCCC GCACCTGCAG GACGAGGTGT TCGGCGCCGC CTCGCTGGTC GTTCACGCCA AGGACCTGGC CCAACTGATC CAGGTGATCG CCGCCCTGGA AGGCCAGCTG ACCATCGCCG TGCACATGGA TGACGCCGAC ACGGATCTCG CCCGCGCCCT GCTGCCGGCC CTGGAGCTGA AAGCCGGCCG CGTGCTGATC AACGGCTTCG GCACCGGCGT CGAGGTCGGC CATGCCATGG TCCACGGCGG CCCGTTCCCC TCGACATCGG ACAGCCGCTC GACCTCGGTC GGCTCGCTGG CGATCTTCCG CTTCCTGCGC CCGGTCTCGT ACCAGAACCT GCCGGCGGCG CTGCTGCCGG CGGAACTGCA GGACGCCAAC CCGCTGGGGA TTGCGCGTCG CGTGGATGGG AAGGTTCAGT TGGCCTAG
|
Protein sequence | MTLTGELLID GGARPGTGGA IHAVDPSTGG ALDVTFGGAA KADLDEACAL AAAAFGPYRA TSPEQRAVFL ETVAAKIEAI GDDLIVRAMA ETGLPRARLE GERGRTMGQL KMFAGVLRDG GWLEARIDPA QPDRKPMARP DLRLRNVPLG PVAVFGASNF PLAFSVAGGD TASALAAGCP VVVKAHPAHP GTSELVGRAV QAAVKECGLP SGVFALLHDA GIEISQGLVA DHRIKAAGFT GSRRAGLALL AIAQGRPEPI PFYAEMSSIN PVLLLPNALK ARGPAIAPDF VAALTQGAGQ FCTNPGLILG IDGPELDAFL AATAAVIAEA PAGQMLTPGI CKAFAGGVKA LAETAGVTEV AHGLEGGHGQ GRASLFSVDA ATFLATPHLQ DEVFGAASLV VHAKDLAQLI QVIAALEGQL TIAVHMDDAD TDLARALLPA LELKAGRVLI NGFGTGVEVG HAMVHGGPFP STSDSRSTSV GSLAIFRFLR PVSYQNLPAA LLPAELQDAN PLGIARRVDG KVQLA
|
| |