Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4341 |
Symbol | |
ID | 5901802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4719436 |
End bp | 4720875 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564859 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001685959 |
Protein GI | 167648296 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.977746 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGACT ACCTGAAGTT CTACATCGAC GGCCAGTGGG TGCAGCCCAA GGGCACGCAA ACGGTCGACG TGATCAATCC GGCCACCGAG GCCGTCGCCG GCCGGGTGAC CCTGGGCACG GCCGAGGACG TCGACCTGGC CGTCCGCGCC GCCCGCAAGG CCTTCGCCAC CTACTCCCTG ACCAGCCGCG AAGAGCGCAT CGACCTGCTC GAGCGGATCA TCGCCGAGTA CCAGAAGCGC TTCGAGGACA TGGCCAAGGC CATCACCGAG GAGATGGGCG CCCCGGCGTG GCTGGCCCAG CGCGCACAGG CCGCCATGGG CATCGCCCAC GTCCAGACCG CGCTTCAGGT TCTCAAGGAC TACAAGTTCG AGGAAGATCG CGGCACGACC CGTCTGGTCA AGGAGCCGAT CGGCGTCTGC GCCTTCATCA CCCCGTGGAA CTGGCCGGTC AACCAGATCG CCTGCAAGGT CGCGCCGGCC CTGGCCGTCG GCTGCACCAT GGTGCTCAAG CCCTCGGAAG TGGCCCCGTT CTCAGCCTGG ATCTGGACCG AGATCCTGGC CGCCGCCGGC GTGCCAGCCG GGGTGTTCAA CCTGGTCAAC GGCGACGGCC CGACCGTGGG CGCGGCGCTG AGCTCGCATC CGGAAGTCGA CATGGTGTCG TTCACCGGCT CGACCCGGGC CGGCATCGAG GTGGCCAAGA ACGCCGCCCC GACCGTCAAG CGCGTGCACC AGGAACTGGG CGGCAAGAGC CCCAACATCA TCCTCGACGA CGCCGACTAC CAGAAGGCGG TCGGCGGCGG CGTGGCCTCG GTGATGATGA ACTCGGGCCA GTCGTGCAAC GCCCCGACCC GGATGCTGGT GCCGCAAAAG CGCATGGACG AGGTGATCGC CATCGCCAAG GCCGCCGCCG ACGCGCACAC CGTGGGCGAC CCCAACGGCA ATTCCAAGCT GGGTCCGGTC GTGTCGGAAG TGCAGTGGAA CAAGATCCAG GGCCTGATCC AGAAGGGCGT CGACGAGGGC GCGACCCTGG TCTCCGGCGG TCCCGGCAAG CCCGAGGGCC TGGACGTCGG CTACTATGTG AAGCCGACGG TGTTCGCCAA TGTCACCCCG GACATGACCA TTGCCAAGGA GGAGATCTTT GGCCCGGTGC TGGCCATCCT CGGCTATGAT GGCGTCGACC AGGCGGTCGA GATCGGCAAC GACACCGAAT ACGGCCTGGC CGCCTACGTG TCCGGCAACG ACGAGTCCCA GGTGCGAGCC GTGGCGTCCA AGCTTCGGGC CGGCCAGGTG ATCCTCAACG GCGCCGGTCC CGACCTGATG GCCCCGTTCG GCGGCTACAA GATGAGCGGC AACGGCCGCG AATGGGGCGA CCACGCGTTC GGCGAGTTCC TCGAGACCAA GGCGATCCTG GGCTACGGCG CCAAGATCGC GGCGGAGTAA
|
Protein sequence | MRDYLKFYID GQWVQPKGTQ TVDVINPATE AVAGRVTLGT AEDVDLAVRA ARKAFATYSL TSREERIDLL ERIIAEYQKR FEDMAKAITE EMGAPAWLAQ RAQAAMGIAH VQTALQVLKD YKFEEDRGTT RLVKEPIGVC AFITPWNWPV NQIACKVAPA LAVGCTMVLK PSEVAPFSAW IWTEILAAAG VPAGVFNLVN GDGPTVGAAL SSHPEVDMVS FTGSTRAGIE VAKNAAPTVK RVHQELGGKS PNIILDDADY QKAVGGGVAS VMMNSGQSCN APTRMLVPQK RMDEVIAIAK AAADAHTVGD PNGNSKLGPV VSEVQWNKIQ GLIQKGVDEG ATLVSGGPGK PEGLDVGYYV KPTVFANVTP DMTIAKEEIF GPVLAILGYD GVDQAVEIGN DTEYGLAAYV SGNDESQVRA VASKLRAGQV ILNGAGPDLM APFGGYKMSG NGREWGDHAF GEFLETKAIL GYGAKIAAE
|
| |