Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2375 |
Symbol | |
ID | 5899830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2580567 |
End bp | 2582084 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641562866 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001684000 |
Protein GI | 167646337 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCAGG ATCTGGCGAC CGAAACCCGC GCCCTGCTCG CCGATCTCGG CGTCGATCCT GCGCGACTGG GGGGCGGATC CCTGACCGTC CGCTCGCCGA TCACGGGCGA CATCCTGGCC CAGGTCCGCG AGACCAGTGT CGCCGAGGTC GGCTATGAGA TCGCCCGGGC CGAGCAGGCC TTCCAGATCT GGCGGCGGGT CCCGGCGCCC CGCCGCGGCG AGTTCGTGCG CCTGCTGGGT GAGGAACTGC GCCGCAGCAA GGAAGCCCTC GGCCAACTGG TGTCGATCGA GGTCGGCAAG GTTCTGTCCG AGGGCCTGGG CGAGGTCCAG GAGATGATCG ACATCTGCGA TTTCGCCGTT GGGCTGTCGC GCCAGCTCCA GGGACTGTGC CTTCCGTCCG AGCGCCGCGA TCACCGCATC ACCGAACAGT GGCATCCAAT CGGCCCGGTC GGGGTGATCT CCGCTTTCAA CTTCCCGGTG GCGGTGTGGA GCTGGAACGC CGCGCTGGCC TTCATTTGCG GCGACAGCGT GATTTGGAAG CCGTCCGAGA AGGCGCCGCT GACGGCGCTC GCGGTCAGCG CCCTGGCGGC GCGGGCCTGC AAGGCCTTTG GCGACGAGGC GCCCGATGGG CTGGCGACGT TGATCATTGG CGGTCGCGAG GCCGGTCGGA CGCTCGTGGA TGATCCGCGC GTGCCGGTGA TCTCGGCGAC CGGGTCGACG CGGATGGGCC AGACCGTCGG CGAACGTGTC GCACGCCGGT TTGGCAAGGC GATCCTTGAG CTCGGCGGGA ACAACGCTTC GATCGTCACG CCGTCCGCCG ATCTCGATCT GACGCTTCGC GCGGTCGCAT TCGCCGCCAT GGGGACCGCC GGCCAACGCT GTACGACGCT CCGCCGACTG CTGGTGCACG ACACCGTCTA TGATGCGCTT GTCCCAAGAC TCGCCGCCGT CTACGGCAAG ATCGCAGTGG GTGATCCCCG CGAAGACGGC AATCTCGTCG GTCCGCTCAT CGACGCCGAG GCCTTCACCG CCATGGAACG CGCACTAGAC GCCGCGCGCA CGGCGGGCGG TCGCGTTCAC GGCGGCGGTC GCGTCGATGT CAACGGCGAG AACTCCTTCT ACGCCCGACC TGCCCTGATC GAGATGAGCC AACATGCCGA GTGCGTCCGT GCGGAGACGT TCGCGCCGAT CCTCTATGTC TTCCGCTACG AAACACTCGA AGAGGCGATC GCGCTTCAGA ACGATGTGCC GCAGGGTCTG TCCTCTTCGA TCTTCGCCAC GGACATGCGC GAGGTCGAGC AGTTCCTCTC GGCCACCGGC TCCGATTGCG GCATCGCCAA CGTCAATATG GGGACGTCGG GCGCCGAGAT TGGCGGTGCT TTCGGTGGCG AGAAGGAGAC GGGCGGCGGA CGCGAAAGCG GTTCGGACAG CTGGAAGGCC TACATGCGCC GTCAGACCAA TGCGATCAAC TATGGCCGCA CGCTGCCGTT GGCCCAGGGC GTCAGGTTCG ACGTCTGA
|
Protein sequence | MTQDLATETR ALLADLGVDP ARLGGGSLTV RSPITGDILA QVRETSVAEV GYEIARAEQA FQIWRRVPAP RRGEFVRLLG EELRRSKEAL GQLVSIEVGK VLSEGLGEVQ EMIDICDFAV GLSRQLQGLC LPSERRDHRI TEQWHPIGPV GVISAFNFPV AVWSWNAALA FICGDSVIWK PSEKAPLTAL AVSALAARAC KAFGDEAPDG LATLIIGGRE AGRTLVDDPR VPVISATGST RMGQTVGERV ARRFGKAILE LGGNNASIVT PSADLDLTLR AVAFAAMGTA GQRCTTLRRL LVHDTVYDAL VPRLAAVYGK IAVGDPREDG NLVGPLIDAE AFTAMERALD AARTAGGRVH GGGRVDVNGE NSFYARPALI EMSQHAECVR AETFAPILYV FRYETLEEAI ALQNDVPQGL SSSIFATDMR EVEQFLSATG SDCGIANVNM GTSGAEIGGA FGGEKETGGG RESGSDSWKA YMRRQTNAIN YGRTLPLAQG VRFDV
|
| |