Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3955 |
Symbol | |
ID | 5901417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4283766 |
End bp | 4285253 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564476 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001685578 |
Protein GI | 167647915 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.768123 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.683477 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCCTGC TGTCTCCCAT CGACGCGCTG AAGACGTTGC CGCTGCCCGG CCGGGCGGTG ATCGACGGCG CGTTGGTCGA GGCGGCTTCG GGGGCGACGT TCCACAACGT CTCGCCGCGC GACGGGGCGG TGATCAACCA GGTCGCCGCC TGCCAGGCCG AGGACGTCGA CCGCGCCGTG GCCAGCGCCC GCGCAGCCTT TGAGGATGGC CGGTGGCGCG ACCAGGGTCC GCGCGACAAG AAGCGGGTGC TGTTCAGGCT GGCCGAGCTG ATGGAGCGCG ACGCCGAACA GCTGGCCCTG CTGGAGAGCC TGGACACCGG CAAGCCGATC CGCGACGCCC GGGCGGTCGA CGTGCCCCTG TCGATCGGCA CAACCCGCTG GTACGCCGAG GCCCTGGACA AGATTTATGG CGAGGTGGGC GCCTCGCCGA TCGATCGCCT GAGCTGGGCC ACGCATGAGC CGCTGGGGGT GATCGGCGCC ATCGTGCCGT GGAACTTCCC CCTGCACATG GCGATGTGGA AGGCGGCCCC GGCCCTGGCC ATGGGCAACA GCGTCGTCCT CAAGCCCGCC GAGCAGTCGC CGCTGACCGC CCTGAAGCTG GGCGAACTGG CGCTGGAGGC TGGCCTGCCG CCCGGCGTGC TGAACGTGGT TCCGGGCCTG GGCGCCACGG CCGGCGAGGC GCTCGCCCTG TCGATGGACG TCGACATGAT CGCCTTCACC GGCTCGGGTC CGGTGGGCCG GCGGCTGATG GAATATTCGG CGCGGAGCAA CCTCAAGCGC GTGTCGCTGG AGCTGGGCGG CAAGTCGCCC CAGATCGTCT TCGCCGACTG CCCGGACCTG GACGCCGCCG CCCAGGCCGC CGCCTGGGGC GTGTTCTATA ACCAGGGCGA GGTCTGCACC GCCGCGTCCC GCTTGCTGGT CGAGGCCTCG ATCAAGGACG CCTTCCTCGA GAAGGTGATC GCGGTGGCCA AGACCATGGT CCCCGGCGAC CCGCTGGATC CCGACACCGT GTTCGGGGCC ATGGTCAGCG AGCGGCAGAT GAACACCGCC CTGGACTACA TCGCCACCGC CGACAGCCAG GGCGCCCGCC GGCTGCTGGG CGGCAAGCAA GTACGTCGGG AGACGGGCGG CTTCTATGTC GAGCCCACCA TCTTCGATCG GCTGGAGCCC GACCACACCC TGGCCCGCGA AGAGGTGTTC GGGCCGGTGC TAGGCGTGCA GACCTTCAAG ACCCAGGACG AGGCGATCGC CCTGGCCAAC GACACCGTCT ACGGCCTGGC CGCCGGCCTG TGGACCAGCG ACATCAACCG CGCCCTCACC GCCGCGCGGC GGCTGAAGGC CGGCCTGGTG TGGATTAACG GCTGGGACGC CTGCGACATC ACCATGCCGT TCGGCGGCTT CAAGCAGTCG GGTTTCGGTC GCGACCGCAG CCTTCACGCG TTGCACAAAT ATGCCGACCT GAAGTCGGTG TCCGTGACGC TGAGGTGA
|
Protein sequence | MTLLSPIDAL KTLPLPGRAV IDGALVEAAS GATFHNVSPR DGAVINQVAA CQAEDVDRAV ASARAAFEDG RWRDQGPRDK KRVLFRLAEL MERDAEQLAL LESLDTGKPI RDARAVDVPL SIGTTRWYAE ALDKIYGEVG ASPIDRLSWA THEPLGVIGA IVPWNFPLHM AMWKAAPALA MGNSVVLKPA EQSPLTALKL GELALEAGLP PGVLNVVPGL GATAGEALAL SMDVDMIAFT GSGPVGRRLM EYSARSNLKR VSLELGGKSP QIVFADCPDL DAAAQAAAWG VFYNQGEVCT AASRLLVEAS IKDAFLEKVI AVAKTMVPGD PLDPDTVFGA MVSERQMNTA LDYIATADSQ GARRLLGGKQ VRRETGGFYV EPTIFDRLEP DHTLAREEVF GPVLGVQTFK TQDEAIALAN DTVYGLAAGL WTSDINRALT AARRLKAGLV WINGWDACDI TMPFGGFKQS GFGRDRSLHA LHKYADLKSV SVTLR
|
| |