Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1547 |
Symbol | |
ID | 5899002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1638019 |
End bp | 1639065 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641562035 |
Product | pyridoxal 4-dehydrogenase |
Protein accession | YP_001683175 |
Protein GI | 167645512 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.977746 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCTCG CCCCCGCTGG ACGGATGACT GCTTCCCCTG GACCAGCGGA GGGACGCGGC CGGCTGAACC TGCCGCCGCT GGGCTTCGGC GCCGGAGGAA TCGGCAATCT CTACGCCGCC ATGAGCGACG CCGCCGCCCG CGAAGCGATC GAAGCGGCCC TGGCCACCGG CCTGGCCTAT TTCGACACCG CGCCCCACTA TGGCTTTGGT CTGAGCGAAA CGCGCCTGGG CGCTGCCCTG CCGCCGGAGG CCAAGGTCTC CACCAAGGTG GGCCGGCTGC TGCGTCCAGC GCCCGAAGTC GATCCGGCGG CCGAGCGCCA CGGTTTCGTC GGCGCAGCGC CGTTCGAGCC GGTGTTCGAT TATTCCTATG ACGGAGTCAT GCGATCCTTC GAGGCCAGCC TGGAGCGCCT GAACCGCGAT CACGTCGAGG TGCTGCTGGC CCACGACCTG GGTCAGGCCA CCCACGGCGC CGATGACGCG GCGCGACGGC GGCAGTTCTT CGACGGCGGC TATAGGGCGA TGCGCGCCCT GCAGGACGCC GGCGCCGTCG ACGCCATCGG CCTGGGCGTC AACGAGTGGG AGATCTGCGA CGCGGCGCTG GACGAGGCCG ACTTCGATGT TTTCCTGCTA GCGGGTCGCT ACACCTTGCT GGAACAGACG GCGCTGGACC GTTTCCTGCC ACGCTGCGCC GCCCGCGACG TGTCGATCAT CGTCGGCGGC CCCTTCAATT CCGGCGTGCT GGTCGAGGGC GTCCGGCCGG GCGCCCACTA CAACTACGGT CCGGCCCCGG CCGAGGTCCT TGACCGTGTT GGAAGGCTGG AGGCGGTCTG TCTGGCCCAT GCCACGCCCC TGGCCGCGGC GGCCCTGCAA TTCCCGCTTG CTCATCCGCA AGTGGCCAGC GTCATCCCGG GCCTGTCCAG CCCGGACCAG GTGCGCCAAG CGCTGGCCTG GGCCGCGCAC GCCGTTCCCG ACGCCCTGTG GGACGATCTC CGCTCGGAGG GCCTGTTGCA TCCGGACGCC CCGACACCCC GCGCGGTCAA AGCGTGA
|
Protein sequence | MTLAPAGRMT ASPGPAEGRG RLNLPPLGFG AGGIGNLYAA MSDAAAREAI EAALATGLAY FDTAPHYGFG LSETRLGAAL PPEAKVSTKV GRLLRPAPEV DPAAERHGFV GAAPFEPVFD YSYDGVMRSF EASLERLNRD HVEVLLAHDL GQATHGADDA ARRRQFFDGG YRAMRALQDA GAVDAIGLGV NEWEICDAAL DEADFDVFLL AGRYTLLEQT ALDRFLPRCA ARDVSIIVGG PFNSGVLVEG VRPGAHYNYG PAPAEVLDRV GRLEAVCLAH ATPLAAAALQ FPLAHPQVAS VIPGLSSPDQ VRQALAWAAH AVPDALWDDL RSEGLLHPDA PTPRAVKA
|
| |