Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1133 |
Symbol | |
ID | 5898588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1200440 |
End bp | 1201900 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641561615 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001682761 |
Protein GI | 167645098 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.809245 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.000145932 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCGACG TCGCCCAAAT CCACCTGTTG ATCGACAACC AGGCGCGCCC CGCCCATGGC GGCGCCACCT TCGATCGCCT TGATCCGATC ACCGGCGCGG TCGCCACCCG CGCCGCGGCC GCCAGCCCCG GCGACGCGCG GGCCGCGGTC GACGCGGCGG CGGCGGCCTT CGTCGCCTGG TCGGAAACCG GCCCCAACAC CCGTCGCGCC CTGCTGGCCA AGGCCGCGGA CAGGCTGGAG GGCCTGGCTG ACGATTTCGT CGTCGCCATG CGTGACGAAA TCGGCGCCAC CGAGGGCTGG GCGCGCTTCA ATGTCATGCT GGCCGCCGGC ATGATCCGCG AGGCCGCCGC CATGACCACC CAGATCAGCG GCGAGGTCAT CCCTTCCGAC AAGCCTGGCT GTGTCGCCAT GGCCGTGCGC CAGCCCGCCG GCGTGGTGGT GGGGATCGCG CCGTGGAACG CGCCCGTGAT CCTGGGCGTG CGCGCCATCG CCACGCCCCT GGCCTGCGGC AACACCGTGG TGCTGAAGGC CTCCGAGACC TGCCCGCGCA CCCACGCCCT GATCGCGCGA GCCTTCCAGG AGGCCGGCCT GCCGCCCGGC GTCGTCAACG CGATCACCAA TGCGCCGGCC GACGCCGCCG CCGTGGTCGA GGCGCTCATC GCCCATCCGG CGGTCAAGCG GATCAACTTC ACCGGCTCGA CCAAGGTCGG CAAGATCATC GCCCGGCTGG GCGCCGAGCA CATGAAGCCT GTGCTGCTGG AACTGGGCGG CAAGGCCCCG CTGCTGGTGC TCGACGACGC CGACCTCGAC GAGGCGGTCA AGGCCGCCGC GTTCGGCGCC TTCATGAACC AGGGCCAGAT CTGCATGTCG ACCGAGCGGA TCGTGGTGGT CGAGTCCGTG GCCGACGCCT TCGTCGAGAA GTTCGCCGCC AAGGCCAGGA CTCTGGTCGC CGGCGACCCG CGCGAGGGCA AGACCCCGCT CGGCGCCCTG GTCGACAAGG CCGCCGCCCA AAAGGTCCAG CGCCTGATCG ACGACGCCGT CGCCAAGGGT GCGCGCCAGG TGGCCGGCGG CGGCGCCGAG GGCGTGCTGA TGTCGGCCGT GGTGCTGGAC GGGGTCAGGC CGGACATGGA GATCTATGCC GAAGAGTCGT TTGGTCCGTC GGTCAGCATC ATCCGCGTCA AGGACGAGGC CGAAGCGATC GCCGTGGCCA ATGACACCGA ATATGGCCTG TCGGCGGCGG TCTTCACCCG CGACATCGCG CGAGGCCTGA AGGTCGCCAA GCAGATCCAG TCGGGCATCT GCCACATCAA CGGCCCCACC GTGCACGACG AGGCCCAGAT GCCGTTCGGC GGCGTCAAGG CCAGCGGCTG GGGCCGCTTC GGCGGCAAGG CCGGGATCAA CGAATTTACC GACCTGCGCT GGATCACCTT CGAGACCCAG CCCGGCCACT TTCCCATCTG A
|
Protein sequence | MADVAQIHLL IDNQARPAHG GATFDRLDPI TGAVATRAAA ASPGDARAAV DAAAAAFVAW SETGPNTRRA LLAKAADRLE GLADDFVVAM RDEIGATEGW ARFNVMLAAG MIREAAAMTT QISGEVIPSD KPGCVAMAVR QPAGVVVGIA PWNAPVILGV RAIATPLACG NTVVLKASET CPRTHALIAR AFQEAGLPPG VVNAITNAPA DAAAVVEALI AHPAVKRINF TGSTKVGKII ARLGAEHMKP VLLELGGKAP LLVLDDADLD EAVKAAAFGA FMNQGQICMS TERIVVVESV ADAFVEKFAA KARTLVAGDP REGKTPLGAL VDKAAAQKVQ RLIDDAVAKG ARQVAGGGAE GVLMSAVVLD GVRPDMEIYA EESFGPSVSI IRVKDEAEAI AVANDTEYGL SAAVFTRDIA RGLKVAKQIQ SGICHINGPT VHDEAQMPFG GVKASGWGRF GGKAGINEFT DLRWITFETQ PGHFPI
|
| |