Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5442 |
Symbol | |
ID | 5897159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010333 |
Strand | + |
Start bp | 154583 |
End bp | 156043 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641550729 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001672215 |
Protein GI | 167621707 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0274338 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTTT ATGCAATCAC TGCGACCACT GAAGCACCGG TTCATCCCTG CGGCGAGGGG GCGATCCGAT CCCTGTTCGC GTCGCAGCGC CGATCAGCCT TGGAGAACAG GACGAAATTC ACGCTGAAGG CGCGACTGGC TATGTTGTCG CGACTAAAGG CGACGATGAA GAGCCGGGAA GACGAGATTA TCCGAGCTCT CTGCACGGAT TTCAGGAAGC CTGAATCCGA GGTGCGCCTG ACCGAACTGT TCCCGGTCTA TCAGGAGATA TCGCATGCCC GGCGCCACCT CCGATCCTGG CTGAGACCGC ACCGGGTTCA CGACTCTTTG GGGATGTTCG GAATCGCTGC GGAGGTCCGC TATCAGGCCA AGGGCGTCTG CCTGATAATT TCCCCGTGGA ACTATCCGGT CAATCTCAGT TTCGGGCCAC TGGTGTCCGC GCTGGCAGCC GGAAACACCG TCATCATCAA GCCTTCCGAA CTGACGCCGG CGACGTCCGC CCTGGTCAGG GACATCGTCG AGCAGACCTT CCCCCGGGAT CTCGTCGCCG TCTGCGAAGG CGACGCCGAG GTTTCGCAGG CCCTGCTGGA TCTACCCTTC GACCACATCT TCTTCACGGG CAGTCCCCAG GTCGGCAAGA TCGTGATGGC GGCCGCAGCG AAACATTTAA CATCCGTGAC GCTTGAACTC GGGGGCAAGT CCCCGACCAT CGTCGATTCG ACCGCGAATA TCGAGCAAGC CGCCTGCAAG ATCGTCTGGG GCAAGTTCGC CAATAACGGC CAGACCTGCA TTGCTCCGGA TCATGTCTAT GTTGCTCGCG ACCAGGCCTC GGCGCTGGTC GATGCGCTGC GGCATGAGAT CAGGCGGGTC TACGGGCAGA CGGACGGCGA GCAGAAAGCC GGGCCGGACT ATTGCCGGAT CGTGAACCGG CGGCATTTCG ATCGTCTGAC CGCCCTGGCC GACGACGCCA CATCGCGCGG TGCGACCCTC CTGGAAGGTG GGGCGCGAGA TTCAGACCAG AACTATTTCG CGCCGACCAT ACTCGGCGGA ACGACGCCGC AGATGGCGAT TTCCCAGGAA GAAATATTCG GTCCGCTTCT TCCGATCATC GAATATGACG ACATCAGCGT CGTCATCGAC GCGATCAACG CGGGCCCAAA GCCGCTTGCC ATGTATGTCT TCAGCAACGA CGCCGCCGCC CGCGAGGATA TCATCCTTAG GACGAGTTCC GGTGGTGTCT GCGTCAACAA CAATGTCGTC CAATTCTTGC ATCCAAACCT GCCGTTTGGC GGAGTCAACA ACAGCGGCAT TGGCGCTGCA CACGGTTTCT ATGGCTTTAA AGCCTTCTCC CATGAACGTG CGATTCTAAG AGACAAATTC TCCGTCCTGC GTCTTCTTTT CCCGCCGTAC ACCCCGACCG TAAAGAAACT CATCAATCTA ATCGTCCGTC TTTTGGGTTG A
|
Protein sequence | MNFYAITATT EAPVHPCGEG AIRSLFASQR RSALENRTKF TLKARLAMLS RLKATMKSRE DEIIRALCTD FRKPESEVRL TELFPVYQEI SHARRHLRSW LRPHRVHDSL GMFGIAAEVR YQAKGVCLII SPWNYPVNLS FGPLVSALAA GNTVIIKPSE LTPATSALVR DIVEQTFPRD LVAVCEGDAE VSQALLDLPF DHIFFTGSPQ VGKIVMAAAA KHLTSVTLEL GGKSPTIVDS TANIEQAACK IVWGKFANNG QTCIAPDHVY VARDQASALV DALRHEIRRV YGQTDGEQKA GPDYCRIVNR RHFDRLTALA DDATSRGATL LEGGARDSDQ NYFAPTILGG TTPQMAISQE EIFGPLLPII EYDDISVVID AINAGPKPLA MYVFSNDAAA REDIILRTSS GGVCVNNNVV QFLHPNLPFG GVNNSGIGAA HGFYGFKAFS HERAILRDKF SVLRLLFPPY TPTVKKLINL IVRLLG
|
| |