Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2747 |
Symbol | |
ID | 5900202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2984288 |
End bp | 2985820 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641563239 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001684372 |
Protein GI | 167646709 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.120582 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0601156 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCGCATCG CCCAGGTCGG CTTCATGGTG ATCGCTGATA AGATGAGCCG CCAGGCCGCT TGGGAGGACC TAACCTTGAA CGCGCCCGCC AATCTCAACC TTGACCTGCA CAAAGCCTCG CTCTCGAAAA TTCTCGAAGC GCAGAAGGCC GCGCACCTGC GACAAGGCGC CCCAACGGCG GCCCGTCGCA TCGATTGGCT GGACCGTTGC ATCGGCCTGC TGGTCGATCA CCAGGTCGAG ATCGCCGACG CCCTGAACAC CGATTTTGGC GCGCGCTCCA AGGACGCCAC GGGCCTGACC GACATCGCCG GCTCGATCGG CCCGCTGAAG TACGCCAAGG AGAATGTCGC CAAGTGGATG CGCCCCGAAA AGCGCAAGAC CACCCCGGCG ATCCTGGGCC TGTTCGGCGC CAAGGCCGAG GTCCACTATC AGCCCAAGGG CGTGGTCGGG GTGATCAGCC CGTGGAACTT CCCGGTCAAC CTGACCTTCG CGCCGCTGGC CGGCGTGTTG GCGGCCGGCA ACCGGGCGAT GATCAAGCCG TCCGAGTTCA CGCCGATCAC CTCCGAACTG ATGAAGACGA TGTTCGCCAA GGCCTTTTCC GAGGAGGAGA TCGCGGTGAT CACTGGCGGC CCCGACGTGG GCCAGGCCTT CACCAGCCTG CCGTTCGACC ACCTGGTCTT CACCGGCGCG ACGTCGGTGG CGCGTCATGT GATGCGAGCG GCGGCCGAGA ACCTGGTGCC GGTGACCCTG GAGCTGGGCG GCAAGAGCCC GGTGATTCTG TCGCGGGGGG CCGACATGGC CACGGCGGCG GCGCGGATCA TGAACGGCAA GACCCTCAAC GCCGGCCAGA TCTGCCTGGC GCCCGACTAT GTGCTGGCGC CGGCCGACCA GATCGACAGC TTCGTCGCCG AGGCCAAGGC CGCCGTGGCG CGGACCTTCC CGACCCTCAA GGACAATCCC GACTATACGG CCGTGGTCGC CCAGCGCCAC TATGACCGGA TCAAGGGCCA TGTGGACGAC GCCCGGGCCA AGGGCGCGAC GATCATCGAG ATCAACCCGG CCGGCGAGGA TCTGAGCCAG CAGGAGCATC GCAAGATCGC CCCGACCCTG ATCCTCAACC CGACCGACGA CATGACGGTG ATGCAGGACG AGATCTTCGG TCCCGTCCTG CCGGTGAAGA CCTACGGCAA GGTCGAGGAG GCGGTAAACT ATATCAACGC CCACGATCGG CCCCTGGGGC TCTACTGGTT CGGGACCGAC GACGCCGAGC GCGACATGGT GCTGAACCGC ACGACCAGCG GCGGGGTGAC GGTCAATGAC GTGATTTTCC ACGTCGCCCA GGAGGATCTG CCGTTCGGCG GCGTCGGGCC GGCCGGCATG GGCTCGTACC ATGGCCGTGA CGGCTTCATG GAGTTCAGCC ACCGCAAGGC GGTGTTCCAT CAGCTGAAGA AGGACATCGC GCCCATGCTG GCCCTGCGGC CGCCCTATGG CGCGGGGATC CGCAAGTATC TGGCGAGTCA GATCAAGAAG TAG
|
Protein sequence | MRIAQVGFMV IADKMSRQAA WEDLTLNAPA NLNLDLHKAS LSKILEAQKA AHLRQGAPTA ARRIDWLDRC IGLLVDHQVE IADALNTDFG ARSKDATGLT DIAGSIGPLK YAKENVAKWM RPEKRKTTPA ILGLFGAKAE VHYQPKGVVG VISPWNFPVN LTFAPLAGVL AAGNRAMIKP SEFTPITSEL MKTMFAKAFS EEEIAVITGG PDVGQAFTSL PFDHLVFTGA TSVARHVMRA AAENLVPVTL ELGGKSPVIL SRGADMATAA ARIMNGKTLN AGQICLAPDY VLAPADQIDS FVAEAKAAVA RTFPTLKDNP DYTAVVAQRH YDRIKGHVDD ARAKGATIIE INPAGEDLSQ QEHRKIAPTL ILNPTDDMTV MQDEIFGPVL PVKTYGKVEE AVNYINAHDR PLGLYWFGTD DAERDMVLNR TTSGGVTVND VIFHVAQEDL PFGGVGPAGM GSYHGRDGFM EFSHRKAVFH QLKKDIAPML ALRPPYGAGI RKYLASQIKK
|
| |