Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3617 |
Symbol | |
ID | 5901072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3903614 |
End bp | 3905152 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641564128 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001685242 |
Protein GI | 167647579 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAAG CCGCCCCCCA AGCCGCTTTG CGGGCCCTCA ACCCCGCCAC CAACGAACAC TTTGGTCCCA GCTTCCCAGA GCCCAGCGCC GCCCAGATCG AGGCGGCCTG CGCCGCCGCC GCGGCCGCGT TCGACGCCTA TCGCGAGACC GACCTGGAAA CCCGCGCGGC CTTCCTCGAG GGAATCGCCA CCGAGATCGA GGCCCTGGGC GACGCGTTGA TCCAGACCGC CATGGCCGAG ACCGGCCTGC CCCAGGCCCG CATCACCGGC GAGCGCGGCC GCACCTGCGG CCAGCTGCGC CTGTTCGCCC AGGTCGTGCG CCGCGGCGAC TGGATCGGCG CGCGGATCGA CCCGGCCATG CCCGAGCGCA CGCCCCTGCC CCGCGCCGAC CTGCGCCAGC GCTTCATCCC GCTGGGTCCG GTCGTGGTGT TCGGAGCCAG CAACTTCCCA CTGGCCTTCT CGACGGCCGG CGGCGACACC GCCTCGGCCC TGGCGGCCGG TTGCCCGGTG ATCGTCAAGG GCCACTCGGC CCACCCCAAC ACCGGCGCGA TGATCGGCGG CGCGATCGAC AAGGCGGTCA AGGCCGCCGG CCTGCCCGCC GGGGTCTTCG CCATCCTGAT CGGCCAGCAG CGCACCCTGG GCGCCGGCCT GGTCGCCGAT CCGCGCATCA AGGCCGTGGG CTTCACCGGT TCTCGCGCCG GCGGCGTCGC CTTCATGCGG ATCGCCGCGG GGCGTCCCGA GCCGATCCCG GTCTTCGCCG AGATGAGCAG CATCAACCCG GTGGTCCTCA TGCCCGCCGC CCTGGCCGCC CGGGCCGAGG CCCTGGGGAC GGCCTTCGTC GGCTCGCTGA CGATGGGCGC GGGCCAGTTC TGCACCAATC CCGGCCTGGT CTTCGCCCTG GGCGGCCCCG ACCTGGATCG TTTCGAAGCC GCCGCCGTCG CCGCCCTGAC CGCCGCCCAG CCGCAGGTCA TGCTGACGCC CGGCATCTTC GGCGCCTATG AGCAGGGGGT GAACCAATTG CTCGACCGCG ACGGCGTCAC GCTGCTGGCG CGCGGCTGCG TCGGCGACGG CGTCAACCAG GCGGTCGGCG CGCTGTTCTC GGTCGATGTC GAGACCTTCC AGCGCGACGC GGTGCTGAGC CATGAGGTGT TCGGCTCGTC GTCGCTGATC GTGCGGGTGT CGGACGCCGC CCAACTGGCC GGCGCGTTGG AAGGGCTGGA GGGCCAACTG ACCGCCACCC TGCAGATGGA TCCCGCCGAC GCCGAGGCCG CGCGCGGCCT GATGCCGATC CTGGAGCGCA AGGCCGGCCG CATCCTGGCC AATGGCTGGC CGACCGGGGT CGAGGTCTCG CACGCCATGG TCCACGGCGG CCCGTTCCCG GCCACGTCCG ACCCGCGCGG AACGTCGGTG GGCACGCGGG CCATCGAGCG GTTCCTGCGG CCGGTCTGCT ACCAGGACAT CCCCGATACG CTGCTGCCGC CAGCCCTGAA GGCGGACAAT CCGCTGGGCG TGCGGCGGGC CGTGGACGGG GTGCTGTAA
|
Protein sequence | MAEAAPQAAL RALNPATNEH FGPSFPEPSA AQIEAACAAA AAAFDAYRET DLETRAAFLE GIATEIEALG DALIQTAMAE TGLPQARITG ERGRTCGQLR LFAQVVRRGD WIGARIDPAM PERTPLPRAD LRQRFIPLGP VVVFGASNFP LAFSTAGGDT ASALAAGCPV IVKGHSAHPN TGAMIGGAID KAVKAAGLPA GVFAILIGQQ RTLGAGLVAD PRIKAVGFTG SRAGGVAFMR IAAGRPEPIP VFAEMSSINP VVLMPAALAA RAEALGTAFV GSLTMGAGQF CTNPGLVFAL GGPDLDRFEA AAVAALTAAQ PQVMLTPGIF GAYEQGVNQL LDRDGVTLLA RGCVGDGVNQ AVGALFSVDV ETFQRDAVLS HEVFGSSSLI VRVSDAAQLA GALEGLEGQL TATLQMDPAD AEAARGLMPI LERKAGRILA NGWPTGVEVS HAMVHGGPFP ATSDPRGTSV GTRAIERFLR PVCYQDIPDT LLPPALKADN PLGVRRAVDG VL
|
| |