Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3778 |
Symbol | |
ID | 5901240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4094551 |
End bp | 4096020 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564301 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001685403 |
Protein GI | 167647740 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01780] succinate-semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0616063 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCCCCA CGCAGGCCCG CCGGGCGACC CTGACGCGAC CGGAGCTGCT CCGCTCGCAA GTCTATTATG CGGGGGCGTG GCGCGGCGCC GGCTCCGGCG AGACCGTCCC GGTGATCGAT CCGTTCTCGG GTGAAGCGCT TGGCGAGGTC GCGTCGCTGG GCGAGGGCGA GATCCACGCC GCGATCGAGG CCGCCCAGGC GGCGTTCCCG CGCTGGTCGC GGACGCCCCA CCGCGAACGC GGCGCCTTGC TGCGCCGCTG GCTCGAGCTG ATCGAGCGCG ACAAGGAAGA TCTGGCCCGG CTGATCACCC TGGAGAACGG CAAGCCGCTG AAGGAAGCGC GCGCGGAAGT AGCCTATGGT TCGGGCTTCA TCGAGGTCTA TGCCGAGGAG GCGGGTCGCA TCCTTGGCGA AATCCTTCCG CCCAACATGC CCGGACGCCG CCTGCTGGTC GAACGCGAGC CGATCGGCGT CTGCGCGGCG ATCACCCCCT GGAACTTCCC GATGGCCATG CTGACGCGCA AGATCGCGCC GGCGCTGGCG GCGGGCTGCA CGATCGTCTG CAAGCCGGCC AGCGAGACGC CGCTGACCGC GCTGGCCCTG GCCCTCCTCG CGCAAGAGGC CGGCATTCCG GCCGGCGTGC TGAGCGTCGT GGTCAGCGCG CCGGCGCTGT TTGGCGACAT CGTCACGGCC TCCAGCGTGG TGCGCAAGAT CACCTTCACC GGGTCCACGC CGGTCGGGGC GCGGCTGATG GCGGCGTCGG CCCCGACCAT CAAGCGGCTG TCGCTGGAAC TGGGCGGCAA CGCCCCCCTG CTGGTCTTCG ACGACGCCGA TCTGGAGGTG GCGGTCGAGA CCGCGATGGT GGCCAAGTTC CGCAACGGCG GGCAAAGCTG CATCGCGGCC AACCGCCTGT ACGTCCAGCG CGGGATCTAC GAGGCGTTCC TGTCGGCGTT CCAGGCGCGA GTCGCCGCGC TGCGGGTCGG CGACGGCCTT GATCCCGAGA CCGATATCGG GCCGCTGATC AGCGCCCGCG CGGTGGAGAA GGTCGAACGC CACCTCGACG ACGCCCTGGC CGGCGGCGCG CGCCTGATCA GCGGCGGCAA GAGCGACGGC TCGCTGCTGT CACCGGCGAC CTTGCTCGGC GACGTGGCGC CCGACGCCCT TCTGACCCGG GAAGAGACCT TCGGGCCGAT GGCCGGCGTC ATTCCGTTCG AGACCTACGA CCAGGCCGTC ACGATGGCCA ACGACACGCC GTTTGGCCTG GCCGCCTATG TCTGCTCCAC CCGCCAGGAC ACCATCGCCC GCGCCGGTCG CGACCTGGAG ACCGGGATGG TCGGCGTCAA TACCGGCCTG ATCTCGACGG CCGCCGCGCC GTTCGGCGGG GTTAAGCTGT CCGGCGTCGG CCGCGAGGGC TCGCATCACG GCATCTCGGA ATACTTGAAC TACAAGTACC TCTGCCAGGC AGGACTCTAG
|
Protein sequence | MTPTQARRAT LTRPELLRSQ VYYAGAWRGA GSGETVPVID PFSGEALGEV ASLGEGEIHA AIEAAQAAFP RWSRTPHRER GALLRRWLEL IERDKEDLAR LITLENGKPL KEARAEVAYG SGFIEVYAEE AGRILGEILP PNMPGRRLLV EREPIGVCAA ITPWNFPMAM LTRKIAPALA AGCTIVCKPA SETPLTALAL ALLAQEAGIP AGVLSVVVSA PALFGDIVTA SSVVRKITFT GSTPVGARLM AASAPTIKRL SLELGGNAPL LVFDDADLEV AVETAMVAKF RNGGQSCIAA NRLYVQRGIY EAFLSAFQAR VAALRVGDGL DPETDIGPLI SARAVEKVER HLDDALAGGA RLISGGKSDG SLLSPATLLG DVAPDALLTR EETFGPMAGV IPFETYDQAV TMANDTPFGL AAYVCSTRQD TIARAGRDLE TGMVGVNTGL ISTAAAPFGG VKLSGVGREG SHHGISEYLN YKYLCQAGL
|
| |