Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3954 |
Symbol | |
ID | 5901416 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4282315 |
End bp | 4283760 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641564475 |
Product | succinic semialdehyde dehydrogenase |
Protein accession | YP_001685577 |
Protein GI | 167647914 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | [TIGR01780] succinate-semialdehyde dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.0655928 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.638974 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTGG AACTCGTCGA AACCGCCGCC TTCATCGACG GCCTCTGGAT CGAAGCCGAC GCCACCTTCG AGGTGTTCAA CCCCGCCGAC GGCTCGGTGA TCGCCCAGGT CGCCAACCTG GGCGCGTCGG AAACCAAGCT CGCCATCGAG GCGGCCCACC GCGCCTTCCC GGCCTGGGCC GCGCGCACCG CCAAGGACCG CGGGGCGATC CTGCGCCGGT GGTCCGACCT GATGCTGCTG CACGCCGAGG CCCTGGCCCG GCTGATGACC GCCGAGCAGG GCAAGCCGCT GGCGGAGTCC CGGGGCGAGG TGGCCTACGG CGCGGCGTTC ATCGACTGGT TCGCCGACGA GGCCAAGCGG GCCTACGGCC ATGCCATCCC CAGTCCCATG CCCGGCAAGA GATTGGTCTC GATCAAGCAG CCGGTCGGGG TGTGCGCGGC CATCGCGCCG TGGAACTTCC CGATCGCCAT GATCACCCGC AAGGTCGGCC CGGCCCTGGC GGCGGGCTGC ACCGTGGTGG TCAAGCCGGC GGCCGAGACC CCGCTGTGCG CCCTGGCCAT CGCCCGCCTG GCGGTGGAGG CGGGCGTGCC GGCCGGGGTG CTCAATGTCG TCACCGGCAA GGACAGCGCC GCCATCGGCA AGGCCCTGTG CGAGGATGCA AGGGTGCGCA AGCTGTCGTT CACGGGCTCG ACCCCGGTGG GCAAGACCCT CTACGCCCAG TGCGCCGGCA CCATGAAGAA GCTGTCGCTG GAGCTGGGCG GCAATGCGCC GTTCATCGTC TTCGACGACG CCGATCTCGA GGCCGCCGTC GATGGGGCCA TCGCCAGCAA GTACCGCAAC ACCGGCCAGA CCTGCGTCTG CGCCAATCGC CTGCTGGTGC AGTCCGGCAT CCACGACGCC TTCGTCGCGC GGCTGACCGA AAAGGTCGCG GCGATGAAGG TCGGGCCGGG CACAGGCGAG GGCGTGACCA TCGGCCCGCT GATCAACGAC AAGGCCATTG CCAAGGTCGA AAAGCTGGTG CGTGAAGCGG TCGAGCAGGG CGCCAAGGCC ACGGTCGGCG GCGATCGTCA TGCGCTGGGC GGCCTGTTCT GGCAGCCCAC GGTGCTGACC GGCGCGACGC CCGACATGCG GCTGTTCCAG GAGGAGATCT TCGGCCCGGT CGCGCCGATC GTGAAGTTCG ACACCGAGCA GGAGGCCATC GACCTGGCCA ACGCCACGCC ATTTGGTCTC GCCTCGTACT TCTACAGCCG CGACGTTGGC CGCTGCTGGC GGGTGGCCGA GGCGATCGAG GCGGGGATGG TCGGGATCAA CGAAGGGATC ATCTCCACCG AGGTGGCGCC GTTCGGCGGC GTCAAGGATT CGGGCCTGGG CCGCGAGGGG GCGTCCGAGG GTTTGGACGA GTATCTGGAG ACCAAGTACC TGTGCTTTGG CGGGGTGGGG GTGTGA
|
Protein sequence | MTLELVETAA FIDGLWIEAD ATFEVFNPAD GSVIAQVANL GASETKLAIE AAHRAFPAWA ARTAKDRGAI LRRWSDLMLL HAEALARLMT AEQGKPLAES RGEVAYGAAF IDWFADEAKR AYGHAIPSPM PGKRLVSIKQ PVGVCAAIAP WNFPIAMITR KVGPALAAGC TVVVKPAAET PLCALAIARL AVEAGVPAGV LNVVTGKDSA AIGKALCEDA RVRKLSFTGS TPVGKTLYAQ CAGTMKKLSL ELGGNAPFIV FDDADLEAAV DGAIASKYRN TGQTCVCANR LLVQSGIHDA FVARLTEKVA AMKVGPGTGE GVTIGPLIND KAIAKVEKLV REAVEQGAKA TVGGDRHALG GLFWQPTVLT GATPDMRLFQ EEIFGPVAPI VKFDTEQEAI DLANATPFGL ASYFYSRDVG RCWRVAEAIE AGMVGINEGI ISTEVAPFGG VKDSGLGREG ASEGLDEYLE TKYLCFGGVG V
|
| |