Gene Caul_3954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3954 
Symbol 
ID5901416 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4282315 
End bp4283760 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content70% 
IMG OID641564475 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_001685577 
Protein GI167647914 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0655928 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.638974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGG AACTCGTCGA AACCGCCGCC TTCATCGACG GCCTCTGGAT CGAAGCCGAC 
GCCACCTTCG AGGTGTTCAA CCCCGCCGAC GGCTCGGTGA TCGCCCAGGT CGCCAACCTG
GGCGCGTCGG AAACCAAGCT CGCCATCGAG GCGGCCCACC GCGCCTTCCC GGCCTGGGCC
GCGCGCACCG CCAAGGACCG CGGGGCGATC CTGCGCCGGT GGTCCGACCT GATGCTGCTG
CACGCCGAGG CCCTGGCCCG GCTGATGACC GCCGAGCAGG GCAAGCCGCT GGCGGAGTCC
CGGGGCGAGG TGGCCTACGG CGCGGCGTTC ATCGACTGGT TCGCCGACGA GGCCAAGCGG
GCCTACGGCC ATGCCATCCC CAGTCCCATG CCCGGCAAGA GATTGGTCTC GATCAAGCAG
CCGGTCGGGG TGTGCGCGGC CATCGCGCCG TGGAACTTCC CGATCGCCAT GATCACCCGC
AAGGTCGGCC CGGCCCTGGC GGCGGGCTGC ACCGTGGTGG TCAAGCCGGC GGCCGAGACC
CCGCTGTGCG CCCTGGCCAT CGCCCGCCTG GCGGTGGAGG CGGGCGTGCC GGCCGGGGTG
CTCAATGTCG TCACCGGCAA GGACAGCGCC GCCATCGGCA AGGCCCTGTG CGAGGATGCA
AGGGTGCGCA AGCTGTCGTT CACGGGCTCG ACCCCGGTGG GCAAGACCCT CTACGCCCAG
TGCGCCGGCA CCATGAAGAA GCTGTCGCTG GAGCTGGGCG GCAATGCGCC GTTCATCGTC
TTCGACGACG CCGATCTCGA GGCCGCCGTC GATGGGGCCA TCGCCAGCAA GTACCGCAAC
ACCGGCCAGA CCTGCGTCTG CGCCAATCGC CTGCTGGTGC AGTCCGGCAT CCACGACGCC
TTCGTCGCGC GGCTGACCGA AAAGGTCGCG GCGATGAAGG TCGGGCCGGG CACAGGCGAG
GGCGTGACCA TCGGCCCGCT GATCAACGAC AAGGCCATTG CCAAGGTCGA AAAGCTGGTG
CGTGAAGCGG TCGAGCAGGG CGCCAAGGCC ACGGTCGGCG GCGATCGTCA TGCGCTGGGC
GGCCTGTTCT GGCAGCCCAC GGTGCTGACC GGCGCGACGC CCGACATGCG GCTGTTCCAG
GAGGAGATCT TCGGCCCGGT CGCGCCGATC GTGAAGTTCG ACACCGAGCA GGAGGCCATC
GACCTGGCCA ACGCCACGCC ATTTGGTCTC GCCTCGTACT TCTACAGCCG CGACGTTGGC
CGCTGCTGGC GGGTGGCCGA GGCGATCGAG GCGGGGATGG TCGGGATCAA CGAAGGGATC
ATCTCCACCG AGGTGGCGCC GTTCGGCGGC GTCAAGGATT CGGGCCTGGG CCGCGAGGGG
GCGTCCGAGG GTTTGGACGA GTATCTGGAG ACCAAGTACC TGTGCTTTGG CGGGGTGGGG
GTGTGA
 
Protein sequence
MTLELVETAA FIDGLWIEAD ATFEVFNPAD GSVIAQVANL GASETKLAIE AAHRAFPAWA 
ARTAKDRGAI LRRWSDLMLL HAEALARLMT AEQGKPLAES RGEVAYGAAF IDWFADEAKR
AYGHAIPSPM PGKRLVSIKQ PVGVCAAIAP WNFPIAMITR KVGPALAAGC TVVVKPAAET
PLCALAIARL AVEAGVPAGV LNVVTGKDSA AIGKALCEDA RVRKLSFTGS TPVGKTLYAQ
CAGTMKKLSL ELGGNAPFIV FDDADLEAAV DGAIASKYRN TGQTCVCANR LLVQSGIHDA
FVARLTEKVA AMKVGPGTGE GVTIGPLIND KAIAKVEKLV REAVEQGAKA TVGGDRHALG
GLFWQPTVLT GATPDMRLFQ EEIFGPVAPI VKFDTEQEAI DLANATPFGL ASYFYSRDVG
RCWRVAEAIE AGMVGINEGI ISTEVAPFGG VKDSGLGREG ASEGLDEYLE TKYLCFGGVG
V