Gene Caul_3778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3778 
Symbol 
ID5901240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4094551 
End bp4096020 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content70% 
IMG OID641564301 
Productaldehyde dehydrogenase 
Protein accessionYP_001685403 
Protein GI167647740 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0616063 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCCCCA CGCAGGCCCG CCGGGCGACC CTGACGCGAC CGGAGCTGCT CCGCTCGCAA 
GTCTATTATG CGGGGGCGTG GCGCGGCGCC GGCTCCGGCG AGACCGTCCC GGTGATCGAT
CCGTTCTCGG GTGAAGCGCT TGGCGAGGTC GCGTCGCTGG GCGAGGGCGA GATCCACGCC
GCGATCGAGG CCGCCCAGGC GGCGTTCCCG CGCTGGTCGC GGACGCCCCA CCGCGAACGC
GGCGCCTTGC TGCGCCGCTG GCTCGAGCTG ATCGAGCGCG ACAAGGAAGA TCTGGCCCGG
CTGATCACCC TGGAGAACGG CAAGCCGCTG AAGGAAGCGC GCGCGGAAGT AGCCTATGGT
TCGGGCTTCA TCGAGGTCTA TGCCGAGGAG GCGGGTCGCA TCCTTGGCGA AATCCTTCCG
CCCAACATGC CCGGACGCCG CCTGCTGGTC GAACGCGAGC CGATCGGCGT CTGCGCGGCG
ATCACCCCCT GGAACTTCCC GATGGCCATG CTGACGCGCA AGATCGCGCC GGCGCTGGCG
GCGGGCTGCA CGATCGTCTG CAAGCCGGCC AGCGAGACGC CGCTGACCGC GCTGGCCCTG
GCCCTCCTCG CGCAAGAGGC CGGCATTCCG GCCGGCGTGC TGAGCGTCGT GGTCAGCGCG
CCGGCGCTGT TTGGCGACAT CGTCACGGCC TCCAGCGTGG TGCGCAAGAT CACCTTCACC
GGGTCCACGC CGGTCGGGGC GCGGCTGATG GCGGCGTCGG CCCCGACCAT CAAGCGGCTG
TCGCTGGAAC TGGGCGGCAA CGCCCCCCTG CTGGTCTTCG ACGACGCCGA TCTGGAGGTG
GCGGTCGAGA CCGCGATGGT GGCCAAGTTC CGCAACGGCG GGCAAAGCTG CATCGCGGCC
AACCGCCTGT ACGTCCAGCG CGGGATCTAC GAGGCGTTCC TGTCGGCGTT CCAGGCGCGA
GTCGCCGCGC TGCGGGTCGG CGACGGCCTT GATCCCGAGA CCGATATCGG GCCGCTGATC
AGCGCCCGCG CGGTGGAGAA GGTCGAACGC CACCTCGACG ACGCCCTGGC CGGCGGCGCG
CGCCTGATCA GCGGCGGCAA GAGCGACGGC TCGCTGCTGT CACCGGCGAC CTTGCTCGGC
GACGTGGCGC CCGACGCCCT TCTGACCCGG GAAGAGACCT TCGGGCCGAT GGCCGGCGTC
ATTCCGTTCG AGACCTACGA CCAGGCCGTC ACGATGGCCA ACGACACGCC GTTTGGCCTG
GCCGCCTATG TCTGCTCCAC CCGCCAGGAC ACCATCGCCC GCGCCGGTCG CGACCTGGAG
ACCGGGATGG TCGGCGTCAA TACCGGCCTG ATCTCGACGG CCGCCGCGCC GTTCGGCGGG
GTTAAGCTGT CCGGCGTCGG CCGCGAGGGC TCGCATCACG GCATCTCGGA ATACTTGAAC
TACAAGTACC TCTGCCAGGC AGGACTCTAG
 
Protein sequence
MTPTQARRAT LTRPELLRSQ VYYAGAWRGA GSGETVPVID PFSGEALGEV ASLGEGEIHA 
AIEAAQAAFP RWSRTPHRER GALLRRWLEL IERDKEDLAR LITLENGKPL KEARAEVAYG
SGFIEVYAEE AGRILGEILP PNMPGRRLLV EREPIGVCAA ITPWNFPMAM LTRKIAPALA
AGCTIVCKPA SETPLTALAL ALLAQEAGIP AGVLSVVVSA PALFGDIVTA SSVVRKITFT
GSTPVGARLM AASAPTIKRL SLELGGNAPL LVFDDADLEV AVETAMVAKF RNGGQSCIAA
NRLYVQRGIY EAFLSAFQAR VAALRVGDGL DPETDIGPLI SARAVEKVER HLDDALAGGA
RLISGGKSDG SLLSPATLLG DVAPDALLTR EETFGPMAGV IPFETYDQAV TMANDTPFGL
AAYVCSTRQD TIARAGRDLE TGMVGVNTGL ISTAAAPFGG VKLSGVGREG SHHGISEYLN
YKYLCQAGL