Gene Caul_0580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0580 
Symbol 
ID5898035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp632054 
End bp633496 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content63% 
IMG OID641561062 
Productsuccinic semialdehyde dehydrogenase 
Protein accessionYP_001682211 
Protein GI167644548 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01780] succinate-semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGCTAA GGGATCCGTC CCTCCTCAAA GATCAATGCT TCGTCGGCGG CGCGTGGATT 
GGTTTGCCCC AAGTTGATGT CATCGACCCG GCGAGCGGCG AGAAGATCGC GGCGGTGCCA
AATTGTGGGG CGAACGAGAC GCAGCAGGCC ATCCAGGCAG CCGATGCCGC GCTGCCCGGC
TGGCGCGGCC GCACCGCGGC GCAGCGTTCG ACGATCATGC GGCGCTGGTT TGAGGCCACG
ATCGAGGCGA CCGAAGATCT CGCGCTTATC CTGAGCAGCG AGCAGGGCAA ACCGATCGCC
GAGGCGCGCG CTGAAATCAT CTACGCCGCC AGCTTCATCG AATGGTTCGC AGAAGAGGCT
AAGCGAACCT ATGGCGAGGT CATCCCCAGT CCGCGCGCCG ACGCACGGAT CGTCGTGATC
CAGCAGCCGA TCGGCGTGAC GGCCGCGATC ACGCCGTGGA ATTTCCCGGC CGCCATGATC
ACGCGCAAGG CTGGTCCGGC GCTGGCGGCG GGTTGCACGA TGGTGTTGAA ACCGGCGATG
CAGACGCCGT TGACGGCCTT GGCGCTGGCT GCGCTCGCGC AACGCAGCGG CGTCCCTGAT
GGTGTCTTCA ATGTCGTGAC CGGGAGCGCA CGCGACATCG GCGGGGAACT GACCTCGAAC
CCCATCGTGC GCAAGATCAG CTTTACCGGA TCGACCGAGA TCGGCCGCCT GTTGATGCGA
CAGGGCGCTG CGACGGTGAA GAAGATGTCT CTGGAATTGG GAGGAAACGC GCCCTTTATC
GTGTTCGATG ACGCCGATGT CGAAGCTGCG GTCGAAGGCG CGATGTTGTC CAAATACCGC
AACAGCGGCC AAACCTGCGT ATGCGTCAAT CGCATATATG TTCAGCGCGG CGTCGCCGAA
GCGTTCGTCG AGAAGCTGGC CAAGGCGGCG GCGGATCTGC GCGTCGGACG CGGCACGGAC
GAGGGCGTGA CACAAGGGCC CTTGATTGAC GCCGCGGCGG TGGAGAAGGT CGAAGAGCAT
GTGGCCGATG CGCTGGCCAA GGGGGCAAGG CTCGTCCTGG GCGGAGCCCG TCATGCCTTG
GGGGGCACGT TCTTCGAACC GACAATTCTG ACGAACTGTT CGGCGGACAT GCTCGTCGCG
CATGAGGAGA CGTTCGGTCC TGTGGCGTCG GTCTTCGTAT TCGACGAGGA AGACGAGGCG
ATCGGCTTGG CGAACGCCAG CGAGTTTGGC TTGGCCGGAT ATTTCTACAG CCGTGACCTT
GGCCGGGTGT GGCGTGTGGC CGAAGCACTC GAATGCGGGA TGGTCGGCAT CAACACCGGC
CTGATTTCGA ATGAAGTCGC ACCCTTTGGC GGGATCAAGC AATCGGGCCT GGGACGGGAG
GGCTCGTCAC ACGGGATCAC CGACTATCTC GAACTGAAAT ATCTCTGCAT GGCCGGCCTC
TGA
 
Protein sequence
MSLRDPSLLK DQCFVGGAWI GLPQVDVIDP ASGEKIAAVP NCGANETQQA IQAADAALPG 
WRGRTAAQRS TIMRRWFEAT IEATEDLALI LSSEQGKPIA EARAEIIYAA SFIEWFAEEA
KRTYGEVIPS PRADARIVVI QQPIGVTAAI TPWNFPAAMI TRKAGPALAA GCTMVLKPAM
QTPLTALALA ALAQRSGVPD GVFNVVTGSA RDIGGELTSN PIVRKISFTG STEIGRLLMR
QGAATVKKMS LELGGNAPFI VFDDADVEAA VEGAMLSKYR NSGQTCVCVN RIYVQRGVAE
AFVEKLAKAA ADLRVGRGTD EGVTQGPLID AAAVEKVEEH VADALAKGAR LVLGGARHAL
GGTFFEPTIL TNCSADMLVA HEETFGPVAS VFVFDEEDEA IGLANASEFG LAGYFYSRDL
GRVWRVAEAL ECGMVGINTG LISNEVAPFG GIKQSGLGRE GSSHGITDYL ELKYLCMAGL