Gene Caul_1409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1409 
Symbol 
ID5898864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1498790 
End bp1500286 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content66% 
IMG OID641561896 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_001683037 
Protein GI167645374 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.671216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGACCA TCAGCCATTT CGTGAACGGA CAAACCTTTG AAGGGGCGTC GGGTCGCTTT 
GGCGACGTGT TCAATCCCAA CACCGGCGAG GTCCAGGCCC GCGTCCAGTT GGCCACCGAC
GCCGAGCTCG ACGCCGCCGT ACAGGCCGCC GCCGCCGCCC AGATCGGCTG GGCCGCCACC
AACCCGCAGC GCCGCGCCCG GGTGATGTTC GAGTTCAAGC GCCTGATCGA GCGCGACATG
AACAGCCTAG CCGAGATCCT GTCGTCCGAG CACGGCAAGG TGGTCGCCGA CAGCAAGGGC
GACATCCAGC GCGGCCTGGA GGTGATCGAG TTCGCCTGCG GCATCCCCCA CATCCTGAAG
GGCGAATATA CCGAGGGCGC GGGCCCCGGC ATCGACGTCT ATTCAATGCG CCAGCCGCTG
GGCGTCTGCG CCGGCATCAC CCCGTTCAAC TTCCCGGCCA TGATCCCGAT GTGGATGTTC
GGCATCAGCA TCGCCGTGGG CAACACCTTC ATCCTCAAGC CGTCGGAGAA GGATCCGACG
GTGCCGGTCA AGCTGGCCGA GCTGATGATG GAAGCCGGGG CTCCGGCCGG CGTGCTGAAC
GTGGTGCACG GCGACAAGGT CTGCGTCGAC GCGATCCTGA CCCATCCGCT GATCCGCGCC
GTCAGCTTCG TCGGTTCGTC GGACATCGCC CACTACGTCT ACCAGACCGG CACGGCGCAC
GGTAAACGTG TCCAGGCCAT GGGCGGCGCC AAGAACCACG GCATTGTCCT GCCCGACGCC
GACCTCGACC AGGTGGTCAA GGACTTGTCG GGCGCGGCCT TTGGTTCGGC GGGCGAGCGC
TGCATGGCCC TGCCGGTGGT GGTTCCGGTC GGCCAGAAGA CCGCTGACGA ACTGCGCGAA
CGGATGGTCG CCGAGATCGA GACGCTGCGG GTCGGCGTCT CCAGCGACCC GGCCGCCCAC
TACGGCCCGG TGGTCAGCGC CCAGCACCGC GCCAAGATCG CGGACTACAT CCGTCTTGGC
GTTGAAGAGG GCGCGGACTT GGTGGTCGAT GGCCGCGACT TTTCCATGCA GGGCTTCGAG
AAGGGCTTCT TCATCGGCCC GTCGCTGTTC GACGGCGTCA AGAAGGGCAT GAAGACCTAT
CAGGAAGAGA TCTTCGGACC GGTGTTGCAG ATCGTCCGCG CCGAGACCTT CGAAGAAGCC
TTGGCCCTGC CGTCCGAGCA TCAGTACGGC AACGGCGTGG CGATCTTCAC CCGCAACGGC
CGGGCGGCGC GCGAGTTCGC CAGCCGCGTC AATGTCGGCA TGGTCGGCAT CAACGTGCCG
ATCCCGGTGC CGGTGGCCTA CCACACCTTC GGCGGCTGGA AGCGCAGCGC CTTTGGCGAC
ACCAACCAGC ACGGCGTCGA GGGCGTGAAA TTCTACACCA AGGTCAAGAC GATCACCGCG
CGGTGGCCCG AGGGCGACCA CGAGGGCGAC GCCTTCGTCA TTCCGACGAT GAAATAG
 
Protein sequence
MRTISHFVNG QTFEGASGRF GDVFNPNTGE VQARVQLATD AELDAAVQAA AAAQIGWAAT 
NPQRRARVMF EFKRLIERDM NSLAEILSSE HGKVVADSKG DIQRGLEVIE FACGIPHILK
GEYTEGAGPG IDVYSMRQPL GVCAGITPFN FPAMIPMWMF GISIAVGNTF ILKPSEKDPT
VPVKLAELMM EAGAPAGVLN VVHGDKVCVD AILTHPLIRA VSFVGSSDIA HYVYQTGTAH
GKRVQAMGGA KNHGIVLPDA DLDQVVKDLS GAAFGSAGER CMALPVVVPV GQKTADELRE
RMVAEIETLR VGVSSDPAAH YGPVVSAQHR AKIADYIRLG VEEGADLVVD GRDFSMQGFE
KGFFIGPSLF DGVKKGMKTY QEEIFGPVLQ IVRAETFEEA LALPSEHQYG NGVAIFTRNG
RAAREFASRV NVGMVGINVP IPVPVAYHTF GGWKRSAFGD TNQHGVEGVK FYTKVKTITA
RWPEGDHEGD AFVIPTMK