Gene Acid345_1533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1533 
Symbol 
ID4073021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1870788 
End bp1872260 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content59% 
IMG OID637983542 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_590609 
Protein GI94968561 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.84974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACCA CAGCACAAAC CGAGCTTCTC CAGGTCCAGA ATTTTGTCAA CGGCGAGTGG 
CGGACCAGCC ACTCTGGCGA TGTTCTCGAG ATTTATAACC CCGCCACGGC TGAGCCGTTG
GCGCACGTGC CCCTCTCCGG CGCCGACGAA GTGAATGAAG CGGTGCGCGC GGCCGCTGCC
GCGTGGCCCG CGTGGCGCGA GACGCCGCCC GGCGACCGCA TTCAATACAT CTTCAAGCTG
AAACAGCTAA TGGAGGAGCA CTTCGAAGAA ATCGCCCGAA CGGTGACGAT CGAGAACGGC
AAGACGCTCA CCGAAGCACG AGGAGAAGTG CGTCGCGGCA TTGAGAACGT TGAAGTCGCC
TGCGGAATTC CGCTGATGAT GCAGGGCTAC AACCTCGAAA ACATATCGCG TGGCATTGAC
GAAATCATGT ATCGCCATCC AATCGGAGTC GTAGCCGCGA TCACACCGTT CAACTTTCCG
GCAATGATTC CCTTCTGGTA CCTGCCCTAC GCAATCGCTA CTGGGAACTG TTTCATCCTG
AAGCCAAGTG AGCGCGTACC GTTCACAATG CAGAAGGTTT TCGAACTGAT TCATCAGATA
GGATTGCCGA AGGGCGTCAT CAACCTGCTG AACGGAGGTA AACCCGCGGT GGACGCTCTG
CTCGACCACC CTGAGGTGCG CGCCATCAGC TTCGTCGGGT CGACGCCCGT AGCCCGTTAC
ATCTACGAGC GCGCTGCAAA GAACGGCAAG CGCGTGCAAT GCCAGGGCGG CGCAAAGAAC
TACGCGGTCA TCCTGCCCGA TGCCGACATG AAAGTGGCGA CGAACATCGT GGGCGAGAGC
GCCTTCGGTT GTGCGGGACA ACGATGCCTG GCATTGAGTG TTGGAGTCAC TGTCGGTGAG
GCCCAGAAGG GTTTCCGCGA AGCCGTCTCT GAGTTCGCCG CGCACCTCAA GACCGGCAAC
GGACTCGAAG CAGGGACACA GATGGGGCCC GTGATCACCG CGCAGAGCAA ATCACGAATC
GAAGAAGTTA TTGACCATGC TGTGAAGCAA GGTGCAAAAG CCGTGACCGA TGGCCGCGGC
TACAGGGTTG CGAATCATGA GCGCGGCAAC TTCCTTGCGC CGACGATTCT CGATGAAGTG
CCCGCCGACA GCGATGTGCC ACAGACTGAA ATCTTCGGCC CCGTGTTGAG CCTGGTGCAC
GCCGACAGTC TCGAGCATGC GATTGAGCTG CTTTCCAAGA GCGCGTACGG TAATGCCGCA
TCTCTCTTCA CCACCAATGG AGCGCACGCG CGACGTTTCC GCCATGAAGC GCCAGCTGGA
AACATTGGCA TCAATATCGG TGTCCCTGCA CCTGTCGCCT ACTTCCCTTT CAGCGGCTGG
AAGGAGAGCT TCTTCGGCGA CCTCCACGGC CAAGGTCGTG ATGCGATCGA GTTTTACACC
GACAAGAAAG TCGTCATCGA GCGCTGGAGC TAA
 
Protein sequence
MSTTAQTELL QVQNFVNGEW RTSHSGDVLE IYNPATAEPL AHVPLSGADE VNEAVRAAAA 
AWPAWRETPP GDRIQYIFKL KQLMEEHFEE IARTVTIENG KTLTEARGEV RRGIENVEVA
CGIPLMMQGY NLENISRGID EIMYRHPIGV VAAITPFNFP AMIPFWYLPY AIATGNCFIL
KPSERVPFTM QKVFELIHQI GLPKGVINLL NGGKPAVDAL LDHPEVRAIS FVGSTPVARY
IYERAAKNGK RVQCQGGAKN YAVILPDADM KVATNIVGES AFGCAGQRCL ALSVGVTVGE
AQKGFREAVS EFAAHLKTGN GLEAGTQMGP VITAQSKSRI EEVIDHAVKQ GAKAVTDGRG
YRVANHERGN FLAPTILDEV PADSDVPQTE IFGPVLSLVH ADSLEHAIEL LSKSAYGNAA
SLFTTNGAHA RRFRHEAPAG NIGINIGVPA PVAYFPFSGW KESFFGDLHG QGRDAIEFYT
DKKVVIERWS