Gene Acel_1799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1799 
Symbol 
ID4485698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2039068 
End bp2040558 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content66% 
IMG OID639730589 
Productmethylmalonate-semialdehyde dehydrogenase (acylating) 
Protein accessionYP_873557 
Protein GI117929006 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.688459 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGA TCGAGCACTG GGTAGACGGT AAATCGACGA CGGGCAGCGG AACGCGGACC 
GGGCCGGTGT TCAACCCAGC GACCGGTGAA CAGACCGGTG TCGTGGTTTT CGCTGCCGCC
GAAGACGTCG ACGCCGCCGT CAGGTCGGCA ACAGCGGCAT TCGAGCAGTG GTCGCAAACC
TCCCTCGCCG CCCGCACAAA AATCCTCTTC GAATTCCGGC GTCTGGTGGC CGACCACATG
GATGAGCTTG CCCGCATTAT TTGCGAGGAG CACGGCAAAG TCCTGGCGGA CGCCCGCGGT
GAAGTCCAAC GGGGGCTGGA GGTCGTCGAA CTCGCCTGCG GAATCCCGAC CCTGCTCAAG
GGTGACTATT CCGATCAGGT CTCGACCGGT GTCGATGCCT TCTCGTTCCG CCAGCCGCTG
GGCGTGGTCG CCGGTATCAC GCCGTTCAAT TTTCCGGTGA TGGTGCCGAT GTGGATGCAC
CCGATCGCGA TCGCCTGCGG AAATGCCTTC ATTCTGAAGC CGAGTGAACG CGATCCCAGC
GCGTCGCAAA TGGTCGCCCG GCTCTGGCAG GAGGCCGGAC TGCCCGACGG GGTGTTCACC
GTCATCAACG GTGACCGGGA GGCGGTGGAC GCCCTGCTCG ACCATCCAGG GATCGCGGCG
ATCTCATTCG TTGGTTCGAC ACCGGTCGCC CGCTACGTGC ATGCCCGGGC GACCGCCGCG
GGCAAGCGGG TCCAGGCCCT CGGCGGGGCG AAGAATCACG CGGTGGTGTT GCCGGACGTC
GACCCCGGCT ACGCCGCCGA ACATGTGGCC GCTGCGGCGT TCGGCTCCGC CGGCGAACGG
TGCATGGCCA TCTCCGTCGC GGTGGCCGTC GGCGACGGAC AGGTCGTTGA CGCCATAACC
GAGGAGGCAC GGAAAATCCG GGTCGGACCG GGGTGGGAGC CGGAGAGCCA GATGGGTCCC
GTGATCACAG CTGCCGCCAA GGAGCGCATT ACCGGTCTGG TCAACCGCGG TGTGGAGCAG
GGCGCCCGAC TTCTCGTCGA CGGGCGGAGC CACGTGGTGC CCGGATACGA GAAGGGGTTC
TTCCTTGGCC CGACGGTGCT CGACGAGGTC ACCCCGGCGA TGGACGTGTA CCGCGAGGAA
ATCTTCGGCC CGGTGCTCTC GGTGGTCCGT GTCGGGACAA TCGATGAGGC GATCCGGCTG
GTGAACGCCA ATCCGTACGG CAACGGCGCC GCGATTTTCA CGTCCAGCGG AGCGGCGGCT
CGGCGGTTCC AGCGGGAGGT CACCGCCGGG ATGATCGGCA TCAACGTTCC CATTCCAACG
CCGATGGCCT ACTACTCATT CGGCGGCTGG AAAGACTCGC TCTTCGGCGA GCGCCACGTG
CACGGACCGG AGGGCGTCGC GTTCTACACG CGCCTGAAGG CGGTGACCAG CAGGTGGCCC
CAGGTCGAGG CGGCGCAGCA GGCGAGTTTC CACTTCCCGA CGGCGACGTG A
 
Protein sequence
MKQIEHWVDG KSTTGSGTRT GPVFNPATGE QTGVVVFAAA EDVDAAVRSA TAAFEQWSQT 
SLAARTKILF EFRRLVADHM DELARIICEE HGKVLADARG EVQRGLEVVE LACGIPTLLK
GDYSDQVSTG VDAFSFRQPL GVVAGITPFN FPVMVPMWMH PIAIACGNAF ILKPSERDPS
ASQMVARLWQ EAGLPDGVFT VINGDREAVD ALLDHPGIAA ISFVGSTPVA RYVHARATAA
GKRVQALGGA KNHAVVLPDV DPGYAAEHVA AAAFGSAGER CMAISVAVAV GDGQVVDAIT
EEARKIRVGP GWEPESQMGP VITAAAKERI TGLVNRGVEQ GARLLVDGRS HVVPGYEKGF
FLGPTVLDEV TPAMDVYREE IFGPVLSVVR VGTIDEAIRL VNANPYGNGA AIFTSSGAAA
RRFQREVTAG MIGINVPIPT PMAYYSFGGW KDSLFGERHV HGPEGVAFYT RLKAVTSRWP
QVEAAQQASF HFPTAT