Gene Acel_1100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1100 
Symbol 
ID4485763 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1217860 
End bp1219341 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content65% 
IMG OID639729875 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_872858 
Protein GI117928307 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.486917 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGTA CGACGTCCGT CAGCGGAATT GAGGTTTCCG TTGACCACTG GGTGGGCGGC 
GAACGCCTCG CGTCGGATGC GACGTTCCCT GACATTTCAC CGCTTGATCA GCAGGTTCTC
GCCAATGTCG CCCGCGGCGG ACCGCGGGAA GTGAGCGCCG CGGTCGACGC CGCTTCGGCG
GCGTTTCCGC AATGGGCGGC GACACCGCGC ACCGAGCGGG CCGCCCTCCT GCACGCCGTA
GCGGAAGGCA TCGAGAAGCG TGTCGACGAT CTCGCACTCG TCGAAACGTT GGACAACGGG
GGGTTGCTCC GGTCCCATCG TCGGAGTGTG ATTCCCCGCG CCGCGTATAA CTTTCATTTC
TTTGCGGATT TTCTCTTGCA GCTCGGTCAT GAGGATTTCG AGACACGAGG ACATAGAAAC
CACATTTCGT GGCAACCGGC AGGAGTCACG GCGGTCATCA CGCCGTGGAA TGCACCGTTG
ATGCTCGCCA CCTGGCGGGT CGCACCCGCC CTAGCCGCCG GCAATACCGT CGTGCTCAAG
CCCCCGGAGT GGGCGCCGTT GACGGCATCA CTCCTCGCCG ACATTACCGC TGAAGCCGGG
CTGCCGCCAG GCGTCTTTAA CGTCGTGCAG GGTATTGGGG AGGAGGCCGG TGCCGCACTG
GTGCGCGATC CCCGGGTACG CCGTATCGCC TTCACCGGCT CGGTTGCGAC CGCGCGCGCC
ATCGGTCACG CGGCAGCGGA GAACGTCATC CCCGTCTCCT TCGAGCTGGG AGGCAAGAAT
CCGTTCATTG TCTTTCCCGA CGCCGACCTG GACCTCGCTG TACGACACGC GGTGGATCAG
TACGACAACG CCGGCCAGGT CTGTCTCGCC GGCACGCGAT TGTACGTCGC CGACGCCGTC
TACGACGAAT TCCTTGAGCG GTTTCTCCAG GCGGCTGCGG CGTGGCGGGT AGGGGACCCG
CGCAGCGAAG ACGTCGACAT GGGCCCGCAG ATTCATCCCG ACCATCTTGC GCGCATTGAC
GGATACGTCC GCCGCGCGAA AGCCGCCGGG GCGACGGTCC TACTCGGCGG CGGCCCGCAT
CCGGAGCTGG GCGGTCTGTA CTACCAACCC ACCTTATTGA CGAATGTTGC CGATGACAGT
GAGATCAACC GCGAAGAAGT CTTCGGTCCT GTCATTGTCC TGCATCGATT TACGGACGAA
GACGAAGTCA TCCGGCGTGC GAACGACAAT ATCTATGGGC TCGCGGCGAT GGTCTTTACC
GGCGACCGGT CACGGGCGGA GCGCGTCGCG GACCGGCTGG TCGCCGGCAC CGTCTGGGTG
AACTGCTTCT ACGTCCGCGA CTTGCGGGCG CCGTTCGGCG GCGCGCGGTT GTCCGGTATC
GGCCGGGAAG GCGGCACCTG GTCGTTCGAC TTCTACGCGG AGGTCAAGAA CACGGTGACC
GCCCCGAGCG GCTGGTTGAT AAAGGAGGCG AATGGTGGGT GA
 
Protein sequence
MPRTTSVSGI EVSVDHWVGG ERLASDATFP DISPLDQQVL ANVARGGPRE VSAAVDAASA 
AFPQWAATPR TERAALLHAV AEGIEKRVDD LALVETLDNG GLLRSHRRSV IPRAAYNFHF
FADFLLQLGH EDFETRGHRN HISWQPAGVT AVITPWNAPL MLATWRVAPA LAAGNTVVLK
PPEWAPLTAS LLADITAEAG LPPGVFNVVQ GIGEEAGAAL VRDPRVRRIA FTGSVATARA
IGHAAAENVI PVSFELGGKN PFIVFPDADL DLAVRHAVDQ YDNAGQVCLA GTRLYVADAV
YDEFLERFLQ AAAAWRVGDP RSEDVDMGPQ IHPDHLARID GYVRRAKAAG ATVLLGGGPH
PELGGLYYQP TLLTNVADDS EINREEVFGP VIVLHRFTDE DEVIRRANDN IYGLAAMVFT
GDRSRAERVA DRLVAGTVWV NCFYVRDLRA PFGGARLSGI GREGGTWSFD FYAEVKNTVT
APSGWLIKEA NGG