Gene Acel_1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1850 
Symbol 
ID4486648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2091700 
End bp2093061 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content66% 
IMG OID639730640 
Productbetaine-aldehyde dehydrogenase 
Protein accessionYP_873608 
Protein GI117929057 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.92194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGATA TATCAGTAAT CGACCCGGCG ACCGAACAAG TTATTGACAC GGTGCCGGCG 
GCCGACGAAG AGGCGGTGGA TGCCGCTGTC GCCCGCGCCT CTCGTGCGTT TGCCGAGTGG
CGTCGGGTGA CACCCGCCGA TCGCAGCCGG TTGCTGCGCC GGTTCGCGGA GGTCGTGGAC
GGGCATCTGG AGGAACTTGC CCGCCTGGAG GTGCGCAACG CCGGTCATAC GATCCGCAAC
GCGCTCGGCG AGGCCGCCAA TGTGCGCGAC GTCCTTGCCT ACTACGCCGG TGCACCGGAG
CGGCTCCTCG GTGAGCAGAT TCCCGTCGCA GGCGGCGTTG ACGTTACTTT TCATGAGCCG
CTCGGCGTGG TCGGAATCAT CGTTCCGTGG AACTTTCCGA TGCCGATCGC CGCATGGGGC
TTTGCCCCTG CGCTGGCCGC CGGCAATACC GTCGTGCTCA AACCCGCCGA ACTGACGCCG
CTGACCGCGC TGCGGCTCGG TGAACTCGCG CTCGAGGCGG GAATCCCCGA AGGCGTTTTC
ACCGTCCTGC CGGGCAAGGG TTCGGTGGCC GGCGAACGGT TGGTCTGCCA CCCGCTGGTC
AGGAAGATCT GCTTCACCGG TTCGACGGAG GTGGGCAAGC GCATCATGCG GCTGGCCGCG
GACGGCGTGA AGCGCATCAC CTTGGAGCTT GGCGGAAAGA GCGCAAACAT TGTCTTTGCC
GACGCGGATC TCGAGCGTGC GGCGGCGGCG GCGCCGTACG CGGTCTTTGA CAATGCCGGC
CAGGATTGCT GCGCCCGCAG TCGGATTCTC GTCCAGCGCC GTGTGTACGA CGAATTCCTC
GCCCTGTTCC AGAAAGCGGT GGCCGGCGTC GTGGTTGGGC CGCCCGGCGA CGAGCGGACC
GAGATGGGAC CACTCATTTC CGCGCAGCAG CGCGACCGCG TCGCACGCTT CGTCGTCGAG
GATCACGTGT TGTTCCGCGG CACGGCGCCT GCGGGAGCCG GATTCTGGTT TCCGCCGACG
GTGGTGGCAC CCGCCGGTAC CGACGATCCC GTCTGGCGGG AAGAAGTTTT CGGGCCCGTC
GTCGCCGTCC TGCCGTTCGA TGATGAGGAC GACGCGATCC GGATGGCGAA CGACACGGCG
TACGGGTTAT CCGGCTCGAT CTGGACCCGC GACGTCGGCC GGGCCTTCCG CGTAGCGCGC
GGCGTTGAGT CCGGAAATCT GTCGGTCAAT TCCAACACCT CGGTGCGGTA CAACACGCCG
TTCGGTGGTT TCAAGCAATC GGGACTCGGG CGGGAGCTCG GGCCCCATGC GTTGGAGAGT
TTCACGGAGA TCAAGAACGT TTTCATCGCA ACGGAGGAGT GA
 
Protein sequence
MTDISVIDPA TEQVIDTVPA ADEEAVDAAV ARASRAFAEW RRVTPADRSR LLRRFAEVVD 
GHLEELARLE VRNAGHTIRN ALGEAANVRD VLAYYAGAPE RLLGEQIPVA GGVDVTFHEP
LGVVGIIVPW NFPMPIAAWG FAPALAAGNT VVLKPAELTP LTALRLGELA LEAGIPEGVF
TVLPGKGSVA GERLVCHPLV RKICFTGSTE VGKRIMRLAA DGVKRITLEL GGKSANIVFA
DADLERAAAA APYAVFDNAG QDCCARSRIL VQRRVYDEFL ALFQKAVAGV VVGPPGDERT
EMGPLISAQQ RDRVARFVVE DHVLFRGTAP AGAGFWFPPT VVAPAGTDDP VWREEVFGPV
VAVLPFDDED DAIRMANDTA YGLSGSIWTR DVGRAFRVAR GVESGNLSVN SNTSVRYNTP
FGGFKQSGLG RELGPHALES FTEIKNVFIA TEE