Gene BCG9842_B1655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B1655 
SymboldhaS 
ID7181556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp3491462 
End bp3492946 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content41% 
IMG OID643551386 
Productaldehyde dehydrogenase (NAD) 
Protein accessionYP_002447056 
Protein GI218898645 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.0330388 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAC TAGCTGTAAA TCTTCATGAA AAGGTAGAAA ATTTTCTTCA AGGTACAAAA 
AAGTTATATG TGAATGGATC TTTCATTGAA AGCGCTTCCG GAAAAACATT TAAAACACCT
AACCCAGCAA CTGGTGAAAC ACTTGCCGTC GTTTCTGAAG CTGGTCGTGA AGATATTCAT
AAAGCTGTAG TCGCAGCTCG TATGGCTTTT GACGAAGGTC CTTGGTCTCG TATGAGCACT
GCTGAGCGAA GCCGTCTCAT GTACAAGTTA GCTGATTTAA TGGAAGAACA TAAAGAAGAG
CTTGCACAGC TCGAAACATT AGATAACGGA AAGCCAATCC GTGAAACAAT GGCAGCAGAC
ATACCACTTG CAATTGAGCA TATGCGCTAT TACGCTGGCT GGGCTACGAA AATCGTTGGT
CAAACAATTC CTGTTTCCGG TGATTACTTT AACTATACAC GCCATGAAGC TGTTGGTGTC
GTTGGTCAAA TTATCCCTTG GAACTTCCCG CTTCTTATGG CAATGTGGAA AATGGGAGCA
GCGCTTGCTA CAGGATGTAC AATCGTTTTA AAACCTGCAG AACAAACTCC ACTATCTGCT
CTATACTTAG CTGAATTAAT TGAAGAAGCT GGATTCCCGA AAGGTGTTAT TAATATCGTA
CCTGGATTCG GTGAATCAGC TGGACAAGCT CTCGTTAATC ATCCACTCGT TGATAAAATT
GCATTTACCG GTTCTACTCC TGTCGGTAAA CAAATTATGC GACAAGCATC CGAATCATTA
AAACGCGTTA CACTTGAGTT AGGCGGTAAA TCACCAAATA TCATCTTGCC AGATGCTGAT
TTATCTCGCG CGATTCCTGG TGCACTTTCT GGTGTTATGT TTAACCAAGG ACAAGTATGC
TCTGCTGGAT CACGCTTATT TGTTCCGAAG AAAATGTATG ATAATGTCAT GGCTGATCTC
GTCCTTTATT CTAAAAAATT AAATCAAGGC GCTGGTCTAA GTCCAGAAAC TACAATCGGT
CCTCTCGTTT CCGAAGAACA ACAAAAACGT GTAATGGGCT TCATTGAAAA AGGGATTGAA
GAAGGCGCTG AAGTACTTTG CGGAGGAAAT AATCCATTCG ATCAAGGCTA CTTCGTTTCT
CCTACAGTAT TCGCTGACGT AAATGACGAA ATGACGATCG CAAAAGAAGA AATTTTCGGT
CCAGTTATTT CTGCAATACC GTTTAACGAT ATTGATGAAG TAATTGAACG TGCGAATAAA
TCTCAATTTG GCTTAGCTGC TGGTGTATGG ACAGAAAATG TTAAAACTGC ACACTATGTT
GCAAGTAAAG TACGTGCAGG TACAGTATGG GTAAACTGTT ATAACGTCTT TGATGCAGCA
TCTCCATTTG GAGGATTTAA ACAATCTGGT CTCGGCCGTG AAATGGGATC TTACGCATTA
AATAACTATA CAGAAGTGAA GAGCGTTTGG CTTAACTTAA ATTAA
 
Protein sequence
MSQLAVNLHE KVENFLQGTK KLYVNGSFIE SASGKTFKTP NPATGETLAV VSEAGREDIH 
KAVVAARMAF DEGPWSRMST AERSRLMYKL ADLMEEHKEE LAQLETLDNG KPIRETMAAD
IPLAIEHMRY YAGWATKIVG QTIPVSGDYF NYTRHEAVGV VGQIIPWNFP LLMAMWKMGA
ALATGCTIVL KPAEQTPLSA LYLAELIEEA GFPKGVINIV PGFGESAGQA LVNHPLVDKI
AFTGSTPVGK QIMRQASESL KRVTLELGGK SPNIILPDAD LSRAIPGALS GVMFNQGQVC
SAGSRLFVPK KMYDNVMADL VLYSKKLNQG AGLSPETTIG PLVSEEQQKR VMGFIEKGIE
EGAEVLCGGN NPFDQGYFVS PTVFADVNDE MTIAKEEIFG PVISAIPFND IDEVIERANK
SQFGLAAGVW TENVKTAHYV ASKVRAGTVW VNCYNVFDAA SPFGGFKQSG LGREMGSYAL
NNYTEVKSVW LNLN