Gene BCG9842_B4008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B4008 
Symbol 
ID7181502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp1237812 
End bp1239179 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content38% 
IMG OID643549057 
Productaldehyde dehydrogenase (NAD) family protein 
Protein accessionYP_002444727 
Protein GI218896316 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.148119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00000000000192548 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTATTT CCTCTATTGT AAGTAGGCAA AAGGAATATT TTTTAAAAGG GCATACGAGA 
AGCATCGAAA TGAGAAAGAA TAATTTGAAG AGGCTTTATG AAGGCATTCA GCGTTTTGAA
GAAGAAATAT TTCAGGCATT GAAATTAGAT TTAAATAAGT CAGTTCACGA GTCGTTTACA
ACGGAAGTTG GATATGTATT AAAAGAAATT TCTTTTCAAT TGAAACATAT GTCATCGTGG
AGTAAACCAA AGCGAGTTCG AACAGCACTG ACTCATTTTG GATCAAAAGG AAAAGTAGTG
CCAGAACCGT ATGGTGTTAC GCTTATTATT GCACCGTGGA ACTATCCGTT CCAATTAGCA
ATTGCACCAC TTGTAGGAGC ACTGGCAGCT GGAAATACAA TCGTTTTAAA GCCGTCAGAG
TTAACGCCAA GCGTTTCAAA AGTGCTTAAG AGAATGTTAG GTGAGTTATT CCCAGAAGAG
CTTGTAGCGG TAGTAGAAGG TGGCGTTGAA GAGAGTACAT CTTTGCTGAG GGAACCGATT
GATTATATTT TCTTTACTGG TAGTGTTGGC GTTGGAAAAG TTGTAATGGA AGCAGCAGCG
AAACAGTTGA CGCCGCTTAC GTTAGAACTT GGCGGGAAAA GTCCTTGTAT TGTACATAAA
GATGCAAAGA TAGAGATGAC AGCAAGAAGA ATTGTTTGGG GTAAGTTTTT AAATGCAGGG
CAGACGTGTG TAGCGCCTGA TTATATGTAC GTGCATTCTT CCGTGAAAGA AAAGCTAATT
GAGGCAATGC GACATGAAAT TACAGAGCAG TATAGTAAAG AACCTTTGCA AAATGAAAAT
TACGTGCGTA TTGTAAGTGA GCGTCATTTT GAACGATTAT GTCGATTTTT ACAAGATGGT
CAAGTCGTAA TTGGTGGAAA CTATAAGAAA GATACATTAC ATATTGAGCC GACAGTACTA
GCGGATACTA CATGGCAAGA TGCTGTTATG GAAGATGAAA TTTTTGGCCC GATTTTACCA
ATCATAGAGT ACGACAATAT AGAAGATGTA ATTGGCACAA TTCAGCAACA TCCGAAGCCG
TTAGCGTTAT ATGTATTTTC TGAAGATAAA GAAGTACAAA AGAAAGTGAC GAGTAATATT
TCATATGGTG GAGGCTGTAT TAATGATGTT GTCTATCATC TTGCCACGCC ATATTTACCT
TTTGGGGGTG TTGGAAGTAG TGGATTAGGG GGTTATCATG GGAAAGAAAG TTTTCGGACT
TTTTCACATT ATAAAAGCAT TTTAGCCCAA TCTACAGCAT TCGACATGAA AATTCGTTAC
TCTTCTACAA AAAGTGCTTT AAAATTCATA CGAAAGTTGT TAAAATGA
 
Protein sequence
MSISSIVSRQ KEYFLKGHTR SIEMRKNNLK RLYEGIQRFE EEIFQALKLD LNKSVHESFT 
TEVGYVLKEI SFQLKHMSSW SKPKRVRTAL THFGSKGKVV PEPYGVTLII APWNYPFQLA
IAPLVGALAA GNTIVLKPSE LTPSVSKVLK RMLGELFPEE LVAVVEGGVE ESTSLLREPI
DYIFFTGSVG VGKVVMEAAA KQLTPLTLEL GGKSPCIVHK DAKIEMTARR IVWGKFLNAG
QTCVAPDYMY VHSSVKEKLI EAMRHEITEQ YSKEPLQNEN YVRIVSERHF ERLCRFLQDG
QVVIGGNYKK DTLHIEPTVL ADTTWQDAVM EDEIFGPILP IIEYDNIEDV IGTIQQHPKP
LALYVFSEDK EVQKKVTSNI SYGGGCINDV VYHLATPYLP FGGVGSSGLG GYHGKESFRT
FSHYKSILAQ STAFDMKIRY SSTKSALKFI RKLLK