Gene BCG9842_B5531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B5531 
Symbol 
ID7183229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp5170739 
End bp5171977 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content46% 
IMG OID643553194 
ProductNADH dehydrogenase subunit C 
Protein accessionYP_002448835 
Protein GI218900424 
COG category[C] Energy production and conversion 
COG ID[COG0852] NADH:ubiquinone oxidoreductase 27 kD subunit 
TIGRFAM ID[TIGR01961] NADH (or F420H2) dehydrogenase, subunit C 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.000346603 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTAATC CAAACAAAGA CTTAGAGGAT CTGAAAAAAG AAGCAGCTAG GCGTGCAAAA 
GAAGAAGCGA GAAAACGCCT TGTAGCGAAA CAAGAGGCGG AAATAAGTGA GCTGGAGGCA
GAAAATCAAG AAAAAGAGAA AGCGCTACCA AAAAATAATG ATATTACTAT AGAAGAAGCA
AAACGACGTG CAGAAGCGGC GGTGTTAGCG AAGCAGAAAA GAGAAGGAAC AGAAGAGGTA
ACGGAAGAAG AAAAAGCGAA AGCAAAGGCA GCAGCAGCGG CAGCAAAAGC AAAAGCGGCG
GCGTTAGCGA AGCAGAAAAG AGAAGGAACA GAAGAAGTAA CGGAAGAAGA AAAAGCAAAG
GCAAAGGCAG CGGTAGCAGC AAAAGCAAAA GCGGCGGCGT TAGCGAAGCA GAAAAGAGAA
GGAACAGAAG AGGTAACGGA AGAAGAAAAA GCGAAAGCAA AGGCAAAGGC AGCGGCAGCA
GCAAAAGCAA AAGCGGCGGC GTTAGCGAAG CAGAAAAGAG AAGGAACAGA AGAAGTAACG
GAAGAAGAAA AAGCGAAAGC AAAGGCAAAG GCGGTGGCAG CAGCCAAGGC AAAAGCGGCA
GCATTAGCGA AGCAGAAAGC TTCGCAAGGT GATGGGGATT CGGGAGATGA AAAGGCAAAG
GCAATTGCAG CAGCAAAAGC GAAAGCAGCA GCGGCTGCAA GAGCGAAGAC AAAGGGAGCT
GAAGGTAAGA AAGAGGATGA GCCGAAGCGG GAAGAAACGT CCGTAAATCA GCCGTATTTA
AATCAGTATG TTGAGGCTAT TAGGGAGAAG GTAGGAGAGG GTGCATTAGT AGATTCCTAC
ATTAATAAAC TGTCAAAGGA TGTGCCGACT CTTGTGGTGG ATCCCGAAAA ATATTATGAA
GTGATGGAGT CACTGCGATT CCATGAAGGA CTTGCTTTTG ATTACATGTC AGAGCTACAT
GCGACGGATT TTGTGACACA TATGGAAGTA TATGTTCATT TGTTTTCATA TGGTAAGAAA
CAATCGGTAG CGGTGAAGGT AAAGCTAGAC CGGGAAGAAC CGCAAGTTGA ATCTGTGACA
GCGCTTTGGA AAGGGGCTGA CTGGCCGGAG CGAGAAGCAT ACGATTTGCT CGGCATTGTA
TTTAAAGGGC ATCCGAATTT AACGCGTATT TTAATGCCAG ATGATTGGGT AGGACATCCG
CTTAGAAAAG ACTATGAACC GTATGATGTG GAGGTGTAG
 
Protein sequence
MSNPNKDLED LKKEAARRAK EEARKRLVAK QEAEISELEA ENQEKEKALP KNNDITIEEA 
KRRAEAAVLA KQKREGTEEV TEEEKAKAKA AAAAAKAKAA ALAKQKREGT EEVTEEEKAK
AKAAVAAKAK AAALAKQKRE GTEEVTEEEK AKAKAKAAAA AKAKAAALAK QKREGTEEVT
EEEKAKAKAK AVAAAKAKAA ALAKQKASQG DGDSGDEKAK AIAAAKAKAA AAARAKTKGA
EGKKEDEPKR EETSVNQPYL NQYVEAIREK VGEGALVDSY INKLSKDVPT LVVDPEKYYE
VMESLRFHEG LAFDYMSELH ATDFVTHMEV YVHLFSYGKK QSVAVKVKLD REEPQVESVT
ALWKGADWPE REAYDLLGIV FKGHPNLTRI LMPDDWVGHP LRKDYEPYDV EV