Gene BCG9842_B4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B4033 
Symbol 
ID7186215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp1212077 
End bp1214026 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content39% 
IMG OID643549032 
Producthypothetical protein 
Protein accessionYP_002444703 
Protein GI218896292 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAA ATATATTGCT TATAGGGGAT GGCCTTCTTG CAGACTATGT ACATGATCAA 
TTATGCAAAC AATATTCCAT CATTCGTCAG CATACCCTTA CAGAAGAACT TCCTGAAAAT
ATTTATCTCG CTCTCGTATT ACATGATGGA TCTCCTTCTC CTATTCATCA TGATGCTGAG
CTAATTTTTC GCTCAAATCA TATTCCGTGG CTTCGTGGTT TCACTTCATT TGGCGAAGGT
ATTATCGGTC CTTACATTCA TCCTCTTACG CCCGGATGTT CCCATTGTTC TGATGGACGC
CGTTTTATCG CTGGCTTTGA TCAAAAAGAA ATGTGGGAAC TACAGCGGAA ATATGCCTTT
AAAGCCGAAA ATGGAACGAG GCGTGATGTA CGTGCCACCC AAAATGGCAT GTTACAAATG
TGCCATCTTA TTTGCGCAGA AACAGAAAAA ATATTAACTC ATAACCATTC TTCTTTAGAA
AATGAACTTA TTTTACTAAA CTTACAAACA TTACAATGTA CGCGGCATTC TTTTCTTCCA
GATCCTCTCT GTCCTGTATG TAGTAGTTTA CCTGATGATA CTGCGGATGC GGCGGAAGTT
TCTTTACAAC CGAGTTTAAA AGTAAGCACC GAAACGTATC GCTGCCGTTC CATTCATGAA
TTAAACACAT TTTTAACGAA AGACTATTTA GATTATCGGA TCGGTATGTT GAACGGCAAA
ATGCAGCATT CTTTATTGCC ATTCGCTGAC GTCATTATAA ACATGCCATT AATGTTTGGA
AATGAAGGTG TTGCAGGCCG AACTCATTCA TTTGCAGTGA GTGAAGCAAC TGCTATTTTA
GAAGGTTTAG AACGATATTG CGGTATGTCG CCTCGCGGGA AAAAGACAAA TGTGCATGGT
AGTTTTCATG ATTTAGAGGA ACATGCACTA AATCCCCTTA CACTCGGTGT GCATACAAAT
GAACACTACA ATCGTGATAA TTTTCCATTT AAGCCGTTTG ATCCTGATTA TGAGCAAAAC
TGGGTATGGG GATATTCTTT ATCACAAAAC AGACCGCTTT TAGTTCCTGA ATCAATTGCT
TATTATAGCC TTGGTCATCG AGATGCTTTC GTGTATGAAA CATCAAATGG ATGTGCAATT
GGCGGTAGTT TAGAAGAAGC GATTTTTCAC GGCATTTTAG AAATTGTAGA GCGTGATGCC
TTCTTACTCA CTTGGTATGC CGAATTACCT CTTCCCCGCC TTGATCTTAG TTCAGCAAAT
GATACAGAAT TACAATTAAT GATCCAGCGG TTACGTACGA TTACTGGATA TGAATTACAC
GCTTTTAACG CGACGATGGA ACACGGCATC CCGAGCTTAT GGGTAATTGC AAAGAATACA
CGTGAAAATG GAATGAACGT TGTTTGTGCG GGAGGCGCTC ATTTAGATCC TATTCGAGCT
TTAAAAAGTG CCATTCAAGA AATAGCAGGC ATGTTACTTA TAACAGACGA TGAACTCGAG
CACAAAAGAG AATACTATGA AAAATGCTTA CAAGATCCTT ATTTCGTTAA TAAAATGGAA
GACCACAGTA TGCTGTACGG ATTGAAAGAA ACGGAAGAAC GTCTTCACTT CCTTTTACGA
GAAGATGCAC CAGTGCAAAC GTTCCAAGAA ATGAATGTAT CACAGTCATT TGATATGGAT
TTAACATCCG ATCTCCACCA ACTTTTAAAT CGTTTGCATC AATCTAATCT TGAAATAATC
GTTGTAGATC AAACCGTTCC CCTTATAGAA AAGAACGGAC TACACTGTGT AAAAGTCATT
ATTCCAGGCA TGTTACCGAT GACATTCGGC CATCACCTTA CTCGTGTTAC AGGATTAGAT
AGAGTCTATA CCGTACCGAT GACACTTGGA TATAGCACTG AACCGTTAAC AAATGAACAA
TTAAATCCAC ATCCGCACCC GTTTCCATAG
 
Protein sequence
MTQNILLIGD GLLADYVHDQ LCKQYSIIRQ HTLTEELPEN IYLALVLHDG SPSPIHHDAE 
LIFRSNHIPW LRGFTSFGEG IIGPYIHPLT PGCSHCSDGR RFIAGFDQKE MWELQRKYAF
KAENGTRRDV RATQNGMLQM CHLICAETEK ILTHNHSSLE NELILLNLQT LQCTRHSFLP
DPLCPVCSSL PDDTADAAEV SLQPSLKVST ETYRCRSIHE LNTFLTKDYL DYRIGMLNGK
MQHSLLPFAD VIINMPLMFG NEGVAGRTHS FAVSEATAIL EGLERYCGMS PRGKKTNVHG
SFHDLEEHAL NPLTLGVHTN EHYNRDNFPF KPFDPDYEQN WVWGYSLSQN RPLLVPESIA
YYSLGHRDAF VYETSNGCAI GGSLEEAIFH GILEIVERDA FLLTWYAELP LPRLDLSSAN
DTELQLMIQR LRTITGYELH AFNATMEHGI PSLWVIAKNT RENGMNVVCA GGAHLDPIRA
LKSAIQEIAG MLLITDDELE HKREYYEKCL QDPYFVNKME DHSMLYGLKE TEERLHFLLR
EDAPVQTFQE MNVSQSFDMD LTSDLHQLLN RLHQSNLEII VVDQTVPLIE KNGLHCVKVI
IPGMLPMTFG HHLTRVTGLD RVYTVPMTLG YSTEPLTNEQ LNPHPHPFP