Gene BCG9842_B1389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B1389 
Symbol 
ID7181675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp3755226 
End bp3757643 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content38% 
IMG OID643551651 
Productphage protein 
Protein accessionYP_002447321 
Protein GI218898910 
COG category[S] Function unknown 
COG ID[COG5412] Phage-related protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00005153 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTGATA ATCTCTTAGA CGGGCTGAAA GAAGGTAGGG TCCAATTAAC TGAATTCGCA 
CAAGGAGCTG ATAAGGCTTT AAAAGAAGCG CTTGACGGTT CTGGTATTGC AACTGGACAA
ATAGAAAAAT GGGGAGCATC TGTCGCTAAA GGCGGAAGAG ATGGCGCAGC AGCGATGGTA
GAAGTAGCTA AAGCTATTGA CGGAATAGAA GACCCAGTTA AGAAAAATCA GGTTGGGGTT
AAAGTTCTAG CCACTATGTT TGAAGATCAA GGTCAAAATT TAACAAACAC TTTAATTGAA
GCTTCTAAGA AAACAAAGGA TCTTCAACAA AACCAAGACA ACTTAAATGA ATCTGTTAAA
AAATTAGATG CAAATCCAGC TGTAAAGTTC CAAAAAGCGA TGGGAGATTT ACAAATGGCT
CTTGAACCTA TACTAGGAGT AATTGCTGAT GTTGTTGCTA GTATTGCTGA TTGGATTTCT
AATAATCCAG AATTAGCAGC GACATTAGCA GCAGTTGCAA CGGCTATTGG AGTAATTTCA
GGGGCACTTA TGGCTATTGC ACCAATTGTT GTATCGGTCA TGGGGGTATT TGAAATTGGG
GCCGCCGCGG CACTAGGTAT AGTTGCTATT GTTCCTATTA TCATAGCCGC TATAGTTGCT
CTTGGAGTGG CTATTTATAA AAACTGGGAT GATATTAAAA ATTGGACAAT AGAAGCATGG
GATTCTATTA AAGAGTACTT AGTAGAGCTT TGGGAGGGGA TATCCCAATC CTGTAGTGAA
GCATGGTCTT CATTTTTAGA AGCAATGCAT GAATTTTTTG ATCCGATAGG TCAATTTTTT
AGTGATTTAT GGGAGGGTGT AAAGCAGGCG TGTAGCGATG CATGGAATTC TACTGTTGAA
TTCTTTTCTG AAGCATGGTC TTCTTTCGTA GAAATGATGC ATAGTTTCTT TGATCCGATA
GGTGAATTCT TTAGTAGTTT ATGGTCTGGC ATTGTTGAAA CTGCTTCCTC CTGGTGGTCC
TCTTTAGTTG AAACAGCATG TGAATTGTGG GGAACATTAA CGCAAGCATG GCAAGAAACA
TGGGATACAA TTCTTACTGT TTTAGATCCA ATTATTTCGG CAGTTTCTAC CGTTTTAGAA
GCTGGTTGGT TGTTAATACA GGCAGGTGCA CAAATTGCAT GGGCGGCAAT CTGTCAATAT
ATTATTCAAC CGATTCAGGA AGCCTACGAC TGGGTAAATA CACAAATCGG TGAAATGGTC
ACTTGGCTTG GTACACAATG GGAAATTGCA AAAGCTATGG CACAAATTGC TTGGGGACTA
TTTAAGCAAT ATATTATTCA ACCTGTTCTA GACACTTGGA ACTTAGTAAA AGAAAAGTTC
AGTGATTTAG TTTCTTGGCT AAATTCACAA TGGGAGACAG TTAAATCATA TACATCAGCA
GCATGGGGGT TATTTAAACA ATATATTATA AAACCTGTAC AAGATACTTG GAATTTAGTA
AAAGAAAAGT TTAGTGATTT ATCCAATTGG ATGTTAGGAA TTTGGGCGAA AATAAAAGGC
TATACACTTG AAGCGTGGAA GATGGTTTAC ACATACATCG TTCAACCAGT TATTTCAGCT
TATAATTCTG CAAAAGAGAA ATTCAATGAT ATGTACAATA TAGCACGGGA AAAATTTGAT
TCTGTTAAGA ATGCAGCTCA AGAAAAATTT GAAGCGGCAA AACATTTCAT TATAGATCCA
ATTAAAGATG CAGTTGACAG TATAGAAAAA TTCATTGGAA AGATTAAAGG ATTCTTTAGT
GACTTGAAGT TGAAAATTCC AAAACCAGAA ATGCCACCTC TTCCACACTT CAGCTTACAA
ACAAGCACGA AAAATGTTTT AGGTAAAGAT ATTACATTTC CGTCAGGAAT TAATATTGAT
TGGCGTGCAA AAGGCGGTAT CTTTACTAAA CCAACTATCT TTGGAATGAA TGGCGGAAAC
TTGCAAGGTG CAGGAGAAGC GGGGCGAGAA GCAGTGCTTC CTCTGAATAA AAAGACACTT
GGAGATATTG GTGCAGGAAT TGTGGCAGCC ATGCCACGAC AACAATTTGC AACGCCGAGA
GAAATAAATC AACTAATGGG TGACATGAGC CGTATGATGG CTAGTTCTGT GAGTCAATTA
TCAGGATTAA AGAGTGTCAT GAGTGGTGTG TATGGAAGTA TGTCAAATAG TAGACAAGCT
ATGGCAAGCA GCGTATCAAA TCAAGTGATT AATTACGGAT CTAGTTCATC TTCTAGTGGT
GAAGTTATTC CAATGCTTGG GGGAGATTTA GTTATTGAAG TGCCTGTTAA TTTAGAAGGA
AGAGACGTGG CACGCGGTAC TTATCGCTAT ACAACCGAGT ATCAAGAAAG AGAAGCAAAA
AGAAACTCAG CCTTTTAG
 
Protein sequence
MIDNLLDGLK EGRVQLTEFA QGADKALKEA LDGSGIATGQ IEKWGASVAK GGRDGAAAMV 
EVAKAIDGIE DPVKKNQVGV KVLATMFEDQ GQNLTNTLIE ASKKTKDLQQ NQDNLNESVK
KLDANPAVKF QKAMGDLQMA LEPILGVIAD VVASIADWIS NNPELAATLA AVATAIGVIS
GALMAIAPIV VSVMGVFEIG AAAALGIVAI VPIIIAAIVA LGVAIYKNWD DIKNWTIEAW
DSIKEYLVEL WEGISQSCSE AWSSFLEAMH EFFDPIGQFF SDLWEGVKQA CSDAWNSTVE
FFSEAWSSFV EMMHSFFDPI GEFFSSLWSG IVETASSWWS SLVETACELW GTLTQAWQET
WDTILTVLDP IISAVSTVLE AGWLLIQAGA QIAWAAICQY IIQPIQEAYD WVNTQIGEMV
TWLGTQWEIA KAMAQIAWGL FKQYIIQPVL DTWNLVKEKF SDLVSWLNSQ WETVKSYTSA
AWGLFKQYII KPVQDTWNLV KEKFSDLSNW MLGIWAKIKG YTLEAWKMVY TYIVQPVISA
YNSAKEKFND MYNIAREKFD SVKNAAQEKF EAAKHFIIDP IKDAVDSIEK FIGKIKGFFS
DLKLKIPKPE MPPLPHFSLQ TSTKNVLGKD ITFPSGINID WRAKGGIFTK PTIFGMNGGN
LQGAGEAGRE AVLPLNKKTL GDIGAGIVAA MPRQQFATPR EINQLMGDMS RMMASSVSQL
SGLKSVMSGV YGSMSNSRQA MASSVSNQVI NYGSSSSSSG EVIPMLGGDL VIEVPVNLEG
RDVARGTYRY TTEYQEREAK RNSAF