Gene BCG9842_B4386 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B4386 
Symbol 
ID7183188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp883700 
End bp885289 
Gene Length1590 bp 
Protein Length529 aa 
Translation table11 
GC content36% 
IMG OID643548680 
Productoligopeptide ABC transporter, oligopeptide-binding protein 
Protein accessionYP_002444351 
Protein GI218895940 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.775513 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value3.14637e-22 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAGA AGTTATTTGC ATTACCATTT GTTCTAATAC TACTTATTGC ATTAGCGGCT 
TGTTCTGGGG AAAAAGATTC TTCGAAGCAA GCGGGCACTA GTAAATCAGG GACTCCGAAA
GATGGAGGAA TGTTAACGAT TGGTGTTAGC GATAACCCAG ATACGATGAA TCCGCTTTAT
GCGAATGATC GTGTGTCATT AACTGTGCAG CAAGCTTTAT ATGCGCCGCT ATATCATATG
GAAGACGGTA AGAAAAAGTT TGTTCTTGCT GAGAGCTTTA CGCCTTCAGA AGATCAATTA
ACGTGGACAT TGAAATTAAA AGATAATTTG AAATGGCATG ATGGTAAGAA AATTACATCA
GACGATATAG CATTTACATT CCAATCTATT TTGGATGAGA AGCAAAATAG CTCAAGTCGT
GAAAACTTTA TTTTTAAAGG AAAGCCGCTT GAGGTGAAAA AGGTAGATGA GTTAACAACT
CAATTCGTTT TACCACAAGT AAGCGCATCT TTTGAAGGTG TGATGAATGA TTTCTTCCCA
ATTCCGAAAC ATGTATTTGA AGGGGAAGCA GATTTAGCGA AGAGTAAGAA AAACTTACAG
CCTGTAGGAT CAGGGCCGTT TAAGTTTAAA GAGTATAAAT CAGATGAGTA CGTTGCATTA
GATCGATTTG ATGATTATGT AGGTGGGAAA GCTAAATTAG ACTCTATCGT ATACCGCGTT
GTAAAAGACC GTAATACAGC AAATGTTTCA CTGCAAAATG GTCAAATCAA CATGAAGATG
ATTGAGCCGC AAGACTTTAA GAAACTAGAT AGCACTGGGA ACTTCTCAAT GGTGACATTC
CCTGAAGGTA GATTATTCTA CTTATCTTAT AACATGAATA CTGATCTTAT GAAGAAAAAA
GAAGTGCGCC AAGCAATTGC ACATGCGTTA GATAAGAAAG AAATGATTAA CTCAGCATTC
GTTTCGGGTG AATTTGCAGA ACCAGCAAAT TCAATCTTAA CGCCAGACGC TATGTATTAT
GCGAAAGATA TTAAAGACTA TAAGTATGAT AAAAAAGAAG CAAAAGATTT ATTAGCAAAA
GCTGGCGTGA AAGATAAGGA AAAAGTACGT GTGATGTATG TAACGAATAA TAAAATTATG
GAAAGCTTAG CGTTATATAC ACAACAAAAA TTAAAAGAAG TTGGCTTAGA AGTTGAACTA
AATGCATTAG ATGCTAGTGC GGCAAGTGAA AAAGGCTTAG ATAAAGAGAA TAAAGAATAT
GACATTACAT TTGGTGGTTA CATAATGGGA CCTGAGCCAG ATTCATATAA GAGCTTATTC
TTAAGCAATG CTGAATACAA TTATGCACGA TATAAAAACG CTGATTTCGA TAAGTTATGG
GAAGAAGCTG CAGTTGAGAC AGATAAAACA AAACGCGCAG AGCTATACCA TAAAATTCAA
GAGACAGCTA GAGAAGACGT ACCTTATCTA CCAATCGCGT ATCCGAAAGC AGTTATTGCA
GTAGATAAAA AGTTTGATGG ATTAAAAGAA GCGAAAGCAA TTCCTGTCAC AATGTTTGAA
GATCTATCTA AGATTTATGA AGTGAAATAA
 
Protein sequence
MKQKLFALPF VLILLIALAA CSGEKDSSKQ AGTSKSGTPK DGGMLTIGVS DNPDTMNPLY 
ANDRVSLTVQ QALYAPLYHM EDGKKKFVLA ESFTPSEDQL TWTLKLKDNL KWHDGKKITS
DDIAFTFQSI LDEKQNSSSR ENFIFKGKPL EVKKVDELTT QFVLPQVSAS FEGVMNDFFP
IPKHVFEGEA DLAKSKKNLQ PVGSGPFKFK EYKSDEYVAL DRFDDYVGGK AKLDSIVYRV
VKDRNTANVS LQNGQINMKM IEPQDFKKLD STGNFSMVTF PEGRLFYLSY NMNTDLMKKK
EVRQAIAHAL DKKEMINSAF VSGEFAEPAN SILTPDAMYY AKDIKDYKYD KKEAKDLLAK
AGVKDKEKVR VMYVTNNKIM ESLALYTQQK LKEVGLEVEL NALDASAASE KGLDKENKEY
DITFGGYIMG PEPDSYKSLF LSNAEYNYAR YKNADFDKLW EEAAVETDKT KRAELYHKIQ
ETAREDVPYL PIAYPKAVIA VDKKFDGLKE AKAIPVTMFE DLSKIYEVK