Gene pE33L466_0350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagpE33L466_0350 
SymbolbgaC 
ID3399919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_007103 
Strand
Start bp346449 
End bp348245 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content34% 
IMG OID637660167 
Productbeta-galactosidase 
Protein accessionYP_245831 
Protein GI67078211 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000121293 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCAT TTGAAATTGG GAAAGATTTT ATGCTAGATG GTGAGCCTAT AAAGATTATA 
TCAGGTGCTC TTCACTATTT CAGGATTGTT CCAGAATACT GGGATCATAG TCTTTATAAC
TTGAAGGCAC TAGGATGTAA TACAGTAGAG ACGTATGTTC CGTGGAATAT GCATGAACCA
AAAGAAGGGA TATTTAATTT TGAAGGTATA GCAGATCTTG TGAAATATGT ACAATTAGCT
CAAAAATATG GATTAATGGT TATTTTACGT CCTACTCCTT ATATTTGTGC AGAATGGGAG
TTCGGTGGAT TACCTGCTTG GTTACTAAAG TATAAAGATA TAAGGGTTCG AAGTAATACG
AACTTGTTCC TAAATAAAGT GGAAAACTTT TATAAAGTTC TCCTACCAAT GGTAACCCCA
TTACAAGTAG AGAATGGTGG ACCAATTATT ATGATGCAAG TAGAAAATGA ATATGGTTCA
TTTGGAAATG ACAAAGAGTA TGTAAGAAAT ATAAAAAAAT TAATGAGAGA TTTAGGCGTT
ACGGTTCCAC TTTTTACTTC AGATGGTGCA TGGCAAGAGG CTTTAGAATC AGGGAGTCTG
ATTGATGACG ATGTTTTAGT AACGGGTAAT TTTGGCTCAC GCTCTAATGA AAATTTAAAT
GAGCTTGAAA GTTTCATTAA AGAAAATAAA AAAGAATGGC CATTAATGTG TATGGAATTT
TGGGATGGCT GGTTTAATCG ATGGGGTATG GAAATTATTC GGCGTGATGG CAGTGAATTA
GCAGAAGAAG TCAAGGAACT ATTGAAGAGA GCTAGTATTA ATTTTTACAT GTTTCAAGGT
GGTACTAATT TTGGATTTAT GAATGGTTGT TCATCGAGGG AAAATGTGGA TTTACCACAA
ATTACTTCAT ATGATTATGA TGCATTACTG ACAGAATGGG GAGAGCCAAC GTCAAAATAT
TATGCTGTTC AGAGAGCTAT TAAAGAAGTT TGTTCAGATG TAGAGCAATT TGAACCGCGA
ATTTTACCAC GTGCAAATTA TGGGGAAATT AAATTGAATC GTAAAGTATC ACTATTTTCA
ACATTAGAGA AAATAGCGAA GAAGAGACAC AATTCTTATA CATTAACAAT GGAAGATATG
GATCAGCAGT ATGGTTATAT TTTATATCGA ACATTTTTAA AAGGACCAAA AAATATAGAA
AAATGTAAGG TAGTAGATGC TAGAGATCGT GTACATTTGT TCCTTAATGA ACAATTAGTA
GACACCCAAT ATAGGGATGA GATTGGTAGA GAGGTATCGT TAGATTTGAC TAAGGAAGAA
AATACGTTGG ATATTTTAGT AGAAAATATG GGACGTGTTA ATTATGGGGC AAGATTGTTA
TCTCCAACGC AAAGAAAAGG TATCTCCTCA GGTGTAATGA TTGATATACA TTTACAATCA
AACTGGGAGC ATTATGCACT TGAATTTGAT AATCTTGATG AGATTGATTT CAACGGTCAA
TGGGAACCTA ATACACCTAG CTTTTATGAA TACACATTTA ATGTACAAGA ATTAAATGAT
ACATTTTTAG ACTGTAGTAA GTTGGGAAAA GGATTTGTTG TTTTGAATGG GTTTAATTTA
GGAAAGTATT GGGATGTAGG TCCAACGGGT TATTTATATA TCCCGGCTCC TTTATTAATA
AAGGGAGAGA ATAATTTGAT TGTCTTCGAA ACGGAAGGGA ATTATGAGGA AGAACTTTAT
TTAAGGGAAA ATCCAATTTA TTTAGACGTG AACTACTCGC CACTTAGCAA AGCTTGA
 
Protein sequence
MKSFEIGKDF MLDGEPIKII SGALHYFRIV PEYWDHSLYN LKALGCNTVE TYVPWNMHEP 
KEGIFNFEGI ADLVKYVQLA QKYGLMVILR PTPYICAEWE FGGLPAWLLK YKDIRVRSNT
NLFLNKVENF YKVLLPMVTP LQVENGGPII MMQVENEYGS FGNDKEYVRN IKKLMRDLGV
TVPLFTSDGA WQEALESGSL IDDDVLVTGN FGSRSNENLN ELESFIKENK KEWPLMCMEF
WDGWFNRWGM EIIRRDGSEL AEEVKELLKR ASINFYMFQG GTNFGFMNGC SSRENVDLPQ
ITSYDYDALL TEWGEPTSKY YAVQRAIKEV CSDVEQFEPR ILPRANYGEI KLNRKVSLFS
TLEKIAKKRH NSYTLTMEDM DQQYGYILYR TFLKGPKNIE KCKVVDARDR VHLFLNEQLV
DTQYRDEIGR EVSLDLTKEE NTLDILVENM GRVNYGARLL SPTQRKGISS GVMIDIHLQS
NWEHYALEFD NLDEIDFNGQ WEPNTPSFYE YTFNVQELND TFLDCSKLGK GFVVLNGFNL
GKYWDVGPTG YLYIPAPLLI KGENNLIVFE TEGNYEEELY LRENPIYLDV NYSPLSKA