Gene BCZK0411 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK0411 
SymbolnagE 
ID3024782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp475506 
End bp476999 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content38% 
IMG OID637544611 
ProductPTS system, N-acetylglucosamine-specific EIIBC component 
Protein accessionYP_082020 
Protein GI52144809 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000040679 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCAGT TTCTACAACG TATTGGTAAA GCGTTAATGC TTCCAATCGC CGTACTACCA 
GCAGCAGGAT TATTGCTTCG TTTAGGACAA GAAGACGTAT TTAACATTCC TGTTATGGCA
CAGGCCGGTG CAGCAATTTT TGATAATTTA GCACTTATTT TTGCAATTGG TGTTGCAATC
GGTTTGTCTG TTGACGGTAG TGGAGCAGCT GGACTTGCCG GAGCAATCGG ATATCTTGTT
TTACAAAATA CAACGAATGC TCTAAGTAAG ACGTATTCAG CAGCAGAGTT AAATGATAAA
TTAAAAAGTG TTCAAGATTT AGTCGGTTCA GTAGATCCAA CTAAATTAGC AGATACAATG
ACAAAGGTTT CAAAAGCAGC GGCGTTAACG CCAAAAATAA ATATGGCCAT ACTCGGTGGT
ATTATTGCAG GGGTTGTTGC GGGATTACTA TACAACAAAT TCCATAAGAT TAAACTACCA
GAATGGTTAG GATTCTTTGC AGGAAAACGC TTCGTACCAA TCATTACTTC AATCGTAATG
TTACTTTTAG GATTGGTATT CGGTCAAATT TGGCCAACGA TTCAAAGTGG TATTGATGCA
GTGGCACATG GTATCGTGAA CTTAGGTTCA ATTGGTGCTG GTTTATTTGG ATTATTAAAC
CGTTTATTAA TTCCAATTGG TTTACACCAC GTAATGAACA CATACTTCTG GTTCGTACTT
GGTGACTTTA CAAATGCAGC TGGCGATATT GTTCATGGTG ATATCGCACG TTTCTTTGCA
AAAGATCCAT CAGCAGGTAT GTTTATGACT GGTTTCTTCC CAGTTATGAT GTTCGGTTTA
CCAGCAGCAT GTTTCGCAAT GATTGCAGCT GCTAAACCAG AAAAACGTAA AATGGTTACA
GGTATGTTAG GTGGTCTAGC ATTAACTTCA TTCTTAACTG GTATTACAGA GCCAATTGAA
TTCTCATTCA TGTTCTTATC ACCAGTACTA TATGGAATTC ATGCTGTATT AACAGGTCTG
TCTCTATTCA TTACAACAAC ACTTGGCATT CATGATGGTT TCTCATTTAG TGCCGGGGCA
ATCGATTACG TCTTAAACTT CGGTATTGCA ACAAAACCAT TGTTACTAGC AGGAATCGGT
TTAATTTACG CAGCAATTTA CTTTGTAGTA TTCTACTTCT TAATTAAGAA GTTTGACCTA
AAAACTCCTG GTCGTGAAGA TGAAGAGGAA ATGGCTGAAG GCGAAGAAGC TCCAGTTGCA
GGTTCAATTG GTGAAACTTA CGTAGCAGCT TTAGGTGGAA AAGAAAACTT AACAGTTATT
GATAACTGTG CAACACGTCT ACGCTTACAA GTGAAAGATG CTGGTCAAGT AAACGAAGCA
GCATTAAAAC GTGCTGGTGC AAAAGGTGTT ATGAAATTAA GTAACACGAG TGTCCAAGTT
ATCGTAGGTA CAAATGTTGA ATCTGTTGCC GATGATATGA AAAAACACGT ATAA
 
Protein sequence
MLQFLQRIGK ALMLPIAVLP AAGLLLRLGQ EDVFNIPVMA QAGAAIFDNL ALIFAIGVAI 
GLSVDGSGAA GLAGAIGYLV LQNTTNALSK TYSAAELNDK LKSVQDLVGS VDPTKLADTM
TKVSKAAALT PKINMAILGG IIAGVVAGLL YNKFHKIKLP EWLGFFAGKR FVPIITSIVM
LLLGLVFGQI WPTIQSGIDA VAHGIVNLGS IGAGLFGLLN RLLIPIGLHH VMNTYFWFVL
GDFTNAAGDI VHGDIARFFA KDPSAGMFMT GFFPVMMFGL PAACFAMIAA AKPEKRKMVT
GMLGGLALTS FLTGITEPIE FSFMFLSPVL YGIHAVLTGL SLFITTTLGI HDGFSFSAGA
IDYVLNFGIA TKPLLLAGIG LIYAAIYFVV FYFLIKKFDL KTPGREDEEE MAEGEEAPVA
GSIGETYVAA LGGKENLTVI DNCATRLRLQ VKDAGQVNEA ALKRAGAKGV MKLSNTSVQV
IVGTNVESVA DDMKKHV