Gene BCZK2677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK2677 
SymbolaroA 
ID3025316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp2791329 
End bp2792405 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content38% 
IMG OID637546898 
Productbifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase 
Protein accessionYP_084264 
Protein GI52142565 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1605] Chorismate mutase
[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase
[TIGR01801] chorismate mutase domain of gram positive AroA protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.907033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAATC ATGAATTAGA TCAATTACGT AAACAGGTAG ATGAAATTAA CTTACAACTA 
TTACACCTTT TAAACAAACG CGGTGAAATC GTTCAAAAAA TTGGGGAACA AAAGCAAGTA
CAAGGTACAA AACGTTTTGA TCCAGTACGT GAGCGTGAAG TGCTTGATAT GATTGCAGAG
AATAACGAAG GACCATTCGA AACATCAACA GTTCAACATA TTTTCAAAAC AATCTTCAAA
GCTAGCTTAG AATTACAAGA AGATGATAAC CGTAAAGCAT TACTAGTATC ACGTAAAAAG
AAACAAGAAA ATACAATCGT TGATGTAAAA GGTGAATTGA TTGGTAACGG CACACAAACG
TTCATCATGG GACCTTGTGC GGTAGAAAGC TTAGAGCAAG TTCGCCAAGT AGGGCAAGCG
ATGAAAGACC AAGGCTTAAA ATTAATGCGC GGTGGTGCTT TCAAACCGAG AACATCTCCA
TACGATTTCC AAGGTTTAGG AGTAGAAGGG CTACAAATTT TACGCCAAGT AGCAGATGAG
TTCGACTTAG CGATCATCAG TGAGATTTTA AATCCAAACG ATGTTGAAAT GGCATTAGAC
TACGTTGATG TAATTCAAGT TGGTGCACGT AACATGCAAA ACTTCGATTT ACTACGAGCT
GTAGGTAAAG TTAACAAGCC AGTATTATTA AAACGTGGAT TAGCAGCAAC AATTGATGAG
TTCATTAACG CAGCTGAATA CATCATTGCA CAAGGTAACG ACCAAATTAT TCTATGTGAG
CGTGGTATCC GCACATACGA AAGAGCAACA CGTAACACAT TAGACATTTC TGCTGTACCG
ATTTTAAAGA AAGAAACACA TTTACCAGTT ATCGTTGACG TAACGCATTC AACTGGACGT
AGAGATTTAT TATTACCAAC GGCGAAAGCA GCACTTGCAA TTGGTGCAGA TGCAGTAATG
GCTGAAGTAC ACCCAGACCC AGCAGTGGCA CTATCTGATT CTGCACAACA AATGGATATT
CCAGAATTCC ATAGATTCAT GGATGAGTTA AAAGGTTTCA AAAATAAATT ATCTTAA
 
Protein sequence
MANHELDQLR KQVDEINLQL LHLLNKRGEI VQKIGEQKQV QGTKRFDPVR EREVLDMIAE 
NNEGPFETST VQHIFKTIFK ASLELQEDDN RKALLVSRKK KQENTIVDVK GELIGNGTQT
FIMGPCAVES LEQVRQVGQA MKDQGLKLMR GGAFKPRTSP YDFQGLGVEG LQILRQVADE
FDLAIISEIL NPNDVEMALD YVDVIQVGAR NMQNFDLLRA VGKVNKPVLL KRGLAATIDE
FINAAEYIIA QGNDQIILCE RGIRTYERAT RNTLDISAVP ILKKETHLPV IVDVTHSTGR
RDLLLPTAKA ALAIGADAVM AEVHPDPAVA LSDSAQQMDI PEFHRFMDEL KGFKNKLS