Gene Bcer98_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_2014 
Symbol 
ID5347493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp2116891 
End bp2117967 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content38% 
IMG OID640839556 
Productbifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase 
Protein accessionYP_001375282 
Protein GI152975765 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1605] Chorismate mutase
[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase
[TIGR01801] chorismate mutase domain of gram positive AroA protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00348159 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAATC ATGAATTAGA ACAATTACGT AAACAGGTGG ATGAAATTAA TTTACAGCTA 
TTAAAGCTTT TAAATGAAAG AGGTAGAATT GTTCAAAAAA TTGGTGAACA AAAGCAACTG
CAAGGAACGA AGCGCTTTGA TCCCGTTCGC GAGCGCGAAG TACTAGATAT GATTGCGGAG
CGTAATGAAG GGCCATTTGA GACATCGACG GTTCAACACA TTTTCAAAAC GATTTTCAAA
GCAAGCTTAG AGCTTCAAGA GGACGATAAC CGTAAAGCGC TTCTTGTTTC CCGTAAGAAA
AAACAAGAAA ATACAATTGT GGATGTAAAA GGTGAGCGAT TAGGTAGTGG AACACAATCA
TTCATTATGG GACCTTGTGC GGTAGAAAGT TTAGAGCAAG TTCGTCAAGT TGCGCAAGCG
ATAAAAGAGC AAGGATTAAA ATTAATGCGC GGGGGAGCGT TCAAACCGAG AACATCACCA
TATGATTTCC AAGGTTTAGG GGTAGAAGGA TTACAAATTT TACGACAAGT GGCTGATGAG
TTTGATTTAG CGATTATTAG TGAAATTTTA AATCCGAACG ATGTGGAAAT GGCTCTTGAT
TACGTTGATG TCATTCAAGT TGGGGCACGA AATATGCAAA ACTTTGACTT ATTAAGAGCT
GTAGGAAAAG TGAATAAACC TGTATTGCTA AAAAGAGGTT TAGCAGCGAC AATTGATGAG
TTCATGCATG CAGCTGAATA CATTATTGCA CAAGGTAATG ATCAAATTAT TTTATGTGAA
CGCGGCATTC GCACATACGA AAAAGCAACT CGTAATACGT TAGATATTTC AGCAGTACCA
ATTTTGAAAA AAGAAACACA TTTGCCAGTT GTTGTGGATG TAACACATTC AACAGGACGC
AGAGATTTAT TATTACCGAC TGCAAAAGCA GCGTTAGCAA TTGGAGCAGA TGCAGTAATG
GCAGAAGTAC ATCCAGACCC AGCTGTTGCA TTATCTGACT CAGCACAGCA AATGGATATT
CCAGAATTCC ATAGATTCAT GGAAGAGTTA AAAGAATTCA AAAATAAATT ATCGTAA
 
Protein sequence
MANHELEQLR KQVDEINLQL LKLLNERGRI VQKIGEQKQL QGTKRFDPVR EREVLDMIAE 
RNEGPFETST VQHIFKTIFK ASLELQEDDN RKALLVSRKK KQENTIVDVK GERLGSGTQS
FIMGPCAVES LEQVRQVAQA IKEQGLKLMR GGAFKPRTSP YDFQGLGVEG LQILRQVADE
FDLAIISEIL NPNDVEMALD YVDVIQVGAR NMQNFDLLRA VGKVNKPVLL KRGLAATIDE
FMHAAEYIIA QGNDQIILCE RGIRTYEKAT RNTLDISAVP ILKKETHLPV VVDVTHSTGR
RDLLLPTAKA ALAIGADAVM AEVHPDPAVA LSDSAQQMDI PEFHRFMEEL KEFKNKLS