Gene Bcer98_3343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_3343 
Symbol 
ID5343818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp3411257 
End bp3412330 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content38% 
IMG OID640840830 
Productbifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase 
Protein accessionYP_001376553 
Protein GI152977036 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1605] Chorismate mutase
[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase
[TIGR01801] chorismate mutase domain of gram positive AroA protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0172834 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCAC AAGAATTAGA TCGTTTACGT TCTCAAATTG ATGAAATAAA TATGCAAATG 
TTAGAGCTTT TAAATGAAAG GGGCCGTCTT GTTCAAGAAG TCGGTAAAGT AAAAGAAGAG
CAAGGGATTA TGAAATTTGA TCCTGTACGT GAAAGAAATA TGCTCGATTT AATTGCACAA
CATAATAATG GCCCGTTTGA AACATCAACA CTTCAACATA TTTTTAAACA AATTTTTCAG
ATGAGCTTAG AGTTACAAGA AGATGATCAT CGCAAGGCGC TTCTTGTTTC TCGTAAGAAA
AAGCCAGAGG ATACAATTGT TACGATTAAA GGTGAGAGAA TTGGTGATGG GAATCCGCAC
TTTATTATGG GACCATGTGC TGTAGAGAGT TATGAACAAG TGCGTCAAGT AGCGGAAGCG
ATAAAGGAAC AAGGATTAAA ATTAATGCGC GGTGGTGCAT TTAAACCTCG TACATCTCCT
TACGATTTCC AAGGTCTTGG TTTAGAGGGA TTACAAATTT TGCGACAAGT TGCCGATGAA
TATGATTTAG CTGTCATTAG CGAAATTTTA AATCCAAACG ATATGGAAAT GGCTCTTGAT
TATGTAGATG TAATTCAAAT TGGGGCTCGC AATATGCAAA ACTTTGAACT ATTAAAAGCT
GCCGGTGCTG TAAAGAAACC AGTATTATTA AAACGAGGTT TATCAGCTAC TATTGAAGAA
TTTATTTATG CCGCAGAATA TATTATGGCA CAAGGGAATG GAGATATTAT TTTATGTGAA
CGAGGCATTC GAACGTATGA GAAGGCAACT CGTAACACGC TAGATATTTC CGCTGTGCCG
ATTTTGAAGA AGGAGACACA TTTACCTGTT GTAGTAGATG TAACGCATTC TACAGGGCGC
CGTGACCTTC TATTACCAAC TGCAAAAGCA GCAATGGCAA TCGGTGCTGA TGCTGTTATG
GCTGAAGTGC ATCCGGATCC AGCTGTTGCA TTATCGGATT CAGCACAGCA AATGGATATT
CCGGAGTTTA ATGAGTTTAT GAAAGAACTA AAAGCGTTTC GTGGTAGATC GTAA
 
Protein sequence
MASQELDRLR SQIDEINMQM LELLNERGRL VQEVGKVKEE QGIMKFDPVR ERNMLDLIAQ 
HNNGPFETST LQHIFKQIFQ MSLELQEDDH RKALLVSRKK KPEDTIVTIK GERIGDGNPH
FIMGPCAVES YEQVRQVAEA IKEQGLKLMR GGAFKPRTSP YDFQGLGLEG LQILRQVADE
YDLAVISEIL NPNDMEMALD YVDVIQIGAR NMQNFELLKA AGAVKKPVLL KRGLSATIEE
FIYAAEYIMA QGNGDIILCE RGIRTYEKAT RNTLDISAVP ILKKETHLPV VVDVTHSTGR
RDLLLPTAKA AMAIGADAVM AEVHPDPAVA LSDSAQQMDI PEFNEFMKEL KAFRGRS