Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcer98_3343 |
Symbol | |
ID | 5343818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cytotoxicus NVH 391-98 |
Kingdom | Bacteria |
Replicon accession | NC_009674 |
Strand | - |
Start bp | 3411257 |
End bp | 3412330 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640840830 |
Product | bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase |
Protein accession | YP_001376553 |
Protein GI | 152977036 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1605] Chorismate mutase [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase [TIGR01801] chorismate mutase domain of gram positive AroA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0172834 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCAC AAGAATTAGA TCGTTTACGT TCTCAAATTG ATGAAATAAA TATGCAAATG TTAGAGCTTT TAAATGAAAG GGGCCGTCTT GTTCAAGAAG TCGGTAAAGT AAAAGAAGAG CAAGGGATTA TGAAATTTGA TCCTGTACGT GAAAGAAATA TGCTCGATTT AATTGCACAA CATAATAATG GCCCGTTTGA AACATCAACA CTTCAACATA TTTTTAAACA AATTTTTCAG ATGAGCTTAG AGTTACAAGA AGATGATCAT CGCAAGGCGC TTCTTGTTTC TCGTAAGAAA AAGCCAGAGG ATACAATTGT TACGATTAAA GGTGAGAGAA TTGGTGATGG GAATCCGCAC TTTATTATGG GACCATGTGC TGTAGAGAGT TATGAACAAG TGCGTCAAGT AGCGGAAGCG ATAAAGGAAC AAGGATTAAA ATTAATGCGC GGTGGTGCAT TTAAACCTCG TACATCTCCT TACGATTTCC AAGGTCTTGG TTTAGAGGGA TTACAAATTT TGCGACAAGT TGCCGATGAA TATGATTTAG CTGTCATTAG CGAAATTTTA AATCCAAACG ATATGGAAAT GGCTCTTGAT TATGTAGATG TAATTCAAAT TGGGGCTCGC AATATGCAAA ACTTTGAACT ATTAAAAGCT GCCGGTGCTG TAAAGAAACC AGTATTATTA AAACGAGGTT TATCAGCTAC TATTGAAGAA TTTATTTATG CCGCAGAATA TATTATGGCA CAAGGGAATG GAGATATTAT TTTATGTGAA CGAGGCATTC GAACGTATGA GAAGGCAACT CGTAACACGC TAGATATTTC CGCTGTGCCG ATTTTGAAGA AGGAGACACA TTTACCTGTT GTAGTAGATG TAACGCATTC TACAGGGCGC CGTGACCTTC TATTACCAAC TGCAAAAGCA GCAATGGCAA TCGGTGCTGA TGCTGTTATG GCTGAAGTGC ATCCGGATCC AGCTGTTGCA TTATCGGATT CAGCACAGCA AATGGATATT CCGGAGTTTA ATGAGTTTAT GAAAGAACTA AAAGCGTTTC GTGGTAGATC GTAA
|
Protein sequence | MASQELDRLR SQIDEINMQM LELLNERGRL VQEVGKVKEE QGIMKFDPVR ERNMLDLIAQ HNNGPFETST LQHIFKQIFQ MSLELQEDDH RKALLVSRKK KPEDTIVTIK GERIGDGNPH FIMGPCAVES YEQVRQVAEA IKEQGLKLMR GGAFKPRTSP YDFQGLGLEG LQILRQVADE YDLAVISEIL NPNDMEMALD YVDVIQIGAR NMQNFELLKA AGAVKKPVLL KRGLSATIEE FIYAAEYIMA QGNGDIILCE RGIRTYEKAT RNTLDISAVP ILKKETHLPV VVDVTHSTGR RDLLLPTAKA AMAIGADAVM AEVHPDPAVA LSDSAQQMDI PEFNEFMKEL KAFRGRS
|
| |