Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcer98_2014 |
Symbol | |
ID | 5347493 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cytotoxicus NVH 391-98 |
Kingdom | Bacteria |
Replicon accession | NC_009674 |
Strand | - |
Start bp | 2116891 |
End bp | 2117967 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640839556 |
Product | bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase |
Protein accession | YP_001375282 |
Protein GI | 152975765 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1605] Chorismate mutase [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase [TIGR01801] chorismate mutase domain of gram positive AroA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00348159 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAATC ATGAATTAGA ACAATTACGT AAACAGGTGG ATGAAATTAA TTTACAGCTA TTAAAGCTTT TAAATGAAAG AGGTAGAATT GTTCAAAAAA TTGGTGAACA AAAGCAACTG CAAGGAACGA AGCGCTTTGA TCCCGTTCGC GAGCGCGAAG TACTAGATAT GATTGCGGAG CGTAATGAAG GGCCATTTGA GACATCGACG GTTCAACACA TTTTCAAAAC GATTTTCAAA GCAAGCTTAG AGCTTCAAGA GGACGATAAC CGTAAAGCGC TTCTTGTTTC CCGTAAGAAA AAACAAGAAA ATACAATTGT GGATGTAAAA GGTGAGCGAT TAGGTAGTGG AACACAATCA TTCATTATGG GACCTTGTGC GGTAGAAAGT TTAGAGCAAG TTCGTCAAGT TGCGCAAGCG ATAAAAGAGC AAGGATTAAA ATTAATGCGC GGGGGAGCGT TCAAACCGAG AACATCACCA TATGATTTCC AAGGTTTAGG GGTAGAAGGA TTACAAATTT TACGACAAGT GGCTGATGAG TTTGATTTAG CGATTATTAG TGAAATTTTA AATCCGAACG ATGTGGAAAT GGCTCTTGAT TACGTTGATG TCATTCAAGT TGGGGCACGA AATATGCAAA ACTTTGACTT ATTAAGAGCT GTAGGAAAAG TGAATAAACC TGTATTGCTA AAAAGAGGTT TAGCAGCGAC AATTGATGAG TTCATGCATG CAGCTGAATA CATTATTGCA CAAGGTAATG ATCAAATTAT TTTATGTGAA CGCGGCATTC GCACATACGA AAAAGCAACT CGTAATACGT TAGATATTTC AGCAGTACCA ATTTTGAAAA AAGAAACACA TTTGCCAGTT GTTGTGGATG TAACACATTC AACAGGACGC AGAGATTTAT TATTACCGAC TGCAAAAGCA GCGTTAGCAA TTGGAGCAGA TGCAGTAATG GCAGAAGTAC ATCCAGACCC AGCTGTTGCA TTATCTGACT CAGCACAGCA AATGGATATT CCAGAATTCC ATAGATTCAT GGAAGAGTTA AAAGAATTCA AAAATAAATT ATCGTAA
|
Protein sequence | MANHELEQLR KQVDEINLQL LKLLNERGRI VQKIGEQKQL QGTKRFDPVR EREVLDMIAE RNEGPFETST VQHIFKTIFK ASLELQEDDN RKALLVSRKK KQENTIVDVK GERLGSGTQS FIMGPCAVES LEQVRQVAQA IKEQGLKLMR GGAFKPRTSP YDFQGLGVEG LQILRQVADE FDLAIISEIL NPNDVEMALD YVDVIQVGAR NMQNFDLLRA VGKVNKPVLL KRGLAATIDE FMHAAEYIIA QGNDQIILCE RGIRTYEKAT RNTLDISAVP ILKKETHLPV VVDVTHSTGR RDLLLPTAKA ALAIGADAVM AEVHPDPAVA LSDSAQQMDI PEFHRFMEEL KEFKNKLS
|
| |