Gene BAS2748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS2748 
Symbol 
ID2852601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp2727743 
End bp2728819 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content38% 
IMG OID637505993 
Productbifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase 
Protein accessionYP_029006 
Protein GI49185754 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1605] Chorismate mutase
[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase
[TIGR01801] chorismate mutase domain of gram positive AroA protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.531887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAATC ATGAATTAGA TCAATTACGT AAACAGGTAG ATGAAATTAA CTTACAACTA 
TTACACCTTT TAAACAAACG CGGTGAAATC GTTCAAAAAA TTGGGGAACA AAAGCAAGTA
CAAGGTACAA AACGTTTTGA TCCAGTACGT GAGCGTGAAG TGCTTGATAT GATTGCAGAG
AATAACGAAG GACCATTCGA AACATCAACA GTTCAACATA TTTTCAAAAC AATCTTCAAA
GCTAGCTTAG AATTACAAGA AGATGATAAC CGTAAAGCAT TACTAGTATC ACGTAAAAAG
AAACAAGAAA ACACAATCGT TGATGTAAAA GGTGAATTGA TTGGTAACGG CACACAAACG
TTCATCATGG GACCTTGCGC GGTAGAAAGC TTAGAGCAAG TTCGCCAAGT AGGGCAAGCG
ATGAAAGACC AAGGCTTAAA ATTAATGCGC GGTGGTGCTT TCAAACCGAG AACATCTCCA
TACGATTTCC AAGGTTTAGG AGTAGAAGGG CTACAAATTT TACGTCAAGT AGCAGATGAG
TTCGACTTAG CGATCATTAG TGAGATTTTA AATCCAAACG ATGTTGAAAT GGCATTAGAC
TACGTTGATG TAATTCAAGT TGGTGCACGT AACATGCAAA ACTTCGATTT ACTACGAGCT
GTAGGTAAAG TTAACAAGCC AGTATTATTA AAACGTGGAT TAGCAGCAAC AATTGATGAG
TTCATTAATG CAGCGGAATA CATCATTGCA CAAGGTAATG ACCAAATTAT TCTATGTGAG
CGCGGTATTC GCACATACGA AAGAGCAACA CGTAACACAT TAGACATTTC AGCAGTACCG
ATCTTAAAGA AAGAAACACA TTTACCAGTT GTTGTTGACG TAACGCATTC AACTGGACGT
AGAGATTTAT TATTACCAAC AGCGAAAGCG GCTCTTGCAA TTGGTGCAGA TGCAGTAATG
GCTGAAGTAC ATCCAGACCC AGCAGTTGCA TTATCAGATT CTGCACAACA AATGGATATT
CCGGAATTCC ATAGATTCAT GGAAGAGTTA AAAGGTTTCA AAAATAAATT ATCTTAA
 
Protein sequence
MANHELDQLR KQVDEINLQL LHLLNKRGEI VQKIGEQKQV QGTKRFDPVR EREVLDMIAE 
NNEGPFETST VQHIFKTIFK ASLELQEDDN RKALLVSRKK KQENTIVDVK GELIGNGTQT
FIMGPCAVES LEQVRQVGQA MKDQGLKLMR GGAFKPRTSP YDFQGLGVEG LQILRQVADE
FDLAIISEIL NPNDVEMALD YVDVIQVGAR NMQNFDLLRA VGKVNKPVLL KRGLAATIDE
FINAAEYIIA QGNDQIILCE RGIRTYERAT RNTLDISAVP ILKKETHLPV VVDVTHSTGR
RDLLLPTAKA ALAIGADAVM AEVHPDPAVA LSDSAQQMDI PEFHRFMEEL KGFKNKLS