Gene GBAA_2958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_2958 
Symbol 
ID2819977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp2727408 
End bp2728484 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content38% 
IMG OID637789764 
Productbifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase 
Protein accessionYP_019601 
Protein GI47528252 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1605] Chorismate mutase
[COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase
[TIGR01801] chorismate mutase domain of gram positive AroA protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0331892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAATC ATGAATTAGA TCAATTACGT AAACAGGTAG ATGAAATTAA CTTACAACTA 
TTACACCTTT TAAACAAACG CGGTGAAATC GTTCAAAAAA TTGGGGAACA AAAGCAAGTA
CAAGGTACAA AACGTTTTGA TCCAGTACGT GAGCGTGAAG TGCTTGATAT GATTGCAGAG
AATAACGAAG GACCATTCGA AACATCAACA GTTCAACATA TTTTCAAAAC AATCTTCAAA
GCTAGCTTAG AATTACAAGA AGATGATAAC CGTAAAGCAT TACTAGTATC ACGTAAAAAG
AAACAAGAAA ACACAATCGT TGATGTAAAA GGTGAATTGA TTGGTAACGG CACACAAACG
TTCATCATGG GACCTTGCGC GGTAGAAAGC TTAGAGCAAG TTCGCCAAGT AGGGCAAGCG
ATGAAAGACC AAGGCTTAAA ATTAATGCGC GGTGGTGCTT TCAAACCGAG AACATCTCCA
TACGATTTCC AAGGTTTAGG AGTAGAAGGG CTACAAATTT TACGTCAAGT AGCAGATGAG
TTCGACTTAG CGATCATTAG TGAGATTTTA AATCCAAACG ATGTTGAAAT GGCATTAGAC
TACGTTGATG TAATTCAAGT TGGTGCACGT AACATGCAAA ACTTCGATTT ACTACGAGCT
GTAGGTAAAG TTAACAAGCC AGTATTATTA AAACGTGGAT TAGCAGCAAC AATTGATGAG
TTCATTAATG CAGCGGAATA CATCATTGCA CAAGGTAATG ACCAAATTAT TCTATGTGAG
CGCGGTATTC GCACATACGA AAGAGCAACA CGTAACACAT TAGACATTTC AGCAGTACCG
ATCTTAAAGA AAGAAACACA TTTACCAGTT GTTGTTGACG TAACGCATTC AACTGGACGT
AGAGATTTAT TATTACCAAC AGCGAAAGCG GCTCTTGCAA TTGGTGCAGA TGCAGTAATG
GCTGAAGTAC ATCCAGACCC AGCAGTTGCA TTATCAGATT CTGCACAACA AATGGATATT
CCGGAATTCC ATAGATTCAT GGAAGAGTTA AAAGGTTTCA AAAATAAATT ATCTTAA
 
Protein sequence
MANHELDQLR KQVDEINLQL LHLLNKRGEI VQKIGEQKQV QGTKRFDPVR EREVLDMIAE 
NNEGPFETST VQHIFKTIFK ASLELQEDDN RKALLVSRKK KQENTIVDVK GELIGNGTQT
FIMGPCAVES LEQVRQVGQA MKDQGLKLMR GGAFKPRTSP YDFQGLGVEG LQILRQVADE
FDLAIISEIL NPNDVEMALD YVDVIQVGAR NMQNFDLLRA VGKVNKPVLL KRGLAATIDE
FINAAEYIIA QGNDQIILCE RGIRTYERAT RNTLDISAVP ILKKETHLPV VVDVTHSTGR
RDLLLPTAKA ALAIGADAVM AEVHPDPAVA LSDSAQQMDI PEFHRFMEEL KGFKNKLS