Gene BAS4040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4040 
SymbolargC 
ID2848270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3976266 
End bp3977303 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content38% 
IMG OID637507277 
ProductN-acetyl-gamma-glutamyl-phosphate reductase 
Protein accessionYP_030290 
Protein GI49187038 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0002] Acetylglutamate semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTCG CGATTATTGG AGCAACTGGG TATGGAGGTA TTGAGTTAAT TCGGTTATTA 
GAACAACATC CATATTTTTC GATAGCATCT CTCCATTCTT TTTCACAAGT TGGCGAGTGT
ATAACAAATG TATATCCGCA TTTTCAAAAT GTTCTTGTTC ATACGTTACA AGAAATTGAT
GTGGAGGAAA TAGAGAAGGA AGCAGAAATT GTATTTTTAG CAACCCCAGC AGGAGTATCA
GCAGAGTTAA CTCCCAAATT ATTAGCAGTA GGCTTAAAAG TAATTGACCT ATCTGGAGAC
TTTCGTATGA AAGATCCTTT CATATATGAA CAGTGGTATA AAAGGGCAGC TGCAAAAGAA
GGAGTCCTTA GGGAAGCTGT ATATGGGTTA AGTGAATGGA AAAGGTCCGA AATTCAAAAG
GCAAATTTAA TTGCAAACCC GGGATGTTTT GCTACAGCTG CATTATTAGC GATATTACCG
TTAGTTCGTA GCGGCATAAT TGAGGAAGAC TCAATTATTA TTGATGCGAA ATCAGGAGTA
TCTGGAGCAG GCAAAACGCC AACAACGATG ACTCACTTTC CTGAGTTATA TGATAACTTG
CGTATTTATA AAGTAAATGA GCATCAACAC ATTCCTGAGA TTGAGCAAAT GCTCGCGGAG
TGGAATAGAG AAACGAAGCC AATCACGTTT AGTACACATT TAATACCGAT ATCACGTGGG
ATTATGGTTA CACTGTATGC GAAAGTAAAG CGAGAAATGG AAATAGAACA ACTTCAACAA
TTATATGAAG AAGCGTATGA ACAATCGGCT TTTATTCGAA TTCGCATGCA AGGAGAGTTT
CCAAGTCCGA AAGAAGTGAG AGGCTCAAAT TATTGTGATA TGGGGATAGC TTACGATGAA
AGAACAGGAA GAGTGACAAT TGTTTCTGTT ATAGACAATA TGATGAAAGG TGCGGCTGGT
CAAGCGATTC AAAATGCAAA TATAGTAGCG GGACTAGAAG AAACGACAGG TTTACAACAT
ATGCCGCTTT ATCTATAA
 
Protein sequence
MKVAIIGATG YGGIELIRLL EQHPYFSIAS LHSFSQVGEC ITNVYPHFQN VLVHTLQEID 
VEEIEKEAEI VFLATPAGVS AELTPKLLAV GLKVIDLSGD FRMKDPFIYE QWYKRAAAKE
GVLREAVYGL SEWKRSEIQK ANLIANPGCF ATAALLAILP LVRSGIIEED SIIIDAKSGV
SGAGKTPTTM THFPELYDNL RIYKVNEHQH IPEIEQMLAE WNRETKPITF STHLIPISRG
IMVTLYAKVK REMEIEQLQQ LYEEAYEQSA FIRIRMQGEF PSPKEVRGSN YCDMGIAYDE
RTGRVTIVSV IDNMMKGAAG QAIQNANIVA GLEETTGLQH MPLYL