Gene BAS4039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4039 
SymbolargJ 
ID2850280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3975031 
End bp3976257 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content38% 
IMG OID637507276 
Productbifunctional ornithine acetyltransferase/N-acetylglutamate synthase protein 
Protein accessionYP_030289 
Protein GI49187037 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1364] N-acetylglutamate synthase (N-acetylornithine aminotransferase) 
TIGRFAM ID[TIGR00120] glutamate N-acetyltransferase/amino-acid acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGATTA AAGTAGCGTC TATTACAAAA GTAGAAGATG GTTCGATTGT AACGCCAAAA 
GGTTTCTCGG CCATTGGCAC TGCAATTGGT CTGAAAAAGG GGAAAAAGGA TTTAGGGGCA
ATCGTTTGTG ATGTACCGGC ATCATGTGCT GCTGTTTATA CAACAAATCA AATACAAGCA
GCCCCGTTGC AAGTGACGAA GGATAGTATA ACGACTGAGG GGAAACTACA AGCTATTATC
GTTAATAGTG GAAATGCAAA TGCTTGTACA GGAATGAAAG GGTTGCAAGA TGCTTACGAG
ATGCGTGCAT TAGGGGCGGA ACATTTTGGA TTGAAAGAAA AGTATGTTGC AGTAGCTTCA
ACAGGTGTAA TTGGTGTTCC GCTGCCGATG GATATAATCC GAAAGGGAAT TGTAACTCTT
ATACCGGCGA AGGAAGAAAA TGGAGCTCAT TCTTTTTCTG AAGCAATTTT AACGACGGAT
CTTATAACGA AAGAAACTTG CTATGAAATG ATTATTGATG GGAAGAAAGT GATGATTGCT
GGTGTTGCGA AAGGTTCAGG GATGATTCAT CCAAATATGG CAACGATGCT AAGTTTTATT
ACGACAGACG CTCGTATAGA GCATGACGTA TTGCAAACAG CATTATCACA AATAACGAAT
CATACATTTA ATCAAATTAC AGTAGATGGA GATACTTCTA CGAATGATAT GGTCATCGCT
ATGGCAAGTG GATTATCAGA AACGAAACCA ATCGATATGG AACATGCAGA TTGGGAAACT
TTCGTATTTG CTTTACAGAA GGTATGTGAA GATTTAGCCA AAAAAATTGC ACAAGATGGT
GAAGGTGCTA CGAAGTTAAT AGAAGTAAAT GTGCTAGGAG TTCAAACAAA TGAAGAGGCA
AAGAAAATCG CAAAGCAAAT AGTCGGTTCA AGTCTTGTGA AAACAGCAAT ACATGGTGAA
GACCCAAATT GGGGGCGAAT TATTAGCAGT ATTGGACAAA GTGAAGTAGC AATTAATCCG
AATACAATTG ACATTACTCT TCAATCTATA TCGGTATTAA AAAATAGTGA GCCTCAAACA
TTTTCTGAAG AAGAAATGAA AGAGAGATTA CAAGAAGATG AAATAGTCAT TAATGTGTAT
TTACATTTAG GTAAAGAGAC AGGATCAGCT TGGGGCTGTG ACTTAAGCTA TGAATATGTG
AAAATAAACG CTTGTTATCG TACATAA
 
Protein sequence
MMIKVASITK VEDGSIVTPK GFSAIGTAIG LKKGKKDLGA IVCDVPASCA AVYTTNQIQA 
APLQVTKDSI TTEGKLQAII VNSGNANACT GMKGLQDAYE MRALGAEHFG LKEKYVAVAS
TGVIGVPLPM DIIRKGIVTL IPAKEENGAH SFSEAILTTD LITKETCYEM IIDGKKVMIA
GVAKGSGMIH PNMATMLSFI TTDARIEHDV LQTALSQITN HTFNQITVDG DTSTNDMVIA
MASGLSETKP IDMEHADWET FVFALQKVCE DLAKKIAQDG EGATKLIEVN VLGVQTNEEA
KKIAKQIVGS SLVKTAIHGE DPNWGRIISS IGQSEVAINP NTIDITLQSI SVLKNSEPQT
FSEEEMKERL QEDEIVINVY LHLGKETGSA WGCDLSYEYV KINACYRT