Gene BAS3440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3440 
Symbol 
ID2851420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3412169 
End bp3413440 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content40% 
IMG OID637506683 
Productimidazolonepropionase 
Protein accessionYP_029696 
Protein GI49186444 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGACA CTTTACTAAT CAATATCGGT CAATTACTAA CAATGGATCA AGAAGATGGC 
TTGTTAAGAC GGGAAGCGAT GAACACGCTT CCTGTTATCG AAAATGGTGC GGTTGGAATT
GAAAATGGTG TAATCACTTT CGTTGGAACA GCGGAAGAAG CGAAAGGATT ACAAGCGAAA
GAGGTTATTG ATTGCGGCGG AAAAATGGTT TCTCCTGGCC TTGTTGACCC GCATACTCAT
CTTGTATTTG GTGGATCTCG CGAAAATGAA ATCGCACTAA AATTACAAGG AGTTCCGTAC
TTAGAAATTT TAGAACAAGG CGGAGGTATT CTTTCAACTG TAAATGCAAC GAAACAAGCG
TCGAAAGAAG AGCTTGTTCA AAAAGCGAAA TTCCATTTAG ACCGTATGCT ATCTTTCGGA
GTTACAACTG TAGAAGCGAA GAGTGGTTAT GGATTAGATG ATGAGACGGA ATGGAAACAA
TTAGAGGCAA CTGCACAATT ACAAAAAGAA CATCCAATCG ATTTAGTGTC AACGTTTTTA
GGTGCTCATG CAGTTCCGAA GGAGTACAAA GGTAGATCAA AAGAATTTTT ACAATGGATG
TTAGACCTAC TACCAGAAAT GAAAGAGAAG CAATTAGCAG AATTCGTTGA TATTTTCTGC
GAAACAGGTG TGTTCTCTGT CGAAGAATCA AAAGAGTTTT TATTAAAAGC GAAAGAGCTT
GGCTTTGATG TGAAAATTCA TGCGGATGAA ATTGATCCTC TTGGTGGTGC GGAAGCAGCA
GCTGAAATTG GTGCAGCATC AGCGGACCAT TTAGTTGGTG CTTCTGATAA AGGAATTGAA
ATGCTTGCAA ACTCTAATAC AGTAGCCACT TTATTACCAG GAACAACCTT CTATTTAAAT
AAAGAAAGCT TTGCTCGCGG TCGTAAAATG ATTGATGAAG GTGTTGCGGT AGCTTTAGCC
ACAGACTTTA ACCCAGGCAG CTGCCCAACT GAAAACATTC AGCTTATTAT GAGCATCGCA
ATGCTGAAAT TGAAAATGAC ACCAGAGGAA GTTTGGAATG CTGTAACAGT TAACTCTTCT
TATGCTATTA ATCGAGGCGA TGTAGCTGGG AAAATTAGAG TGGGTCGTAA GGCAGATTTA
GTTTTATGGG ATGCTTACAA TTATGCTTAC GTACCGTATC ATTACGGTGT AAGTCATGTA
AATACAGTGT GGAAGAATGG TAATATCGCA TATACAAGAG GTGAACAATC GTGGAGCACG
GCCACTATTT AA
 
Protein sequence
MLDTLLINIG QLLTMDQEDG LLRREAMNTL PVIENGAVGI ENGVITFVGT AEEAKGLQAK 
EVIDCGGKMV SPGLVDPHTH LVFGGSRENE IALKLQGVPY LEILEQGGGI LSTVNATKQA
SKEELVQKAK FHLDRMLSFG VTTVEAKSGY GLDDETEWKQ LEATAQLQKE HPIDLVSTFL
GAHAVPKEYK GRSKEFLQWM LDLLPEMKEK QLAEFVDIFC ETGVFSVEES KEFLLKAKEL
GFDVKIHADE IDPLGGAEAA AEIGAASADH LVGASDKGIE MLANSNTVAT LLPGTTFYLN
KESFARGRKM IDEGVAVALA TDFNPGSCPT ENIQLIMSIA MLKLKMTPEE VWNAVTVNSS
YAINRGDVAG KIRVGRKADL VLWDAYNYAY VPYHYGVSHV NTVWKNGNIA YTRGEQSWST
ATI