Gene BAS4359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4359 
Symbol 
ID2850992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4270518 
End bp4271507 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content39% 
IMG OID637507594 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_030606 
Protein GI49187354 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTCTT TACAATTTAA TCGTCATCGT CGCTTAAGAC AAAGCGGGGG CATGCGTGCG 
CTTGTGCGTG AAACGTTTTT ACATACGGAA GATTTTATTT ACCCTATTTT TGTATTAGAA
GGTGAAAATG TTCGTAATGA AGTACCTTCT ATGCCAGGCG TATATCAAAT GTCTTTAGAT
TTATTGCAAG CTGAAATGCA AGAAGTTGTT GATTTAGGTA TTCGTTCTGT TATTGTATTT
GGTTTACCTG CTGAAAAAGA TGAAGTTGGA TCATCAGCAT ATTGTGATCA TGGAATTGTG
CAACGCGCGA TTCAGCAAAT TAAAGGTGAA TTCCCCGATC TAGTAGTAGT TGCGGATACA
TGTTTATGTC AATTTACAAG CCATGGTCAT TGCGGTGTAA TTGAAGATGG TATTATTTTA
AATGACGAGT CTCTTGCAGT TCTTGCAAAA ACAGCTGTAA GCCAAGCGAA AGCAGGAGCG
GACATTATTG CGCCATCAAA CATGATGGAC GGATTCGTAA CAGCAATTCG CCACGCATTA
GATGAAAATG GTTTTGGACA TGTACCAGTT ATGTCGTACG CTGTGAAATA TTCATCAGCA
TTTTATGGAC CATTCCGTGA TGCGGCACAC GGTGCACCGC AATTTGGTGA TCGTAAAACA
TATCAAATGG ACCCAGCGAA CCGCATGGAA GCATTCCGTG AAGCAGAATC AGATGTAATG
GAAGGGGCAG ATTTCTTAAT TGTAAAACCA GCTCTTTCTT ATTTAGATAT CGTTCGTGAT
GTGAAAAATA ACTTTAATTT ACCAGTCGTT GCTTATAACG TAAGCGGTGA ATATTCAATG
ATTAAAGCGG CAGCGCAAAA TGGTTGGATT AATGAAAAAG AAGTTGTACT TGAAAAATTA
ATTAGTATGA AACGTGCAGG AGCAGATTTA ATTATTACGT ATCATGCAAA AGATGCAGCA
AGATGGTTAC AAGAAGGAGG CGCTAAATAA
 
Protein sequence
MNSLQFNRHR RLRQSGGMRA LVRETFLHTE DFIYPIFVLE GENVRNEVPS MPGVYQMSLD 
LLQAEMQEVV DLGIRSVIVF GLPAEKDEVG SSAYCDHGIV QRAIQQIKGE FPDLVVVADT
CLCQFTSHGH CGVIEDGIIL NDESLAVLAK TAVSQAKAGA DIIAPSNMMD GFVTAIRHAL
DENGFGHVPV MSYAVKYSSA FYGPFRDAAH GAPQFGDRKT YQMDPANRME AFREAESDVM
EGADFLIVKP ALSYLDIVRD VKNNFNLPVV AYNVSGEYSM IKAAAQNGWI NEKEVVLEKL
ISMKRAGADL IITYHAKDAA RWLQEGGAK