Gene BAS4563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4563 
Symbol 
ID2850479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4467874 
End bp4469040 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content43% 
IMG OID637507800 
Productacetoin utilization protein AcuC 
Protein accessionYP_030810 
Protein GI49187557 
COG category[B] Chromatin structure and dynamics
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0123] Deacetylases, including yeast histone deacetylase and acetoin utilization protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAGCG CGTTTATTTA TTCGGATGAC TTTCGGGGCT ATTCATTTAG CCCTGATCAT 
CCCTTTAACC AACTGCGCGT CACACTTACG TATGATTTAT TACAAAAGGG CGGTTTCATC
TCTCCTTCCC AAATCATCTC ACCACGGATG GCTACAGATG AAGAGATTGC CTACATTCAT
ACAGAGGAGT ACATAAATGC GGTAAAACGT GCTGGAGAAG GTAAGTTAGA AAAATCAATT
GCGATGACAT ATGGACTCGG AACAGAAGAT ACACCAATGT TTCCAAATAT GCACGAAGCA
AGCGCATTAC TCGTTGGCGG TACGTTAACC GCTGTCGATG CTGTTCTTTC TGGGAAAGTA
AAACACGCTC TCAATTTAGG TGGTGGCTTA CATCATGGCT TCCGTGGCAA AGCATCTGGC
TTTTGCATTT ATAACGATAG TTCCATCGCA ATGAAATATA TTCAAAAGAA GTACGGTTTA
CGCGTTTTAT ATATTGATAC GGATGCTCAT CACGGTGATG GTGTACAGTG GTCCTTTTAT
GACGATCCTA ACGTATGCAC CATTTCACTA CATGAAACTG GTCGATATTT ATTCCCTGGA
ACTGGCGCTG TAAACGAACG CGGACAAGGT AATGGCTATA GTTATTCTTT TAACGTTCCA
CTCGATGCTT TTACAGAAGA CGAATCGTTT TTAGATTCCT ATCGAACTGT TGTAAAAGAA
GTGGCCGCAT ACTTTAAACC GGATATTATT TTAACGCAAA ATGGTGCTGA CGCACATTAC
TACGACCCAC TTACACACCT TTGCGCAACG ATGAATATTT ACCGCGAGAT ACCAAAGCTC
GCTCGCGAAA TCGCTAACGA ATATTGCGAA GGTCGCTGGA TTGCTGTCGG CGGCGGTGGC
TATGACCACT GGCGTGTCGT CCCAAGAGCT TGGGCACTCA TTTGGCTCGA AATGAACAAC
ATCCAAAACA TCTCAGGTTA TCTCCCTCCA GAATGGATTG ACGCTTGGAA AGGACAAGCT
GAAACAGAAC TTCCTCTCAC ATGGGAAGAT CCAAACAACA TGTATAAACC TATCCCCCGC
AAACCAGAAA TTGAAGAAAA GAACGCATTA ACTGTAGCAA AATCCCTTGA AATTATTCGG
AATAATATGA AAAAATCTTT GTACTAA
 
Protein sequence
MSSAFIYSDD FRGYSFSPDH PFNQLRVTLT YDLLQKGGFI SPSQIISPRM ATDEEIAYIH 
TEEYINAVKR AGEGKLEKSI AMTYGLGTED TPMFPNMHEA SALLVGGTLT AVDAVLSGKV
KHALNLGGGL HHGFRGKASG FCIYNDSSIA MKYIQKKYGL RVLYIDTDAH HGDGVQWSFY
DDPNVCTISL HETGRYLFPG TGAVNERGQG NGYSYSFNVP LDAFTEDESF LDSYRTVVKE
VAAYFKPDII LTQNGADAHY YDPLTHLCAT MNIYREIPKL AREIANEYCE GRWIAVGGGG
YDHWRVVPRA WALIWLEMNN IQNISGYLPP EWIDAWKGQA ETELPLTWED PNNMYKPIPR
KPEIEEKNAL TVAKSLEIIR NNMKKSLY