Gene BAS5048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5048 
Symbol 
ID2853024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4920849 
End bp4921964 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content39% 
IMG OID637508303 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_031287 
Protein GI49188034 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC GTTTAAAAGT AATGACAATT TTCGGGACAC GTCCAGAAGC AATTAAAATG 
GCACCTCTTG TATTAGAGTT GCAAAAGCAT CCAGAGAAAA TTGAGTCGAT TGTGACTGTA
ACAGCGCAAC ATCGTCAAAT GTTAGACCAA GTATTAAGTA TCTTTGGAAT TACACCAGAT
TTCGATTTGA ATATTATGAA GGATCGCCAA ACTTTAATTG ATATTACAAC ACGAGGTTTA
GAAGGTTTGG ATAAAGTAAT GAAAGAAGCA AAGCCGGATA TCGTACTTGT ACATGGTGAT
ACAACGACAA CGTTTATCGC AAGCTTAGCT GCTTTCTATA ATCAAATTCC AGTAGGTCAT
GTCGAGGCGG GACTTCGTAC ATGGGATAAA TATTCTCCAT ACCCAGAAGA GATGAATCGT
CAATTAACAG GCGTAATGGC GGACCTTCAT TTCTCACCTA CAGCAAAATC GGCAACGAAC
TTACAGAAAG AAAATAAAGA TGAGTCACGC ATTTTCATAA CAGGAAATAC AGCGATTGAC
GCACTAAAAA CGACTGTAAA AGAAACATAT AGTCATCCTG TACTAGAGAA ACTTGGAAAT
GATCGTCTTG TACTTATGAC AGCTCACCGT CGTGAAAACT TAGGAGAGCC AATGCGTAAT
ATGTTCCGTG CAATTAAGCG TCTTGTTGAT AAGCATGAAG ACGTACAAGT TGTATATCCT
GTTCATATGA ATCCTGTTGT TCGTGAAACT GCAAATGATA TTTTAGGCGA TCATGGCCGC
ATTCATTTAA TTGAGCCGTT AGATGTAATT GATTTCCACA ATGTTGCAGC TCGTTCATAC
TTAATGTTAA CTGATTCTGG TGGGGTACAA GAGGAAGCTC CGTCACTTGG TGTACCGGCT
CTTGTTCTTC GTGATACAAC AGAGCGCCCT GAAGGTATTG AAGCAGGTAC GTTGAAATTA
GCGGGAACAG ACGAAGAGAC AATCTTTAGT CTTGCTGATG AGTTGTTATC AGACAAAGAA
GCTCATGATA AAATGTCAAA AGCATCTAAC CCGTACGGTG ATGGCCGTGC ATCAGAGCGT
ATTGTAGAAG CAATTTTAAA ACACTTTAAT AAGTAA
 
Protein sequence
MTERLKVMTI FGTRPEAIKM APLVLELQKH PEKIESIVTV TAQHRQMLDQ VLSIFGITPD 
FDLNIMKDRQ TLIDITTRGL EGLDKVMKEA KPDIVLVHGD TTTTFIASLA AFYNQIPVGH
VEAGLRTWDK YSPYPEEMNR QLTGVMADLH FSPTAKSATN LQKENKDESR IFITGNTAID
ALKTTVKETY SHPVLEKLGN DRLVLMTAHR RENLGEPMRN MFRAIKRLVD KHEDVQVVYP
VHMNPVVRET ANDILGDHGR IHLIEPLDVI DFHNVAARSY LMLTDSGGVQ EEAPSLGVPA
LVLRDTTERP EGIEAGTLKL AGTDEETIFS LADELLSDKE AHDKMSKASN PYGDGRASER
IVEAILKHFN K