Gene BAS5117 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5117 
Symbol 
ID2850827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4997513 
End bp4998628 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content39% 
IMG OID637508372 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_031356 
Protein GI49188103 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.200272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAC GTTTAAAAGT AATGACGATT TTTGGGACAC GTCCAGAAGC AATTAAAATG 
GCACCTCTTG TATTAGAGTT GCAAAAGCAT CCAGAGAAAA TTGAGTCAAT TGTGACTGTA
ACAGCGCAAC ATCGTCAAAT GTTAGACCAA GTATTAAGTA TCTTTGGAAT TACACCAGAT
TTCGATTTGA ATATTATGAA GGATCGCCAA ACTTTAATTG ATATTACAAC GCGTGGTTTA
GAAGGTTTGG ATAAAGTAAT GAAAGAAGCA AAGCCGGATA TCGTACTTGT ACATGGTGAT
ACAACGACAA CGTTTATCGC AAGCTTAGCT GCTTTCTATA ATCAAATTCC AGTAGGTCAT
GTCGAGGCGG GACTTCGTAC ATGGGATAAA TATTCTCCAT ACCCAGAAGA GATGAATCGT
CAATTAACAG GCGTAATGGC GGACCTTCAT TTCTCACCTA CAGCAAAATC GGCAACGAAC
TTACAGAAAG AAAATAAAGA TGAGTCACGC ATTTTCATAA CAGGAAATAC AGCGATTGAC
GCACTAAAAA CGACTGTAAA AGAAACATAT AGTCATCCCG TACTAGAGAA ACTTGGAAAT
AATCGTCTTG TACTTATGAC AGCTCACCGT CGTGAAAACT TAGGAGAGCC AATGCGTAAT
ATGTTCCGTG CAATTAAGCG TCTTGTTGAT AAGCATGAAG ACGTACAAGT TGTATATCCT
GTTCATATGA ATCCTGTTGT TCGTGAAACT GCAAATGATA TTTTAGGCGA TTATGGCCGC
ATTCATTTAA TTGAGCCGTT AGATGTAATT GATTTCCACA ATGTTGCAGC TCGTTCATAC
TTAATGTTAA CTGATTCTGG TGGGGTACAA GAGGAAGCAC CGTCACTTGG TGTACCGGTT
CTTGTTCTTC GTGATACAAC GGAGCGTCCA GAAGGTATTG AAGCAGGTAC GTTGAAATTA
GCGGGAACAG ACGAAGAGAC AATCTTTAGT CTTGCTGATG AGTTGTTATC AGACAAAGAA
GCTCATGATA AGATGTCAAA AGCATCTAAC CCGTACGGTG ATGGCCGTGC ATCAGAGCGT
ATTGTAGAAG CAATTTTAAA ACACTTTAAT AAGTAA
 
Protein sequence
MTERLKVMTI FGTRPEAIKM APLVLELQKH PEKIESIVTV TAQHRQMLDQ VLSIFGITPD 
FDLNIMKDRQ TLIDITTRGL EGLDKVMKEA KPDIVLVHGD TTTTFIASLA AFYNQIPVGH
VEAGLRTWDK YSPYPEEMNR QLTGVMADLH FSPTAKSATN LQKENKDESR IFITGNTAID
ALKTTVKETY SHPVLEKLGN NRLVLMTAHR RENLGEPMRN MFRAIKRLVD KHEDVQVVYP
VHMNPVVRET ANDILGDYGR IHLIEPLDVI DFHNVAARSY LMLTDSGGVQ EEAPSLGVPV
LVLRDTTERP EGIEAGTLKL AGTDEETIFS LADELLSDKE AHDKMSKASN PYGDGRASER
IVEAILKHFN K