Gene BAS5273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5273 
Symbol 
ID2852411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5155664 
End bp5156764 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content34% 
IMG OID637508527 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_031511 
Protein GI49188258 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0104828 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTGT TGCATATGAA TGCTGGTGCG GAAGAAGGTG GGGGAAAAAC ACATATTATT 
TCACTTTTAT CTCAATTTTC AAAGGAAGAA GTGGAATTAA TGGTATTTGA AGAAGGTGCG
ATTGCTAGAG AGGCGAGAAA CCTTGGTATT CAAGTACATG TTTTTACCCA ATCATCTCGG
TACGACCTAT CAATTCTTTC AAAAATAAAA GCATTTATTA ATGAAAATCA ATTTGACATT
GTGCATACAC ATGGCGCACG AGCAAATTTC TATCTCTCCC TCTTGAAAAA AGGTATAAAA
GCGAAATGGA TAATGACTGT CCACAGTGAT CCAACTTTGG ATTTTATGAA GAGGGGATTA
AAAGGATGGG TATTTACGAA GTTAAATTTA CGTTCTTTCA GGAAGGTAGA TTTATTCTTT
GCAATTACGG AGAACTTTAA GAGAAATATA ATAAAACTAG GTGTACCAGA AGAGAAGATT
TGTACTGTTT ATAATGGAAT TGAGTATGAT AGTAATCCGG CAAAACCTTA TGATAAGAGT
GAATTTGGCA TCGATGAAGG AATATTTACA GCCATTCAAG TAGCACGTCT TCATCCTGTT
AAAGGTCATG ATATTTTATT TGAAGCATTA CAAAAAATTA AAATTCCCAA TATAAAGGTA
CTCTTGCTTG GTGATGGTCC TATAGAAGCA GAATTAAAAG AGATGGTGAA ACAAAAGGGT
CTAGAGGATA AAGTAATGTT TCTAGGTTTT CGTATAGATT CAAAGGAATT ATATGCGTCT
GCACACATTA ATTTGTTAAC CTCTTATAGC GAAAGTTTCC CTCTCGTTTT ATTAGAAGCG
GCTAATCAAC GCTTAACATC TATTGCAACA AATGTAGGTG ATATGAAAAA GTTAATAGTT
GATGATACGT ATGGATGGAT TGTACCGATT GGTGATGCAG ACTCGTTAGC AAATGCATTA
GAAAATGCTT ATGAAAAATG GTTGAATGGT GAATTAGAAG CGATGGGAAA TCGTTTATAT
ACTCACGCAT CTACTCACTT CTCGCTAAAG AATTTGTATG AAGATACTTA TAATGCATAT
AAAACACTTT TACTGAAATA G
 
Protein sequence
MKVLHMNAGA EEGGGKTHII SLLSQFSKEE VELMVFEEGA IAREARNLGI QVHVFTQSSR 
YDLSILSKIK AFINENQFDI VHTHGARANF YLSLLKKGIK AKWIMTVHSD PTLDFMKRGL
KGWVFTKLNL RSFRKVDLFF AITENFKRNI IKLGVPEEKI CTVYNGIEYD SNPAKPYDKS
EFGIDEGIFT AIQVARLHPV KGHDILFEAL QKIKIPNIKV LLLGDGPIEA ELKEMVKQKG
LEDKVMFLGF RIDSKELYAS AHINLLTSYS ESFPLVLLEA ANQRLTSIAT NVGDMKKLIV
DDTYGWIVPI GDADSLANAL ENAYEKWLNG ELEAMGNRLY THASTHFSLK NLYEDTYNAY
KTLLLK