Gene BAS5126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5126 
Symbol 
ID2847878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5010632 
End bp5011906 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content32% 
IMG OID637508381 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_031365 
Protein GI49188112 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACAGA AGAAAGTGTG CATGTTTGTA TGGAATCACT TCACAAATGA TGCACGTGTA 
TTAAGAGAAT GTACAGCCTT AGCAGAATCT GGATATGAGG TTGATTTAAT TTGTATTCAC
GATTGGAAAC AAGAGGATCT TCCTTTATGG GAAAAACGTC AAGAGGGCTT TACAGTAACT
CGTGTGAAGA ATAGAGTACC AGTATTACAG AAAATATTTG GGTTGGCTAA ACGTGCCAAA
AGAGTGGCGA TGAAAAATAT CGCTACAATG ACTGTACTTG GTCTACTATT GGCATTAGGA
ATTTGGAAGT TCCCTATGAT AACAGTTAGT TTATTATTAC TAGCTTTATT ATTTTCGCAG
AGAAAGGTTG CTACTTTATT GGTGAGAGGA GCAATTTTGT TTAGGATGGT TCGAGCGGGA
TTAAAAAAGA AGTATGATAT ATATCACTCG AACGATTTAA ATACATTACC GCAAGGGTTT
ATATGTGCGA AAATATTGAG AAGGAAAAAG TTAATTTATG ATTCTCATGA AGTACAGACG
AGTAGGACGG GATATAACAG TAATATTTAT GGGGTCATGG AGAAATTTTT TATTAAATTT
TGTGATGTAA TGATTATGGA AAATCATACA AGAGCAAAGT ATATAGAAGA ATTATATGGA
TTTTATCCAA AAGTCATTCA TAATTATCCA TTTGTTTCGC GTCCGGAACT GAGTAAATCA
ATTGATTTAC ATGGAATATT AAATATTTCA CAAGATGAAC CGATTCTTTT ATATCAAGGT
GGAATTCAAA TTGGACGTGG TCTTGATAAA TTGGTACAAG CAGTTCCTTT ATTTAAACGA
GGTGTTGTGG TATTCATTGG AGATGGTCGT ATCAAACCTG AATTGAAAAA AATGGTAAAA
GAAATGGAAT TAGAAGATCG AGTAAAGTTT ATACCAAAAG TGCCCGTACA GGATTTAATT
CATTATACAA AAAATGCTTA TTTAGGATTT CAAGTATTAA ATAATGTTTG TTTTAACCAT
TATTCTGCTT CTTCTAATAA ATTATTTGAA TATATGATGA GTGGTGTACC TGTAGTTGCT
TGTAGTTTTC CTGAAATTCA AGGTGTAGTT GAAAAAGAAA ACATAGGAGT TTGTGTTGAC
TCGCATGATC CAGTTTCAAT TGCTGATGGG GTAAACTACT TATTAAATAA TCAGGATGAT
AGGGAAAAAA TGATGGTAAA TTGTTTAAGT GCAAGGGAAA AGTATAATTG GCAAAGAGAA
AAAAGGATTT TATAA
 
Protein sequence
MSQKKVCMFV WNHFTNDARV LRECTALAES GYEVDLICIH DWKQEDLPLW EKRQEGFTVT 
RVKNRVPVLQ KIFGLAKRAK RVAMKNIATM TVLGLLLALG IWKFPMITVS LLLLALLFSQ
RKVATLLVRG AILFRMVRAG LKKKYDIYHS NDLNTLPQGF ICAKILRRKK LIYDSHEVQT
SRTGYNSNIY GVMEKFFIKF CDVMIMENHT RAKYIEELYG FYPKVIHNYP FVSRPELSKS
IDLHGILNIS QDEPILLYQG GIQIGRGLDK LVQAVPLFKR GVVVFIGDGR IKPELKKMVK
EMELEDRVKF IPKVPVQDLI HYTKNAYLGF QVLNNVCFNH YSASSNKLFE YMMSGVPVVA
CSFPEIQGVV EKENIGVCVD SHDPVSIADG VNYLLNNQDD REKMMVNCLS AREKYNWQRE
KRIL