Gene BAS3997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3997 
Symbol 
ID2850274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3936984 
End bp3938168 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content38% 
IMG OID637507234 
Productphosphopentomutase 
Protein accessionYP_030247 
Protein GI49186995 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1015] Phosphopentomutase 
TIGRFAM ID[TIGR01696] phosphopentomutase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAT ATAAACGTAT ATTCCTAGTC GTAATGGACT CTGTTGGAAT CGGTGAAGCA 
CCAGATGCTG AACAATTTGG TGATTTAGGA TCTGATACAA TTGGTCACAT TGCTGAACAT
ATGAATGGAT TACACATGCC TAACATGGTG AAATTAGGTC TTGGTAACAT TCGTGAAATG
AAAGGCATCT CTAAAGTAGA AAAACCACTT GGATATTATA CAAAAATGCA AGAGAAATCT
ACTGGTAAAG ATACAATGAC AGGACACTGG GAAATTATGG GTCTTTACAT TGATACACCA
TTCCAAGTGT TCCCTGAAGG ATTCCCGAAA GAATTACTTG ATGAATTAGA AGAAAAAACA
GGTCGTAAAA TCATCGGTAA TAAACCAGCT TCTGGAACTG AAATTCTTGA TGAACTTGGT
CAAGAACAAA TGGAAACAGG CTCTTTAATT GTTTACACTT CTGCTGATAG CGTTCTGCAA
ATCGCAGCAC ACGAAGAAGT AGTACCGCTT GATGAGTTGT ATAAAATTTG TAAAATTGCA
CGTGAATTAA CGTTAGATGA GAAGTACATG GTAGGTCGCG TTATTGCTCG TCCATTCGTT
GGTGAGCCTG GAAACTTTAC ACGTACACCG AACCGTCATG ACTATGCATT AAAACCATTC
GGCCGTACAG TAATGAATGA ATTAAAAGAT AGTGATTATG ATGTGATTGC TATCGGTAAA
ATCTCTGACA TCTATGATGG TGAAGGCGTA ACTGAATCAC TTCGTACGAA GTCTAACATG
GATGGAATGG ATAAGGTTGT AGATACATTA AATATGGACT TTACAGGTCT TAGCTTCTTA
AACTTAGTTG ACTTTGATGC ACTATTTGGT CACCGTCGTG ACCCACAAGG ATATGGAGAA
GCTCTGCAAG AATATGATGC ACGTCTTCCA GAAGTATTCG AAAAACTAAA AGAAGATGAT
CTATTATTAA TTACAGCAGA CCACGGTAAT GACCCAGTTC ACCACGGTAC TGACCATACA
CGTGAATATG TACCGTTATT AGCATATAGC CCAAGCATGA AAGAAGGCGG ACAAGAGTTA
CCACTTCGTC AAACATTTGC TGATATTGGT GCAACTGTAG CAGAAAACTT CGGTGTGAAA
ATGCCAGAAT ACGGAACAAG CTTCTTAAAC GAGCTAAAGA AATAG
 
Protein sequence
MNKYKRIFLV VMDSVGIGEA PDAEQFGDLG SDTIGHIAEH MNGLHMPNMV KLGLGNIREM 
KGISKVEKPL GYYTKMQEKS TGKDTMTGHW EIMGLYIDTP FQVFPEGFPK ELLDELEEKT
GRKIIGNKPA SGTEILDELG QEQMETGSLI VYTSADSVLQ IAAHEEVVPL DELYKICKIA
RELTLDEKYM VGRVIARPFV GEPGNFTRTP NRHDYALKPF GRTVMNELKD SDYDVIAIGK
ISDIYDGEGV TESLRTKSNM DGMDKVVDTL NMDFTGLSFL NLVDFDALFG HRRDPQGYGE
ALQEYDARLP EVFEKLKEDD LLLITADHGN DPVHHGTDHT REYVPLLAYS PSMKEGGQEL
PLRQTFADIG ATVAENFGVK MPEYGTSFLN ELKK