Gene BAS1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1756 
Symbol 
ID2851407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1778928 
End bp1780229 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content37% 
IMG OID637505007 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_028020 
Protein GI49184768 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.212389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGG TAGATATTAT TGCAAAAAAA CGTGACGGTA AAGAATTAAC AACTGAAGAA 
ATCAAATTCT TTATTAATGG TTATACAGAC GGAAGTATTC CTGATTATCA AGTAAGTGCA
CTTGCAATGG CAATCTTCTT TAAAGATATG ACAGATCGTG AACGTGCAGA TTTAACGATG
GCAATGGTGG AGTCTGGAGA AACAATCGAC TTATCTGCAA TTGAAGGAAT TAAAGTAGAC
AAACATTCAA CTGGTGGTGT TGGTGATACA ACAACATTAG TATTAGGACC ATTAGTAGCT
GCTTTAGATG TACCAGTAGC AAAAATGTCT GGTCGTGGTT TAGGACATAC AGGCGGAACA
ATTGATAAAT TAGAAGCAGT AGAAGGATTC CACGTTGAAA TTACGAAAGA GCAGTTCATT
GATATTGTAA ACCGTGACAA AGTAGCTGTT ATTGGACAAA CAGGAAACTT AACACCTGCA
GATAAAAAGA TTTATGCATT ACGCGATGTA ACAGGAACAG TAAACTCAAT TCCTTTAATC
GCAAGTTCAA TTATGAGTAA AAAAATTGCA GCTGGTGCAG ATGCAATCGT ACTTGATGTA
AAAACAGGTG CTGGCGCATT TATGAAAACA GAAGAAGATG CAAAAGAATT AGCACATGCG
ATGGTACGTA TCGGAAATAA TGTAGGACGT CAAACTATGG CTGTTATTTC AGACATGTCA
CAACCGCTTG GATTTGCGAT TGGTAACGCA CTAGAAGTGA AAGAAGCAAT TGATACGTTA
AAAGGTGAAG GTCCAGAAGA TTTAACAGAA TTAGTACTCG TATTAGGAAG TCAGATGGTT
GTACTTGCGA AAAAAGCTAA TACATTAGAA GAAGCGCGTG AAATGTTAAT TGAAGTGATG
AAGAACGGAA AAGCAACTGA GAAGTTTAAA GAATTCTTAA ACAATCAAGG CGGAGATAGC
TCAATTGTAG ACAATCCAGA AAAAATGCCA CAGGCGAAGT ATGTAATTGA TGTACCTGCT
AAAACTTCAG GTGTTATTTC TAACATTGTT GCAGATGAAA TCGGTATCGC AGCTATGCTA
CTTGGTGCTG GTCGTGCAAC AAAAGAAGAT GAAATTGATT TAGCAGTAGG ATTAATGTTA
CGTAAAAAAG TGGGCGATGC AGTAAAAGAA GGCGAGCCAT TCGTAACGAT TTATGCAAAT
CGCGAAAATG TAGAAGATGT AAAAGCTAAA ATTTATGAGA ACATTTCTAT CGCTGAAACA
GCAGTGGCTC CTAAATTAGT TCATACAGTT ATTACTGACT AA
 
Protein sequence
MRMVDIIAKK RDGKELTTEE IKFFINGYTD GSIPDYQVSA LAMAIFFKDM TDRERADLTM 
AMVESGETID LSAIEGIKVD KHSTGGVGDT TTLVLGPLVA ALDVPVAKMS GRGLGHTGGT
IDKLEAVEGF HVEITKEQFI DIVNRDKVAV IGQTGNLTPA DKKIYALRDV TGTVNSIPLI
ASSIMSKKIA AGADAIVLDV KTGAGAFMKT EEDAKELAHA MVRIGNNVGR QTMAVISDMS
QPLGFAIGNA LEVKEAIDTL KGEGPEDLTE LVLVLGSQMV VLAKKANTLE EAREMLIEVM
KNGKATEKFK EFLNNQGGDS SIVDNPEKMP QAKYVIDVPA KTSGVISNIV ADEIGIAAML
LGAGRATKED EIDLAVGLML RKKVGDAVKE GEPFVTIYAN RENVEDVKAK IYENISIAET
AVAPKLVHTV ITD