Gene BAS3995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3995 
Symbol 
ID2849826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3934838 
End bp3936142 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content39% 
IMG OID637507232 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_030245 
Protein GI49186993 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0822094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGG TGGACCTAAT TGCAAAAAAA CGTGACGGAC ATGCATTAAC GACAGAAGAA 
ATTAACTTTA TTGTTGAAGG ATATACAAAT GGTGATATTC CTGATTATCA AGTAAGTTCA
CTTGCAATGG CAATTTTCTT CCAAGATATG AACGATCAAG AACGTGCAGA TTTAACGATG
GCAATGGTAA ATAGTGGTGA TACAATCGAC TTATCAGCTA TTGAAGGTGT AAAAGTAGAT
AAGCACTCAA CAGGTGGCGT TGGTGATATA ACGACACTTG TATTAGGTCC ATTAGTAGCC
GCTTTAGGTG TACCGGTTGC AAAAATGTCT GGACGTGGTC TAGGACATAC TGGCGGTACA
ATTGATAAAT TAGAAGCAGT TCCAGGGTTC CATGTTGAAA TCGAAAATGA TGAATTCATG
CGTCTTGTAA ATGAAAATAA AATCGCAGTT ATTGGTCAGA GTGGAAACTT AACACCTGCG
GATAAAAAGT TATATGCACT CCGTGATGTA ACGGCAACAG TAAACTCAAT TCCGCTTATT
GCAAGCTCGA TTATGAGTAA AAAAATTGCT GCTGGTGCAG ATGCAATTGT TCTTGATGTA
AAAACTGGAG CAGGTGCATT TATGAAAACG GATGAAGATG CAAAACGTTT AGCAGAAGCA
ATGGTACGCA TTGGTAATAA CGTTGGTCGT AATACGATGG CTGTTATTTC TGATATGAGT
CAACCACTTG GTGAGGCTAT TGGTAACGCA CTTGAAGTAC AAGAAGCAAT TGATACATTA
CAAGGTAAAG GACCGAAAGA TTTAGAAGAG TTATGTTTAA CACTTGGAAG TCAAATGGTA
TACCTTGCTG GACAAGCTTC ATCTTTAGAA GATGCGCGTG AGAAATTAAT TGAAGTAATG
AACAACGGTA AAGCGCTAGA ATCATTTAAA ACATTCTTAT CAGCGCAAGG CGGCGATGCA
TCTGTTGTTG ATGATCCTTC TAAATTACCA CAAGCACAAT TTAAAATTGA AGTGGAAGCG
AAGGAAGACG GTTATGTATC AGAAATCGTT GCAGATGAAA TTGGAACAGC AGCAATGCTT
TTAGGAGCAG GACGTGCGAC GAAGGAATCT GAAATTGATT TAGCAGTTGG CTTAATGCTT
CGCAAAAAAG TAGGGGACAG CGTGAAAAAA GGTGAATCCC TTGTTACCAT TTACGCAAAC
CGTGAAAATG TAGAAGATGT AAAAGCAAAA ATTTATGAGA ACATGAAGAT CTCTAAAGAG
CATGTAGATG CACCGACATT AGTGCATGGC ATCGTTACTG AATAA
 
Protein sequence
MRMVDLIAKK RDGHALTTEE INFIVEGYTN GDIPDYQVSS LAMAIFFQDM NDQERADLTM 
AMVNSGDTID LSAIEGVKVD KHSTGGVGDI TTLVLGPLVA ALGVPVAKMS GRGLGHTGGT
IDKLEAVPGF HVEIENDEFM RLVNENKIAV IGQSGNLTPA DKKLYALRDV TATVNSIPLI
ASSIMSKKIA AGADAIVLDV KTGAGAFMKT DEDAKRLAEA MVRIGNNVGR NTMAVISDMS
QPLGEAIGNA LEVQEAIDTL QGKGPKDLEE LCLTLGSQMV YLAGQASSLE DAREKLIEVM
NNGKALESFK TFLSAQGGDA SVVDDPSKLP QAQFKIEVEA KEDGYVSEIV ADEIGTAAML
LGAGRATKES EIDLAVGLML RKKVGDSVKK GESLVTIYAN RENVEDVKAK IYENMKISKE
HVDAPTLVHG IVTE