Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS3995 |
Symbol | |
ID | 2849826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | - |
Start bp | 3934838 |
End bp | 3936142 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637507232 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_030245 |
Protein GI | 49186993 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0822094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGG TGGACCTAAT TGCAAAAAAA CGTGACGGAC ATGCATTAAC GACAGAAGAA ATTAACTTTA TTGTTGAAGG ATATACAAAT GGTGATATTC CTGATTATCA AGTAAGTTCA CTTGCAATGG CAATTTTCTT CCAAGATATG AACGATCAAG AACGTGCAGA TTTAACGATG GCAATGGTAA ATAGTGGTGA TACAATCGAC TTATCAGCTA TTGAAGGTGT AAAAGTAGAT AAGCACTCAA CAGGTGGCGT TGGTGATATA ACGACACTTG TATTAGGTCC ATTAGTAGCC GCTTTAGGTG TACCGGTTGC AAAAATGTCT GGACGTGGTC TAGGACATAC TGGCGGTACA ATTGATAAAT TAGAAGCAGT TCCAGGGTTC CATGTTGAAA TCGAAAATGA TGAATTCATG CGTCTTGTAA ATGAAAATAA AATCGCAGTT ATTGGTCAGA GTGGAAACTT AACACCTGCG GATAAAAAGT TATATGCACT CCGTGATGTA ACGGCAACAG TAAACTCAAT TCCGCTTATT GCAAGCTCGA TTATGAGTAA AAAAATTGCT GCTGGTGCAG ATGCAATTGT TCTTGATGTA AAAACTGGAG CAGGTGCATT TATGAAAACG GATGAAGATG CAAAACGTTT AGCAGAAGCA ATGGTACGCA TTGGTAATAA CGTTGGTCGT AATACGATGG CTGTTATTTC TGATATGAGT CAACCACTTG GTGAGGCTAT TGGTAACGCA CTTGAAGTAC AAGAAGCAAT TGATACATTA CAAGGTAAAG GACCGAAAGA TTTAGAAGAG TTATGTTTAA CACTTGGAAG TCAAATGGTA TACCTTGCTG GACAAGCTTC ATCTTTAGAA GATGCGCGTG AGAAATTAAT TGAAGTAATG AACAACGGTA AAGCGCTAGA ATCATTTAAA ACATTCTTAT CAGCGCAAGG CGGCGATGCA TCTGTTGTTG ATGATCCTTC TAAATTACCA CAAGCACAAT TTAAAATTGA AGTGGAAGCG AAGGAAGACG GTTATGTATC AGAAATCGTT GCAGATGAAA TTGGAACAGC AGCAATGCTT TTAGGAGCAG GACGTGCGAC GAAGGAATCT GAAATTGATT TAGCAGTTGG CTTAATGCTT CGCAAAAAAG TAGGGGACAG CGTGAAAAAA GGTGAATCCC TTGTTACCAT TTACGCAAAC CGTGAAAATG TAGAAGATGT AAAAGCAAAA ATTTATGAGA ACATGAAGAT CTCTAAAGAG CATGTAGATG CACCGACATT AGTGCATGGC ATCGTTACTG AATAA
|
Protein sequence | MRMVDLIAKK RDGHALTTEE INFIVEGYTN GDIPDYQVSS LAMAIFFQDM NDQERADLTM AMVNSGDTID LSAIEGVKVD KHSTGGVGDI TTLVLGPLVA ALGVPVAKMS GRGLGHTGGT IDKLEAVPGF HVEIENDEFM RLVNENKIAV IGQSGNLTPA DKKLYALRDV TATVNSIPLI ASSIMSKKIA AGADAIVLDV KTGAGAFMKT DEDAKRLAEA MVRIGNNVGR NTMAVISDMS QPLGEAIGNA LEVQEAIDTL QGKGPKDLEE LCLTLGSQMV YLAGQASSLE DAREKLIEVM NNGKALESFK TFLSAQGGDA SVVDDPSKLP QAQFKIEVEA KEDGYVSEIV ADEIGTAAML LGAGRATKES EIDLAVGLML RKKVGDSVKK GESLVTIYAN RENVEDVKAK IYENMKISKE HVDAPTLVHG IVTE
|
| |