Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH187_A4218 |
Symbol | pyn2 |
ID | 7074920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH187 |
Kingdom | Bacteria |
Replicon accession | NC_011658 |
Strand | - |
Start bp | 3927741 |
End bp | 3929045 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643452636 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_002340149 |
Protein GI | 217961579 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000180142 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGG TGGACCTAAT TGCAAAAAAA CGTGACGGAC ATGCATTAAC GACAGAAGAA ATCAACTTTA TTGTTGAAGG ATATACGAAT GGTGATATTC CTGATTATCA AGTAAGTTCA CTTGCAATGG CAATTTTCTT CCAAGATATG AACGATCAAG AACGTGCTGA TTTAACGATG GCAATGGTCA ATAGCGGTGA TACAATCGAC TTATCAGCTA TTGAAGGGGT AAAAGTAGAT AAGCACTCCA CAGGTGGCGT TGGTGATACA ACAACACTTG TATTAGGTCC ATTAGTAGCC GCTTTAGGTG TACCGGTTGC AAAAATGTCT GGACGTGGTC TAGGACATAC TGGTGGTACA ATTGATAAAT TAGAAGCAGT TCCAGGATTT CATGTTGAAA TCGAAAATGA TGAATTCATG CGTCTTGTAA ATGAAAACAA AATCGCAGTT ATTGGTCAAA GTGGAAACTT AACACCTGCG GATAAAAAAT TGTATGCACT TCGTGATGTA ACGGCAACAG TAAACTCAAT TCCGCTTATT GCAAGCTCAA TTATGAGTAA AAAAATTGCT GCTGGTGCAG ATGCAATTGT TCTTGATGTA AAAACTGGAG CAGGTGCATT TATGAAAACA GATGAAGATG CAAAACGTTT AGCAGAAGCA ATGGTACGCA TTGGGAATAA CGTTGGTCGT AATACGATGG CTGTTATTTC GGATATGAGT CAACCACTTG GTGAGGCGAT TGGTAACGCA CTTGAAGTAC AAGAAGCAAT TGATACATTA CAAGGTAAAG GACCGAAAGA TTTAGAAGAG TTATGTTTAA CACTTGGAAG TCAAATGGTA TACCTTGCTG GACAAGCTTC ATCTTTAGAA GATGCACGTG AGAAATTAAT TGAAGTAATG AACAACGGTA AAGCGCTAGA ATCATTTAAA ACGTTCTTAT CAGCGCAAGG CGGAGATGCA TCTGTTGTTG ATGATCCTTC TAAGTTACCA CAAGCACAGT TTAAAGTTGA AGTGGAAGCG AAGGAAGACG GTTATGTATC AGAAATCGTT GCAGATGAAA TCGGAACAGC AGCAATGCTT TTAGGAGCAG GACGTGCGAC GAAGGAATCT GAAATTGATT TAGCAGTTGG CTTAATGCTT CGTAAAAAAG TAGGGGACAG CGTGAAAAAA GGTGAATCCC TTGTTACCAT TTACGCAAAC CGTGAAAATG TAGAAGACGT AAAAGCAAAA ATTTATGAGA ACATGAAGAT CTCTAAAGAG CATGTAGATG CACCAACATT AGTGCACGGC ATCGTTACTG AATAA
|
Protein sequence | MRMVDLIAKK RDGHALTTEE INFIVEGYTN GDIPDYQVSS LAMAIFFQDM NDQERADLTM AMVNSGDTID LSAIEGVKVD KHSTGGVGDT TTLVLGPLVA ALGVPVAKMS GRGLGHTGGT IDKLEAVPGF HVEIENDEFM RLVNENKIAV IGQSGNLTPA DKKLYALRDV TATVNSIPLI ASSIMSKKIA AGADAIVLDV KTGAGAFMKT DEDAKRLAEA MVRIGNNVGR NTMAVISDMS QPLGEAIGNA LEVQEAIDTL QGKGPKDLEE LCLTLGSQMV YLAGQASSLE DAREKLIEVM NNGKALESFK TFLSAQGGDA SVVDDPSKLP QAQFKVEVEA KEDGYVSEIV ADEIGTAAML LGAGRATKES EIDLAVGLML RKKVGDSVKK GESLVTIYAN RENVEDVKAK IYENMKISKE HVDAPTLVHG IVTE
|
| |