Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B1042 |
Symbol | pyn2 |
ID | 7181268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 4063871 |
End bp | 4065175 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643551989 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_002447659 |
Protein GI | 218899248 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00204257 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0000153983 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGAATGG TGGACCTAAT TGCAAAAAAA CGTGACGGAC ATGCATTAAC GACAGAAGAA ATTAACTTTA TTGTTGAAGG ATATACAAAT GGTGATATTC CTGATTATCA AGTAAGTTCA CTTGCAATGG CAATTTTCTT CCAAGATATG AACGACCAAG AGCGTGCTGA TTTAACGATG GCAATGGTAA ATAGCGGCGA TACAATTGAC TTATCAGCTA TTGAAGGTGT AAAAGTAGAT AAGCATTCGA CAGGTGGTGT TGGTGATACA ACAACACTTG TATTAGGTCC ATTAGTAGCT GCTTTAGACG TACCAGTAGC GAAAATGTCT GGACGCGGTT TAGGACATAC TGGTGGTACA ATTGATAAAT TAGAAGCAGT TCCAGGATTC CATGTGGAAA TTGAAAACGA TGAATTCATG CGACTTGTAA ATGAAAACAA AATCGCTGTT ATCGGCCAAA GTGGTAACTT AACACCTGCT GATAAAAAGT TATATGCACT TCGTGATGTA ACAGCAACAG TAAACTCAAT TCCGCTTATT GCAAGTTCGA TTATGAGTAA AAAAATTGCT GCTGGTGCAG ATGCAATTGT TCTTGATGTA AAAACTGGAG CAGGTGCATT TATGAAAACG GATGAAGATG CAAAACGTTT AGCAGAAGCA ATGGTACGCA TCGGTAATAA CGTTGGTCGT AATACGATGG CAGTTATTTC TGATATGAGT CAACCACTCG GTGAAGCTAT CGGTAACGCA CTAGAAGTAC AAGAAGCAAT TGATACATTA CAAGGTAAAG GGCCAAAAGA TTTAGAAGAG CTATGTTTAA CACTTGGAAG CCAAATGGTG TACCTTGCTG GACAAGCTTC ATCTTTAGAA GATGCACGTG AAAAGCTAAT TGAAGTAATG AACAACGGAA AAGCGCTAGA ATCATTTAAA ACGTTCTTAT CAGCGCAAGG CGGCGACGCA TCTGTTGTTG ATGACCCTTC TAAATTGCCA CAAGCACAAT TTAAAATTGA AGTGGAAGCG AAAGAAGACG GATATGTATC AGAAATCGTT GCAGACGAAA TCGGAACAGC AGCAATGCTT TTAGGAGCAG GACGTGCAAC GAAAGAGTCA GAAATTGATT TAGCAGTTGG TCTAATGCTG CGCAAAAAAG TAGGCGATAG CGTGAAGCAA GGTGAATCCC TTGTTACAAT TTACGCGAAC CGCGAAAATG TTGAAGATGT AAAAGCGAAA ATTTATGAGA ACATGAAGAT CTCTAAAGAA CACGTAGATG CACCAACTTT AGTACACGGT ATCGTAACGA AGTAA
|
Protein sequence | MRMVDLIAKK RDGHALTTEE INFIVEGYTN GDIPDYQVSS LAMAIFFQDM NDQERADLTM AMVNSGDTID LSAIEGVKVD KHSTGGVGDT TTLVLGPLVA ALDVPVAKMS GRGLGHTGGT IDKLEAVPGF HVEIENDEFM RLVNENKIAV IGQSGNLTPA DKKLYALRDV TATVNSIPLI ASSIMSKKIA AGADAIVLDV KTGAGAFMKT DEDAKRLAEA MVRIGNNVGR NTMAVISDMS QPLGEAIGNA LEVQEAIDTL QGKGPKDLEE LCLTLGSQMV YLAGQASSLE DAREKLIEVM NNGKALESFK TFLSAQGGDA SVVDDPSKLP QAQFKIEVEA KEDGYVSEIV ADEIGTAAML LGAGRATKES EIDLAVGLML RKKVGDSVKQ GESLVTIYAN RENVEDVKAK IYENMKISKE HVDAPTLVHG IVTK
|
| |