Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCE_4154 |
Symbol | pyn |
ID | 2749358 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus ATCC 10987 |
Kingdom | Bacteria |
Replicon accession | NC_003909 |
Strand | - |
Start bp | 3878247 |
End bp | 3879551 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637280951 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | NP_980447 |
Protein GI | 42783200 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00147082 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGG TGGACCTAAT TGCAAAAAAA CGTGACGGAC ATGCATTAAC GACAGAAGAA ATTAACTTTA TTGTTGAAGG ATATACAAAT GGTGATATTC CTGATTATCA AGTAAGTTCA CTTGCAATGG CAATTTTCTT CCAAAATATG AACGATCAAG AGCGTGCAGA TTTAACGATG GCAATGGTAA ATAGCGGTGA TACAATCGAC TTATCAGCTA TTGAAGGAGT AAAAGTAGAT AAGCACTCGA CAGGTGGCGT TGGTGATACA ACAACACTTG TATTAGGTCC ATTAGTAGCC GCTTTAGGTG TACCGGTTGC AAAAATGTCT GGACGTGGTC TAGGACATAC TGGTGGTACA ATTGATAAAT TAGAAGCAGT TCCAGGATTC CATGTTGAAA TCGAAAATGA TGAATTCATG CGTCTTGTAA ATGAAAATAA AATCGCAGTT ATTGGTCAAA GTGGAAACTT AACACCTGCG GATAAAAAAT TGTATGCACT TCGTGATGTA ACGGCAACAG TAAACTCAAT TCCGCTTATT GCAAGCTCAA TTATGAGTAA AAAAATTGCT GCTGGTGCAG ATGCAATTGT TCTTGATGTA AAAACTGGAG CAGGTGCATT TATGAAAACA GATGAAGATG CAAAACGTTT AGCAGAAGCA ATGGTACGCA TTGGTAATAA CGTTGGTCGT AATACGATGG CTGTTATTTC TGATATGAGT CAACCACTTG GTGAGGCTAT TGGTAACGCA CTTGAAGTAC AAGAAGCAAT TGATACGTTA CAAGGTAAAG GACCGAAAGA TTTAGAAGAG TTATGTTTAA CGCTTGGAAG TCAAATGGTA TACCTTGCTG GACAAGCTTC ATCTCTAGAA GATGCACGTG AGAAATTAAT TGAAGTAATG AACAACGGTA AGGCGCTAGA ATCATTTAAA ACGTTCTTAT CAGCGCAAGG CGGCGATGCA TCTGTTGTTG ATGATCCTTC TAAATTACCA CAAGCACAAT TTAAAATTGA AGTGGAAGCG AAGGAAGACG GTTATGTATC AGAAATCGTT GCAGATGAAA TCGGAACAGC AGCAATGCTT TTAGGAGCAG GACGTGCGAC GAAGGAATCT GAAATTGATT TAGCTGTTGG CTTAATGCTT CGTAAAAAAG TAGGGGACAG CGTGAAAAAA GGTGAATCCC TTGTTACCAT TTACGCAAAC CGTGAAAATG TAGAAGACGT AAAAGCAAAA ATTTATGAGA ACATGAAGAT CTCTAAAGAG CATGTAGATG CACCAACATT AGTGCACGGC ATCGTTACTG AATAA
|
Protein sequence | MRMVDLIAKK RDGHALTTEE INFIVEGYTN GDIPDYQVSS LAMAIFFQNM NDQERADLTM AMVNSGDTID LSAIEGVKVD KHSTGGVGDT TTLVLGPLVA ALGVPVAKMS GRGLGHTGGT IDKLEAVPGF HVEIENDEFM RLVNENKIAV IGQSGNLTPA DKKLYALRDV TATVNSIPLI ASSIMSKKIA AGADAIVLDV KTGAGAFMKT DEDAKRLAEA MVRIGNNVGR NTMAVISDMS QPLGEAIGNA LEVQEAIDTL QGKGPKDLEE LCLTLGSQMV YLAGQASSLE DAREKLIEVM NNGKALESFK TFLSAQGGDA SVVDDPSKLP QAQFKIEVEA KEDGYVSEIV ADEIGTAAML LGAGRATKES EIDLAVGLML RKKVGDSVKK GESLVTIYAN RENVEDVKAK IYENMKISKE HVDAPTLVHG IVTE
|
| |