Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK3842 |
Symbol | pdp |
ID | 3027250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | - |
Start bp | 3971835 |
End bp | 3973139 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637548056 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_085422 |
Protein GI | 52141406 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00792291 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGG TGGACCTAAT TGCAAAAAAA CGTGACGGAC ATGCATTAAC GACAGAAGAA ATTAACTTTA TTGTTGAAGG ATATACAAAT GGTGATATTC CTGATTATCA AGTAAGTTCA CTTGCAATGG CAATTTTCTT CCAAGATATG AACGATCAAG AACGTGCTGA TTTAACGATG GCAATGGTCA ATAGCGGTGA TACAATTGAC TTATCAGCTA TTGAAGGAGT AAAAGTAGAT AAGCACTCAA CAGGTGGCGT TGGTGATACA ACAACACTTG TATTAGGTCC ATTAGTAGCC GCTTTAGGTG TACCGGTTGC AAAAATGTCT GGACGTGGTC TAGGACATAC TGGTGGTACA ATTGATAAAT TAGAAGCAGT TCCAGGATTC CATGTTGAAA TCGAAAATGA TGAATTCATG CGTCTTGTAA ATGAAAATAA AATCGCAGTT ATTGGTCAAA GTGGAAACTT AACACCTGCG GATAAAAAAT TGTATGCACT TCGTGATGTA ACGGCAACAG TAAACTCAAT TCCGCTTATT GCAAGCTCAA TTATGAGTAA AAAAATTGCT GCTGGTGCAG ATGCAATTGT TCTTGATGTA AAAACTGGAG CAGGTGCATT TATGAAAACG GATGAAGATG CAAAACGTTT AGCAGAAGCA ATGGTGCGCA TTGGTAATAA CGTTGGTCGT AATACGATGG CTGTTATTTC TGATATGAGT CAACCACTTG GTGAGGCTAT TGGTAACGCA CTTGAAGTAC AAGAAGCAAT TGATACATTA CAAGGTAAAG GACCGAAAGA TTTAGAAGAG TTATGTTTAA CGCTTGGAAG TCAAATGGTA TACCTTGCTG GACAAGCTTC ATCTTTAGAA GATGCACGCG AGAAATTAAT TGAAGTAATG AACAACGGTA AAGCGCTAGA ATCATTTAAA ACGTTCTTAT CAGCGCAAGG TGGCGATGCA TCTGTTGTTG ATGATCCTTC TAAATTACCA CAAGCACAAT TTAAAATTGA AGTGGAAGCG AAGGAGGACG GTTATGTATC AGAAATCGTT GCAGATGAAA TTGGAACAGC AGCAATGCTT TTAGGAGCAG GACGTGCGAC GAAGGAATCT GAAATTGATT TAGCAGTTGG CTTAATGCTT CGCAAAAAAG TAGGGGACAG CGTGAAAAAA GGTGAATCCC TTGTTACAAT TTACGCAAAC CGTGAAAATG TAGAAGATGT AAAAGCAAAA ATTTATGAGA ACATGAAGAT CTCTAAAGAG CATGTAGATG CACCGACATT AGTGCATGGC ATCGTTACTG AATAA
|
Protein sequence | MRMVDLIAKK RDGHALTTEE INFIVEGYTN GDIPDYQVSS LAMAIFFQDM NDQERADLTM AMVNSGDTID LSAIEGVKVD KHSTGGVGDT TTLVLGPLVA ALGVPVAKMS GRGLGHTGGT IDKLEAVPGF HVEIENDEFM RLVNENKIAV IGQSGNLTPA DKKLYALRDV TATVNSIPLI ASSIMSKKIA AGADAIVLDV KTGAGAFMKT DEDAKRLAEA MVRIGNNVGR NTMAVISDMS QPLGEAIGNA LEVQEAIDTL QGKGPKDLEE LCLTLGSQMV YLAGQASSLE DAREKLIEVM NNGKALESFK TFLSAQGGDA SVVDDPSKLP QAQFKIEVEA KEDGYVSEIV ADEIGTAAML LGAGRATKES EIDLAVGLML RKKVGDSVKK GESLVTIYAN RENVEDVKAK IYENMKISKE HVDAPTLVHG IVTE
|
| |