Gene BCZK3842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCZK3842 
Symbolpdp 
ID3027250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_006274 
Strand
Start bp3971835 
End bp3973139 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content38% 
IMG OID637548056 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_085422 
Protein GI52141406 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00792291 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGG TGGACCTAAT TGCAAAAAAA CGTGACGGAC ATGCATTAAC GACAGAAGAA 
ATTAACTTTA TTGTTGAAGG ATATACAAAT GGTGATATTC CTGATTATCA AGTAAGTTCA
CTTGCAATGG CAATTTTCTT CCAAGATATG AACGATCAAG AACGTGCTGA TTTAACGATG
GCAATGGTCA ATAGCGGTGA TACAATTGAC TTATCAGCTA TTGAAGGAGT AAAAGTAGAT
AAGCACTCAA CAGGTGGCGT TGGTGATACA ACAACACTTG TATTAGGTCC ATTAGTAGCC
GCTTTAGGTG TACCGGTTGC AAAAATGTCT GGACGTGGTC TAGGACATAC TGGTGGTACA
ATTGATAAAT TAGAAGCAGT TCCAGGATTC CATGTTGAAA TCGAAAATGA TGAATTCATG
CGTCTTGTAA ATGAAAATAA AATCGCAGTT ATTGGTCAAA GTGGAAACTT AACACCTGCG
GATAAAAAAT TGTATGCACT TCGTGATGTA ACGGCAACAG TAAACTCAAT TCCGCTTATT
GCAAGCTCAA TTATGAGTAA AAAAATTGCT GCTGGTGCAG ATGCAATTGT TCTTGATGTA
AAAACTGGAG CAGGTGCATT TATGAAAACG GATGAAGATG CAAAACGTTT AGCAGAAGCA
ATGGTGCGCA TTGGTAATAA CGTTGGTCGT AATACGATGG CTGTTATTTC TGATATGAGT
CAACCACTTG GTGAGGCTAT TGGTAACGCA CTTGAAGTAC AAGAAGCAAT TGATACATTA
CAAGGTAAAG GACCGAAAGA TTTAGAAGAG TTATGTTTAA CGCTTGGAAG TCAAATGGTA
TACCTTGCTG GACAAGCTTC ATCTTTAGAA GATGCACGCG AGAAATTAAT TGAAGTAATG
AACAACGGTA AAGCGCTAGA ATCATTTAAA ACGTTCTTAT CAGCGCAAGG TGGCGATGCA
TCTGTTGTTG ATGATCCTTC TAAATTACCA CAAGCACAAT TTAAAATTGA AGTGGAAGCG
AAGGAGGACG GTTATGTATC AGAAATCGTT GCAGATGAAA TTGGAACAGC AGCAATGCTT
TTAGGAGCAG GACGTGCGAC GAAGGAATCT GAAATTGATT TAGCAGTTGG CTTAATGCTT
CGCAAAAAAG TAGGGGACAG CGTGAAAAAA GGTGAATCCC TTGTTACAAT TTACGCAAAC
CGTGAAAATG TAGAAGATGT AAAAGCAAAA ATTTATGAGA ACATGAAGAT CTCTAAAGAG
CATGTAGATG CACCGACATT AGTGCATGGC ATCGTTACTG AATAA
 
Protein sequence
MRMVDLIAKK RDGHALTTEE INFIVEGYTN GDIPDYQVSS LAMAIFFQDM NDQERADLTM 
AMVNSGDTID LSAIEGVKVD KHSTGGVGDT TTLVLGPLVA ALGVPVAKMS GRGLGHTGGT
IDKLEAVPGF HVEIENDEFM RLVNENKIAV IGQSGNLTPA DKKLYALRDV TATVNSIPLI
ASSIMSKKIA AGADAIVLDV KTGAGAFMKT DEDAKRLAEA MVRIGNNVGR NTMAVISDMS
QPLGEAIGNA LEVQEAIDTL QGKGPKDLEE LCLTLGSQMV YLAGQASSLE DAREKLIEVM
NNGKALESFK TFLSAQGGDA SVVDDPSKLP QAQFKIEVEA KEDGYVSEIV ADEIGTAAML
LGAGRATKES EIDLAVGLML RKKVGDSVKK GESLVTIYAN RENVEDVKAK IYENMKISKE
HVDAPTLVHG IVTE