Gene BCE_4154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCE_4154 
Symbolpyn 
ID2749358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus ATCC 10987 
KingdomBacteria 
Replicon accessionNC_003909 
Strand
Start bp3878247 
End bp3879551 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content39% 
IMG OID637280951 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionNP_980447 
Protein GI42783200 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00147082 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGG TGGACCTAAT TGCAAAAAAA CGTGACGGAC ATGCATTAAC GACAGAAGAA 
ATTAACTTTA TTGTTGAAGG ATATACAAAT GGTGATATTC CTGATTATCA AGTAAGTTCA
CTTGCAATGG CAATTTTCTT CCAAAATATG AACGATCAAG AGCGTGCAGA TTTAACGATG
GCAATGGTAA ATAGCGGTGA TACAATCGAC TTATCAGCTA TTGAAGGAGT AAAAGTAGAT
AAGCACTCGA CAGGTGGCGT TGGTGATACA ACAACACTTG TATTAGGTCC ATTAGTAGCC
GCTTTAGGTG TACCGGTTGC AAAAATGTCT GGACGTGGTC TAGGACATAC TGGTGGTACA
ATTGATAAAT TAGAAGCAGT TCCAGGATTC CATGTTGAAA TCGAAAATGA TGAATTCATG
CGTCTTGTAA ATGAAAATAA AATCGCAGTT ATTGGTCAAA GTGGAAACTT AACACCTGCG
GATAAAAAAT TGTATGCACT TCGTGATGTA ACGGCAACAG TAAACTCAAT TCCGCTTATT
GCAAGCTCAA TTATGAGTAA AAAAATTGCT GCTGGTGCAG ATGCAATTGT TCTTGATGTA
AAAACTGGAG CAGGTGCATT TATGAAAACA GATGAAGATG CAAAACGTTT AGCAGAAGCA
ATGGTACGCA TTGGTAATAA CGTTGGTCGT AATACGATGG CTGTTATTTC TGATATGAGT
CAACCACTTG GTGAGGCTAT TGGTAACGCA CTTGAAGTAC AAGAAGCAAT TGATACGTTA
CAAGGTAAAG GACCGAAAGA TTTAGAAGAG TTATGTTTAA CGCTTGGAAG TCAAATGGTA
TACCTTGCTG GACAAGCTTC ATCTCTAGAA GATGCACGTG AGAAATTAAT TGAAGTAATG
AACAACGGTA AGGCGCTAGA ATCATTTAAA ACGTTCTTAT CAGCGCAAGG CGGCGATGCA
TCTGTTGTTG ATGATCCTTC TAAATTACCA CAAGCACAAT TTAAAATTGA AGTGGAAGCG
AAGGAAGACG GTTATGTATC AGAAATCGTT GCAGATGAAA TCGGAACAGC AGCAATGCTT
TTAGGAGCAG GACGTGCGAC GAAGGAATCT GAAATTGATT TAGCTGTTGG CTTAATGCTT
CGTAAAAAAG TAGGGGACAG CGTGAAAAAA GGTGAATCCC TTGTTACCAT TTACGCAAAC
CGTGAAAATG TAGAAGACGT AAAAGCAAAA ATTTATGAGA ACATGAAGAT CTCTAAAGAG
CATGTAGATG CACCAACATT AGTGCACGGC ATCGTTACTG AATAA
 
Protein sequence
MRMVDLIAKK RDGHALTTEE INFIVEGYTN GDIPDYQVSS LAMAIFFQNM NDQERADLTM 
AMVNSGDTID LSAIEGVKVD KHSTGGVGDT TTLVLGPLVA ALGVPVAKMS GRGLGHTGGT
IDKLEAVPGF HVEIENDEFM RLVNENKIAV IGQSGNLTPA DKKLYALRDV TATVNSIPLI
ASSIMSKKIA AGADAIVLDV KTGAGAFMKT DEDAKRLAEA MVRIGNNVGR NTMAVISDMS
QPLGEAIGNA LEVQEAIDTL QGKGPKDLEE LCLTLGSQMV YLAGQASSLE DAREKLIEVM
NNGKALESFK TFLSAQGGDA SVVDDPSKLP QAQFKIEVEA KEDGYVSEIV ADEIGTAAML
LGAGRATKES EIDLAVGLML RKKVGDSVKK GESLVTIYAN RENVEDVKAK IYENMKISKE
HVDAPTLVHG IVTE