Gene BCG9842_B1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B1042 
Symbolpyn2 
ID7181268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp4063871 
End bp4065175 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content39% 
IMG OID643551989 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_002447659 
Protein GI218899248 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00204257 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value1.53983e-05 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGAATGG TGGACCTAAT TGCAAAAAAA CGTGACGGAC ATGCATTAAC GACAGAAGAA 
ATTAACTTTA TTGTTGAAGG ATATACAAAT GGTGATATTC CTGATTATCA AGTAAGTTCA
CTTGCAATGG CAATTTTCTT CCAAGATATG AACGACCAAG AGCGTGCTGA TTTAACGATG
GCAATGGTAA ATAGCGGCGA TACAATTGAC TTATCAGCTA TTGAAGGTGT AAAAGTAGAT
AAGCATTCGA CAGGTGGTGT TGGTGATACA ACAACACTTG TATTAGGTCC ATTAGTAGCT
GCTTTAGACG TACCAGTAGC GAAAATGTCT GGACGCGGTT TAGGACATAC TGGTGGTACA
ATTGATAAAT TAGAAGCAGT TCCAGGATTC CATGTGGAAA TTGAAAACGA TGAATTCATG
CGACTTGTAA ATGAAAACAA AATCGCTGTT ATCGGCCAAA GTGGTAACTT AACACCTGCT
GATAAAAAGT TATATGCACT TCGTGATGTA ACAGCAACAG TAAACTCAAT TCCGCTTATT
GCAAGTTCGA TTATGAGTAA AAAAATTGCT GCTGGTGCAG ATGCAATTGT TCTTGATGTA
AAAACTGGAG CAGGTGCATT TATGAAAACG GATGAAGATG CAAAACGTTT AGCAGAAGCA
ATGGTACGCA TCGGTAATAA CGTTGGTCGT AATACGATGG CAGTTATTTC TGATATGAGT
CAACCACTCG GTGAAGCTAT CGGTAACGCA CTAGAAGTAC AAGAAGCAAT TGATACATTA
CAAGGTAAAG GGCCAAAAGA TTTAGAAGAG CTATGTTTAA CACTTGGAAG CCAAATGGTG
TACCTTGCTG GACAAGCTTC ATCTTTAGAA GATGCACGTG AAAAGCTAAT TGAAGTAATG
AACAACGGAA AAGCGCTAGA ATCATTTAAA ACGTTCTTAT CAGCGCAAGG CGGCGACGCA
TCTGTTGTTG ATGACCCTTC TAAATTGCCA CAAGCACAAT TTAAAATTGA AGTGGAAGCG
AAAGAAGACG GATATGTATC AGAAATCGTT GCAGACGAAA TCGGAACAGC AGCAATGCTT
TTAGGAGCAG GACGTGCAAC GAAAGAGTCA GAAATTGATT TAGCAGTTGG TCTAATGCTG
CGCAAAAAAG TAGGCGATAG CGTGAAGCAA GGTGAATCCC TTGTTACAAT TTACGCGAAC
CGCGAAAATG TTGAAGATGT AAAAGCGAAA ATTTATGAGA ACATGAAGAT CTCTAAAGAA
CACGTAGATG CACCAACTTT AGTACACGGT ATCGTAACGA AGTAA
 
Protein sequence
MRMVDLIAKK RDGHALTTEE INFIVEGYTN GDIPDYQVSS LAMAIFFQDM NDQERADLTM 
AMVNSGDTID LSAIEGVKVD KHSTGGVGDT TTLVLGPLVA ALDVPVAKMS GRGLGHTGGT
IDKLEAVPGF HVEIENDEFM RLVNENKIAV IGQSGNLTPA DKKLYALRDV TATVNSIPLI
ASSIMSKKIA AGADAIVLDV KTGAGAFMKT DEDAKRLAEA MVRIGNNVGR NTMAVISDMS
QPLGEAIGNA LEVQEAIDTL QGKGPKDLEE LCLTLGSQMV YLAGQASSLE DAREKLIEVM
NNGKALESFK TFLSAQGGDA SVVDDPSKLP QAQFKIEVEA KEDGYVSEIV ADEIGTAAML
LGAGRATKES EIDLAVGLML RKKVGDSVKQ GESLVTIYAN RENVEDVKAK IYENMKISKE
HVDAPTLVHG IVTK