Gene BCAH820_4109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_4109 
Symbolpyn2 
ID7188336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp3926552 
End bp3927856 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content39% 
IMG OID643557520 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_002453059 
Protein GI218905225 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value9.99872e-53 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAATGG TGGACCTAAT TGCAAAAAAA CGTGACGGAC ATGCATTAAC GACAGAAGAA 
ATTAACTTTA TTGTTGAAGG ATATACAAAT GGTGATATTC CTGATTATCA AGTAAGTTCA
CTTGCAATGG CAATTTTCTT CCAAGATATG AACGATCAAG AACGTGCAGA TTTAACGATG
GCAATGGTAA ATAGTGGTGA TACAATCGAC TTATCAGCTA TTGAAGGTGT AAAAGTAGAT
AAGCACTCAA CAGGTGGCGT TGGTGATACA ACGACACTTG TATTAGGTCC ATTAGTAGCC
GCTTTAGGTG TACCGGTTGC AAAAATGTCT GGACGTGGTC TAGGACATAC TGGGGGTACA
ATTGATAAAT TAGAAGCAGT TCCAGGGTTC CATGTTGAAA TCGAAAATGA TGAATTCATG
CGTCTTGTAA ATGAAAATAA AATCGCAGTT ATTGGTCAGA GTGGAAACTT AACACCTGCG
GATAAAAAGT TATATGCACT CCGTGATGTA ACGGCAACAG TAAACTCAAT TCCGCTTATT
GCAAGCTCGA TTATGAGTAA AAAAATTGCT GCTGGTGCAG ATGCAATTGT TCTTGATGTA
AAAACTGGAG CAGGTGCATT TATGAAAACG GATGAAGATG CAAAACGTTT AGCAGAAGCA
ATGGTACGCA TTGGTAATAA CGTTGGTCGT AATACGATGG CTGTTATTTC TGATATGAGT
CAACCACTTG GTGAGGCTAT TGGTAACGCA CTTGAAGTAC AAGAAGCAAT TGATACATTA
CAAGGTAAAG GACCGAAAGA TTTAGAAGAG TTATGTTTAA CACTTGGAAG TCAAATGGTA
TACCTTGCTG GACAAGCTTC ATCTTTAGAA GATGCGCGTG AGAAATTAAT TGAAGTAATG
AACAACGGTA AAGCGCTAGA ATCATTTAAA ACATTCTTAT CAGCGCAAGG CGGCGATGCA
TCTGTTGTTG ATGATCCTTC TAAATTACCA CAAGCACAAT TTAAAATTGA AGTGGAAGCG
AAGGAAGACG GTTATGTATC AGAAATCGTT GCAGATGAAA TTGGAACAGC AGCAATGCTT
TTAGGAGCAG GACGTGCGAC GAAGGAATCT GAAATTGATT TAGCAGTTGG CTTAATGCTT
CGCAAAAAAG TAGGGGACAG CGTGAAAAAA GGTGAATCCC TTGTTACCAT TTACGCAAAC
CGTGAAAATG TAGAAGATGT AAAAGCAAAA ATTTATGAGA ACATGAAGAT CTCTAAAGAG
CATGTAGATG CACCGACATT AGTGCATGGC ATCGTTACTG AATAA
 
Protein sequence
MRMVDLIAKK RDGHALTTEE INFIVEGYTN GDIPDYQVSS LAMAIFFQDM NDQERADLTM 
AMVNSGDTID LSAIEGVKVD KHSTGGVGDT TTLVLGPLVA ALGVPVAKMS GRGLGHTGGT
IDKLEAVPGF HVEIENDEFM RLVNENKIAV IGQSGNLTPA DKKLYALRDV TATVNSIPLI
ASSIMSKKIA AGADAIVLDV KTGAGAFMKT DEDAKRLAEA MVRIGNNVGR NTMAVISDMS
QPLGEAIGNA LEVQEAIDTL QGKGPKDLEE LCLTLGSQMV YLAGQASSLE DAREKLIEVM
NNGKALESFK TFLSAQGGDA SVVDDPSKLP QAQFKIEVEA KEDGYVSEIV ADEIGTAAML
LGAGRATKES EIDLAVGLML RKKVGDSVKK GESLVTIYAN RENVEDVKAK IYENMKISKE
HVDAPTLVHG IVTE