Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcer98_2784 |
Symbol | |
ID | 5343786 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cytotoxicus NVH 391-98 |
Kingdom | Bacteria |
Replicon accession | NC_009674 |
Strand | - |
Start bp | 2864223 |
End bp | 2865527 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 640840286 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_001376012 |
Protein GI | 152976495 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000152618 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATGG TGGACCTAAT TGCCAAAAAA CGTGATGGAC ATGCGCTAAC AACAGAAGAA ATTAATTTTA TTGTTGAAGG ATTTACAAAC GGTGATATTC CTGATTATCA AATGAGCTCT TTTGCAATGG CGATTTTCTT TCAAGATATG AATGAACAAG AGCGTGCTGA TTTAACAATG GCAATGGTAA ATAGCGGTGA TACAATTGAT CTCTCAGCAA TTGAAGGGAT AAAGGTAGAT AAGCATTCTA CAGGTGGCGT TGGGGATACA ACTACGCTTG TACTAGGTCC GTTAGTAGCT GCTTTAGGTG TACCAGTTGC AAAAATGTCT GGACGTGGTT TAGGACATAC TGGTGGTACA ATTGATAAAT TAGAAGCTGT GCCAGGATTC CATGTGGAAA TTGAAAATGA AGAGTTTATT CGCCTTGTAA ATGAAAATAA AATTGCTGTT ATTGGACAAA GTGGGAACTT AACACCTGCT GATAAGAAGT TATACGCACT TCGTGATGTA ACAGCGACGG TAAACTCTAT ACCACTTATC GCAAGTTCTA TTATGAGTAA AAAAATTGCT GCAGGTGCAG ATGCGATCGT TCTAGATGTA AAAACTGGTG CGGGTGCATT TATGAAAACG GATGAAGATG CAAAACGTCT TGCAGAAGCA ATGGTGCGTA TTGGAAATAA TGTAGGCCGT AAGACAATGG CAGTTATTTC GGATATGAGT CAACCGCTTG GTGAAGCAAT CGGTAACGCG TTGGAAGTAC AAGAAGCAAT TGATACATTG CAAGGTAAAG GTCCAAAAGA TTTAGAAGAG CTATGTTTAA CACTTGGAAG TCAAATGGTA TACCTTGCTG GTAAAGCATC TTCTTTAGAA GATGCACGTA ATAAACTTAT TGAAGTAATG AATAATGGAA AAGCGTTAGA CACATTTAAA TTATTTTTAG CAGCGCAAGG CGGAGATGCT TCAGTTATTG ATGACCCTTC TAAATTGCCA CAAGCTAAAT ATAAAATTGA AGTTGAAGCA AAAGAAGACG GATATGTGTC TGAGATTGTG GCAGATGAAA TCGGGACAGC AGCAATGCTT TTAGGTGCTG GACGTGCAAC GAAAGAATCT GAGATTGATT TAGCGGTTGG CCTTATGCTT CGGAAAAAAG TTGGCGATAG CGTAAAACAA GGTGAATCGC TTGTAACAAT TTATGCAAAC CGTGAAAATG TAGAAGATGT GAAGACGAAG ATTTATGAGA ATATAAAAAT TACAAAAAAT CATGTCAAAG CACCTACATT AGTACATGGT ATTGTAACGA AATAA
|
Protein sequence | MRMVDLIAKK RDGHALTTEE INFIVEGFTN GDIPDYQMSS FAMAIFFQDM NEQERADLTM AMVNSGDTID LSAIEGIKVD KHSTGGVGDT TTLVLGPLVA ALGVPVAKMS GRGLGHTGGT IDKLEAVPGF HVEIENEEFI RLVNENKIAV IGQSGNLTPA DKKLYALRDV TATVNSIPLI ASSIMSKKIA AGADAIVLDV KTGAGAFMKT DEDAKRLAEA MVRIGNNVGR KTMAVISDMS QPLGEAIGNA LEVQEAIDTL QGKGPKDLEE LCLTLGSQMV YLAGKASSLE DARNKLIEVM NNGKALDTFK LFLAAQGGDA SVIDDPSKLP QAKYKIEVEA KEDGYVSEIV ADEIGTAAML LGAGRATKES EIDLAVGLML RKKVGDSVKQ GESLVTIYAN RENVEDVKTK IYENIKITKN HVKAPTLVHG IVTK
|
| |