Gene Bcer98_2784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_2784 
Symbol 
ID5343786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp2864223 
End bp2865527 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content38% 
IMG OID640840286 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_001376012 
Protein GI152976495 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02644] pyrimidine-nucleoside phosphorylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000152618 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATGG TGGACCTAAT TGCCAAAAAA CGTGATGGAC ATGCGCTAAC AACAGAAGAA 
ATTAATTTTA TTGTTGAAGG ATTTACAAAC GGTGATATTC CTGATTATCA AATGAGCTCT
TTTGCAATGG CGATTTTCTT TCAAGATATG AATGAACAAG AGCGTGCTGA TTTAACAATG
GCAATGGTAA ATAGCGGTGA TACAATTGAT CTCTCAGCAA TTGAAGGGAT AAAGGTAGAT
AAGCATTCTA CAGGTGGCGT TGGGGATACA ACTACGCTTG TACTAGGTCC GTTAGTAGCT
GCTTTAGGTG TACCAGTTGC AAAAATGTCT GGACGTGGTT TAGGACATAC TGGTGGTACA
ATTGATAAAT TAGAAGCTGT GCCAGGATTC CATGTGGAAA TTGAAAATGA AGAGTTTATT
CGCCTTGTAA ATGAAAATAA AATTGCTGTT ATTGGACAAA GTGGGAACTT AACACCTGCT
GATAAGAAGT TATACGCACT TCGTGATGTA ACAGCGACGG TAAACTCTAT ACCACTTATC
GCAAGTTCTA TTATGAGTAA AAAAATTGCT GCAGGTGCAG ATGCGATCGT TCTAGATGTA
AAAACTGGTG CGGGTGCATT TATGAAAACG GATGAAGATG CAAAACGTCT TGCAGAAGCA
ATGGTGCGTA TTGGAAATAA TGTAGGCCGT AAGACAATGG CAGTTATTTC GGATATGAGT
CAACCGCTTG GTGAAGCAAT CGGTAACGCG TTGGAAGTAC AAGAAGCAAT TGATACATTG
CAAGGTAAAG GTCCAAAAGA TTTAGAAGAG CTATGTTTAA CACTTGGAAG TCAAATGGTA
TACCTTGCTG GTAAAGCATC TTCTTTAGAA GATGCACGTA ATAAACTTAT TGAAGTAATG
AATAATGGAA AAGCGTTAGA CACATTTAAA TTATTTTTAG CAGCGCAAGG CGGAGATGCT
TCAGTTATTG ATGACCCTTC TAAATTGCCA CAAGCTAAAT ATAAAATTGA AGTTGAAGCA
AAAGAAGACG GATATGTGTC TGAGATTGTG GCAGATGAAA TCGGGACAGC AGCAATGCTT
TTAGGTGCTG GACGTGCAAC GAAAGAATCT GAGATTGATT TAGCGGTTGG CCTTATGCTT
CGGAAAAAAG TTGGCGATAG CGTAAAACAA GGTGAATCGC TTGTAACAAT TTATGCAAAC
CGTGAAAATG TAGAAGATGT GAAGACGAAG ATTTATGAGA ATATAAAAAT TACAAAAAAT
CATGTCAAAG CACCTACATT AGTACATGGT ATTGTAACGA AATAA
 
Protein sequence
MRMVDLIAKK RDGHALTTEE INFIVEGFTN GDIPDYQMSS FAMAIFFQDM NEQERADLTM 
AMVNSGDTID LSAIEGIKVD KHSTGGVGDT TTLVLGPLVA ALGVPVAKMS GRGLGHTGGT
IDKLEAVPGF HVEIENEEFI RLVNENKIAV IGQSGNLTPA DKKLYALRDV TATVNSIPLI
ASSIMSKKIA AGADAIVLDV KTGAGAFMKT DEDAKRLAEA MVRIGNNVGR KTMAVISDMS
QPLGEAIGNA LEVQEAIDTL QGKGPKDLEE LCLTLGSQMV YLAGKASSLE DARNKLIEVM
NNGKALDTFK LFLAAQGGDA SVIDDPSKLP QAKYKIEVEA KEDGYVSEIV ADEIGTAAML
LGAGRATKES EIDLAVGLML RKKVGDSVKQ GESLVTIYAN RENVEDVKTK IYENIKITKN
HVKAPTLVHG IVTK