Gene BCG9842_B5704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B5704 
Symbol 
ID7186467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp5002173 
End bp5003123 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content38% 
IMG OID643553024 
Productinosine-uridine preferring nucleoside hydrolase family protein 
Protein accessionYP_002448666 
Protein GI218900255 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1957] Inosine-uridine nucleoside N-ribohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00697028 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones88 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAA AAGTTCTCAT TTTTTGTGAT CCTGGGATTG ATGATACGAT GGCTCTCCTC 
TTAGCTTTCT TTATTGATGA AATAGAAATT ATCGGTATTG TTGCTGATTA CGGCAATGTC
CCGAAAAAAA TGGCCGTACA AAATGCTCAT TTTTTTAATA ACGAAACAAA GAATAGAAAT
ATCAAGATAT TCGGTGGTTC AGAACGTCCT CTTACTGGTG CCCCACCTGC GTTTTTTACG
GATGTACATG GGAAACAGGG GCTCGGGCCA ATTATTCCAA AGGTAAATGT GACTAACGGA
GAAATGGAGA ATTTTTTTGA AGTTATTCCT CTTATTGAGC AGTATAAAGA TGAATTAATC
ATTGTAAGTT TAGGAAGACT TACCTCCCTA GCAATTTTAT TCATCGTATG TAAACAGTTA
ATGAAGCAAG TTAAATCTTA CTACGTAATG GGCGGTGCCT TTTTACACCC TGGTAATGTT
ACCCCTATTT CCGAAGCAAA CTTTTATGGC GATCCTACTG CTGCTAATAT AGTCCTCCAA
TCTACAGCTA ACATGTACAT ATACCCATTA AACGTCACCC AATATTCCGT CATTACACCC
GAGATGGCGG AGTATATTGA AACAAAAGGA AAAGCCCCAC TTGTCAAACC TTTATTCGAT
CATTATTACT ACGGATATTA TAAAGACGCC CTACCACATT TAAAGGGTAG CCCCTTCCAT
GACACAATGC CAATACTCGC TTTACTTGAT AACTCTATGT TTACCTATCA CAAATCACCT
ATCGTTGTCA TGACAGAATC TTATGCGCAG GGGGCAAGCA TTGGAGAATT TCGCTCTTTA
GGAGAATCTA AGCCATTTAT TGATTGGCCG AGTCATCAAA TCGCAATTGA TTTTGATTAT
AACCGCTTCT TCAAACATTT CATGTCACTT ATGACGGGCG AACAATTTTA G
 
Protein sequence
MPKKVLIFCD PGIDDTMALL LAFFIDEIEI IGIVADYGNV PKKMAVQNAH FFNNETKNRN 
IKIFGGSERP LTGAPPAFFT DVHGKQGLGP IIPKVNVTNG EMENFFEVIP LIEQYKDELI
IVSLGRLTSL AILFIVCKQL MKQVKSYYVM GGAFLHPGNV TPISEANFYG DPTAANIVLQ
STANMYIYPL NVTQYSVITP EMAEYIETKG KAPLVKPLFD HYYYGYYKDA LPHLKGSPFH
DTMPILALLD NSMFTYHKSP IVVMTESYAQ GASIGEFRSL GESKPFIDWP SHQIAIDFDY
NRFFKHFMSL MTGEQF