Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B5704 |
Symbol | |
ID | 7186467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | + |
Start bp | 5002173 |
End bp | 5003123 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643553024 |
Product | inosine-uridine preferring nucleoside hydrolase family protein |
Protein accession | YP_002448666 |
Protein GI | 218900255 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1957] Inosine-uridine nucleoside N-ribohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00697028 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 88 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAAA AAGTTCTCAT TTTTTGTGAT CCTGGGATTG ATGATACGAT GGCTCTCCTC TTAGCTTTCT TTATTGATGA AATAGAAATT ATCGGTATTG TTGCTGATTA CGGCAATGTC CCGAAAAAAA TGGCCGTACA AAATGCTCAT TTTTTTAATA ACGAAACAAA GAATAGAAAT ATCAAGATAT TCGGTGGTTC AGAACGTCCT CTTACTGGTG CCCCACCTGC GTTTTTTACG GATGTACATG GGAAACAGGG GCTCGGGCCA ATTATTCCAA AGGTAAATGT GACTAACGGA GAAATGGAGA ATTTTTTTGA AGTTATTCCT CTTATTGAGC AGTATAAAGA TGAATTAATC ATTGTAAGTT TAGGAAGACT TACCTCCCTA GCAATTTTAT TCATCGTATG TAAACAGTTA ATGAAGCAAG TTAAATCTTA CTACGTAATG GGCGGTGCCT TTTTACACCC TGGTAATGTT ACCCCTATTT CCGAAGCAAA CTTTTATGGC GATCCTACTG CTGCTAATAT AGTCCTCCAA TCTACAGCTA ACATGTACAT ATACCCATTA AACGTCACCC AATATTCCGT CATTACACCC GAGATGGCGG AGTATATTGA AACAAAAGGA AAAGCCCCAC TTGTCAAACC TTTATTCGAT CATTATTACT ACGGATATTA TAAAGACGCC CTACCACATT TAAAGGGTAG CCCCTTCCAT GACACAATGC CAATACTCGC TTTACTTGAT AACTCTATGT TTACCTATCA CAAATCACCT ATCGTTGTCA TGACAGAATC TTATGCGCAG GGGGCAAGCA TTGGAGAATT TCGCTCTTTA GGAGAATCTA AGCCATTTAT TGATTGGCCG AGTCATCAAA TCGCAATTGA TTTTGATTAT AACCGCTTCT TCAAACATTT CATGTCACTT ATGACGGGCG AACAATTTTA G
|
Protein sequence | MPKKVLIFCD PGIDDTMALL LAFFIDEIEI IGIVADYGNV PKKMAVQNAH FFNNETKNRN IKIFGGSERP LTGAPPAFFT DVHGKQGLGP IIPKVNVTNG EMENFFEVIP LIEQYKDELI IVSLGRLTSL AILFIVCKQL MKQVKSYYVM GGAFLHPGNV TPISEANFYG DPTAANIVLQ STANMYIYPL NVTQYSVITP EMAEYIETKG KAPLVKPLFD HYYYGYYKDA LPHLKGSPFH DTMPILALLD NSMFTYHKSP IVVMTESYAQ GASIGEFRSL GESKPFIDWP SHQIAIDFDY NRFFKHFMSL MTGEQF
|
| |