Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_5338 |
Symbol | |
ID | 2814533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | + |
Start bp | 4840837 |
End bp | 4841787 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637792012 |
Product | inosine-uridine preferring nucleoside hydrolase family protein |
Protein accession | YP_021997 |
Protein GI | 47530648 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1957] Inosine-uridine nucleoside N-ribohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.637949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAAAA AAGTTCTTAT TTTTTGTGAT CCTGGGATTG ATGATACGAT GGCTCTCCTC TTAGCATTCT TTATCGATGA AATAGAAATC ATCGGCATCG TTGCTGATTA CGGCAATGTT CCAAAGAAGA TGGCCGTACA AAATGCTCAT TTCCTTAAGA ATGAAACAAG GAATAGAAAT ATTAAGATAT TCGGTGGTTC AGAACGCCCT CTTACTGGTG CCCCGCCTGC TTTTTTTACA GAAGTACACG GGAAACAAGG GCTTGGGCCA ATTATTCCAA ATGGGAATGT GACTAACGGA GAAATGGAGA ATTTCTTTGA AGTCATTCCT CTTATTGAAC AGTATAAAGA TGAATTAATC ATTGTAAGTT TAGGAAGACT TACCTCTCTA GCAATTTTAT TTATTATGTG CAAACAGCTA ATGAAACAAA TTAAATCTTA TTACGTAATG GGCGGTGCCT TTTTACATCC TGGTAACGTT ACCCCTATTT CCGAAGCTAA CTTTTATGGC GACCCTACTG CTGCTAACAT TGTTCTTCAA TCCGCAGCTA ACATGTACAT ATACCCGTTA AACGTTACCC AATACTCCGT CATTACACCA GAAATGGCCG AGTACATTGA AGCAAAAGGA AAAGTCCCAC TCGTCAAACC ATTATTCGAT CATTATTACT ATGGATATTA TAAAAATGCC CTGCCAGATT TAAAAGGTAG CCCCTTCCAT GACACAATGC CAATACTCGC TTTACTTGAT AACTCTATGT TTACGTATCA CAAATCACCT ATCGTTGTCA TGGCAGAATC TTATGCACAA GGAGCAAGCA TTGGAGAATT TCGTTCCTTA GGAAAACCTA AACCATTTAT GGATTGGCCG AGTCATCAAA TCGCAATTGA TTTTGATTAT AACCGTTTCT TTAAACATTT CATGTCACTT ATGACAGGTG AGCAATTTTA A
|
Protein sequence | MPKKVLIFCD PGIDDTMALL LAFFIDEIEI IGIVADYGNV PKKMAVQNAH FLKNETRNRN IKIFGGSERP LTGAPPAFFT EVHGKQGLGP IIPNGNVTNG EMENFFEVIP LIEQYKDELI IVSLGRLTSL AILFIMCKQL MKQIKSYYVM GGAFLHPGNV TPISEANFYG DPTAANIVLQ SAANMYIYPL NVTQYSVITP EMAEYIEAKG KVPLVKPLFD HYYYGYYKNA LPDLKGSPFH DTMPILALLD NSMFTYHKSP IVVMAESYAQ GASIGEFRSL GKPKPFMDWP SHQIAIDFDY NRFFKHFMSL MTGEQF
|
| |